Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method, device, system and storage medium

A technology of speech synthesis and speech, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as difficult to achieve completely satisfactory effects, achieve the effect of improving the quality of speech synthesis and improving user experience

Active Publication Date: 2022-03-15
标贝(青岛)科技有限公司
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, in some scenarios, high-quality synthesis is still difficult to achieve completely satisfactory results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method, device, system and storage medium
  • Speech synthesis method, device, system and storage medium
  • Speech synthesis method, device, system and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] In order to make the objects, technical solutions and advantages of the present invention more apparent, exemplary embodiments according to the present invention will be described in detail below with reference to the accompanying drawings. Apparently, the described embodiments are only some embodiments of the present invention, rather than all embodiments of the present invention, and it should be understood that the present invention is not limited by the exemplary embodiments described here. Based on the embodiments of the present invention described in the present invention, all other embodiments obtained by those skilled in the art without creative effort shall fall within the protection scope of the present invention.

[0062] In order to improve the quality of speech synthesis, the embodiment of the present application trains a text analysis model based on the user's trial listening experience, and then uses the trained text analysis model to perform subsequent sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present invention provide a speech synthesis method, device, system, and storage medium. The method includes: using a text analysis model to analyze the text to be processed, and converting the text to be processed into a first text containing one or more control elements. Editing text; generating a first voice corresponding to the first editing text; receiving an editing instruction from the user to edit the text to be processed; modifying the control element in the first editing text according to the editing instruction to generate a second editing text; generate a second voice corresponding to the second edited text; receive a confirmation instruction from the user for the second voice; use the text to be processed and the second edited text as training samples to analyze the text The model is trained; the subsequent speech synthesis is performed using the trained text analysis model. The embodiments of the present invention can generate voices more in line with user needs in subsequent voice synthesis, thereby improving user experience and improving the quality of voice synthesis.

Description

technical field [0001] The present invention relates to the technical field of text-to-speech conversion (TTS), and more particularly relates to a speech synthesis method, device, system and storage medium. Background technique [0002] Speech synthesis technology is the process of converting text into speech output. Speech synthesis technology can make the machine sound, which is an important link in the realization of human-computer interaction. With the rapid development of speech technology, high-quality speech synthesis technology solutions such as wavenet and waveglow are constantly emerging, creating conditions for users to obtain high-quality synthetic speech. With the gradual maturity of high-quality sound solutions, synthetic speech can reach a level close to that of real ones, which can greatly speed up the development of new synthetic application scenarios. [0003] However, even with a high-quality speech synthesis system, mistakes in pronunciation, pauses, to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/04G10L13/033G10L13/08G10L13/10
CPCG10L13/04G10L13/033G10L13/08G10L13/10
Inventor 李秀林钟彩桂边会康
Owner 标贝(青岛)科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products