Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method and device, equipment and storage medium

A technology of speech synthesis and speech synthesis, applied in speech synthesis, speech analysis, speech recognition, etc., can solve problems such as poor synthesis effect and error-prone synthetic speech, and achieve the goal of improving the quality of speech synthesis, improving quality, and enriching reference information Effect

Pending Publication Date: 2021-05-14
IFLYTEK CO LTD
View PDF7 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Existing speech synthesis schemes only refer to the original text for speech synthesis, resulting in error-prone synthetic speech and poor synthesis effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device, equipment and storage medium
  • Speech synthesis method and device, equipment and storage medium
  • Speech synthesis method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

preparation example Construction

[0099] Next, combine figure 1 Described, the speech synthesis method of the present application can comprise the following steps:

[0100] Step S100, obtaining the original text to be synthesized.

[0101] Specifically, the original text is the text of speech to be synthesized. The original text may be provided by the user, or it may be text provided by other devices or applications that needs to be synthesized into speech.

[0102] Step S110, acquiring auxiliary synthesis features corresponding to the matching text, where there is a text segment matching the original text.

[0103] Wherein, the matching text can be the text matching the original text or a text fragment in the original text, for example, the original text is "these pants are not discounted", and the matching text can be "these pants are not discounted" or "discount" . In addition, the matching text may also be text containing a text fragment matching a text fragment in the original text. Still taking the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech synthesis method and device, equipment and a storage medium. In the process of performing speech synthesis on a to-be-synthesized original text, auxiliary synthesis features corresponding to a matching text of a text fragment matched with the original text are referred; the auxiliary synthesis features are features which are determined based on a pronunciation audio corresponding to the matching text and are used for assisting speech synthesis; by referring to the auxiliary synthesis features corresponding to the matching text, the pronunciation information in the pronunciation audio corresponding to the matching text can be used for assisting speech synthesis of the original text, so that the reference information during the speech synthesis of the original text is enriched, and the speech synthesis quality of the original text is improved. The speech synthesis method of the invention can be suitable for speech synthesis systems with front-end preprocessing and speech synthesis systems without front-end preprocessing; the auxiliary synthesis features can be used as front-end text analysis results and can also directly assist the speech synthesis systems in speech synthesis, and can improve the quality of synthesized speech.

Description

technical field [0001] The present application relates to the technical field of speech processing, and more specifically, relates to a speech synthesis method, device, equipment and storage medium. Background technique [0002] In recent years, with the development of information and the rise of artificial intelligence, human-computer interaction has become more and more important. Among them, speech synthesis is a hot spot in human-computer interaction research at home and abroad. Speech synthesis is the process of synthesizing the input original text to be synthesized into speech output. [0003] The traditional speech synthesis model is generally based on the end-to-end speech synthesis scheme, that is, the training text and the corresponding speech data or waveform data are directly used to train the speech synthesis model, and the trained speech synthesis model is based on the input original text to be synthesized. That is, synthesized speech or waveform data can be ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/02G10L13/08G10L15/16
CPCG10L13/02G10L13/08G10L15/16
Inventor 周良孟廷侯秋侠刘丹江源胡亚军
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products