Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method and device based on rhythm feature prediction, terminal and medium

A prosodic feature and speech synthesis technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as insufficient speech synthesis effect, insufficient speech effect, and insufficient accuracy of prosody prediction, so as to improve user experience, improve effect, and improve The effect of accuracy

Pending Publication Date: 2020-06-02
UBTECH ROBOTICS CORP LTD
View PDF5 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there is a certain error between the prosodic feature prediction result obtained through the above scheme and the real prosodic feature, which leads to insufficient accuracy of prosody prediction, resulting in insufficient effect of speech synthesis
[0005] That is to say, in the above speech synthesis scheme, the effect of the synthesized speech is insufficient due to the insufficient accuracy of prosody prediction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device based on rhythm feature prediction, terminal and medium
  • Speech synthesis method and device based on rhythm feature prediction, terminal and medium
  • Speech synthesis method and device based on rhythm feature prediction, terminal and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0052] figure 1 It is an application environment diagram of a speech synthesis method based on prosodic feature prediction in an embodiment. refer to figure 1 , the speech synthesis method based on prosodic feature prediction can be applied to the speech synthesis system. The speech synthesis system includes a terminal 110 and a server 120 . The terminal 110 and the server 120 are connected through a network. The terminal 110 may specifically be a desktop termin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech synthesis method based on rhythm feature prediction. The speech synthesis method comprises the steps of obtaining a to-be-synthesized text; inputting the to-be-synthesized text into a preset rhythm prediction model, obtaining rhythm features of the to-be-synthesized text as first rhythm features, and determining target rhythm features according to the first rhythmfeatures, the rhythm features of the to-be-synthesized text including rhythm word features, rhythm phrase features and rhythm intonation phrase features; and performing voice synthesis according to the target rhythm feature to generate a target voice corresponding to the to-be-synthesized text. In addition, the invention also discloses a speech synthesis device based on rhythm feature prediction,an intelligent terminal and a computer readable storage medium. By adopting the method and the device, the accuracy of text rhythm feature prediction can be improved, and the speech synthesis effect is improved.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to a speech synthesis method, device, intelligent terminal and computer-readable storage medium based on prosodic feature prediction. Background technique [0002] With the rapid development of mobile Internet and artificial intelligence technology, there are more and more voice synthesis scenarios such as voice broadcast, listening to novels, listening to news, and intelligent interaction. Speech synthesis can convert text, text, etc. into natural speech output. [0003] In the process of speech synthesis, it is necessary to predict the prosody of the text. Prosody affects the naturalness and fluency of pronunciation. A good prosody prediction result will make the synthesized speech more similar to the pauses of human speech, thus making the synthesized speech more natural. [0004] However, in the existing prosody prediction schemes, the training an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08G10L13/10G10L13/047G10L25/30
CPCG10L13/08G10L13/10G10L13/047G10L25/30G10L2013/083
Inventor 李贤黄东延丁万张皓熊友军
Owner UBTECH ROBOTICS CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products