Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Correction method for Chinese speech synthesis tone

A speech synthesis and tone technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of uneven fundamental frequency, intelligibility and naturalness decline, etc.

Active Publication Date: 2012-06-13
北京宇音天下科技有限公司 +1
View PDF3 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Since the pitch accuracy of a single syllable plays a vital role in the intelligibility and naturalness of the synthesized speech in Chinese synthesis, the Hidden Markov Model belongs to a segmented model segmented by state. Each segment are independent of each other, causing the fundamental frequency within a syllable to appear uneven, resulting in a significant decline in intelligibility and naturalness

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Correction method for Chinese speech synthesis tone
  • Correction method for Chinese speech synthesis tone
  • Correction method for Chinese speech synthesis tone

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The present invention will be further described below in conjunction with the accompanying drawings and examples, and the steps and processes for realizing the present invention will be better described through a detailed description of each key step of the method in conjunction with the accompanying drawings. It should be pointed out that the described examples are only considered for the purpose of illustration, not limitation of the present invention.

[0055] attached figure 1 It is a schematic diagram of the tone correction method for Chinese speech synthesis proposed by the present invention. The implementation method is written in standard C language, and can be compiled and run under both windows platform and unix platform. in the attached figure 1 In the preferred embodiment of the present invention, the method is divided into two parts: an offline training module 2 and a parametric speech synthesis module 6 . Wherein, the offline training module 2 is not co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a correction method for Chinese speech synthesis tone. According to the invention, a text analysis module receives optional text information to be synthetized; integral synthesis tagging information is outputted according to the syllable and rhythm hierarchical structure; a parameter voice synthesis module receives the synthesis tagging information of the text analysis module; synthetic voice signal is outputted through a parameter generation method of reftone; an off-line training module is responsible for the training of hidden Markov models; a reftone model is used for generating individual syllabic reference base frequency envelope; and a synthesis parameter model is used for gaining synthetic parameter sequence. The invention can solve the problem that the Chinese speech synthesis middle tone based on the hidden Markov model is unstable, thereby greatly improving the natural degree and rhythm of the synthetic speech.

Description

technical field [0001] The invention designs a parameterized speech synthesis method, and in particular relates to a tone correction method for Chinese speech synthesis. Background technique [0002] The goal of speech synthesis technology is to make electronic devices sound like humans. With the development of speech synthesis technology, the sound quality, naturalness, and intelligence of synthesized voices have been greatly improved, and the most rapid development is speech synthesis technology based on parametric statistical models. Parametric statistical speech synthesis technology based on Hidden Markov Model is a representative of this kind of method. The synthesized sound quality has high coherence and flexibility, and the required resources occupy less space, which has great practicality and research value. This method is divided into two parts, one is the offline model training part, and the other is the online speech synthesis part. In the offline training part...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02
Inventor 那兴宇王朝民谢湘何娅玲
Owner 北京宇音天下科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products