Speech synthesis system based on mixed hidden Markov model

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A hidden Markov, speech synthesis technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as time domain over-flatness and inability to describe

Inactive Publication Date: 2009-07-01

INST OF AUTOMATION CHINESE ACAD OF SCI

View PDF0 Cites 22 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

If a certain state lasts for a long time, only relying on the mean value of the Gaussian function corresponding to the state cannot describe the details of the speech parameter changes in the state, which causes a serious time-domain over-smoothing problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0037] The present invention will be further described below with reference to the accompanying drawings and examples, and the steps and processes for realizing the present invention will be better described through the detailed description of each component of the system with reference to the accompanying drawings. It should be noted that the described examples are to be considered for illustrative purposes only and are not intended to limit the invention.

[0038] figure 1 It is a schematic diagram of the speech synthesis system based on the hybrid hidden Markov model of the present invention. The system is written in C language, and can be compiled and run using visual studio under the windows platform, and can be compiled and run under the linux platform using gcc. in the attached figure 1 In a preferred embodiment of the present invention, the system is divided into four parts: a spectrum information generation module 1, a fundamental frequency information generation mod...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a voice synthesis system based on a mixing hidden Markov model, wherein a frequency spectrum information generating module receives any text information, selects the codebook vector which represents frequency spectrum information and outputs the frequency spectrum information, a base frequency information generating module receives the text information, takes charge of predicting the pitch change of a to-be synthetic sentence and outputs a base frequency curve, a parameter voice synthesizer module receives the frequency spectrum information of the frequency spectrum information generating module and the base frequency information of the base frequency information generating module, outputs the synthesized voice results, an off-line training module takes charge of training various hidden Markov models, a discrete hidden Markov model obtains the output probability of the real frequency spectrum vector, guarantees the accuracy of the frequency spectrum information, and the frequency spectrum guaranteed by the codebook choosing arithmetic can not generate the oversmoothing phenomenon of time-domain. Using the system to improve the articulation of the output voice of the parameter voice synthesis system, the fidelity of the output voice is greatly improved, which is almost close to the voice quality based on a splicing voice synthesis system.

Description

technical field [0001] The present invention relates to a speech synthesis system, in particular to a speech synthesis system based on a hybrid hidden Markov model. Background technique [0002] Speech synthesis system, also known as text-to-speech conversion system (TTS system), its main function is to convert any text string received or input by the computer into speech output. The traditional speech synthesis system is based on unit splicing, and its sound quality is good, but the required sound library resources are relatively large, which causes its application in embedded devices to encounter bottlenecks. The speech synthesis system based on Hidden Markov Model is essentially a parametric synthesis system, which has the advantages of high flexibility and small storage resources. However, due to the nature of its parameterization, its sound quality performance is usually much worse than that of the splicing-based synthesis system, which is also the bottleneck of the cu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/02G10L13/06G10L13/08G10L13/027

Inventor 陶建华于剑张蒙

Owner INST OF AUTOMATION CHINESE ACAD OF SCI

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Speech synthesis system based on mixed hidden Markov model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology