Vocal organ visible speech synthesis system

A technology for vocal organs and speech synthesis, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of difficult mapping relationship, high computational complexity, no real-time performance, etc., achieving high conversion sensitivity, small calculation amount, and rich details. Effect

Active Publication Date: 2012-12-12
中科极限元(杭州)智能科技股份有限公司
View PDF3 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Among the existing voice-driven articulation organ movement technologies, one is based on a large number of voices and corresponding motion databases, according to the input voice, with the help of data retrieval and matching technology to find the most suitable motion to drive the computer model or mechanical model motion , the synthesis effect produced by this kind of method is realistic, but there are many organs involved in the pronunciation process, it is difficult to use a unified method to describe the mapping relationship between the movement of different organs and speech; the other is to establish a biophysical model of the pronunciation organ, and analyze the Physiological changes over time, driving model movement, such methods are usually computationally complex and do not have good real-time performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vocal organ visible speech synthesis system
  • Vocal organ visible speech synthesis system
  • Vocal organ visible speech synthesis system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0029] It should be noted that, in the drawings or descriptions of the specification, similar or identical parts all use the same figure numbers. And in the accompanying drawings, it is marked for simplification or convenience. Furthermore, implementations not shown or described in the accompanying drawings are forms known to those of ordinary skill in the art. Additionally, while illustrations of parameters including particular values ​​may be provided herein, it should be understood that the parameters need not be exactly equal to the corresponding values, but rather may approximate the corresponding values ​​within acceptable error margins or design constraints.

[0030] In the visual speech synthesis system of the vo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a vocal organ visible speech synthesis system which comprises a voice frequency analysis module, a parameter mapping module, an animation drive module and a motion analysis module; wherein, the voice frequency analysis module is used for receiving the input speech signal of a speaker, judging a mute section according to energy information, coding non-mute section of speech and outputting a speech line spectrum pair parameter; the parameter mapping module is used for receiving the speech line spectrum pair parameter transmitted in real time from the voice frequency analysis module, converting the speech line spectrum pair parameter into a model motion parameter by using the trained Gaussian mixture model; the animation drive module is used for receiving the model motion parameter generated in real time by the parameter mapping module, driving the motion of key points of a virtual vocal organ model so as to drive the motion of the whole virtual vocal organ model. According to the vocal organ visible speech synthesis system, the motion of the model is driven by the corresponding motion parameter generated directly by a frequency domain parameter of the input speech, and therefore, the vocal organ visible speech synthesis system has the advantage of being free from limitations of an online database and a physiological model.

Description

technical field [0001] The invention relates to the technical field of simulated reality in the information technology industry, in particular to a visual speech synthesis system for articulation organs. Background technique [0002] Visual speech synthesis technology is an important part of human-computer interaction technology, and it is also a technology that people have been paying attention to. The visualization of pronunciation organs is an important part of visual speech synthesis technology. It can process and analyze a person's voice to generate Corresponding motion parameters of the human articulation organ during pronunciation, and drive the motion of the graphics model. Its research results are of great significance to the fields of human-computer speech interaction, speech teaching, and treatment of articulatory disorders. [0003] Among the existing voice-driven articulation organ movement technologies, one is based on a large number of voices and correspondin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/00
Inventor 陶建华杨明浩李昊刘斌
Owner 中科极限元(杭州)智能科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products