Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Construction method of cross-syllable Chinese speech synthesis element with spectrum stable boundary

A technology for stabilizing boundaries and speech synthesis. It is applied in speech synthesis, speech analysis, instruments, etc. It can solve the problems of syllable co-pronunciation destruction, unnatural connection, and increase algorithm complexity, and achieve the effect of improving naturalness and coherence.

Inactive Publication Date: 2015-01-28
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For the former, the synthesis primitive is usually a diphone, which generally contains two phonemes, and its boundary is a stable segment in the two phonemes. Although this method also takes into account the influence of co-pronunciation, the structure of diphones leads to There are a large number of splicing points in synthetic speech, which not only increases the complexity of the algorithm, but also easily leads to unnatural cohesion; for the latter, syllables are usually selected as the synthesis primitive, which ensures the internal coherence of syllables, but the synergy between syllables Pronunciation is broken

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Construction method of cross-syllable Chinese speech synthesis element with spectrum stable boundary
  • Construction method of cross-syllable Chinese speech synthesis element with spectrum stable boundary
  • Construction method of cross-syllable Chinese speech synthesis element with spectrum stable boundary

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be described in detail below in conjunction with accompanying drawing and embodiment, also described the technical problem and beneficial effect that the technical solution of the present invention solves simultaneously, it should be pointed out that described embodiment is only intended to facilitate the understanding of the present invention, and It has no limiting effect on it.

[0034] The present invention defines that the primitive starts from the central vowel of a syllable and ends with the central vowel of the next syllable adjacent to it. The central vowel here refers to the part of the final part of a syllable that is relatively stable in pronunciation and lasts for a long time, and it usually corresponds to the vowel in the syllable that can be marked with tone. The present invention takes the center of the central vowel as the boundary to construct a kind of cross-syllable primitive, which includes the central vowel of the previou...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a construction method of a cross-syllable Chinese speech synthesis element with a spectrum stable boundary, and belongs to the field of speech processing. Element segmentation on speech flow data starts from the center vowel of a syllable and ends at the center vowel of an adjacent syllable, and a cross-syllable element obtained from the segmentation is composed of two parts, i.e., the part of the center vowel of the final of a previous syllable and thereafter, and the part of the center vowel of the final of a current syllable and therebefore so as to obtain the cross-syllable element. According to the invention, by using such a method, coarticulation in a syllable and between syllables can be maintained, enormous splicing caused by a too short element can be prevented, the naturalness and continuity of synthetic speech are effectively improved, and the timbre performance of the synthetic speech is not affected.

Description

technical field [0001] The invention relates to a definition and construction method of a Chinese speech synthesis primitive, in particular to an automatic construction method of a cross-syllable Chinese speech synthesis primitive with spectrum stability boundaries. It belongs to the field of speech processing. Background technique [0002] The selection of speech synthesis primitives is a crucial link in speech synthesis. Reasonable selection of speech primitives and construction of primitive databases are of great significance to speech synthesis. There is no uniform and absolute evaluation standard for the selection of primitives, and it will be limited by conditions such as language, application field, training data volume, and storage requirements. Common speech synthesis primitives include phonemes, diphones, triphones, demisyllables, syllables, words, etc. (see Taylor, Paul. Text-to-speech synthesis. Cambridge University Press, 2009). The selection of these synthet...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/06
Inventor 谢湘焦祎姗
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products