Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthetic text processing method based on rhythm structure

A technology of text processing and speech synthesis, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of low naturalness of synthesized speech and cannot reach the level acceptable to users, and achieve the effect of simplifying prosody control

Inactive Publication Date: 2007-07-18
HEILONGJIANG UNIV
View PDF0 Cites 54 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The first two speech synthesis methods will lead to relatively low naturalness of the synthesized speech output by the computer and the speech synthesis device, and the "machine smell" is too strong. This technology enters the market on a large scale
The reason is that speech synthesis and its prosody control have the following problems: ① The naturalness of continuous synthetic speech needs to be further improved; ② The text analysis process should be able to reflect the rhythm changes in natural speech to enrich the expressiveness of synthetic speech; ③ The prosody control process of speech synthesis should conform to the prosody of natural speech

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthetic text processing method based on rhythm structure
  • Speech synthetic text processing method based on rhythm structure
  • Speech synthetic text processing method based on rhythm structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] Below in conjunction with accompanying drawing and specific embodiment the present invention is described in further detail:

[0043] In conjunction with Fig. 1, the present invention includes the following computer-implementable steps:

[0044] The text regularization step converts the input text string into a legal pronunciation string according to the preset special symbol table, and outputs the legal pronunciation string to the prosodic structure analysis step;

[0045]In the prosodic structure analysis step, the received legal pronunciation strings are sent to the prosodic structure analysis module for processing, and the legal pronunciation strings are marked with prosodic structure information according to the pre-set word segmentation rules and prosodic structure analysis rules, and the prosodic structure information is output. Annotate strings to the linguistic processing step;

[0046] In the linguistic processing step, the received tagged character strings a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method for processing voice synthetic text based on rhythm structure includes comparing inputted text with preset special symbol table to output legal pronunciation character string, comparing legal pronunciation character string according to participle rule and rhythm structure analysis rule to output labeled character string with rhythm structure information, comparing labeled character string with preset rhythm rule and phonetic table word by work and outputting label phonetic code string labeled rhythm information.

Description

(1) Technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a prosodic structure-based text processing method in the speech synthesis technology. (2) Background technology [0002] The existing Chinese speech synthesis method is a word-to-sound conversion using a character as a segmentation unit, or a phrase-based text-to-speech conversion using a grammatical word as a segmentation unit. In fact, when people speak, they do not use words or grammatical words as the segmentation unit, but use prosodic words as the segmentation unit. The first two speech synthesis methods will lead to relatively low naturalness of the synthesized speech output by the computer and the speech synthesis device, and the "machine smell" is too strong. This technology has entered the market on a large scale. The reason is that speech synthesis and its prosody control have the following problems: ① The naturalness of continuous synthetic spee...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/00G10L13/08G10L13/02
Inventor 张鹏王丽红
Owner HEILONGJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products