Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthetic method based on rhythm character

A prosody feature and speech synthesis technology, which is applied in speech synthesis, speech analysis, instruments, etc., can solve the complex prosody model and prosody control, cannot meet the requirements of speech synthesis prosodic features and its prosody control, and cannot meet the requirements acceptable to users. degree, etc.

Inactive Publication Date: 2007-07-18
HEILONGJIANG UNIV
View PDF0 Cites 67 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The first two speech synthesis methods will lead to relatively low naturalness of the synthesized speech output by the computer and the speech synthesis device, and the "machine smell" is too strong. This technology enters the market on a large scale
The reason is that speech synthesis and its prosody control have the following problems: ① The naturalness of continuous synthetic speech needs to be further improved; ② The text analysis process should be able to reflect the rhythm changes in natural speech to enrich the expressiveness of synthetic speech; ③ The prosody control process of speech synthesis should conform to the prosody of natural speech
In practice, it is found that the prosodic model and prosodic control of Chinese are extremely complex, and the primitive selection method based on one method cannot meet the requirements of speech synthesis for prosodic features and prosodic control in the current situation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthetic method based on rhythm character
  • Speech synthetic method based on rhythm character
  • Speech synthetic method based on rhythm character

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0115] 1. Text processing program

[0116] 1.1. Text regularization

[0117] In conjunction with Figures 2-5, the present invention processes the input text through a text regularization step, with the purpose of correcting information with special symbols such as dates, numbers, weather forecasts, and house numbers in the input text according to the correct reading method. Enter text to mark; for example: date "2000-12-12" is marked as "December 12, 2000", "minimum temperature at night -12°C" is marked as "minimum temperature at night -12°C", etc. The output of the text regularization device is a legal pronunciation character sequence, as shown in Table 1.

[0118] Table 1 Relationship between special symbols and input text

[0119]

character type

input character format

special symbol reading

Character order of legal pronunciation

List

date

2000-12-12

The first "-" is read as "year"

The second "-" is ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method for synthesizing voice based on rhythm character includes text processing program formed by text standardizing step, rhythm structure analysis step and language treatment step, synthetic element selecting program formed by element confirming step, matching step, pasting-up step, optimizing and screening step; voice synthesization processing program formed by base frequency outline generating step of phrase unit, base frequency outline generating step of syllable unit and intonation superposing step.

Description

(1) Technical field [0001] The invention relates to the technical field of speech signal processing, in particular to a speech synthesis method based on prosodic features in the speech synthesis technology. (2) Background technology [0002] The existing Chinese speech synthesis method is a word-to-sound conversion using a character as a segmentation unit, or a phrase-based text-to-speech conversion using a grammatical word as a segmentation unit. In fact, when people speak, they do not use words or grammatical words as the segmentation unit, but use prosodic words as the segmentation unit. The first two speech synthesis methods will lead to relatively low naturalness of the synthesized speech output by the computer and the speech synthesis device, and the "machine smell" is too strong. This technology has entered the market on a large scale. The reason is that speech synthesis and its prosody control have the following problems: ① The naturalness of continuous synthetic s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/00G10L13/02G10L13/04G10L13/08
Inventor 张鹏王丽红
Owner HEILONGJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products