Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Rhythm phrase labeling method and device

A phrase and prosody technology, applied in the field of prosodic phrase labeling methods and devices, can solve the problems of low accuracy of prosodic phrase boundaries, low efficiency, and inability to meet demands.

Pending Publication Date: 2020-10-30
BEIJING SINOVOICE TECH CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Manually labeling the prosodic phrases of speech data is inefficient, unable to meet the needs, and highly subjective, and the accuracy of the prosodic phrase boundaries obtained by labeling is low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rhythm phrase labeling method and device
  • Rhythm phrase labeling method and device
  • Rhythm phrase labeling method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] Exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present invention are shown in the drawings, it should be understood that the invention may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present invention and to fully convey the scope of the present invention to those skilled in the art.

[0052] In order to introduce the present invention more clearly, the related technologies of prosodic labeling are firstly introduced.

[0053] Speech is a sound with certain social significance emitted by the human vocal organs, and the voice data is the audio data obtained after sampling and recording the voice. In phonetic phonology, speech can be divided into different levels of units such as prosodic words, prosodic phrases, prosodic phras...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a rhythm phrase labeling method and device, and relates to the technical field of speech synthesis. According to the rhythm phrase labeling method and device, in the rhythm phrase labeling process, a rhythm phrase boundary of voice data is determined according toe PPGs and a fundamental frequency value of each audio frame in the voice data, so that manual labeling of the rhythm phrases of the voice data can be avoided, the rhythm phrase boundary of the voice data can be quickly determined, and the rhythm phrase labeling efficiency is improved. Moreover, the prosodic phrase boundary is determined according to the PPGs and the fundamental frequency value of the voice data, and is not interfered by people as the dominant opinion, so that the accuracy of the prosodic phrase boundary can be improved.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a prosodic phrase tagging method and device. Background technique [0002] With the development of computer performance, the development of speech synthesis technology tends more and more to the waveform splicing method based on large corpus. The effective coverage of the corpus in the phonetic structure and phonetic units becomes the key to improve the quality of synthesized speech. Detailed prosodic annotation of the speech data in the corpus is the basis for checking the data coverage of the corpus. [0003] Prosodic labeling is the process of dividing different levels of units such as prosodic words, prosodic phrases, prosodic phrases, and intonation phrases in speech data, and determining the boundaries of prosodic words, prosodic phrases, prosodic phrases, and intonation phrases in speech data. Among them, since each prosodic phrase is a sense of hearing with a s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/08G10L13/10G10L13/047
CPCG10L13/08G10L13/10G10L13/047
Inventor 王愈李健武卫东
Owner BEIJING SINOVOICE TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products