Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice annotation quality consistency detection method

A technology for voice tagging and detection methods, applied in voice analysis, instruments, etc., can solve problems such as lack of consistency detection methods, and achieve the effect of wide application, flexible application, and increased T0

Inactive Publication Date: 2015-07-22
BEIHANG UNIV
View PDF7 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, voice annotation at home and abroad is carried out according to their own research needs, and the consistency detection method is even more lacking. There is no unified annotation standard and detection method.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice annotation quality consistency detection method
  • Voice annotation quality consistency detection method
  • Voice annotation quality consistency detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to make the purpose, technical solution and advantages of the present invention clearer, a method for checking the consistency of voice annotation quality of the present invention will be further described in detail in conjunction with the accompanying drawings, taking a annotation file containing six layers of annotation information as an example. .

[0023] According to the marked documents, from top to bottom are: the first layer is the syllable layer (PY layer), the second layer is the consonant layer (SY layer), the third layer is the unvoiced sound mute voiced sound layer (SUV layer), the fourth The first layer is the paralinguistic information layer (PARAL layer), the fifth layer is the emotional layer (EMO layer), and the sixth layer is the stress index layer (ST layer). The following describes the specific calculation methods for the consistency of each layer:

[0024] 1) PY layer, SY layer. The marked content of the PY layer is regular syllables and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice annotation quality consistency detection method. The voice annotation quality consistency detection method comprises the following steps of (1) annotating a file to be in a TextGrid format, dividing annotation forms into three types and designing three corresponding consistency detection formulas according to the three different annotation forms; (2) detecting consistency of a file comprising six layers of annotation information, wherein the six layers of information respectively is a polysyllable layer (PY layer), an initial consonant and simple vowel layer (SY layer), an unvoiced sound and voice sound mute layer (SUV layer), an auxiliary language information layer (PARAL layer), an emotion layer (EMO layer), a stress index layer (ST layer); (3) selecting the corresponding consistency detection formulas according to the annotation forms of the annotation layer; (4) setting the time error T0, enabling annotation results to be consistent if annotation contents of two annotator are identical and the time error or less than and equal to T0, enabling the annotation results to be inconsistent and sequentially calculating the consistency of the layers. The voice annotation quality consistency detection method provides basis for speech database annotation quality detection and is effectively for detection of automatic voice annotation quality and manual annotation quality.

Description

technical field [0001] The invention relates to a consistency detection method for checking the quality of voice annotations. The method can compare two marked files to check the consistency of the marking results, and belongs to the field of voice signal processing. Background technique [0002] Speech annotation means that the annotator divides and annotates the speech according to certain annotation rules, including syllables, initials, finals, unvoiced sounds, voiced sounds, mutes, paralinguistic information, stress information, etc. Consistency detection means that different voice annotators annotate the same voice, then compare the annotated results, observe and analyze the sameness and difference of the annotated results. The consistency of manual voice annotation is an important index to evaluate the voice quality and the completeness of the annotation system. In order to ensure the labeling quality of the emotional voice database and to check the integrity of the l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/60
Inventor 毛峡景少玲陈立江王岚张娜娜
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products