Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Resonance peak automatic matching method for voiceprint identification

A formant and voiceprint technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems affecting accuracy, similarity score changes, and voiceprint recognition system, so as to avoid analysis deviation and improve processing The effect of improving efficiency and accuracy

Active Publication Date: 2014-04-09
ANHUI IFLYTEK INTELLIGENT SYST
View PDF4 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0012] 3) Since the vocalization of the human vocal tract has a slow-changing characteristic, and the pronunciation of a single phoneme is affected by the front and rear phonemes, the trend of the formant will also change greatly, and the selected materials and samples for manual comparison may be in the same vocalization stage, which affects the accuracy of judgment
[0019] 1) The voiceprint recognition system is greatly affected by the channel, noise and other phonemes. When the channel of the sample and the inspection material are greatly different, the similarity score given by the system will change greatly;
[0020] 2) The voiceprint recognition system can only give a similarity score, and it needs to set a threshold to give a deterministic judgment result of yes or no. It is difficult to set the threshold in actual identification tasks;
[0021] 3) In order to set a more reliable threshold, multiple sample speaker voices that are similar to the channel, noise, and content of the sample voice are required. In practice, it is difficult to obtain multiple sample voices that meet the requirements, making the voiceprint recognition system give The likelihood score is difficult to use as an evaluation reference for identity determination

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Resonance peak automatic matching method for voiceprint identification
  • Resonance peak automatic matching method for voiceprint identification
  • Resonance peak automatic matching method for voiceprint identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0052] The principle block diagram of the present invention is as figure 1 As shown, it is mainly composed of a training link and a testing link, and the steps to be implemented are as follows:

[0053] 1) Label the inspection materials and samples with comparison fragments

[0054] 101) Massive speech training to obtain the acoustic model required for phoneme segmentation;

[0055] 102) Select the speech segment to be compared from the inspection material and the sample speech file;

[0056] 2) Segmentation of material and sample voice files at phoneme boundaries

[0057] 201) Extracting the acoustic features required for speech recognition from specific segments of inspection materials and sample speech files;

[0058] 202) Using the FA technology and the acoustic model to perform speech recognition on the acoustic features to obtain the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a resonance peak automatic matching method for voiceprint identification. The method comprises the following steps that: phoneme boundary positions in an inspection material and a sample in the voiceprint identification can be automatically marked through using continuous speech recognition-based forced alignment (FA) technology; as for identical vowel phoneme segments of the inspection material and the sample, whether a current phoneme is a valid analysable phoneme is automatically judged through using fundamental frequencies, resonance peaks and power spectrum density parameters; and deviation ratios of corresponding resonance peak time-frequency areas can be automatically rendered through using a dynamic time warping (DTW) algorithm and are adopted as analysis basis of final manual voiceprint identification. With the resonance peak automatic matching method for the voiceprint identification of the invention adopted, the boundaries of phonemes can be automatically marked, and whether the pronunciation of the phonemes is valid is judged, and therefore, processing efficiency can be greatly improved; and at the same time, an automatic resonance peak deviation alignment algorithm is performed on effective phoneme pairs, and therefore, the accuracy of resonance peak alignment can be improved.

Description

technical field [0001] The invention relates to the technical field of voiceprint identification, in particular to an automatic formant matching method for voiceprint identification. Background technique [0002] Voiceprint identification technology (see literature [1] Beigi, Homayoon. Voice: Technologies and Algorithms for Biometrics Applications [M]. http: / / ieee-elearning.org / course.2010) is an application of voice in judicial identification. Texture recognition technology (see literature [2] X.D.Huang, A.Acero and H.Hon, Spoken Language Processing, Prentice Hall, 2000 and literature [3] L.Rabiner and B.H.Juang, Fundamentals of speech recognition, Prentice Hall PTR, 1993), which means that the appraiser uses scientific technology or specialized knowledge to compare the speech of the sample with the speech of the inspection material, and draws an appraisal conclusion on whether the speaker of the sample and the speaker of the building material are the same. At present, voi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/15G10L15/04
Inventor 柳林李敬阳陈涛胡国平邱志超冯祥张友国胡少云汤蕾蕾汤东梅
Owner ANHUI IFLYTEK INTELLIGENT SYST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products