Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speaker recognition method combining Gaussian mixture model and quantum neural network

A Gaussian mixture model and speaker recognition technology, applied in the field of speaker recognition, can solve problems such as affecting the recognition accuracy rate and division error, and achieve the effects of reducing practicability, improving recognition rate, and overcoming training and recognition data.

Inactive Publication Date: 2011-09-28
PLA UNIV OF SCI & TECH
View PDF0 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the randomness of the distribution of the signal in the vector space, there is a certain error in the division of the input feature vector space by the learned neural network, which affects the accuracy of recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speaker recognition method combining Gaussian mixture model and quantum neural network
  • Speaker recognition method combining Gaussian mixture model and quantum neural network
  • Speaker recognition method combining Gaussian mixture model and quantum neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should understand that following specific embodiment is only for illustrating the present invention and is not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand the present invention Modifications in various equivalent forms fall within the scope defined by the appended claims of the present application.

[0019] The invention provides a speaker recognition method combining a Gaussian mixture model and a quantum neural network, which is used for automatic identification of the speaker's identity. The realization of the function of the system designed by the method is divided into two stages of training and recognition. In the training phase, first perform parameter processing on the training voice signals one by one, and store the results in the database, and then take out a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speaker recognition method combining a Gaussian mixture model and a quantum neural network. The method provided by the invention comprises the following steps: at the training stage, framing input training voice signals, extracting characteristic parameters and generating characteristic parameter vectors; then using a K mean value method and an EM (expectation-maximization) algorithm to obtain the Gaussian mixture model parameters of the characteristic parameter vectors, and finally utilizing the Gaussian mixture model parameters of all the training voice signals to train the quantum neural network; and at the recognition stage, obtaining the Gaussian mixture model parameters of a recognized speaker, then inputting the model parameters into the trained neural network, and obtaining a recognition result. The speaker recognition method is applicable to recognition of the speaker under the condition of less sample data and unbalanced sample data, and simultaneously the capability of the quantum neural network which can carries out effective judgment on voice data with cross data and fuzzy boundary of the speaker is utilized, so that the correct recognition rate of a system can be improved.

Description

technical field [0001] The invention relates to a speaker identification method, in particular to a speaker identification method combining a Gaussian mixture model and a quantum neural network. Background technique [0002] At present, the methods adopted by the speaker recognition system mainly include the following: [0003] 1. A speaker recognition method based on Vector Quantization (VQ). In the training phase, this method first extracts the feature vector from the training speech, and then uses this feature vector to generate a speaker template through clustering; when recognizing, first extracts the feature vector of the speech to be recognized, and then calculates this feature vector and the existing ones in the system in turn. template distance, and select the speaker corresponding to the template with the smallest distance as the result of this recognition. Each template only describes the statistical distribution of the speaker's speech feature vector in the vec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/00G10L17/04G10L17/18
Inventor 王金明张雄伟徐志军王耿
Owner PLA UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products