Voiceprint recognition method based on RNN

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of voiceprint recognition and subtraction method, which is applied in speech analysis, instruments, etc., can solve the problems of not considering the context correlation of speech data, not using deep learning to extract features, and features not being able to represent the speaker well, etc. To achieve the effect of improving the recognition performance

Active Publication Date: 2018-02-23

王华锋

View PDF14 Cites 17 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0009] The technical problem solved by the present invention is: overcoming the context correlation of voice data that is not considered in the existing voiceprint recognition method, the extracted features cannot represent the speaker well, and the powerful feature extraction ability of deep learning has not been brought into play And other issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0035] figure 1 The overall processing flow of the present invention is given. The present invention provides a method for voiceprint recognition based on RNN. The main steps are as follows: firstly, denoise the input voice data using spectral subtraction, and extract the MFCC characteristic parameters and first and second order differences of the obtained pure voice data by frame , and then concatenate the MFCC parameters and the first and second order differences of multiple consecutive frames to obtain a two-dimensional feature parameter matrix, which is used as the input of the cyclic neural network. The present invention uses the variant LSTM of the cyclic neural network for training, and the LSTM passes through a unique gate mechanism-a forget gate, an input gate and an output gate ( figure 1 The δ in is the sigmoid activation function, tanh is the hyperbolic tangent activation function) to retain the long-term information of the sequence, and thus complete the training...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voiceprint recognition method based on RNN. According to the voiceprint recognition method based on RNN, after MFCC features of denoised voice data and first-order and second-order differences are obtained, the advanced features of a speaker in the MFCC features are extracted by using a recurrent neural network, the extracted features are classified by using a softmax classifier, and finally, the speaker is recognized by using a naive bayes method. Different from mute elimination of a traditional method, the method provided by the invention has the advantages that a mute section in the voice data is reserved, the features related to the context can be extracted on the basis of the recurrent neural network, the advanced features of the voice, such as the speaking mode and the rhythm of the speaker can be extracted aiming at the voice data, so that the feature information is complete, and the speaker can be well represented. Compared with an existing voiceprint recognition method based on Gauss, the method provided by the invention has the advantages that the requirements for voice data are relatively low, the accuracy is higher, the accuracy is still kept ata very high level even when big data is aimed, and the running speed is not reduced.

Description

technical field [0001] The invention provides an RNN-based voiceprint recognition method, which relates to the fields of deep learning, pattern recognition, and voice signal processing. Background technique [0002] With the rapid development of information technology, how to accurately authenticate a person's identity, protect personal privacy and ensure information security has become an urgent problem to be solved. Compared with traditional identity authentication methods, biometric identity authentication technology has the characteristics that it will not be lost, stolen or forgotten during use; it is not only fast, convenient, but also accurate and reliable. As one of the most popular biometric identification technologies at present, voiceprint recognition has unique advantages in remote authentication and other application fields, and has attracted more and more attention. WeChat has enabled the voice lock verification login method, the world's first voice lock Lenov...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L17/02G10L17/04G10L17/18G10L17/14G10L25/24G10L25/18G10L21/0208

CPCG10L17/02G10L17/04G10L17/14G10L17/18G10L21/0208G10L25/18G10L25/24

Inventor 冯毅夫王华锋徐雷杜俊逸付明霞马晨南齐一凡潘海侠

Owner 王华锋

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voiceprint recognition method based on RNN

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology