Residual error and batch normalization-based neural network noisy vocal print identification method

A neural network and recognition method technology, applied in the field of voiceprint recognition, can solve problems such as network gradient disappearance, poor robustness, degradation, etc., to improve the ability of network generalization, reduce information loss, and voiceprint recognition rate Improved effect

Pending Publication Date: 2021-06-18
GUIZHOU NORMAL UNIVERSITY
View PDF9 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The performance of the TDNN algorithm is stronger than that of I-Vector in all aspects, but when there is strong noise interference in the recognition environment, the robustness effect is not good, and as the network depth increases, the network is prone to gradient disappearance and degradation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Residual error and batch normalization-based neural network noisy vocal print identification method
  • Residual error and batch normalization-based neural network noisy vocal print identification method
  • Residual error and batch normalization-based neural network noisy vocal print identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] Below in conjunction with the accompanying drawings and preferred embodiments, the specific implementation, structure, features and efficacy of a neural network noise pattern recognition method based on residual and batch normalization proposed according to the present invention will be described in detail as follows .

[0015] see figure 1 , a kind of neural network noise pattern recognition method based on residual error and batch normalization of the present invention, comprises the following steps:

[0016] (1) Data preparation: Randomly add reverberation, noise, and music into the data set to obtain a data set in a noisy environment, and perform data preprocessing, noise addition, mute deletion, and less than The process of 5s speech, extracts corresponding Mel cepstral coefficient respectively to described speech signal;

[0017] (2) Build a ResTDNN network:

[0018] Modify the input layer of the residual neural network so that it can correct the output layer. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a neural network noisy vocal print recognition method based on residual error and batch normalization, which comprises the following steps: preparing a data set in a noisy environment, and respectively extracting corresponding Mel-frequency cepstral coefficients from voice signals; constructing a ResTDNN network structure: after the construction is completed, training the network by using a natural gradient random descent method, after the training is completed, capturing vocal print information of a speaker by using voice information about 10 seconds through x-vector extracted by dimension reduction, and adopting a cross entropy loss function; and performing dimension reduction by using linear discriminant analysis, performing length normalization on an x-vector after the dimension reduction, training linear probability analysis, respectively calculating likelihood functions of two voices from different spaces, and evaluating a recognition system by adopting equal error rate and minimum detection cost. According to the method, the robustness of the time delay neural network in the noise environment can be improved, the neural network degeneration and gradient disappearance phenomenon requirements are relieved, and the vocal print recognition rate is improved.

Description

technical field [0001] The invention belongs to the field of voiceprint recognition, and in particular relates to a neural network noise pattern recognition method based on residual error and batch normalization. Background technique [0002] Most of the traditional voiceprint recognition technologies are based on the identity factor (Identity Vector, I-Vector), but the modeling ability of this method needs to be optimized. In recent years, the use of deep neural network (DNN) to capture the speaker's speech features is a big boom, but this method increases the computational complexity while meeting the training requirements. Using time-delayed neural network (Time Delayed Neural Network, TDNN) embedding, and using a deep neural network to train the speaker recognizer to extract speaker information can effectively improve DNN. The performance of the TDNN algorithm is stronger than that of I-Vector in all aspects, but when there is strong noise interference in the recognitio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/18G10L17/04G10L17/02G10L25/24
CPCG10L17/18G10L17/04G10L17/02G10L25/24
Inventor 杨乘雷涪茸罗娅娅张旺余萍王晓慧施香怡
Owner GUIZHOU NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products