Speaker recognition method based on Gaussian mixture model embedded with time delay neural network
A Gaussian mixture model and speaker recognition technology, applied in the field of speaker recognition
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0069] The technical solutions of the present invention will be further described below in conjunction with the drawings and embodiments.
[0070] figure 1 It is a training and recognition model for speaker recognition embedded in TDNN network. It is different from the baseline GMM model (only GMM model is used as speaker recognition) in terms of training and recognition.
[0071] 1. Preprocessing and feature extraction
[0072] First, a method based on energy and zero-crossing rate is used for silence detection, and spectral subtraction is used to remove noise, and then f(Z)=1-0.97Z -1 The filter is pre-emphasized, and the Hamming window with a length of 20ms and a window shift of 10ms is used to divide the frame into a 20th-order linear prediction (LPC) analysis, and then the 13th-order cepstral coefficient is obtained from the 20th-order LPC coefficient for speaker recognition. eigenvectors of .
[0073] 2. Speaker model training
[0074] During training, the process o...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com