Speaker recognition method based on Gaussian mixture model embedded with time delay neural network
A Gaussian mixture model and speaker recognition technology, applied in the field of speaker recognition
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0069] The technical solutions of the present invention will be further described below in conjunction with the drawings and embodiments.
[0070] figure 1 It is a training and recognition model for speaker recognition embedded in TDNN network. It is different from the baseline GMM model (only GMM model is used as speaker recognition) in terms of training and recognition.
[0071] 1. Preprocessing and feature extraction
[0072] First, a method based on energy and zero-crossing rate is used for silence detection, and spectral subtraction is used to remove noise, and then f(Z)=1-0.97Z -1 The filter is pre-emphasized, and the Hamming window with a length of 20ms and a window shift of 10ms is used to divide the frame into a 20th-order linear prediction (LPC) analysis, and then the 13th-order cepstral coefficient is obtained from the 20th-order LPC coefficient for speaker recognition. eigenvectors of .
[0073] 2. Speaker model training
[0074] During training, the process o...
PUM

Abstract
Description
Claims
Application Information

- Generate Ideas
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com