Speaker recognition method based on convolution neural network and spectrogram
A convolutional neural network and speaker recognition technology, applied in the field of speaker recognition based on convolutional neural network, can solve problems such as difficulty in training and short speech, and achieve the effect of less hardware cost and resources, easy implementation, and simple and fast calculation.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0023] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.
[0024] The speaker audio data set has 24 speakers who read the numbers 0-9 respectively, and the following operations are performed on the speaker audio data set.
[0025] S1 generates a spectrogram operation:
[0026] Step 1: Obtain the sampling frequency, left and right channels by reading the sound signal.
[0027] Step 2: Store these data in an array and calculate the length.
[0028] Step 3: Perform windowing processing on the frequency division data, where the overlap ratio is 50%, and save the data
[0029] Step 4: Perform Fourier transform on the frequency-divided data
[0030] Step 5: Display the spectrogram through an array.
[0031] S2 deep learning stage operation:
[0032] Step 1: Convert the voice signal of the audio file into a spectrogram through code;
[0033] Step 2: After getting these spectrograms, run GenerateTrainAnd...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com