Voice change detection method and system, mobile terminal and storage medium
A detection method and technology for speech detection, applied in speech analysis, instruments, etc., can solve problems such as low detection efficiency and poor detection accuracy, and achieve the effects of improving accuracy, reducing computational complexity, and improving resolution
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0040] see figure 1 , is a flow chart of the speech change detection method provided by the first embodiment of the present invention, including steps:
[0041] Step S10, acquiring sample speech data, and performing feature extraction on the sample speech data to obtain cqt speech features;
[0042] Wherein, the sample voice data includes positive sample data and negative sample data, specifically, the positive sample data is mainly the voice data of a real person, and the negative sample data is mainly voice change data, recording playback data and synthetic audio data, etc.;
[0043] Preferably, the voice-changing data can be collected by some mainstream voice-changing apps, and the sound in the audio can be converted into the voice of a specific person through a relevant conversion algorithm. The recording playback data can be collected by some recording equipment, In addition, the synthesized voice data can also be generated through any voice interface;
[0044] Step S20...
Embodiment 2
[0050] see figure 2, is a flow chart of the speech change detection method provided by the second embodiment of the present invention, including steps:
[0051] Step S11, obtaining sample speech data, and performing feature extraction on the sample speech data to obtain cqt speech features;
[0052] Wherein, the sample voice data includes positive sample data and negative sample data. Preferably, since human voices are mainly concentrated in low frequencies, they have higher resolution for low frequencies and lower resolution for high frequencies. Therefore, this step Through the extraction based on cqt features, the model obtained after subsequent training can better distinguish the difference between the altered voice and the normal voice, and can also reduce the amount of data calculation;
[0053] Step S21, performing rate-spectrum conversion on the cqt speech features to obtain a speech power spectrum, and obtaining the logarithm of the speech power spectrum;
[0054] ...
Embodiment 3
[0069] see image 3 , is a schematic structural diagram of the speech change detection system 100 provided by the third embodiment of the present invention, including: a feature extraction module 10, a model training module 11 and a speech detection module 12, wherein:
[0070] The feature extraction module 10 is configured to acquire sample speech data, and perform feature extraction on the sample speech data to obtain cqt speech features, and the sample speech data includes positive sample data and negative sample data.
[0071] The model training module 11 is used to optimize the cqt speech features to obtain the cqcc speech features, and input the cqcc speech features to a preset convolutional neural network for model training to obtain a speech detection model, wherein, The preset convolutional neural network includes three convolutional layers and two fully connected layers.
[0072] Wherein, the model training module 11 is also used to: control the preset convolutional...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com