Audio Type Detection Method Based on Bipolar Modeling of Pure Speech and Background Noise
A technology of background noise and pure speech, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as large amount of calculation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0038]figure 1 It is the generation of background noise and pure speech bipolar model and the flow chart of classifier training in the present invention. The described method comprises the steps of:
[0039] (1) Pure speech and pure background noise model construction: based on enough suitable training data to train a pure speech model GMM of N Gaussian mixed elements s and a background noise model GMM of M Gaussian mixed elements n .
[0040] In this embodiment, the Gaussian mixture number of the pure speech model is 256, and a GMM model is constructed by using as many speakers as possible and pure speech with different language content; the number of speakers is not less than 20, and the male: female ratio is kept as far as possible balanced. Language content should also be diversified. In terms of completeness, language content should contain all basic phonetic units.
[0041] The Gaussian mixture number of the background noise model uses 512, and uses as many backgrou...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com