Short-time and long-time feature modeling fusion-based environmental sound recognition method and device
A technology of long-term features and short-term features, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as insufficient use of algorithm information, and achieve the effect of improving recognition results
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0031] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.
[0032] In order to fully utilize the information of each scale of the audio in the process of environmental sound recognition, the present invention proposes a cascade fusion model based on the short-term and long-term features of the audio. The whole process uses GMM and SVM to model based on different features. The implementation of the GMM model is based on short-term features of audio. The input of the SVM classifier includes long-term features and the probability score of GMM. In this two-stage framework, firstly, the correct classification results of the first stage are retained by introducing confidence, and at the same time, the probability score of GMM is used as a part of the SVM input, so that the short-term d...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com