Improved voice reinforcement method based on multi-target criterion learning
A speech enhancement and multi-objective technology, which is applied in speech analysis, speech recognition, instruments, etc., can solve the problems of poor training target effect and achieve the effect of easy implementation, SNR improvement, and optimization of the target function
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific example
[0047] (1) Signal preprocessing
[0048] 1. Select data:
[0049] Select 600 sentences from the TIMIT standard corpus as training pure speech, and the sampling frequency is 16KHz; select Factory, F16, White and Pink four kinds of noise from the Noisex-92 standard noise library as training noises, and the pure speech and noise are respectively mixed SNR -5dB, -2dB, 0dB and 2dB are mixed to get the training data set.
[0050] Select 120 sentences from the remaining sentences of TIMIT as the test set of pure speech, and the sampling rate is still 16KHz; select Factory from the Noisex-92 standard noise library as the test noise, and use -5dB, -2dB, 0dB and 2dB mixed signal-to-noise Ratio is mixed with pure speech to obtain a test data set.
[0051] 2. Framing and windowing
[0052] When the speech signal is divided into frames, the frame length is 320 points, the frame shift is 160 points, and the window function is Hamming window.
[0053] (2) Calculate the logarithmic power ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com