Voice processing method and device based on generative adversarial network
A speech processing and generative technology, applied in biological neural network models, speech analysis, neural learning methods, etc., can solve the problems of not making full use of the strong correlation between adjacent states of speech, frequency band expansion and poor compensation for packet loss.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0026] Such as figure 1 As shown, the embodiment of the present invention provides a speech processing method based on a generative confrontation network, which is used to obtain a speech processing system composed of a packet loss compensation model and a frequency band extension model, through which the speech processing system processes the original speech and overcomes the original Packet loss problem in voice or problem with too narrow frequency band. In the embodiment of the present invention, the above method includes but not limited to the following steps:
[0027] S101. Acquire voice training samples, where the voice training samples include N groups of complete voice samples and packet loss voice samples corresponding to the complete voice samples, K groups of wideband voice samples and narrowband voice samples corresponding to the wideband voice samples, Wherein, N and K are positive integers.
[0028] In the above step S101, the voice training samples are voice d...
Embodiment 2
[0075] Such as figure 2 As shown, the embodiment of the present invention also provides a speech processing device 20 based on a generative confrontation network, including but not limited to the following modules:
[0076] The training sample acquisition module 21 is used to obtain voice training samples, and the voice training samples include N groups of complete voice samples and packet loss voice samples corresponding to the complete voice samples, K groups of wideband voice samples and narrowband voice samples corresponding to the wideband voice samples, Wherein, N and K are positive integers;
[0077] The voice processing system training module 22 is used to put the voice training samples into the generative confrontation network, and perform packet loss compensation model training based on packet loss voice samples and complete voice samples, and frequency band based on wideband voice samples and narrowband voice samples. Extended model training to obtain a speech pro...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com