Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech dereverberation method based on deep features of generative adversarial network

A deep feature, de-reverberation technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem that the effect is difficult to achieve, and the semantic information of speech cannot be learned well, so as to reduce the amount of calculation and improve the quality of speech. Recognition effect, the effect of good recognition effect

Active Publication Date: 2021-10-26
慧言科技(天津)有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the current neural network method can establish a good nonlinear mapping, it is difficult to achieve the desired effect only by using a fully connected neural network. Secondly, the use of the most basic feature mapping method cannot learn speech very well. Semantic information, constructing a good network structure and studying the deep semantic information of speech will produce a good recognition performance improvement for speech recognition, which has practical significance for speech recognition in complex scenes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech dereverberation method based on deep features of generative adversarial network
  • Speech dereverberation method based on deep features of generative adversarial network
  • Speech dereverberation method based on deep features of generative adversarial network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The functions and effects of the present invention will be described in detail below in conjunction with the accompanying drawings and accompanying tables.

[0037] This embodiment uses the Reverb Challenge data set as an example to give the implementation of the invention. The algorithm flow of the whole system is as follows: figure 1 As shown in the figure, it includes data feature extraction, WPE speech signal preprocessing, construction of generative confrontation network, feature fusion to deal with the over-fitting problem of speech in the real world, and speech recognition model training methods. Specific steps are as follows:

[0038] The present invention takes the data set of Reverb Challenge competition in 2014 as processing object, proposes a kind of far-field speech recognition system, and specific content comprises:

[0039] 1) Aiming at the problem of de-reverberation of speech and better learning of deep-level speech information in far-field speech rec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention discloses a speech de-reverberation method based on the deep features of the generated confrontational network. The main steps of the method are: firstly, performing weighted prediction error WPE on the speech and performing signal preprocessing; secondly, data feature extraction: performing MFCC on the speech data Feature extraction and phoneme-specific bottleneck feature BNF extraction without reverberant speech; then WPE speech signal preprocessing: build a generative confrontation network, and the MFCC features containing reverberant speech are mapped to the deep features of clean speech through the generative confrontation network; finally through The Kaldi toolbox uses traditional GMM‑HMM for forced alignment, and then uses deep neural networks for acoustic model training and decoding. The present invention combines the signal processing method and the deep learning framework based on the generation confrontation network, so that the system can combine the respective advantages of the two to produce a better speech reverberation effect.

Description

technical field [0001] The invention relates to the field of speech signal processing, especially for the problem of recognition performance degradation caused by environmental reverberation in far-field speech recognition, and proposes a speech reverberation method based on deep features of generative adversarial networks. Background technique [0002] In recent years, emerging industries such as smart home, dialogue robots, and smart audio have flourished, which has brought about great changes in people's lifestyles and the way people interact with machines. Voice interaction, as a new way of interaction, has gained popularity in these emerging fields a wide range of applications. With the application of deep learning in speech recognition, the recognition performance has been greatly improved, the recognition rate has exceeded 95%, and the recognition effect has basically reached the level of human hearing. However, the above are limited to near-field conditions, and the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/08G10L15/14G10L15/16G10L15/26G10L19/04G10L21/0208G10L25/24
Inventor 王龙标李楠党建武
Owner 慧言科技(天津)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products