Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech dereverberation method based on depth characteristics of generative adversarial network

A deep feature, de-reverberation technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of not being able to learn the semantic information of speech well, and the effect is difficult to achieve, and achieve good speech recognition effect, computing volume reduction effect

Active Publication Date: 2019-06-14
TIANJIN UNIV
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the current neural network method can establish a good nonlinear mapping, it is difficult to achieve the desired effect only by using a fully connected neural network. Secondly, the use of the most basic feature mapping method cannot learn speech very well. Semantic information, constructing a good network structure and studying the deep semantic information of speech will produce a good recognition performance improvement for speech recognition, which has practical significance for speech recognition in complex scenes

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech dereverberation method based on depth characteristics of generative adversarial network
  • Speech dereverberation method based on depth characteristics of generative adversarial network
  • Speech dereverberation method based on depth characteristics of generative adversarial network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The functions and effects of the present invention will be described in detail below in conjunction with the accompanying drawings and accompanying tables.

[0037] In this embodiment, the embodiment of the invention is given based on the Reverb Challenge data set as an example. The algorithm flow of the entire system is as follows figure 1 As shown, these steps include data feature extraction, WPE speech signal preprocessing, construction of generative confrontation network, feature fusion to deal with the over-fitting problem of speech in the real world, and speech recognition model training methods. Specific steps are as follows:

[0038] The present invention takes the data set of the 2014 Reverb Challenge competition as the processing object, and proposes a far-field speech recognition system, the specific contents of which include:

[0039] 1) Aiming at the problem of both de-reverberation of speech and better learning of deep-level speech information in far-fie...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech dereverberation method based on depth characteristics of a generative adversarial network, the method mainly comprises the following steps of: carrying out signal preprocessing on speech through weighted prediction error (WPE); carrying out characteristic extraction of data: carrying out MFCC characteristic extraction on speech data and bottleneck characteristic BNF extraction aiming at phonemes without reverberation speech; then carrying out signal preprocessing on speech through WPE: constructing the generative adversarial network with depth characteristics containing the MFCC characteristics of reverberation speech mapped to clean speech through the generative adversarial network; finally carrying out forced alignment by using a traditional GMM-HMM through a Kaldi toolbox, and then carrying out acoustic model training and decoding by using a depth neural network. The speech dereverberation method combines a method of signal processing with a deep learning framework based on the generative adversarial network to enable a system to combine the respective advantages of both to produce a better speech dereverberation effect.

Description

technical field [0001] The invention relates to the field of speech signal processing, and in particular aims at the problem of recognition performance degradation caused by environmental reverberation in far-field speech recognition, and proposes a speech reverberation method based on deep features of generative adversarial networks. Background technique [0002] In recent years, emerging industries such as smart home, dialogue robots, and smart audio have flourished, which has brought about great changes in people's lifestyles and the way people interact with machines. Voice interaction, as a new way of interaction, has gained popularity in these emerging fields a wide range of applications. With the application of deep learning in speech recognition, the recognition performance has been greatly improved, the recognition rate has exceeded 95%, and the recognition effect has basically reached the level of human hearing. However, the above are limited to near-field conditio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/08G10L15/14G10L15/16G10L15/26G10L19/04G10L21/0208G10L25/24
Inventor 王龙标李楠党建武
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products