Speech Enhancement Method Based on Combination of Sparse Coding and Ideal Binary Mask

A binary mask and speech enhancement technology, applied in speech analysis, instruments, etc., can solve the problems of residual noise and speech signal loss, and achieve the effect of improving intelligibility and improving the quality of the target speech signal.

Inactive Publication Date: 2017-04-26
HOHAI UNIV CHANGZHOU
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

That is, in view of defects such as voice signal loss and residual noise in the traditional ideal binary mask (IBM), the present invention combines the signal sparse coding theory with the ideal binary mask algorithm to obtain a more intelligible voice signal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech Enhancement Method Based on Combination of Sparse Coding and Ideal Binary Mask
  • Speech Enhancement Method Based on Combination of Sparse Coding and Ideal Binary Mask
  • Speech Enhancement Method Based on Combination of Sparse Coding and Ideal Binary Mask

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The speech enhancement method combining the sparse coding and the ideal binary mask algorithm of the present invention will be further elaborated below in conjunction with the accompanying drawings.

[0039] The speech enhancement coding framework designed by the present invention is as figure 1 As shown, the speech signal is first denoised by the ideal binary mask algorithm (IBM), followed by fine speech extraction and fine denoising processing through the sparsity theory, and finally the target speech signal is reconstructed.

[0040] see figure 2 , image 3 Schematic diagrams of ideal binary mask algorithm processing and sparse coding processing in the present invention are respectively provided.

[0041] figure 2 The ideal binary mask algorithm processing speech signal (8kHz sampling rate) in the block diagram is first divided into frames with 32ms as a frame, and the overlap between frames is 75%, and then the discrete Fourier transform (FFT) is used to calcul...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice enhancement method based on combination of a sparse code and an ideal binary system mask. The method is an improved algorithm for overcoming the defects of noise residue and voice element losses in a traditional ideal binary system mask algorithm. The method includes the steps that a time domain voice signal is converted to be a frequency domain signal by the utilization of the short-time Fourier transform; in the frequency domain, primary denoising processing is carried out on the voice signal according to the ideal binary system mask method; further denoising processing is carried out on the primarily-denoised voice signal through a sparse coding theory, and effective voice elements are extracted from a signal perceived as an interference signal, so that the effect of voice enhancement is achieved. Compared with the prior art, the method has the advantages of being good in denoising performance, high in voice intelligibility and the like.

Description

technical field [0001] The invention relates to a speech enhancement method of sparse coding and ideal binary mask, in particular to a speech processing technology of signal sparse representation based on ideal binary mask algorithm and dictionary learning. Background technique [0002] Speech enhancement technology, simply put, is a technology that suppresses and reduces noise interference and extracts useful speech signals from the noise background when pure speech signals are interfered or even submerged by various noises. These noises mainly include interference signals such as background noise, reverberation, and other people's voices, which will not only reduce the quality and intelligibility of speech, but also cause the degradation of speech signals in other applications. Therefore, it is very necessary to effectively perform speech enhancement. [0003] Representative traditional speech enhancement algorithms include spectral subtraction, ideal binary mask (IBM), W...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L21/0208
Inventor 汤一彬谈雅文李旭斐蒋爱民徐宁殷澄
Owner HOHAI UNIV CHANGZHOU
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products