Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement method and device based on spatial features and electronic equipment

A speech enhancement, noisy speech technology, applied in speech analysis, instruments, etc., can solve the problems of insufficient precision and low quantization efficiency, and achieve the effect of avoiding speech distortion and reducing noise.

Active Publication Date: 2022-01-11
北京清微智能信息技术有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to solve the problems of insufficient precision and low quantization efficiency of existing quantization methods, embodiments of the present invention provide a quantization method, device and electronic equipment for neural networks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method and device based on spatial features and electronic equipment
  • Speech enhancement method and device based on spatial features and electronic equipment
  • Speech enhancement method and device based on spatial features and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0079] see figure 1 , a method for speech enhancement based on spatial features provided by an embodiment of the present invention, the method includes:

[0080] S100. Perform Fourier transform on the dual-channel noisy speech to obtain a dual-channel complex spectrum represented by the dual-channel noisy speech in the frequency domain.

[0081] S110. Obtain, based on beamforming, a first single-channel complex spectrum of the two-channel complex spectrum in the target speech angle direction and a second single-channel complex spectrum of the two-channel complex spectrum in a direction different from the target speech angle by a predetermined angle.

[0082] In implementation, the beamforming formula is shown in equation (1) below:

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech enhancement method and device based on spatial features and electronic equipment. The method comprises the steps of carrying out Fourier transform on dual-channel noisy speech to obtain a dual-channel complex spectrum; obtaining a first single-channel complex spectrum and a second single-channel complex spectrum of the dual-channel complex spectrum based on beam forming; calculating a logarithmic power spectrum of the first single-channel complex spectrum; calculating a direction energy ratio based on the energy of the first single-channel complex spectrum and the energy of the second single-channel complex spectrum, and taking a logarithm to obtain a logarithmic direction energy ratio; inputting the logarithmic power spectrum and the logarithmic direction energy ratio as features into a pre-trained speech enhancement neural network to obtain a masking value; and adding the masking value to the first single-channel complex spectrum, and performing inverse Fourier transform on the first single-channel complex spectrum after masking processing to obtain enhanced speech. According to the scheme provided by the embodiment of the invention, the speech distortion can be well avoided while the noise is effectively reduced.

Description

technical field [0001] The invention relates to the technical field of speech enhancement, in particular to a speech enhancement method, device and electronic equipment based on spatial features. Background technique [0002] Speech enhancement has always played an important role in the field of speech signal processing. The traditional speech enhancement method mainly estimates the spectral information of the noise, and then subtracts the noise from the original speech spectrum. However, sudden noise and random noise will make the spectrum The estimation of information becomes difficult. At the same time, the traditional method also needs to make independent assumptions on the signal and Gaussian assumptions on the feature distribution in advance. These assumptions are equivalent to setting boundaries for speech enhancement, resulting in limited noise reduction effects. [0003] Based on this, the neural network based on deep learning is widely used in the field of speech e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/02G10L21/0232G10L25/18G10L25/21G10L25/30
CPCG10L21/0232G10L25/30G10L25/21G10L25/18
Inventor 苏家雨王博欧阳鹏
Owner 北京清微智能信息技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products