Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement method based on deep learning assisted RLS filtering processing

A technology of filtering processing and speech enhancement, applied in speech analysis, instruments, etc., can solve problems such as reducing the processing power of microphone beam signals, enhancing speech signals, etc., to improve speech recognition rate and human-computer interaction experience, and enhance speech signals. , the effect of reducing noise residual

Active Publication Date: 2021-11-16
成都启英泰伦科技有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to overcome the defects in the prior art, the present invention discloses a speech enhancement method based on deep learning assisted RLS filter processing, which effectively reduces the computing power of microphone beam signal processing, and can reduce the output without increasing distortion Noise residue in the signal, enhance the speech signal, thereby improving the speech recognition rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement method based on deep learning assisted RLS filtering processing
  • Speech enhancement method based on deep learning assisted RLS filtering processing
  • Speech enhancement method based on deep learning assisted RLS filtering processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Specific embodiments of the present invention will be further described in detail below.

[0022] The speech enhancement method based on deep learning assisted RLS filter processing of the present invention, comprises the following steps:

[0023] S1. The beamforming method of generalized sidelobe cancellation is used to process the microphone array voice signal y(l,k) to obtain the fixed beamforming output signal y s (l,k) and noise reference signal u(l,k); l, k denote time and frequency index respectively;

[0024] S2. Randomly extract the feature signal of any microphone signal in the microphone array and send it to the GRU-Mask network to calculate the masking value mask (l, k) of the original microphone signal;

[0025] S3. Compare the masking value mask(l,k) output by the GRU-Mask network with the noise threshold thred:

[0026] When mask(l,k)0 (l,k) for a fixed beamforming output signal y s (l,k) do filtering processing, the processed signal is used as the fin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A speech enhancement method based on deep learning assisted RLS filtering processing comprises the following steps: S1, processing a speech signal by using a generalized sidelobe cancellation beam forming method to obtain a fixed beam forming output signal and a noise reference signal; S2, randomly extracting a feature signal of any microphone signal in a microphone array, sending the feature signal to a GRU-Mask network, and calculating a masking value of an original microphone signal; and S3, comparing the masking value output by the network with a noise threshold, calculating a noise canceller, and performing noise elimination by using the noise canceller. According to the invention, only a noise component dominant signal is filtered by using an RLS algorithm, so that the computing power of microphone beam signal processing is effectively reduced, the noise residue in the output signal can be reduced without increasing distortion, and the purposes of enhancing a voice signal and improving the voice recognition rate and the man-machine interaction experience are achieved.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence and relates to speech recognition, in particular to a speech enhancement method based on deep learning-assisted RLS filter processing. Background technique [0002] With the wide application of voice interaction technology, the traditional single-microphone voice enhancement method can no longer meet the needs of voice quality in interactive technology. For example, in far-field environments or noisy environments, the information captured by single-microphone methods is limited and the noise reduction performance is limited. At this time, using the microphone array signal can effectively use the directional information of the voice signal to capture the voice signal in the beam, suppress the signals of other beams, and obtain a better noise reduction effect. [0003] As one of the classic beamforming algorithms, the General Sidelobe Canceller (GSC) method is widely used. However,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L21/0208G10L21/0216
CPCG10L21/0208G10L21/0216G10L2021/02166
Inventor 万东琴胡岸刘文通曾帆
Owner 成都启英泰伦科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products