Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech enhancement model, electronic device, storage medium and related method

A speech enhancement and model technology, applied in speech analysis, instruments and other directions, can solve the problems of poor speech enhancement effect of speech signals, inability to filter out, inability to effectively identify speech data noise and useful speech signals, etc., to achieve the effect of improving the effect.

Pending Publication Date: 2022-04-12
ALIBABA DAMO (HANGZHOU) TECH CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, based on timing features and short-term frequency features, it is impossible to effectively identify the noise with large frequency differences and the useful speech signal in the speech data, and then the noise with large frequency differences cannot be filtered from the useful speech signal, resulting in the speech enhancement of the speech signal. less effective

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech enhancement model, electronic device, storage medium and related method
  • Speech enhancement model, electronic device, storage medium and related method
  • Speech enhancement model, electronic device, storage medium and related method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to enable those skilled in the art to better understand the technical solutions in the embodiments of the present application, the following will clearly and completely describe the technical solutions in the embodiments of the present application in conjunction with the drawings in the embodiments of the present application. Obviously, the described The embodiments are only some of the embodiments of the present application, but not all of them. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments in the embodiments of the present application shall fall within the protection scope of the embodiments of the present application.

[0025] In the embodiment of the present application, in order to improve the effect of speech enhancement, the noisy speech data is converted into time-frequency domain feature data, and the masking value of the noisy speech data is generated according to the long-range correlation of the time-...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a speech enhancement model, electronic equipment, a storage medium and a related method. The speech enhancement method comprises the following steps: converting noisy speech data into time-frequency domain feature data; generating a masking value of the noisy voice data according to the long-range correlation of the time-frequency domain characteristic data in the frequency direction; and generating enhanced voice data of the noisy voice data according to the masking value and the time-frequency domain feature data. According to the scheme, the effect of performing voice enhancement on the voice signal can be improved.

Description

technical field [0001] The embodiments of the present application relate to the technical field of speech processing, and in particular, to a speech enhancement model, electronic equipment, storage media and related methods. Background technique [0002] Speech enhancement is a speech processing technology that extracts useful speech signals from the noise background to suppress and reduce noise interference after the speech signal is interfered or annihilated by various noises. Speech enhancement is widely used in real-time communication (RTC) audio and video conferencing, courier classrooms and other scenarios. [0003] At present, when performing speech enhancement on speech data, the timing features of the speech data along the time direction and the short-term frequency features along the frequency direction are extracted, and Jier performs speech enhancement on the speech data according to the timing features and short-term frequency features to suppress the speech dat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/0232G10L21/0264
Inventor 赵胜奎
Owner ALIBABA DAMO (HANGZHOU) TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products