Short-time voice signal processing method and device, equipment and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech signal processing and time-domain signal technology, applied in the field of short-term speech signal processing, can solve problems affecting recognition rate, affecting speech quality, environmental noise, etc., and achieve the effect of improving clarity and suppressing residual echo and environmental noise

Active Publication Date: 2018-10-23

SHANGHAI XIAODU TECHNOLOGY CO LTD

View PDF18 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The existing technology has the following defects: when the terminal uses the audio input and audio output functions at the same time, for example, when the speaker and the microphone of the smart device work at the same time, the echo signal in the preprocessed sound signal is not eliminated cleanly, and still contains residual Echo and ambient noise

In the short-term voice signal processing system of the terminal, the residual echo and environmental noise in the short-term voice signal will reduce the clarity of the voice signal and affect the normal operation of the system

For example, in the voice message application scenario, residual echo and environmental noise will affect the voice quality; for a speech recognition system with a small word size, residual echo and environmental noise will affect the recognition rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0029] figure 1 It is a flow chart of a short-term speech signal processing method provided by Embodiment 1 of the present invention. This embodiment is applicable to the case of processing speech signals, and the method can be performed by a speech signal processing device. The device It is implemented by software and / or hardware, and generally can be integrated in a voice signal processing device. Devices for processing voice signals include but are not limited to computers and the like. Exemplarily, the voice signal processing device includes a terminal device with a speaker-microphone circuit, which may be an audio collection device such as a smart phone, a smart bracelet, a smart speaker, or a smart TV. Especially for the short-term voice signal processing system of the voice signal processing equipment, the method can effectively suppress the residual echo and environmental noise in the short-time voice signal, improve the clarity of the short-time voice signal, and ens...

Embodiment 2

[0053] figure 2 It is a flow chart of a short-term speech signal processing method provided by Embodiment 2 of the present invention. This embodiment optimizes step 102 on the basis of the above-mentioned embodiments: The frequency domain signal corresponding to the domain signal and the error time domain signal respectively determines the audio acquisition status that matches the near-end time domain signal. The audio acquisition status includes: single-speak status or double-speak status, including: acquisition of the near-end frequency domain of the current frame signal and the far-end frequency domain signal, and determine the error frequency domain signal according to the near-end frequency domain signal and the far-end frequency domain signal, wherein, the near-end frequency domain signal, the far-end frequency domain signal and the error frequency domain signal are related to the near-end time domain signal, far-end time-domain voice signal and error time-domain signal...

Embodiment 3

[0083] image 3 It is a flow chart of a short-term speech signal processing method provided by Embodiment 3 of the present invention. This embodiment optimizes step 103 on the basis of the above embodiments: according to the remote time domain signal, the error time domain signal and Determine the amplitude spectrum of the residual echo and the amplitude spectrum of the ambient noise corresponding to the near-end time domain signal in the audio acquisition state, including: determine the noise threshold of the error time domain signal according to the error time domain signal and the audio acquisition state, wherein the noise includes the residual echo and the environment Noise: determine the residual echo amplitude spectrum according to the error time domain signal, the remote time domain signal, the audio collection status and the noise threshold; determine the environmental noise amplitude spectrum according to the error time domain signal, the audio collection status and th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Embodiments of the invention disclose a short-time voice signal processing method and device, equipment and a storage medium. The method comprises the following steps of acquiring a near-end time-domain signal, and determining a far-end time-domain signal and an error time-domain signal which are matched with the near-end time-domain signal; determining an audio acquisition state matched with thenear-end time-domain signal, wherein the audio acquisition state comprises a single-talk state or a double-talk state; determining a residual echo amplitude spectrum and an ambient noise amplitude spectrum which are corresponding to the near-end time-domain signal according to the far-end time-domain signal, the error time-domain signal and the audio acquisition state; and generating an output time-domain signal which is matched with the near-end time-domain signal according to the residual echo amplitude spectrum, the ambient noise amplitude spectrum and the error time-domain signal. According to the technical scheme of the embodiments of the present invention, the residual echo and the ambient noise in a voice signal can be effectively inhibited in an echo scene, and the definition of the voice signal is improved.

Description

technical field [0001] Embodiments of the present invention relate to audio processing technologies, and in particular to a short-term speech signal processing method, device, equipment, and storage medium. Background technique [0002] With the continuous development of terminals, more and more terminals have audio input and audio output functions, and the output audio is picked up by the audio input device again, forming an echo. For example, a smart device with a speaker and a microphone. The presence of an echo signal will affect the quality of the audio signal. [0003] In the prior art, the processing of the echo of the terminal generally adopts an adaptive filter to construct an echo canceller to cancel the echo. The adaptive filter is subtracted from the near-end audio signal picked up by the microphone to output an estimated echo signal, and the subtraction result is called an error signal. Ideally, the error signal is considered to be the effective speech signal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): H04M9/08

CPCH04M9/08

Inventor 陈超邓滨宋晨枫

Owner SHANGHAI XIAODU TECHNOLOGY CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Short-time voice signal processing method and device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology