Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Short-time voice signal processing method and device, equipment and storage medium

A speech signal processing and time-domain signal technology, applied in the field of short-term speech signal processing, can solve problems affecting recognition rate, affecting speech quality, environmental noise, etc., and achieve the effect of improving clarity and suppressing residual echo and environmental noise

Active Publication Date: 2018-10-23
SHANGHAI XIAODU TECHNOLOGY CO LTD
View PDF18 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The existing technology has the following defects: when the terminal uses the audio input and audio output functions at the same time, for example, when the speaker and the microphone of the smart device work at the same time, the echo signal in the preprocessed sound signal is not eliminated cleanly, and still contains residual Echo and ambient noise
In the short-term voice signal processing system of the terminal, the residual echo and environmental noise in the short-term voice signal will reduce the clarity of the voice signal and affect the normal operation of the system
For example, in the voice message application scenario, residual echo and environmental noise will affect the voice quality; for a speech recognition system with a small word size, residual echo and environmental noise will affect the recognition rate

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short-time voice signal processing method and device, equipment and storage medium
  • Short-time voice signal processing method and device, equipment and storage medium
  • Short-time voice signal processing method and device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] figure 1 It is a flow chart of a short-term speech signal processing method provided by Embodiment 1 of the present invention. This embodiment is applicable to the case of processing speech signals, and the method can be performed by a speech signal processing device. The device It is implemented by software and / or hardware, and generally can be integrated in a voice signal processing device. Devices for processing voice signals include but are not limited to computers and the like. Exemplarily, the voice signal processing device includes a terminal device with a speaker-microphone circuit, which may be an audio collection device such as a smart phone, a smart bracelet, a smart speaker, or a smart TV. Especially for the short-term voice signal processing system of the voice signal processing equipment, the method can effectively suppress the residual echo and environmental noise in the short-time voice signal, improve the clarity of the short-time voice signal, and ens...

Embodiment 2

[0053] figure 2 It is a flow chart of a short-term speech signal processing method provided by Embodiment 2 of the present invention. This embodiment optimizes step 102 on the basis of the above-mentioned embodiments: The frequency domain signal corresponding to the domain signal and the error time domain signal respectively determines the audio acquisition status that matches the near-end time domain signal. The audio acquisition status includes: single-speak status or double-speak status, including: acquisition of the near-end frequency domain of the current frame signal and the far-end frequency domain signal, and determine the error frequency domain signal according to the near-end frequency domain signal and the far-end frequency domain signal, wherein, the near-end frequency domain signal, the far-end frequency domain signal and the error frequency domain signal are related to the near-end time domain signal, far-end time-domain voice signal and error time-domain signal...

Embodiment 3

[0083] image 3 It is a flow chart of a short-term speech signal processing method provided by Embodiment 3 of the present invention. This embodiment optimizes step 103 on the basis of the above embodiments: according to the remote time domain signal, the error time domain signal and Determine the amplitude spectrum of the residual echo and the amplitude spectrum of the ambient noise corresponding to the near-end time domain signal in the audio acquisition state, including: determine the noise threshold of the error time domain signal according to the error time domain signal and the audio acquisition state, wherein the noise includes the residual echo and the environment Noise: determine the residual echo amplitude spectrum according to the error time domain signal, the remote time domain signal, the audio collection status and the noise threshold; determine the environmental noise amplitude spectrum according to the error time domain signal, the audio collection status and th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the invention disclose a short-time voice signal processing method and device, equipment and a storage medium. The method comprises the following steps of acquiring a near-end time-domain signal, and determining a far-end time-domain signal and an error time-domain signal which are matched with the near-end time-domain signal; determining an audio acquisition state matched with thenear-end time-domain signal, wherein the audio acquisition state comprises a single-talk state or a double-talk state; determining a residual echo amplitude spectrum and an ambient noise amplitude spectrum which are corresponding to the near-end time-domain signal according to the far-end time-domain signal, the error time-domain signal and the audio acquisition state; and generating an output time-domain signal which is matched with the near-end time-domain signal according to the residual echo amplitude spectrum, the ambient noise amplitude spectrum and the error time-domain signal. According to the technical scheme of the embodiments of the present invention, the residual echo and the ambient noise in a voice signal can be effectively inhibited in an echo scene, and the definition of the voice signal is improved.

Description

technical field [0001] Embodiments of the present invention relate to audio processing technologies, and in particular to a short-term speech signal processing method, device, equipment, and storage medium. Background technique [0002] With the continuous development of terminals, more and more terminals have audio input and audio output functions, and the output audio is picked up by the audio input device again, forming an echo. For example, a smart device with a speaker and a microphone. The presence of an echo signal will affect the quality of the audio signal. [0003] In the prior art, the processing of the echo of the terminal generally adopts an adaptive filter to construct an echo canceller to cancel the echo. The adaptive filter is subtracted from the near-end audio signal picked up by the microphone to output an estimated echo signal, and the subtraction result is called an error signal. Ideally, the error signal is considered to be the effective speech signal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): H04M9/08
CPCH04M9/08
Inventor 陈超邓滨宋晨枫
Owner SHANGHAI XIAODU TECHNOLOGY CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products