Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice activity detection in presence of background noise

A voice activity and background noise technology, applied in voice analysis, instruments, etc., to solve problems such as increased possibility of missed speech, increased possibility of voice detection, bad user experience, bad voice activity detection, etc.

Active Publication Date: 2014-09-24
QUALCOMM INC
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At low SNR or under rapidly changing non-stationary noise, a smooth long-term SNR will produce an inaccurate VAD_THR, resulting in an increased probability of missed speech or an increased probability of false speech detection
Also, some VAD techniques (e.g., Adaptive Multi-Rate Wideband or AMR-WB) work well for stationary types of noise, such as car noise, but produce extremely high Voice activity factor (attributable to widespread false detections)
[0008] Therefore, false indications of voice activity can lead to processing and emission of noisy signals
The processing and emission of noisy signals can create a poor user experience, especially if the noise emission period is interrupted by periods of inactivity from time to time due to voice activity detectors indicating no voice activity
Conversely, poor voice activity detection can lead to loss of a substantial portion of the voice signal
Loss of the initial part of the voice activity can result in the user needing to repeat parts of the conversation on a regular basis, which is an undesirable situation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice activity detection in presence of background noise
  • Voice activity detection in presence of background noise
  • Voice activity detection in presence of background noise

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The following detailed description, which references and incorporates drawings, describes and illustrates one or more particular embodiments. These embodiments have been shown and described in sufficient detail (they are provided not to be limiting but merely to exemplify and teach) to enable those skilled in the art to practice what is claimed. Therefore, for the sake of brevity, the description may omit certain information known to those skilled in the art.

[0027] In many speech processing systems, voice activity detection is typically estimated from an audio input signal, such as a microphone signal (eg, that of a mobile phone). Voice activity detection is an important function in many speech processing devices such as vocoders and speech recognition devices. Voice activity detection analysis can be performed in the time or frequency domain. Frequency-domain VAD is generally preferred over time-domain VAD in the presence of background noise and at low SNR. Frequ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

In speech processing systems, compensation is made for sudden changes in the background noise in the average signal-to-noise ratio (SNR) calculation. SNR outlier filtering may be used, alone or in conjunction with weighting the average SNR. Adaptive weights may be applied on the SNRs per band before computing the average SNR. The weighting function can be a function of noise level, noise type, and / or instantaneous SNR value. Another weighting mechanism applies a null filtering or outlier filtering which sets the weight in a particular band to be zero. This particular band may be characterized as the one that exhibits an SNR that is several times higher than the SNRs in other bands.

Description

[0001] Cross References to Related Applications [0002] This application claims priority to Provisional Patent Application No. 61 / 588,729, filed January 20, 2012, on the basis of 35 U.S.C. § 119(e). This provisional patent application is expressly incorporated herein by reference in its entirety. Background technique [0003] For applications where communications occur in noisy environments, it may be desirable to separate the desired speech signal from background noise. Noise may be defined as the combination of all signals that interfere with or otherwise degrade a desired signal. Background noise may include many noise signals generated within an acoustic environment, such as background conversations of other people, as well as reflections and reverberations from any of the desired signal and / or other signals. [0004] A signal activity detector, such as a voice activity detector (VAD), can be used to minimize the amount of unnecessary processing in the electronic devic...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/84
CPCG10L25/84G10L21/0208
Inventor 芬卡特拉曼·斯里尼瓦沙·阿提文卡特什·克里希南
Owner QUALCOMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products