Method, device and electronic equipment for voice activity detection

A voice activation detection and sub-band technology, applied in the field of communication, can solve problems such as easy misjudgment, no adaptive adjustment, unsatisfactory overall performance, etc., and achieve the effect of improving performance

Active Publication Date: 2011-05-04
HUAWEI TECH CO LTD
View PDF0 Cites 40 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] VAD methods based on single classification parameters are prone to misjudgment
Since the coefficients in the 14 decision conditions are constant, the decision criterion does not have the ability to adjust adaptively according to the input signal; ultimately the overall performance of the method is not ideal

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and electronic equipment for voice activity detection
  • Method, device and electronic equipment for voice activity detection
  • Method, device and electronic equipment for voice activity detection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0025] Embodiment 1. Voice activation detection method. The method is attached figure 1 shown.

[0026] figure 1 , S100. Receive an audio frame currently to be detected.

[0027] S110. Acquire time domain parameters and frequency domain parameters from the current audio frame to be detected. Here, the number of time domain parameters and the number of frequency domain parameters may both be one. It should be noted that this embodiment does not rule out the possibility that there are multiple time domain parameters and multiple frequency domain parameters.

[0028] The time-domain parameter in this embodiment may be a zero-crossing rate, and the frequency-domain parameter may be spectrum subband energy. It should be noted that the time-domain parameters in this embodiment may also be other parameters except the zero-crossing rate, and the frequency-domain parameters may also be other parameters except the spectrum sub-band energy. In order to facilitate the description of...

Embodiment 2

[0076] Embodiment 2, a voice activation detection device. The structure of the device is attached as figure 2 shown.

[0077] figure 2 The voice activation detection device in includes: a first acquiring module 210 , a second acquiring module 220 and a judging module 230 . Optionally, the device may also include a receiving module 200 .

[0078] The receiving module 200 is configured to receive an audio frame currently to be detected.

[0079] The first obtaining module 210 is configured to obtain time domain parameters and frequency domain parameters from audio frames. In the case that the device includes the receiving module 200 , the first obtaining module 210 may obtain the time domain parameter and the frequency domain parameter from the currently to-be-detected audio frame received by the receiving module 200 . The first obtaining module 210 may output the obtained time domain parameters and frequency domain parameters, and the time domain parameters and frequency...

Embodiment 3

[0123] Embodiment 3, electronic equipment. The structure of the electronic equipment is as attached image 3 shown.

[0124] image 3 The electronic equipment includes a transceiver device 300 and a voice activation detection device 310 .

[0125] The transceiver device 300 is used for receiving or sending audio signals.

[0126] The voice activation detection device 310 can obtain the current detected audio frame from the audio signal received by the transceiver device 300, and the technical solution of the voice activation detection device 310 can be combined with the technical solution in the second embodiment, and it will not be further described here. Repeatedly.

[0127] The electronic device in the embodiment of the present invention may be a mobile phone, a video processing device, a computer, a server, and the like.

[0128] The electronic device provided by the embodiment of the present invention adopts at least one decision polynomial whose coefficient is a var...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a method, device and electronic equipment for voice activity detection. The method comprises the following steps: acquiring time domain sorting parameters and frequency domain sorting parameters from audio frames; acquiring first distances between the time domain sorting parameters and the long-time sliding average value of the time domain sorting parameters in historical background noise frames; acquiring second distances between the frequency domain sorting parameters and the long-time sliding average value of the frequency domain sorting parameters in historical background noise frames; and determining whether the audio frames are foreground voice frames or background noise frames according to the first distances, the second distances and a determining polynomial group based on the first and second distances, wherein at least one coefficient in the determining polynomial group is a variable which can be changed with the operation mode of voice activity detection or the characteristics of input signals. The technical scheme can endue the determining criterion with self-adaptive regulation capability, thereby improving the performance of voice activity detection.

Description

technical field [0001] The invention relates to the technical field of communications, in particular to a voice activation detection method, device and electronic equipment. Background technique [0002] The communication system can determine when the caller starts talking and when he stops talking by using the Voice Activity Detection (VAD) technology. When the caller stops speaking, the communication system may not transmit signals, thereby saving channel bandwidth. The current VAD technology is not limited to the detection of the caller's voice, but can also detect signals such as color ring tones. [0003] The VAD method usually includes: extracting classification parameters from the signal to be detected, inputting the extracted classification parameters into a binary decision criterion, the binary decision criterion makes a decision, and outputs a decision result, the decision result can be: the input signal is a foreground signal or The input signal is background no...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L11/02G10L25/09G10L25/78
CPCG10L25/09G10L25/78
Inventor 王喆
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products