Method and device for voice activity detection

A voice activity detection and speech technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as poor detection robustness, and achieve the effect of improving performance and strong detection robustness.

Inactive Publication Date: 2014-09-10
HARBIN UNIV OF SCI & TECH
View PDF8 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] The purpose of the present invention is to provide a voice activity detection method and device to solve the problem of poor detection robustness of voice activity detection under changing noise conditions in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for voice activity detection
  • Method and device for voice activity detection
  • Method and device for voice activity detection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0057] The present invention proposes a voice activity detection method, such as figure 1 shown, including the following steps:

[0058] S101 extracting the signal features of the clean speech signal and the signal features of the noisy speech signal, specifically including: preprocessing the discrete-time signal of the clean speech; performing discrete Fourier transform on the signal frame of the preprocessed clean speech signal to obtain the clean speech signal The magnitude spectrum of the clean speech signal is used as the signal feature of the clean speech signal; the discrete-time signal of the noisy speech is preprocessed; the signal frame of the preprocessed noisy...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method and a device for voice activity detection. The method comprises the steps of extracting the signal characteristics of clean voice signals and the signal characteristics of noise mixed voice signals, carrying out dictionary training according to the signal characteristics of the clean voice signals to obtain a voice dictionary, dynamically updating predetermined noise training data according to the signal characteristics of the noise mixed voice signals, extracting the signal characteristics of the updated noise training data and carrying out online dictionary training to obtain a noise dictionary; performing sparse representation on the signal frames of a noise mixed voice signal input according to the voice dictionary and the noise dictionary, extracting a sparse coefficient in the sparse representation, and detecting the signal frames of the input noise mixed voice signal according to the sparse coefficient. The method and the device are capable of accurately recognizing the voice part and the non-voice part of a voice signal in a noise environment, and the performance of the voice activity detection in the varying noise environment is improved.

Description

technical field [0001] The invention relates to the technical field of voice signal processing, in particular to a voice activity detection method and device. Background technique [0002] One of the most important problems to be solved in analyzing and processing speech is to detect speech and non-speech in the speech signal. This task is called Voice activity detection (VAD). This technology plays an important role in the field of speech processing and largely affects the performance of other application technologies, typically robust speech recognition, speaker recognition, speech programming and transmission, and joint noise reduction and echo cancellation Wait. [0003] The basic methods of traditional VAD include G.729 standard, etc. G.729 standard calculates line spectral frequency, full-band energy, low-frequency energy (<1khz), and zero-crossing rate. Then set the threshold to simply classify each frame of the signal, and at the same time use smooth and adaptiv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/20G10L21/0308
Inventor 何勇军孙广路谢怡宁郑云龙
Owner HARBIN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products