Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice activity detection system based on fundamental frequency and calculation method thereof

A technology of endpoint detection and calculation method, which is applied in the direction of speech analysis, instrumentation, etc., to achieve high robustness effect

Active Publication Date: 2014-10-08
PACHIRA INFORAMTION TECH BEIJING CO LTD
View PDF5 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is how to provide an endpoint detection calculation method, so that the endpoint detection system has high robustness, and can maintain high detection accuracy even in the case of poor signal-to-noise ratio

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice activity detection system based on fundamental frequency and calculation method thereof
  • Voice activity detection system based on fundamental frequency and calculation method thereof
  • Voice activity detection system based on fundamental frequency and calculation method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Embodiments of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples. The following examples are used to illustrate the present invention, but should not be used to limit the scope of the present invention.

[0029] This embodiment provides a fundamental frequency-based endpoint detection system, including a framing module, which performs framing on an input signal;

[0030] The voice enhancement module enhances the voice data before calculating the formant, so as to avoid the influence of the spectrum leakage of the frequency band other than the pitch frequency on the low frequency after the FFT calculation; and combines the time domain energy information and the frequency domain information, using The low energy in the time domain is used as the background energy threshold to filter the silent part;

[0031] The formant calculation module determines the corresponding data segment in the autocorrelat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a voice activity detection system based on the fundamental frequency and a calculation method thereof. All possible positions where a fundamental frequency appears are acquired through a fundamental frequency extraction algorithm, and the fundamental frequency is determined by cost. However, because the fundamental frequency may be interfered by low-frequency noise, whether a point is voice is judged by aid of the fact that a position where a fundamental frequency appears has a harmonic structure. Meanwhile, the speed of voice activity detection is increased and the detection accuracy is improved according to adaption of background energy. According to the voice activity detection system based on the fundamental frequency and the calculation method thereof of the invention, the voice activity detection system is enabled to have high robustness under the condition of low signal-to-noise ratio. When noise is difficult to distinguish in a time domain, the method enables noise to be correctly distinguished in a frequency domain according to significantly different characteristics of spectral distribution of noise signals and voice signals from time-domain distribution. The method can be widely applied to the field of voice signal processing.

Description

technical field [0001] The invention relates to an endpoint detection technology of a voice signal, in particular to an endpoint detection technology of a voice signal based on a fundamental frequency. Background technique [0002] The main purpose of the endpoint detection technology (Voice Activity Detection) is to detect a segment containing a voice signal from a given input voice signal, and give its start and end points. In recent years, with the development of computers, speech has gradually become the main way of human-computer interaction. Endpoint detection technology plays an important role in speech recognition, speech analysis and semantic understanding. A better speech endpoint detection result is very important to improve the accuracy of speech recognition and processing speed. [0003] Currently, endpoint detection techniques include methods such as time-domain energy, speech correlation, frequency-domain entropy, and model matching. These methods can achieve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L25/87G10L21/0232
Inventor 赵茂祥贾昌辉李全忠蒲瑶何国涛
Owner PACHIRA INFORAMTION TECH BEIJING CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products