Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

High-efficiency voice detecting method

A speech detection and non-speech technology, applied in speech analysis, instruments, etc., can solve problems such as speech detection system performance degradation, achieve the effect of improving efficiency and robustness, and improving communication efficiency

Active Publication Date: 2014-03-19
中科极限元(杭州)智能科技股份有限公司
View PDF4 Cites 56 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, various mainstream voice detection methods can quickly and accurately detect voice signals in various quiet environments; voice detection systems have high accuracy in stable noise environments and various non-stationary noise environments with high signal-to-noise ratios. rate; however, in the face of various non-stationary random noises in various complex environments, the performance of the speech detection system degrades seriously

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-efficiency voice detecting method
  • High-efficiency voice detecting method
  • High-efficiency voice detecting method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] In order to make the objectives, technical solutions, and advantages of the present invention clearer, the following further describes the present invention in detail in conjunction with specific embodiments and with reference to the accompanying drawings.

[0021] It should be noted that in the drawings or description of the specification, similar or identical parts use the same drawing numbers. The implementations not shown or described in the drawings are those known to those of ordinary skill in the art. In addition, although this article may provide demonstrations of parameters including specific values, it should be understood that the parameters need not be exactly equal to the corresponding values, but can be approximated to the corresponding values ​​within acceptable error tolerances or design constraints.

[0022] The present invention proposes an efficient voice detection mechanism. This mechanism performs two-stage voice detection on the audio stream. First, t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a high-efficiency voice detecting method. The method comprises the following steps: analyzing the short-time energy and the short-time zero-crossing rate of an original audio frequency on a time domain and removing parts of non-voice signals; analyzing the spectral envelop characteristic and the entropy characteristic of a preserved audio frequency signal subband on a frequency domain and further removing parts of non-voice signals; forming an audio frequency segment by continuous frames with similar characteristics in each preserved frame of audio frequency signals; calculating the average value of Mel-frequency Cepstral coefficient of each frame in each audio frequency, respectively inputting the average values into a voice gaussian mixture model and various non-voice gaussian mixture models, and performing band-level judgment on whether the audio frequency segment contains voice data according to the output probability of each model, thereby finally obtaining a voice detecting result. The method is capable of detecting voice signals from audio frequency data streams under various complex environments, and positioning the boundary between voice segment data and non-voice segment data relatively correctly.

Description

Technical field [0001] The invention relates to the field of intelligent information processing, in particular to an efficient voice detection method. Background technique [0002] Voice is one of the main means for humans to communicate information. Voice detection technology has always occupied an important position in the field of voice signal processing. As a preprocessing module for voice recognition, speaker recognition, and voice coding, the voice detection system will be robust It directly affects the performance of other voice processing modules. In the face of random noise in various complex environments, how to accurately locate the voice segment data through an efficient method and effectively distinguish voice and non-voice signals has become a research hotspot at home and abroad, and has attracted more and more attention. . The voice detection system has great practical value. The high-quality robust voice detection technology has been widely used in various commu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/78
Inventor 陶建华刘斌
Owner 中科极限元(杭州)智能科技股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products