Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Double-threshold limited place name speech endpoint detection method

An endpoint detection and double-threshold technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of low accuracy of endpoint detection, achieve the effect of avoiding loss of voice signals, improving accuracy, and reducing environmental requirements

Active Publication Date: 2017-06-13
SOUTH CHINA UNIV OF TECH
View PDF6 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In speech recognition technology, endpoint detection technology is an extremely important link in speech recognition, and its effect directly affects the final recognition result. The traditional endpoint detection method based on short-term energy and zero-crossing rate is an ideal environment. It can only be applied in the medium, and for the place name speech signal of isolated words, the accuracy of endpoint detection is relatively low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Double-threshold limited place name speech endpoint detection method
  • Double-threshold limited place name speech endpoint detection method
  • Double-threshold limited place name speech endpoint detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0031] In the endpoint detection process of the place-name speech signal, if a section of place-name speech is first in the speech segment, then in the silent segment, and then enters the normal speech segment, then the traditional endpoint detection method will consider the preceding segment of the normal speech segment as a noise segment, and then Re-cutting the voice signal will lead to the loss of the voice signal. For example, in the pronunciation of "Shijiazhuang", the pronunciation of "Shi" is very light and short, which is difficult to recognize.

[0032] And the double-threshold place-name speech endpoint detection method that present embodiment provides, based on the improved short-term average energy and zero-crossing rate, by adding the variable silence1 that ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a double-threshold limited place name speech endpoint detection method. Starting from a first frame signal, the energy of each frame voice signal and the minimum energy threshold value and the maximum energy threshold value are judged, and The zero crossing rate and the zero crossing rate threshold are determined to determine how the next frame signal is detected; in the case of possible access to voice status, by increasing the variable, the voice signals that appear in the speech section before the light time are reserved. Thedouble door limited place name speech endpoint detection method combines the characteristics of names of speech signal isolated words, improves double threshold of the traditional method, guarantee the first part of the speech signal light and short duration is not be judged as noise, so as to avoid the loss of speech signal, therefore the accuracy of endpoint detection and the adaptability of field application environment are improved, and the requirement of environment is reduced.

Description

technical field [0001] The invention belongs to the field of voice endpoint detection, in particular to a double-threshold place name voice endpoint detection method. Background technique [0002] With the rapid economic development and the increasingly prominent trend of globalization, the modern logistics industry has achieved unprecedented development in developed countries, and has produced huge economic and social benefits. Logistics resources include transportation, warehousing, sorting, packaging, distribution, etc. These resources are scattered in many fields, including manufacturing, agriculture, and distribution. [0003] In the sorting process, sorting is basically carried out manually at this stage. Since the workers are in a noisy working environment for a long time, they will inevitably have a certain sense of fatigue in their minds and bodies, and the singleness and repetition of work tasks will also make Their working conditions are too relaxed, which will i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/22G10L15/05
CPCG10L15/05G10L15/22
Inventor 谢巍董万里
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products