Double-threshold limited place name speech endpoint detection method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An endpoint detection and double-threshold technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of low accuracy of endpoint detection, achieve the effect of avoiding loss of voice signals, improving accuracy, and reducing environmental requirements

Active Publication Date: 2017-06-13

SOUTH CHINA UNIV OF TECH

View PDF6 Cites 13 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In speech recognition technology, endpoint detection technology is an extremely important link in speech recognition, and its effect directly affects the final recognition result. The traditional endpoint detection method based on short-term energy and zero-crossing rate is an ideal environment. It can only be applied in the medium, and for the place name speech signal of isolated words, the accuracy of endpoint detection is relatively low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0030] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0031] In the endpoint detection process of the place-name speech signal, if a section of place-name speech is first in the speech segment, then in the silent segment, and then enters the normal speech segment, then the traditional endpoint detection method will consider the preceding segment of the normal speech segment as a noise segment, and then Re-cutting the voice signal will lead to the loss of the voice signal. For example, in the pronunciation of "Shijiazhuang", the pronunciation of "Shi" is very light and short, which is difficult to recognize.

[0032] And the double-threshold place-name speech endpoint detection method that present embodiment provides, based on the improved short-term average energy and zero-crossing rate, by adding the variable silence1 that ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a double-threshold limited place name speech endpoint detection method. Starting from a first frame signal, the energy of each frame voice signal and the minimum energy threshold value and the maximum energy threshold value are judged, and The zero crossing rate and the zero crossing rate threshold are determined to determine how the next frame signal is detected; in the case of possible access to voice status, by increasing the variable, the voice signals that appear in the speech section before the light time are reserved. Thedouble door limited place name speech endpoint detection method combines the characteristics of names of speech signal isolated words, improves double threshold of the traditional method, guarantee the first part of the speech signal light and short duration is not be judged as noise, so as to avoid the loss of speech signal, therefore the accuracy of endpoint detection and the adaptability of field application environment are improved, and the requirement of environment is reduced.

Description

technical field [0001] The invention belongs to the field of voice endpoint detection, in particular to a double-threshold place name voice endpoint detection method. Background technique [0002] With the rapid economic development and the increasingly prominent trend of globalization, the modern logistics industry has achieved unprecedented development in developed countries, and has produced huge economic and social benefits. Logistics resources include transportation, warehousing, sorting, packaging, distribution, etc. These resources are scattered in many fields, including manufacturing, agriculture, and distribution. [0003] In the sorting process, sorting is basically carried out manually at this stage. Since the workers are in a noisy working environment for a long time, they will inevitably have a certain sense of fatigue in their minds and bodies, and the singleness and repetition of work tasks will also make Their working conditions are too relaxed, which will i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/22G10L15/05

CPCG10L15/05G10L15/22

Inventor 谢巍董万里

Owner SOUTH CHINA UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Double-threshold limited place name speech endpoint detection method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology