Phonetic recognition system

A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of unsatisfactory practical use, slow response speed of speech recognition system, large amount of calculation, etc., to reduce the amount of decoding calculation and storage capacity , Improve the effect of recognition speed

Inactive Publication Date: 2005-03-23
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, if CDHMM is used in a speech recognition system with a large vocabulary, the Gaussian probability needs to be calculated multiple times when the decoding operation unit performs decoding. Usually, the amount of calculation required in the decoding process is concentrated on the calculation of the Gaussian probability, which requires a lot of Calculations
When large-vocabulary speech recognition is performed on embedded hardware platforms with limited resources such as mobile phones, the response speed of the speech recognition system will be very slow, which cannot meet the needs of actual use.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phonetic recognition system
  • Phonetic recognition system
  • Phonetic recognition system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] The method for compressing the feature vector set used in the speech recognition system will be described first below.

[0054] There are many kinds of speech features, such as LPC coefficients, cepstral coefficients, filter bank coefficients, Mel filter frequency coefficients (MFCC), etc. The commonly used feature parameter is MFCC. Here we don't care about which parameters, the present invention is applicable to any kind of characteristic parameters. For the convenience of understanding, the method for compressing the feature vector set of the speech recognition system according to the present invention will be described below by taking MFCC coefficients as an example.

[0055] Assuming that each frame of speech uses L MFCC parameters, L first-order difference MFCC parameters and L second-order difference MFCC parameters are combined into 3*L=X dimension vectors as feature parameters, forming a speech feature set of X dimensions, correspondingly The dimension of the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech sound recognition system and includes modulus shift conversion unit, which transfers the analogue signal of the speech sound into the digital signals, feature extraction unit, which carries out the frame treatment to the digital signals and extracts the feature of every frame to get the feature vector sequence, feature codes, which is composed of certain number of codon, quantization enencoding unit, which transfers the feature vector sequence of the input spoken words into the feature codes sequence in according to the feature codes, probability table, which stores probability value of every codon corresponding to the gauss codes in the feature codes and deencoding arithmetic element, which decodes the feature codes sequence to get the recognized results and searches for the gauss codes with the highest matching probability directly in the probability table. The speech sound recognition system can increases the recognition speed of the system on the basis of guaranteeing the performance of the speech sound recognition system.

Description

technical field [0001] The invention relates to a speech recognition system. Background technique [0002] Almost all current speech recognition systems use methods based on statistical pattern recognition. In all speech recognition systems, it is necessary to convert the time-domain sound waves of speech input into a digital vector feature to describe and distinguish different pronunciations, which we call Speech features, based on which a sound model is established for all pronunciations, which is usually called an acoustic model in the field of speech recognition. All speech recognition systems must have an acoustic model; at the same time, for large vocabulary continuous speech recognition systems, a language model is also required. The purpose of speech recognition is to give a string of sound feature sequences as input conditions, use acoustic models and language models, and use search algorithms to output recognition results, such as words, words or sentences. In th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/00G10L15/02G10L15/06G10L15/08
Inventor 潘接林韩疆刘建颜永红庹凌云张建平
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products