Method and system for automatic speech recognition

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
a speech recognition and automatic technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of difficult to achieve a better recognition and lower recognition accuracy of obscure words

Active Publication Date: 2014-07-31

TENCENT TECH (SHENZHEN) CO LTD

View PDF25 Cites 26 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Benefits of technology

The patent describes an automatic speech recognition system that uses a combination of linguistic and acoustic models to recognize speech. The system includes a computer with processors and memory, a classifying process module, a language model training module, a weight merging module, a resource construction module, and a decoder. The system obtains speech corpus categories by classifying and calculating raw speech corpus, and then uses these categories to train language models that correspond to them. The system also uses an interpolation language model that is obtained through a weighted interpolation of the language models. The system decodes input speech using the trained language models and outputs a character string with the highest probability as a recognition result. The technical effect of this system is that it improves the accuracy and efficiency of speech recognition.

Problems solved by technology

However, most of the conventional speech recognition technology is based on the universal speech recognition application that constructs the model for the common speech recognition, in this situation, the training corpus of language model is based on the data collection and actual input of users, though it reflects well the speech habits of the users to some extent and often has a better recognition effect for the daily expression, because of less frequent obscure words in the training corpus of the language model, such as medicine name, place name, etc., it can't form an effective probability statistics model, the probability value of the character string corresponding to the obscure words in the language model is very low, so when it needs to recognize the obscure words spoken by the user, a problem of data offset often happens, it means the recognized character string is not the words spoken by the user, in other words, the recognition accuracy for the speech of the obscure words is lower, thus it is difficult to achieve a better recognition result.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0034]Reference will now be made in detail to embodiments, examples of which are illustrated in the accompanying drawings. In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the subject matter presented herein. But it will be apparent to one skilled in the art that the subject matter may be practiced without these specific details. In other instances, well-known methods, procedures, components, and circuits have not been described in detail so as not to unnecessarily obscure aspects of the embodiments.

[0035]The following will make further detailed explanation to the present invention combining with attached drawings and specific embodiment.

[0036]FIG. 2 is a processing flowchart diagram of automatic speech recognition method mentioned in the present invention. Refer to FIG. 2, this flow includes:

[0037]Step 201, carry out the corpus classification calculation for the raw corpus so as to obtain different catego...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

An automatic speech recognition method includes at a computer having one or more processors and memory for storing one or more programs to be executed by the processors, obtaining a plurality of speech corpus categories through classifying and calculating raw speech corpus; obtaining a plurality of classified language models that respectively correspond to the plurality of speech corpus categories through a language model training applied on each speech corpus category; obtaining an interpolation language model through implementing a weighted interpolation on each classified language model and merging the interpolated plurality of classified language models; constructing a decoding resource in accordance with an acoustic model and the interpolation language model; and decoding input speech using the decoding resource, and outputting a character string with a highest probability as a recognition result of the input speech.

Description

RELATED APPLICATIONS[0001]This application is a continuation application of PCT Patent Application No. PCT / CN2013 / 086707, entitled “METHOD AND SYSTEM FOR AUTOMATIC SPEECH RECOGNITION” filed on Nov. 7, 2013, which claims priority to Chinese Patent Application No. 201310033201.7, “METHOD AND SYSTEM FOR AUTOMATIC SPEECH RECOGNITION,” filed on Jan. 29, 2013, both of which are hereby incorporated by reference in their entirety.FIELD OF THE INVENTION[0002]The present invention relates to the technical field of Automatic Speech Recognition (ASR), especially relates to a method and system for automatic speech recognition.BACKGROUND OF THE INVENTION[0003]Automatic speech Recognition technology is a sort of technology which transforms the lexical content of human speech into input characters that can be read by computers. The speech recognition has a complicated processing flow, mainly including four processes of acoustic model training, language model training, decoding resource constructing...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(United States)

IPC IPC(8): G10L15/06

CPCG10L15/063G10L15/183G10L15/197G10L15/26

Inventor RAO, FENGLU, LICHEN, BOYUE, SHUAIZHANG, XIANGWANG, ERYUXIE, DADONGLI, LOULU, DULING

Owner TENCENT TECH (SHENZHEN) CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method and system for automatic speech recognition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Benefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology