Speech recognition by statistical language using square-rootdiscounting
A language and training corpus technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of underestimating the probability of events with large counts, prolonging calculation time, rounding errors, etc., and achieve the effect of reliable statistical language modeling
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0073] According to the example shown in Fig. 1, the speaker utters a sentence including three consecutive words a, b and c. The utterance is detected by the microphone 1 and the corresponding microphone signal has been digitized and input into the speech recognition device. The speech recognition device accesses a database comprising training corpus, such as trigrams and / or bigrams found in a large number of novels or radio news broadcast scripts.
[0074] Assume that the speech recognition device has recognized two initial words a and b2 uttered by the speaker. Then the task is to predict the next word of the considered trigram. Based on the training corpus, N possible trigrams (events) e starting from words a and b are known 1 to eN . Every trigram e j (j=1,..,N) is found in the corpus with a frequency of (number of counts) c j .
[0075] In order to predict word c to complete the trigram, the speech recognition device calculates the probability of different candidate...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com