Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Dictionary generation and speech recognition method and device

A speech recognition and dictionary technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as high delay, inability to upload to the screen in real time, and corresponding pronunciation of English words

Pending Publication Date: 2021-08-13
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the prior art, when generating a dictionary containing word fragments based on an English dictionary, the dictionary is usually generated based on the adjacent sequence with the highest co-occurrence frequency of English words, but the word fragments generated in this way cannot be guaranteed to be consistent with English words. Corresponding pronunciation, which leads to the problem of large amount of calculation, high delay, and inability to be uploaded to the screen in real time when recognizing English words based on dictionaries

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dictionary generation and speech recognition method and device
  • Dictionary generation and speech recognition method and device
  • Dictionary generation and speech recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0020] figure 1 is a schematic diagram according to the first embodiment of the present disclosure. Such as figure 1 As shown, the method for generating a dictionary in this embodiment may specifically include the following steps:

[0021] S101. Obtain an English dictionary;

[0022] S102. Segment each English word according to the pronunciation of each English word...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a dictionary generation and speech recognition method, and relates to the technical field of natural language processing and speech processing. The dictionary generation method comprises the following steps: acquiring an English dictionary; segmenting each English word according to the pronunciation of each English word in the English dictionary so that the number of word segments obtained by segmenting each English word is equal to the number of pronunciation segments of each English word; and generating a dictionary according to the word segments of the English words. The speech recognition method comprises the following steps: acquiring an input audio; and searching a target word segment corresponding to the pronunciation segment of the input audio in a dictionary, and generating a recognition result of the input audio according to the searched target word segment. The word segments in the generated dictionary are in one-to-one correspondence with the pronunciation segments of the English words so that the accuracy and efficiency of speech recognition can be improved.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, in particular to the technical fields of natural language processing and speech processing. Provided are a method, device, electronic device and readable storage medium for generating a dictionary and voice recognition. Background technique [0002] In the prior art, when generating a dictionary containing word fragments based on an English dictionary, the dictionary is usually generated based on the adjacent sequence with the highest co-occurrence frequency of English words, but the word fragments generated in this way cannot be guaranteed to be consistent with English words. Corresponding to the pronunciation, which leads to the problems of large calculation, high delay, and inability to be uploaded to the screen in real time when recognizing English words based on dictionaries. Contents of the invention [0003] According to the first aspect of the present disclosure, a met...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/22
CPCG10L15/063G10L15/22G10L2015/0633
Inventor 张辽臧启光付晓寅蒋正翔赵银楼
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products