Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Chinese speech recognition method based on cyclic neural network language model and deep neural network acoustic model

A technology of cyclic neural network and deep neural network, which is applied in the field of speech recognition technology and deep learning, can solve the problems of high time delay and low accuracy of acoustic models, and achieve high accuracy and good real-time performance in line with the trend of technological development Effect

Active Publication Date: 2018-09-04
SOUTH CHINA UNIV OF TECH
View PDF10 Cites 32 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0014] The main purpose of the present invention is to overcome the shortcoming and deficiency of prior art, provide a kind of speech recognition method based on recurrent neural network language model and deep neural network acoustic model, combine the accuracy of recurrent neural network and the low delay of deep neural network , which solves the shortcomings of the low accuracy of the existing n-gram language model and the high latency of the long-short-term memory network (LSTM) acoustic model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese speech recognition method based on cyclic neural network language model and deep neural network acoustic model
  • Chinese speech recognition method based on cyclic neural network language model and deep neural network acoustic model
  • Chinese speech recognition method based on cyclic neural network language model and deep neural network acoustic model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0070] Such as figure 1 As shown, the present invention is mainly divided into three parts: step 101-step 105 training of recurrent neural network language model, step 201-step 206 training of deep neural network acoustic model, step 301-step 303 recognition and decoding.

[0071] 1. Train a language model based on a recurrent neural network:

[0072] Step 101, using web crawler scripts such as srcapy to crawl a large amount of Chinese Internet texts;

[0073] Step 102, use the BeautifulSoup toolkit to analyze the crawled Internet text, and delete English letters, garbled symbols, title numbers, curly braces, parentheses, angle brackets, square brackets, spaces, commas, commas, double quotation marks, single quotation marks, etc. in the text content, replace periods, semicolons, question marks, and exclamation marks with carriage returns, leaving pure Chinese text;

[0074] Step 103, utilizing the Jieba Chinese word segmentation tool to carry out word segmentation to the pur...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a Chinese speech recognition method based on a cyclic neural network language model and a deep neural network acoustic model. The Chinese speech recognition method mainly includes the following steps: S1, training the language model based on a cyclic neural network; S2, training the acoustic model based on a deep neural network; and S3, employing a Viterbi search scheme bya decoder for Chinese speech recognition based on the cyclic neural network language model and the deep neural network acoustic model. The Chinese speech recognition method based on a cyclic neural network language model and a deep neural network acoustic model combines the accuracy of the cyclic neural network and the low delay of the deep neural network, solves the shortcomings that a current n-gram language model has low accuracy and a long and short time memory acoustic model has high time delay, and achieves Chinese speech recognition with low delay and higher accuracy.

Description

technical field [0001] The invention relates to speech recognition technology and deep learning technology, in particular to a speech recognition method based on a recurrent neural network language model and a deep neural network acoustic model. Background technique [0002] With the popularity of smart hardware products such as Amazon Echo smart speakers, as the most important means of human-computer interaction, the market for voice recognition is also rising. According to the "2015-2020 Global and China Voice Industry Report" released by the internationally renowned market research company Research and Markets in 2016, as the application of voice in the smart industry continues to deepen, by 2020, the global voice market is expected to reach 19.17 billion US dollars . [0003] The mainstream representative of traditional continuous speech recognition technology is GMM-HMM. Around 2011, Microsoft, Google and other companies began to apply deep neural networks to speech re...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/06G10L15/14G10L15/16G10L15/18G10L15/26G06F17/27
CPCG10L15/02G10L15/063G10L15/144G10L15/16G10L15/18G10L15/26G10L2015/025G06F40/279
Inventor 贺前华吴俊峰汪星庞文丰
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products