Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice recognition method and system

A sound recognition and sound signal technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of low recognition accuracy, and achieve the effect of improving the accuracy and maintaining the amount of calculation.

Pending Publication Date: 2020-03-17
ALIBABA GRP HLDG LTD
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The embodiment of the present application provides a voice recognition method and system to at least solve the technical problem of low recognition accuracy of voice recognition methods in complex environments in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method and system
  • Voice recognition method and system
  • Voice recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0031] According to the embodiment of the present application, an embodiment of a voice recognition method is also provided. It should be noted that the steps shown in the flow chart of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, Although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0032] The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. figure 1A block diagram of a hardware structure of a computer terminal (or mobile device) for realizing the voice recognition method is shown. Such as figure 1 As shown, the computer terminal 10 (or mobile device 10) may include one or more (shown by 102a, 102b, ..., 102n in the figure) processor 102 (the processor 102 may include but not...

Embodiment 2

[0082] According to an embodiment of the present application, a voice recognition device for implementing the above voice recognition method is also provided, such as Figure 8 As shown, the apparatus 800 includes: an acquisition module 802 , an extraction module 804 and an identification module 806 .

[0083] Among them, the acquisition module 802 is used to obtain the sound signal; the extraction module 804 is used to extract the feature of the sound signal to obtain the acoustic feature information of the sound signal; the recognition module 806 is used to identify the acoustic feature information by using the acoustic model and the language model to obtain The recognition result of the sound signal, wherein the acoustic model includes: LC-BLSTM model and DFSMN model.

[0084] Specifically, the above-mentioned sound signal may be a voice uttered by the user, and the voice uttered by the user may be collected by a voice collection device such as a microphone. Since the soun...

Embodiment 3

[0107] According to an embodiment of the present application, a voice recognition system is also provided, such as Figure 9 As shown, the system includes:

[0108] The acoustic feature extraction module 92 is configured to perform feature extraction on the acquired sound signal to obtain acoustic feature information of the sound signal.

[0109] Specifically, the above-mentioned sound signal may be a voice uttered by the user, and the voice uttered by the user may be collected by a voice collection device such as a microphone. Since the sound signal collected by the speech collection device is an analog signal, it can first be converted into a digital signal through a recorder, so that the feature extraction of the digital signal can be performed.

[0110] In order to ensure the accuracy of recognition, the above-mentioned acoustic feature information can better distinguish the modeling units of the acoustic model. Under the premise of text information in the signal, interf...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice recognition method and system. The method comprises the following steps: acquiring a sound signal; performing feature extraction on the sound signal to obtain acousticfeature information of the sound signal; and identifying the acoustic characteristic information by using an acoustic model and a language model to obtain an identification result of the sound signal,wherein the acoustic model comprises an LC-BLSTM model and a DFSMN model. According to the invention, the technical problem of low recognition accuracy of a voice recognition method in a complex environment in the prior art is solved.

Description

technical field [0001] The present application relates to the field of voice recognition, in particular, to a voice recognition method and system. Background technique [0002] Acoustic model training is the core part of a speech recognition system, which occupies most of the computing overhead and determines the recognition performance of the system to a large extent. It uses the training speech features and their corresponding annotation information for supervised acoustic model building. mold. With the application of deep learning in acoustic model modeling, the accuracy of speech recognition continues to increase, such as DNN (Deep Neural Network, Deep Neural Network), LSTM (Long Short-Term Memory Network, Long Short-Term Memory), BLSTM (Bidirectional Long-Short-Term Memory) Memory network, Bidirectional Long Short-Term Memory), CNN (Convolutional Neural Network, Convolutional Neural Network), FSMN (Feed-forward Sequential Memory Network, Feed-forwardSequential Memory N...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/08G10L15/16G10L15/02
CPCG10L15/08G10L15/16G10L15/02
Inventor 薛少飞张仕良
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products