Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition model training method and device and speech recognition method and device

A speech recognition model and training method technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of high time and labor costs, cumbersome processes, etc.

Pending Publication Date: 2021-04-23
BEIJING YUANLI WEILAI SCI & TECH CO LTD
View PDF10 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in the above speech recognition method, the output of the convolutional neural network uses the normalized exponential function softmax function to perform classification prediction. The dimension of the softmax function is the number of given keywords. When adding keywords, the dimension of softmax also To increase accordingly, the model needs to be retrained. Since the training model needs to collect a certain amount of voice data of the newly added keyword, the time and labor costs are high.
In addition, after dividing the speech into frames and predicting the probability that each speech frame belongs to each given keyword, it is necessary to use a sliding window larger than the speech frame window to smooth the probability of the speech frame to obtain that the speech frame belongs to each given keyword. Given the confidence of the keyword, the specific smoothing strategy method and the window used for smoothing need to be adjusted according to the actual situation, and the process is relatively cumbersome

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition model training method and device and speech recognition method and device
  • Speech recognition model training method and device and speech recognition method and device
  • Speech recognition model training method and device and speech recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the specification. However, this specification can be implemented in many other ways different from those described here, and those skilled in the art can make similar extensions without violating the connotation of this specification, so this specification is not limited by the specific implementations disclosed below.

[0055] Terms used in one or more embodiments of this specification are for the purpose of describing specific embodiments only, and are not intended to limit one or more embodiments of this specification. As used in one or more embodiments of this specification and the appended claims, the singular forms "a", "the", and "the" are also intended to include the plural forms unless the context clearly dictates otherwise. It should also be understood that the term "and / or" used in one or more embodiments of the present specification refers t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a voice recognition model training method and device and a voice recognition method and device. The voice recognition model training method comprises the steps: acquiring a voice sample pair; obtaining a first sample vector of the first voice sample and a second sample vector of the second voice sample through a full connection layer of the voice recognition model; calculating a loss value of the speech recognition model according to the similarity between the first sample vector and the second sample vector and the sample pair label; and according to the loss value of the speech recognition model, training the speech recognition model until a training stop condition is met, and obtaining a trained speech recognition model. According to the method, the loss value of the speech recognition model based on the sample pair is introduced, and the loss value enables the speech recognition model to have the capability of accurately extracting the speech content information, so that when the to-be-recognized keywords are newly added, the speech recognition model does not need to be retrained, and only a small amount of standard speech of the keywords needs to be prepared.

Description

technical field [0001] This specification relates to the technical field of speech processing, in particular to a speech recognition model training method and device, and a speech recognition method and device. Background technique [0002] With the rapid development and wide application of computer technology and network technology, there are many scenarios that require speech recognition. For example, in a voice word recognition game, it is necessary to detect whether there is a word (target keyword) displayed on the screen based on the user's voice; or, the device can be controlled by voice, so the device needs to recognize the user's voice to obtain the target keywords, and then trigger corresponding operation instructions based on the target keywords to operate the device. [0003] In the prior art, it is generally based on a convolutional neural network (CNN) to predict which category of a speech speech belongs to a given keyword category, so as to determine whether t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/16G10L15/22G10L15/26G10L25/51G10L25/87
Inventor 吴凡贾杨卓邦声林倩倩郭涵涛李振权夏龙郭常圳
Owner BEIJING YUANLI WEILAI SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products