Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice recognition method and device, computer readable storage medium and computer equipment

A speech recognition and recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of increased computational complexity and low speech recognition efficiency, and achieve the effect of reducing the amount of computation and improving efficiency

Pending Publication Date: 2021-12-21
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, under the self-attention mechanism, as the length of the input sequence increases, the computational complexity will greatly increase, resulting in low speech recognition efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method and device, computer readable storage medium and computer equipment
  • Voice recognition method and device, computer readable storage medium and computer equipment
  • Voice recognition method and device, computer readable storage medium and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the drawings in the embodiments of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without creative efforts fall within the protection scope of the present invention.

[0056]Embodiments of the present invention provide a voice recognition method, device, computer-readable storage medium, and computer equipment. Wherein, the voice recognition method can be used in a voice recognition device. The voice recognition device can be integrated in a computer device, and the computer device can be a terminal or a server. Among them, terminals include but are not limited to mobile phones, computers, intelligent voice interaction devices, smart...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a voice recognition method and device, a computer readable storage medium and computer equipment. The method comprises the steps: carrying out the feature extraction of to-be-recognized voice information, and obtaining a plurality of feature vectors; calculating the sparseness value of each feature vector, wherein the sparseness value is the relative entropy between the distribution of the self-attention score sequence of each feature vector and the uniform distribution of the self-attention score sequence; determining a first feature vector of which the sparseness value is greater than a preset threshold value and a second feature vector of which the sparseness value is not greater than the preset threshold value; determining a target matrix according to the self-attention calculation result of the first feature vector and the second feature vector; and inputting the target matrix and the feature matrix corresponding to the tag sequence into a classification network for classification processing to obtain a recognition result corresponding to the to-be-recognized voice information. Therefore, the deep learning method is adopted, the calculation amount of the self-attention mechanism in the speech recognition process is reduced, and the speech recognition efficiency is improved.

Description

technical field [0001] The present invention relates to the technical field of speech recognition, in particular to a speech recognition method, device, computer-readable storage medium and computer equipment. Background technique [0002] Automatic Speech Recognition (ASR) technology is a technology that allows machines to convert voice signals into corresponding text or commands through the process of recognition and understanding. Speech recognition technology mainly includes three aspects: feature extraction technology, pattern matching criteria and model training technology. [0003] In recent years, automatic speech recognition technology has developed rapidly, and its application has penetrated into various fields of people's lives. Among them, end-to-end (End-to-End, E2E) automatic speech recognition technology is widely favored for its simplified architecture and excellent performance. Transfer machines and attention-based codecs are two popular E2E frameworks. Th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/08
CPCG10L15/02G10L15/08
Inventor 孙思宁
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products