Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Streaming coding and speech recognition method and device, electronic equipment and storage medium

An encoding method and speech recognition technology, applied in the computer field, can solve problems such as poor computing power and low computing processing efficiency, and achieve the effects of reducing computing power requirements, reducing data processing volume, and improving data processing efficiency

Pending Publication Date: 2022-05-24
C SKY MICROSYST CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] However, applying speech recognition models such as transformers to devices with poor computing power still has low computational efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Streaming coding and speech recognition method and device, electronic equipment and storage medium
  • Streaming coding and speech recognition method and device, electronic equipment and storage medium
  • Streaming coding and speech recognition method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make those skilled in the art better understand the technical solutions in the embodiments of the present invention, the technical solutions in the embodiments of the present invention will be described clearly and in detail below with reference to the accompanying drawings in the embodiments of the present invention. The embodiments described above are only a part of the embodiments of the present invention, but not all of the embodiments. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments in the embodiments of the present invention should fall within the protection scope of the embodiments of the present invention.

[0032] The specific implementation of the embodiments of the present invention is further described below with reference to the accompanying drawings of the embodiments of the present invention.

[0033] Figure 1A A schematic diagram of the streaming encoding process of an example speech recognitio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a streaming coding and speech recognition method and device, electronic equipment and a storage medium. The streaming coding method comprises the following steps: determining a third transformation sequence of a historical reference frame sequence based on a second transformation sequence of a previous fused frame sequence; performing splicing processing based on the first transformation sequence and the third transformation sequence to obtain a fourth transformation sequence of the current fusion frame sequence; determining at least one of a source sequence and a context sequence for an attention mechanism based on the fourth transformed sequence; and performing streaming coding on the current frame sequence based on the source sequence and the context sequence. The third transformation sequence of the historical reference frame sequence is determined based on the second transformation sequence of the previous fusion frame sequence, so that the data processing amount required by the change processing of the historical reference frame sequence in the linear transformation processing process is reduced, the data processing efficiency is improved, and the computing power requirement of equipment is reduced.

Description

technical field [0001] Embodiments of the present invention relate to the field of computer technologies, and in particular, to a method, apparatus, electronic device, and storage medium for streaming encoding and speech recognition. Background technique [0002] Automatic Speech Recognition (ASR) systems are widely used in various interface applications, such as voice command-based search or knowledge question answering applications. [0003] Sequence to Sequence based neural network models have gained widespread popularity in ASR techniques. The input of an end-to-end ASR system is usually a speech sequence, and the output is usually a text sequence. Compared with traditional ASR systems, it can simplify the system structure and avoid the linguistic expert knowledge required to build ASR systems. This end-to-end ASR system can directly learn the various parts and links of the speech recognizer. [0004] Sequence-to-sequence models for end-to-end ASR systems are mainly ba...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06G10L15/16G10L15/183
CPCG10L15/063G10L15/16G10L15/183
Inventor 方菲菲
Owner C SKY MICROSYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products