Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition method, electronic equipment and storage device

A speech recognition and to-be-recognized technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low speech recognition accuracy, ambiguous pronunciation, and restricting the application of speech recognition technology.

Pending Publication Date: 2021-05-11
IFLYTEK CO LTD
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the higher accuracy of speech recognition depends on the speaker being able to speak clearly. For people who cannot speak clearly, such as patients with sequelae of cerebral apoplexy, the accuracy of speech recognition will be lower due to their ambiguous pronunciation. High, making them unable to use speech recognition technology normally, which greatly affects the user experience and limits the application of speech recognition technology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method, electronic equipment and storage device
  • Speech recognition method, electronic equipment and storage device
  • Speech recognition method, electronic equipment and storage device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] The solutions of the embodiments of the present application will be described in detail below in conjunction with the accompanying drawings.

[0017] In the following description, for purposes of illustration rather than limitation, specific details, such as specific system architectures, interfaces, and techniques, are set forth in order to provide a thorough understanding of the present application.

[0018] The terms "system" and "network" are often used interchangeably herein. The term "and / or" in this article is just an association relationship describing associated objects, which means that there can be three relationships, for example, A and / or B can mean: A exists alone, A and B exist simultaneously, and there exists alone B these three situations. In addition, the character " / " in this article generally indicates that the contextual objects are an "or" relationship. In addition, "many" herein means two or more than two.

[0019] refer to figure 1 , figure ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech recognition method, electronic equipment and a storage device. The method comprises the following steps: acquiring to-be-recognized data when a user speaks, wherein the to-be-recognized data comprises audio data and video data of the mouth of the user; extracting a first feature representation by using the video data, and extracting a second feature representation by using the audio data; performing the following identification steps on the to-be-recognized data for several times: acquiring a fused context representation of the video data and the audio data by using the first feature representation, the second feature representation and the predicted text identified last time, and performing prediction by using the fused context representation to obtain the predicted text identified this time; and taking a combination of the predicted characters identified for several times as a final identification text of the to-be-recognized data. According to the scheme, the accuracy of speech recognition can be improved.

Description

technical field [0001] The present application relates to the technical field of voice recognition, in particular to a voice recognition method, electronic equipment and a storage device. Background technique [0002] Voice recognition is to recognize the input voice data to obtain the recognized text content corresponding to the voice. The application of speech recognition technology has greatly promoted people's input efficiency, making it more convenient and faster for people to input information. [0003] However, the higher accuracy of speech recognition depends on the speaker being able to speak clearly. For people who cannot speak clearly, such as patients with sequelae of cerebral apoplexy, the accuracy of speech recognition will be lower due to their ambiguous pronunciation. High, making them unable to use speech recognition technology normally, greatly affecting the user experience, and also limiting the application of speech recognition technology. In view of th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/26G10L15/16G10L15/02
CPCG10L15/26G10L15/16G10L15/02
Inventor 王孟之万根顺高建清刘聪王智国胡国平
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products