Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cross-language speech recognition method and device

A speech recognition and cross-language technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low accuracy, high cost, and long training time, and achieve a wide range of support, high accuracy, and high recognition rate Effect

Active Publication Date: 2019-10-18
AISPEECH CO LTD
View PDF17 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the process of realizing the present invention, the inventors found that there are at least the following problems in the prior art: (1) training a language model for a language from scratch requires a large amount of manually labeled data, which is not only expensive, but also takes a lot of time to obtain; building separate language models for each language hinders smooth recognition and increases the cost of recognizing mixed-language speech
(2) The training takes a long time, and the input of manpower and material resources is large. The range of other language recognition is based on the field of investment. The overall support range is relatively narrow, and there are similar pronunciations, which may easily cause misidentification and low accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-language speech recognition method and device
  • Cross-language speech recognition method and device
  • Cross-language speech recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0034] figure 1 is a schematic diagram of the main flow of the cross-language speech recognition method according to an embodiment of the present invention, such as figure 1 As shown, the method includes:

[0035] Step S101: Obtain cross-language sample data, use the sample data as input data of a preset neural network model for training, and obtain a language class disc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses cross-language speech recognition method and device, and relates to the technical field of speech processing. A concrete implementation mode of the method comprises the steps of acquiring cross-language sample data, training the sample data as input data of a preset neural network model, and obtaining a language category discriminator; inputting audio to be recognized intothe language category discriminator, and segmenting the audio to be recognized according to a language category determined by the language category discriminator; and utilizing a recognition engine corresponding to the determined language category for recognizing the segmented audio to be recognized. According to the implementation mode, an existing speed recognition engine has no need to be modified, so that the cost is low, the recognition rate is high, and the accuracy is high.

Description

technical field [0001] The invention relates to the field of speech processing, in particular to a cross-language speech recognition method and device. Background technique [0002] The intelligence and integration of electronic equipment are getting higher and higher, and the traditional information retrieval and menu operation methods are increasingly unable to meet the requirements. There is an urgent need for a more convenient information retrieval and command operation method to replace the traditional button operation. Speech recognition technology came into being. However, in most traditional automatic speech recognition systems, only the most commonly used language of the country is supported, and other languages ​​are less supported or not supported. For this situation, the conventional approach is: (1) Different languages ​​are considered independently, and a language model is trained from scratch for each language. (2) In the acoustic model, the most commonly us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/00G10L15/05G10L15/06G10L15/16G10L15/26
CPCG10L15/005G10L15/05G10L15/063G10L15/16G10L15/26
Inventor 朱森
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products