Cross-language speech recognition method and device

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and cross-language technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low accuracy, high cost, and long training time, and achieve a wide range of support, high accuracy, and high recognition rate Effect

Active Publication Date: 2019-10-18

AISPEECH CO LTD

View PDF17 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] In the process of realizing the present invention, the inventors found that there are at least the following problems in the prior art: (1) training a language model for a language from scratch requires a large amount of manually labeled data, which is not only expensive, but also takes a lot of time to obtain; building separate language models for each language hinders smooth recognition and increases the cost of recognizing mixed-language speech

(2) The training takes a long time, and the input of manpower and material resources is large. The range of other language recognition is based on the field of investment. The overall support range is relatively narrow, and there are similar pronunciations, which may easily cause misidentification and low accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0034] figure 1 is a schematic diagram of the main flow of the cross-language speech recognition method according to an embodiment of the present invention, such as figure 1 As shown, the method includes:

[0035] Step S101: Obtain cross-language sample data, use the sample data as input data of a preset neural network model for training, and obtain a language class disc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses cross-language speech recognition method and device, and relates to the technical field of speech processing. A concrete implementation mode of the method comprises the steps of acquiring cross-language sample data, training the sample data as input data of a preset neural network model, and obtaining a language category discriminator; inputting audio to be recognized intothe language category discriminator, and segmenting the audio to be recognized according to a language category determined by the language category discriminator; and utilizing a recognition engine corresponding to the determined language category for recognizing the segmented audio to be recognized. According to the implementation mode, an existing speed recognition engine has no need to be modified, so that the cost is low, the recognition rate is high, and the accuracy is high.

Description

technical field [0001] The invention relates to the field of speech processing, in particular to a cross-language speech recognition method and device. Background technique [0002] The intelligence and integration of electronic equipment are getting higher and higher, and the traditional information retrieval and menu operation methods are increasingly unable to meet the requirements. There is an urgent need for a more convenient information retrieval and command operation method to replace the traditional button operation. Speech recognition technology came into being. However, in most traditional automatic speech recognition systems, only the most commonly used language of the country is supported, and other languages are less supported or not supported. For this situation, the conventional approach is: (1) Different languages are considered independently, and a language model is trained from scratch for each language. (2) In the acoustic model, the most commonly us...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/00G10L15/05G10L15/06G10L15/16G10L15/26

CPCG10L15/005G10L15/05G10L15/063G10L15/16G10L15/26

Inventor 朱森

Owner AISPEECH CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Cross-language speech recognition method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology