Method and system for optimizing speech recognition acoustic model, equipment and storage media

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An acoustic model and speech recognition technology, applied in the computer field, can solve the problems that affect the quality of the annotation, and the optimization of the acoustic model has not yet been found, and achieve the effect of improving the quality of the annotation, optimizing the acoustic model, and improving the accuracy.

Active Publication Date: 2018-08-10

GUANGZHOU SHIYUAN ELECTRONICS CO LTD

View PDF8 Cites 25 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Annotated text is usually achieved based on a large number of manual annotations or obtained through identification by a third-party recognition system. However, there are often certain errors in the annotated text obtained through the above methods, which affect the quality of annotations.

[0004] For the speech recognition acoustic model, improving the annotation quality of the annotation text is equivalent to one of the means to optimize the acoustic model, but there is no technical solution to realize the optimization of the acoustic model by improving the quality of the annotation text.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0028] figure 1 It is a schematic flowchart of a method for optimizing an acoustic model for speech recognition provided by Embodiment 1 of the present invention. The method is applicable to the situation of optimizing and upgrading the acoustic model for speech recognition, and the method can be executed by a device for optimizing the acoustic model for speech recognition, which can be implemented by hardware and / or software, and is generally integrated in a device with a speech recognition function in computer equipment.

[0029] Such as figure 1 As shown, a method for optimizing an acoustic model for speech recognition provided by Embodiment 1 of the present invention includes the following operations:

[0030] S101. Obtain the labeled text of the sample speech, and obtain the recognition text of the sample speech based on the current acoustic model.

[0031] It can be understood that the sample speech is equivalent to a piece of speech data in the speech data set requir...

Embodiment 2

[0043] figure 2 It is a schematic flowchart of a method for optimizing an acoustic model for speech recognition provided by Embodiment 2 of the present invention. The embodiment of the present invention is optimized on the basis of the above-mentioned embodiments. In this embodiment, the marked text and the recognized text are further compared, and when the comparison result is a mismatch, it is determined that the marked text is relatively The wrong annotation information of the recognized text is embodied as: comparing the marked text and the recognized text, obtaining the edit distance between the marked text and the recognized text, and determining the comparison result when the edit distance is non-zero No match; when the comparison result is no match, according to the edit distance, determine the total number of wrong labels of the labeled text relative to the recognized text, the location of the wrong labels and the error type of each wrong label ; Record the total nu...

Embodiment 3

[0098] image 3 A structural block diagram of a device for optimizing an acoustic model for speech recognition provided in Embodiment 3 of the present invention, the device is suitable for optimizing and upgrading the acoustic model for speech recognition, and the device can be implemented by hardware and / or software, And generally integrated in the computer equipment with speech recognition function. Such as image 3 As shown, the device includes: a text acquisition module 31 , a wrong label determination module 32 , a label text update module 33 and an acoustic model optimization module 34 .

[0099] Wherein, the text obtaining module 31 is used to obtain the marked text of the sample speech, and obtain the recognition text obtained based on the current acoustic model of the sample speech;

[0100] An incorrect label determination module 32, configured to compare the labeled text with the recognized text, and determine the wrong labeled information of the labeled text rela...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Embodiments of the present invention disclose a method and a system for optimizing a speech recognition acoustic model, equipment and a storage media. The method includes: obtaining a labeled text ofa sample speech, and acquiring an identification text obtained by the sample speech based on the current acoustic model; comparing the labeled text with the identification text, and determining errorlabeling information of the labeled text relative to the identification text when a comparison result is not matched; updating the labeled text of the sample speech according to a text update decisioncondition corresponding to the error labeling information; and retraining and optimizing the current acoustic model based on sample speeches of the set amount and current corresponding labeled texts.By using this method, the labeling quality of the labeled text corresponding to the sample speech can be effectively improved, thereby achieving the purpose of optimizing the acoustic model.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method, system, device and storage medium for optimizing an acoustic model for speech recognition. Background technique [0002] With the continuous expansion of the application range of speech recognition, speech recognition technology has become a new high-tech industry, and has attracted more and more technical personnel's attention. At present, one of the important components in the speech recognition system is the acoustic model. The quality of the acoustic model largely determines the quality of the speech recognition results. Therefore, it is necessary to continuously optimize the speech recognition acoustic model. [0003] Generally, the training of the acoustic model requires the support of a large amount of sample data, and the sample data often includes voice data and marked text corresponding to the voice data (text content included in the voice data). Annotated ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/06

CPCG10L15/063G10L2015/0635G10L2015/0636

Inventor 雷延强

Owner GUANGZHOU SHIYUAN ELECTRONICS CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method and system for optimizing speech recognition acoustic model, equipment and storage media

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology