Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Spoken language evaluation method and device

An evaluation method and oral language technology, applied in the field of language recognition, can solve the problems of low accuracy of tone discrimination, small distinction between tones, and large gaps in the scores of initials and finals, so as to improve the accuracy of judgment, reduce the large difference in judgment effect, The effect of improving accuracy

Pending Publication Date: 2021-02-05
BEIJING YUANLI WEILAI SCI & TECH CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] With the development of natural language processing (Natural Language Processing, NLP), the deep neural network model (Deep Neural Network, DNN) can convert audio into the posterior probability of phoneme pronunciation at the frame level. Based on this, the good pronunciation degree based on neural network (Neural Network-Goodness of Pronunciation, GOP-NN) is defined as the logarithmic phoneme posterior probability ratio between the standard phoneme and the phoneme with the highest posterior probability. The closer the ratio is to 1, the closer the GOP is to 0. Indicates that the pronunciation is better, but in the oral language evaluation of the prior art, only the alignment information of the phoneme and the good pronunciation are used, and the information of the whole word is not used. The problem that the same standard does not apply to the innate differences of different phonemes will lead to the problems of different judgments of different phonemes by the existing oral language evaluation system, large gaps in initial and final scores, and low accuracy of tone discrimination.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Spoken language evaluation method and device
  • Spoken language evaluation method and device
  • Spoken language evaluation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present application. However, the present application can be implemented in many other ways different from those described herein, and those skilled in the art can make similar promotions without violating the connotation of the present application. Therefore, the present application is not limited by the specific implementation disclosed below.

[0050] The terminology used in one or more embodiments of the present application is for the purpose of describing a particular embodiment only, and is not intended to limit the one or more embodiments of the present application. As used in one or more embodiments of this application and the appended claims, the singular forms "a," "the," and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. It will also be understood that the term "and / or" as used in one...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a spoken language evaluation method and device. The spoken language evaluation method comprises the steps: obtaining a to-be-evaluated audio and an evaluation text correspondingto the to-be-evaluated audio; determining an attribute characteristic value of each phoneme in the evaluation text and a posterior probability corresponding to each phoneme based on the to-be-evaluated audio and the evaluation text; extracting a pronunciation characteristic value corresponding to the evaluation text based on the evaluation text and the posterior probability corresponding to eachphoneme; generating a characteristic vector corresponding to each phoneme according to the attribute feature value and the pronunciation feature value of each phoneme; and inputting the characteristicvector corresponding to each phoneme into a spoken language evaluation model to obtain an evaluation result output by the spoken language evaluation model. According to the spoken language evaluationmethod provided by the invention, the pronunciation characteristic value corresponding to each phoneme is introduced, and the potential error of the current pronunciation can be accurately explored.Multi-dimensional characteristic information is provided for a spoken language evaluation model, and the judgment accuracy of initial consonants, final consonants and tones is improved.

Description

technical field [0001] The present application relates to the technical field of language recognition, and in particular, to a spoken language evaluation method and apparatus, a computing device, and a computer-readable storage medium. Background technique [0002] Spoken language assessment refers to the overall evaluation of a person's speaking level from different dimensions by using speech recognition technology given text and audio. Features such as the Goodness of Pronunciation of each phoneme are used to judge the speaking level of the speaker according to these features. [0003] With the development of Natural Language Processing (NLP), the Deep Neural Network (DNN) model can convert audio into frame-level phoneme pronunciation posterior probability. Based on this, the neural network-based pronunciation goodness (Neural Network-Goodness of Pronunciation, GOP-NN) is defined as the log-phoneme posterior probability ratio between the standard phoneme and the phoneme w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/01G10L15/08
CPCG10L15/01G10L15/08G10L2015/025
Inventor 卓邦声吴凡夏龙高强王宏伟郭常圳
Owner BEIJING YUANLI WEILAI SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products