Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Semi-monitoring speaker self-adaption

An adaptive, speaker-based technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as recognition performance degradation

Inactive Publication Date: 2006-03-29
SONY INT (EURO) GMBH +1
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When this happens repeatedly, recognition performance drops dramatically

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semi-monitoring speaker self-adaption
  • Semi-monitoring speaker self-adaption
  • Semi-monitoring speaker self-adaption

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] figure 2 A first adaptation method according to the invention is given, in which confidence measures are used to avoid adaptation to misrecognized words and to determine the degree of adaptation. The method is repeatedly executed in an infinite loop starting at step S21.

[0025] In said first step S21, the user's utterance is recognized in a similar manner to speech recognition systems according to the prior art. In the following step S22, the confidence measure is used in the recognition result of step S21. In this step, the confidence measure is used to measure the confidence of the recognition result. When the confidence measure is less than a certain threshold, the identified word is considered untrustworthy and will not be used for adaptation, so that the adaptation process starts anew in step S21, in which the next Recognition of user pronunciation. If, on the other hand, the confidence measure is greater than the threshold, the recognition result is conside...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

To prevent adaptation to misrecognized words in unsupervised or on-line automatic speech recognition systems confidence measures are used or the user reaction is interpreted to decide whether a recognized phoneme, several phonemes, a word, several words or a whole utterance should be used for adaptation of the speaker independent model set to a speaker adapted model set or not and, in case an adaptation is executed, how strong the adaptation with this recognized utterance or part of this recognized utterance should be performed. Furtheron, a verification of the speaker adaptation performance is proposed to secure that the recognition rate never decreases (significantly), but only increases or stays at the same level.

Description

technical field [0001] The present invention relates to Automatic Speech Recognition (ASR), and in particular to a method of performing unsupervised or on-line adaptation of an Automatic Speech Recognition system and a speech recognition system capable of implementing the method of the invention. Background technique [0002] State-of-the-art speech recognizers include a set of statistical distributions that model the acoustic properties of certain speech segments. These acoustic properties are encoded as feature vectors. As an example, a Gaussian distribution can be made for each syllable. These distributions are hooked to a certain state. A (stochastic) state transition network (usually a Hidden Markov Model) defines probabilities for state sequences and feature vector sequences. Passing through a state requires the use of a feature vector that covers a frame of speech signal with a length of, for example, 10 ms. [0003] The stochastic parameters of such a recognizer ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/00G10L15/07G10L15/06G10L15/065
CPCG10L2015/0638G10L15/065G10L15/063G10L15/07
Inventor S·戈伦兹R·科姆佩P·布赫纳岩桥直人
Owner SONY INT (EURO) GMBH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products