Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice keyword retrieval method based on audio template

A keyword and voice technology, which is applied in the field of voice keyword retrieval based on audio templates, can solve problems such as limiting the accuracy of keyword retrieval, keyword retrieval restrictions, and increasing application costs

Active Publication Date: 2017-01-04
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF5 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, this method requires a well-performing large-vocabulary continuous speech recognition system. Building such a system requires a large amount of annotated corpus, which significantly increases the cost of application in a new language.
Furthermore, if keywords are given in the form of speech fragments, this method requires first identifying isolated speech fragments as preferred texts, and this process usually has limited precision, further limiting the accuracy of keyword retrieval
Therefore, traditional keyword retrieval methods are usually only applicable to well-understood languages, which limits the application of keyword retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice keyword retrieval method based on audio template
  • Voice keyword retrieval method based on audio template
  • Voice keyword retrieval method based on audio template

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] The present invention will be further described below.

[0066] The voice keyword retrieval method of the present invention first converts the voice sample template and the voice to be retrieved into a sequence of probability distributions through the front end of the acoustic model, and then performs a dynamic time warping (Dynamic Time Warping) algorithm on the voice sample template and the voice to be retrieved. Matching to obtain the acoustic confidence score of the start and end time points of the keywords in the speech to be retrieved and each occurrence position, and finally the scores obtained by different speech sample templates are regularized, and the retrieval results are obtained after sorting. In an ideal situation, it would not use language-specific data at all. refer to figure 1 , the specific description of the inventive method is as follows:

[0067] Step 1), perform feature extraction on the speech sample template and the speech segment to be retrie...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a voice keyword retrieval method based on an audio template. The method comprises the following steps: to begin with, converting a voice sample template and voice to be retrieved into a probability distribution sequence; then, carrying out matching on the voice sample template and the voice to be retrieved through dynamic time warping to obtain keyword starting and ending time point in the voice to be retrieved and acoustic confidence measure scores of each appearance position; and finally, carrying out warping on the scores obtained by different voice sample templates, and carrying out ranking to obtain a retrieval result. In the retrieval process, information of a specific language is not required at all, thereby maximizing universality and transportability; and meanwhile, operation amount in the retrieval process is reduced, and keyword retrieval speed is accelerated.

Description

technical field [0001] The invention relates to the field of voice retrieval, in particular to a voice keyword retrieval method based on an audio template. Background technique [0002] The keyword retrieval task is to quickly find the location of a given keyword from large-scale and diverse speech data. The current mainstream keyword retrieval method is to convert the speech to be retrieved into text through a continuous speech recognition system with a large vocabulary. Considering the recognition accuracy of the continuous speech recognition system with a large vocabulary, the error rate of the preferred result is relatively high. Therefore, a word map containing multiple candidate information and time information is usually used, and then the text or pronunciation of the keyword to be retrieved is searched on the word map. Search and calculate the confidence level to obtain the keyword retrieval results (Shao Jian, Chinese Speech Retrieval for Large-Scale Telephone Conv...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/08
Inventor 徐及张舸潘接林颜永红
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products