Voice keyword detection method based on complementary model score fusion

A keyword detection and keyword technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as unreliable judgment and insufficient information expression of a single model, and achieve the effect of reducing confusion and improving accuracy

Active Publication Date: 2020-05-08
SOUTH CHINA UNIV OF TECH
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to solve the problem of insufficient information expression of a single model, which leads to unreliable judgment, by using model evaluation and fusion with certain information expression complementary

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice keyword detection method based on complementary model score fusion
  • Voice keyword detection method based on complementary model score fusion
  • Voice keyword detection method based on complementary model score fusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0049] Such as figure 1 As shown, a kind of voice keyword detection method based on complementary model score fusion of the present embodiment comprises the following steps:

[0050] 1) On the basis of keyword modeling in audio feature space, keyword modeling based on i-vector (identity vector) is introduced to obtain two modeling methods; The distribution information in as its class attribute;

[0051] Such as figure 2 As shown, the speech feature space is used to model keywords, and the keywords are modeled based on i-vector.

[0052] Using the speech feature space to model keywords is specifically to use the speech data in the aishell Chinese corpus as an unlabeled speech sample set to train the speech feature space, and extract the 12-dimensional MFCC (Mel freque...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a voice keyword detection method based on complementary model score fusion. The method comprises the following steps of 1), introducing keyword modeling based on i-vector on thebasis of keyword modeling in an audio feature space, (2) self-adaptive segmented window shift: for a voice sample to be detected, intercepting a voice segment from the starting signal, obtaining thedistribution expression of the current segment in the voice feature space, calculating the similarity between the distribution expression and the keyword class attributes to obtain a class score sequence of the current segment, obtaining the window shift of the next segment according to the score of the current segment, processing segment by segment until the signal is ended, and dividing a voicesample to be detected into K segments, and 3) conducting score fusion by using the position of a keyword candidate point. According to the method, a keyword detection algorithm with certain complementarity is realized by adopting two different models, the score results of the two models are fused, the problem of voice keyword detection under the condition of a small training sample amount can be solved, and meanwhile, the keyword detection accuracy can be improved.

Description

technical field [0001] The invention relates to the field of continuous speech keyword recognition, in particular to a speech keyword detection method based on complementary model score fusion. Background technique [0002] Speech keyword detection is a special application of speech recognition technology, and its purpose is to detect whether a specific keyword is contained in continuous speech. At present, continuous speech keyword recognition technology has achieved rapid development and has been successfully popularized in many scenes in life. It has become an important research topic in the fields of voice control, voice retrieval, and security monitoring. [0003] There are three main traditional speech keyword detection techniques: sliding matching model, garbage model based on hidden Markov model and syllable grid network. The sliding matching model uses sliding windows to search for keywords on continuous speech, and uses dynamic time warping for matching calculatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/10G10L15/26G10L25/24
CPCG10L15/02G10L15/063G10L15/08G10L15/10G10L25/24G10L15/26G10L2015/0631Y02D10/00
Inventor 贺前华李黎晗
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products