Voice keyword detection method based on complementary model score fusion

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A keyword detection and keyword technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as unreliable judgment and insufficient information expression of a single model, and achieve the effect of reducing confusion and improving accuracy

Active Publication Date: 2020-05-08

SOUTH CHINA UNIV OF TECH

View PDF6 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to solve the problem of insufficient information expression of a single model, which leads to unreliable judgment, by using model evaluation and fusion with certain information expression complementary

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0048] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.

[0049] Such as figure 1 As shown, a kind of voice keyword detection method based on complementary model score fusion of the present embodiment comprises the following steps:

[0050] 1) On the basis of keyword modeling in audio feature space, keyword modeling based on i-vector (identity vector) is introduced to obtain two modeling methods; The distribution information in as its class attribute;

[0051] Such as figure 2 As shown, the speech feature space is used to model keywords, and the keywords are modeled based on i-vector.

[0052] Using the speech feature space to model keywords is specifically to use the speech data in the aishell Chinese corpus as an unlabeled speech sample set to train the speech feature space, and extract the 12-dimensional MFCC (Mel freque...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice keyword detection method based on complementary model score fusion. The method comprises the following steps of 1), introducing keyword modeling based on i-vector on thebasis of keyword modeling in an audio feature space, (2) self-adaptive segmented window shift: for a voice sample to be detected, intercepting a voice segment from the starting signal, obtaining thedistribution expression of the current segment in the voice feature space, calculating the similarity between the distribution expression and the keyword class attributes to obtain a class score sequence of the current segment, obtaining the window shift of the next segment according to the score of the current segment, processing segment by segment until the signal is ended, and dividing a voicesample to be detected into K segments, and 3) conducting score fusion by using the position of a keyword candidate point. According to the method, a keyword detection algorithm with certain complementarity is realized by adopting two different models, the score results of the two models are fused, the problem of voice keyword detection under the condition of a small training sample amount can be solved, and meanwhile, the keyword detection accuracy can be improved.

Description

technical field [0001] The invention relates to the field of continuous speech keyword recognition, in particular to a speech keyword detection method based on complementary model score fusion. Background technique [0002] Speech keyword detection is a special application of speech recognition technology, and its purpose is to detect whether a specific keyword is contained in continuous speech. At present, continuous speech keyword recognition technology has achieved rapid development and has been successfully popularized in many scenes in life. It has become an important research topic in the fields of voice control, voice retrieval, and security monitoring. [0003] There are three main traditional speech keyword detection techniques: sliding matching model, garbage model based on hidden Markov model and syllable grid network. The sliding matching model uses sliding windows to search for keywords on continuous speech, and uses dynamic time warping for matching calculatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/10G10L15/26G10L25/24

CPCG10L15/02G10L15/063G10L15/08G10L15/10G10L25/24G10L15/26G10L2015/0631Y02D10/00

Inventor 贺前华李黎晗

Owner SOUTH CHINA UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Voice keyword detection method based on complementary model score fusion

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology