A voice wake-up implementation method based on a small training set to optimize the decoding network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A decoding network and voice wake-up technology, applied in speech analysis, voice recognition, instruments, etc., can solve problems such as low false wake-up effect, and achieve the effect of simplifying complexity, improving adaptability, and reducing false wake-up.

Inactive Publication Date: 2021-09-24

武汉水象电子科技有限公司

View PDF8 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0008] The technical problem to be solved by the present invention is to overcome the existing overall idea of voice wake-up or the method of voice recognition using a large-scale vocabulary, which needs to use a large amount of training corpus to achieve good wake-up and low false wake-up effect defects, providing A voice wake-up implementation method based on a small training set and an optimized decoder that simplifies the algorithm implementation and improves the operation efficiency under the same wake-up rate, thereby realizing the purpose of voice wake-up and easy transplantation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0031] A voice wake-up implementation method based on small training set optimization decoding network, is characterized in that, comprises the following steps:

[0032] S1 extracts speech eigenfeatures

[0033] According to the analysis of the stability and correlation of the wake-up word data, the time window is designed to obtain the frame feature signal. The time window design involves the window length, shape, the amplitude of each point, and the weight between adjacent frame energies. Obtain the eigenvectors with obvious distinction between wake-up words and non-wake-up words;

[0034] S2 combines feature vectors to obtain feature phoneme alignment files

[0035] The time window is selected according to the distribution of wake word phonemes, and the mapping between features and phonemes is classified to obtain labeled acoustic data; the alignment algorithm between features and phonemes in this step is mainly obtained by using the context-dependent three-factor phoneme ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a voice wake-up implementation method based on a small training set to optimize a decoding network, comprising the following steps: S1 extracting speech intrinsic features to obtain a feature vector with obvious distinction between wake-up words and non-wake-up words; S2 combining the feature vectors to obtain The feature phoneme alignment file selects a time window according to the distribution of the wake word phonemes, classifies the mapping between features and phonemes, and obtains labeled acoustic data; S3 combines the labeled acoustic data to calculate the frame-by-frame posterior probability model S4 combines to obtain acoustic data The probability model obtains the posterior probability confidence level of the phoneme level and the re-confirmation network of the wake-up word calculation network S5; the present invention can be easily implemented on general-purpose processors such as arm and dsp through simple model training strategies and steps such as optimizing the decoding network Voice wake-up and other functions.

Description

technical field [0001] The invention relates to a voice wake-up implementation method based on a small training set to optimize the decoding network. The lightweight model refers to a small training data set, and the trained model occupies a small space, and can also be used on mobile terminals with few hardware resources. The post-decoding network is optimized to reduce the offline false wake-up rate without increasing the complexity of the algorithm. Background technique [0002] Voice is the most convenient and quick means for human beings to communicate and communicate with each other. It is the dream goal of human beings to allow machines to understand voices and perform related operations in accordance with human instructions. As a result, speech recognition technology came into being. Speech recognition technology is currently an important means of human-computer interaction, and voice wake-up is an important entrance for human-computer interaction. Smart voice devi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/16G10L15/187G10L15/26

CPCG10L15/02G10L15/063G10L15/083G10L15/16G10L15/187G10L15/26G10L2015/025

Inventor 赵升

Owner 武汉水象电子科技有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A voice wake-up implementation method based on a small training set to optimize the decoding network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology