A voice wake-up implementation method based on a small training set to optimize the decoding network

A decoding network and voice wake-up technology, applied in speech analysis, voice recognition, instruments, etc., can solve problems such as low false wake-up effect, and achieve the effect of simplifying complexity, improving adaptability, and reducing false wake-up.

Inactive Publication Date: 2021-09-24
武汉水象电子科技有限公司
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The technical problem to be solved by the present invention is to overcome the existing overall idea of ​​voice wake-up or the method of voice recognition using a large-scale vocabulary, which needs to use a large amount of training corpus to achieve good wake-up and low false wake-up effect defects, providing A voice wake-up implementation method based on a small training set and an optimized decoder that simplifies the algorithm implementation and improves the operation efficiency under the same wake-up rate, thereby realizing the purpose of voice wake-up and easy transplantation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A voice wake-up implementation method based on a small training set to optimize the decoding network
  • A voice wake-up implementation method based on a small training set to optimize the decoding network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0031] A voice wake-up implementation method based on small training set optimization decoding network, is characterized in that, comprises the following steps:

[0032] S1 extracts speech eigenfeatures

[0033] According to the analysis of the stability and correlation of the wake-up word data, the time window is designed to obtain the frame feature signal. The time window design involves the window length, shape, the amplitude of each point, and the weight between adjacent frame energies. Obtain the eigenvectors with obvious distinction between wake-up words and non-wake-up words;

[0034] S2 combines feature vectors to obtain feature phoneme alignment files

[0035] The time window is selected according to the distribution of wake word phonemes, and the mapping between features and phonemes is classified to obtain labeled acoustic data; the alignment algorithm between features and phonemes in this step is mainly obtained by using the context-dependent three-factor phoneme ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a voice wake-up implementation method based on a small training set to optimize a decoding network, comprising the following steps: S1 extracting speech intrinsic features to obtain a feature vector with obvious distinction between wake-up words and non-wake-up words; S2 combining the feature vectors to obtain The feature phoneme alignment file selects a time window according to the distribution of the wake word phonemes, classifies the mapping between features and phonemes, and obtains labeled acoustic data; S3 combines the labeled acoustic data to calculate the frame-by-frame posterior probability model S4 combines to obtain acoustic data The probability model obtains the posterior probability confidence level of the phoneme level and the re-confirmation network of the wake-up word calculation network S5; the present invention can be easily implemented on general-purpose processors such as arm and dsp through simple model training strategies and steps such as optimizing the decoding network Voice wake-up and other functions.

Description

technical field [0001] The invention relates to a voice wake-up implementation method based on a small training set to optimize the decoding network. The lightweight model refers to a small training data set, and the trained model occupies a small space, and can also be used on mobile terminals with few hardware resources. The post-decoding network is optimized to reduce the offline false wake-up rate without increasing the complexity of the algorithm. Background technique [0002] Voice is the most convenient and quick means for human beings to communicate and communicate with each other. It is the dream goal of human beings to allow machines to understand voices and perform related operations in accordance with human instructions. As a result, speech recognition technology came into being. Speech recognition technology is currently an important means of human-computer interaction, and voice wake-up is an important entrance for human-computer interaction. Smart voice devi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/16G10L15/187G10L15/26
CPCG10L15/02G10L15/063G10L15/083G10L15/16G10L15/187G10L15/26G10L2015/025
Inventor 赵升
Owner 武汉水象电子科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products