Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Edge computing-oriented lightweight voice keyword recognition method

An edge computing and recognition method technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of limiting the practicality of edge computing equipment, poor performance, high computing power requirements, etc., to reduce the probability of user privacy leakage, model The effect of reducing the number of parameters and enriching speech features

Active Publication Date: 2020-06-12
SOUTH CHINA NORMAL UNIVERSITY
View PDF5 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This model not only has high production costs, but also has the problem of user privacy leakage
[0006] In addition, the existing technology has serious limitations in the application of edge devices. This is because the performance of hardware such as the CPU and memory of the edge devices is relatively poor. For example, the CPU of the Raspberry Pi 3B+ edge device is a single-core ARMs7I chip. , the main frequency is only 1.2GHz
However, the inference process of the deep learning model is particularly demanding on computing power. When running the deep learning model on the edge computing device, there are often situations such as freezes, full CPU usage and shutdown, etc., which limits its application on the edge computing device. practicality

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Edge computing-oriented lightweight voice keyword recognition method
  • Edge computing-oriented lightweight voice keyword recognition method
  • Edge computing-oriented lightweight voice keyword recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0063] The technical problem to be solved by the present invention is to prevent leakage of user privacy in the mode of data collection by edge computing terminal and recognition of voice keywords by the server, and minimize the consumption of resources such as CPU and memory in the process of model reasoning.

[0064] The paper "An experimental analysis of the power consumption of convolutional neural networks for keyword spotting" analyzed a set of convolutional neural networks used in speech keyword recognition tasks. The paper believes that the CNN model has a simple architecture, which is relatively easy to tune, and It is implemented in multiple deep learning frameworks, such as Tensorflow, Pytorch and other frameworks. The keyword recognition model recognition process is as follows: figure 1 .

[0065] The feature extraction adopts Mel-Frequency Cepstrum Coefficient (MFCC) method, the input voice data adopts the frequency of 16KHz, the frame length is 30ms, and the movi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an edge computing-oriented lightweight voice keyword recognition method, which comprises the following steps of: preprocessing a signal, and removing a noise signal; performingacoustic feature extraction; constructing a lightweight speech keyword recognition model EdgeCRNN adopting a first-layer feature enhancement method and a lightweight component, wherein the lightweight component comprises a depth separable convolution and residual structure; constructing a basic module Base-Block and a down-sampling module CRNN-Block suitable for a voice keyword recognition task,and constructing an EdgeCRNN based on the basic module and the down-sampling module; and inputting the features into an EdgeCRNN model to carry out speech recognition. According to the method, a lightweight voice keyword recognition model is designed by adopting a feature enhancement method, depth separable convolution and a depth residual structure, so that the hardware resource consumption is greatly reduced, the model can stably and smoothly run on resource-limited equipment, and the leakage of user privacy is avoided.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, and in particular relates to a lightweight speech keyword recognition method oriented to edge computing. Background technique [0002] Speech keyword spotting (Keyword Spotting, KWS) applications usually use the mode of terminal data collection and cloud server recognition. Although the cloud server has sufficient storage space and powerful computing power, it can store and process a large amount of data, but this mode has potential delays. Moreover, with the rapid growth of data, the pressure on the server to process data and the network bandwidth consumed by transmitting data will increase exponentially, which puts forward higher requirements for server computing power and network bandwidth, and the delay will also become more severe. Big, this will be a very bad user experience for applications based on the KWS model. In addition, when user data is uploaded to the cloud server, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/16G10L15/18G10L15/20G10L15/26G10L15/34G10L25/03
CPCG10L15/02G10L15/20G10L15/16G10L15/18G10L15/26G10L15/34G10L25/03Y02D10/00
Inventor 龚征魏运根杨顺志叶开
Owner SOUTH CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products