Method of recognizing ear speech in normal speech flow under condition of small database

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A recognition method and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of small sample data and difficulty in improving the recognition rate, and achieve the effect of improving the recognition rate, ensuring the recognition rate, and improving the recognition performance

Active Publication Date: 2017-01-11

SUZHOU UNIV

View PDF7 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The purpose of the present invention is to provide a method for recognizing middle ear speech in normal speech streams under the condition of a small database, so as to solve the difficulty in improving the recognition rate due to the small amount of sample data when identifying isolated words in ear speech in normal speech streams. The problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0028] Embodiment one: see attached figure 2 Shown, the recognition method of middle ear speech of normal speech flow under a kind of small database condition, comprises the following steps:

[0029] (1) Construct speech recognition classification system, described speech recognition classification system comprises:

[0030] Digital voice input module, used for sampling or reading the voice stream signal containing ear voice;

[0031] The feature extraction module is used to extract spectral features; the selected spectral features include 12-order MFCC, logarithmic energy, 0-order cepstral coefficient, first-order derivative, and second-order derivative, with a frame length of 25 milliseconds and a frame shift of 10 milliseconds.

[0032] Deep Neural Networks, see attached image 3 As shown, it consists of a deep belief network and a Softmax output layer;

[0033] The deep belief network is composed of a plurality of restricted Boltzmann machine stacks from bottom to top,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method of recognizing ear speech in a normal speech flow under the condition of a small database, comprising the following steps: building a speech recognition and classification system which comprises a digital speech input module, a feature extraction module, and a deep neural network composed of a deep belief network and a Softmax output layer, wherein the deep belief network is composed of restricted Boltzmann machines stacked from bottom to top, spectrum feature is the input feature of the deep belief network, and the Softmax output layer correspondingly outputs a target value of the deep neural network; acquiring training data set samples and processing the samples in at least one of the following ways: (1) artificially extending the data set 8 to 24 times by repetition; and (2) scrambling the data set, and training the speech recognition and classification system; and using the trained classification system to recognize a to-be-recognized speech flow. The recognition performance of the system can be improved in a small database, and the recognition rate of normal speech is guaranteed while ear speech recognition is realized.

Description

technical field [0001] The invention relates to a speech signal processing technology, in particular to a method for recognizing ear speech appearing in Chinese normal speech flow under the condition of low resource and small database. Background technique [0002] Otospeech is a special form of communication in which the sound is low and the vocal cords do not vibrate at all. It is significantly less perceptual and intelligible than normal speech. Nevertheless, ear speech is also a natural speech form of people's daily communication. It is often used for communication in a quiet or confidential environment, such as when the speaker does not want to disturb others, or there is some private information to be exchanged. With the development of communication technology, the human-computer interaction interface of ear voice is getting more and more attention, such as the use of handheld devices such as smart phones in company meetings or public places, and digital password s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/06G10L15/08G10L15/16G10L15/20

CPCG10L15/063G10L15/08G10L15/16G10L15/20

Inventor 陈雪勤刘正赵鹤鸣

Owner SUZHOU UNIV

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Method of recognizing ear speech in normal speech flow under condition of small database

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology