Gain processing method and device for speech recognition system

A technology of speech recognition and processing methods, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as performance degradation of the recognition system, and achieve the effect of improving robustness

Active Publication Date: 2020-01-07
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF14 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, the speech recognition system usually requires the audio amplitude received by the microphone to be higher than a certain threshold. Once the audio amplitude is lower than the threshold, the performance of the recognition system will be greatly reduced.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Gain processing method and device for speech recognition system
  • Gain processing method and device for speech recognition system
  • Gain processing method and device for speech recognition system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary, and are intended to explain the present application, and should not be construed as limiting the present application.

[0021] The gain processing method and device for a speech recognition system according to the embodiments of the present application will be described below with reference to the accompanying drawings.

[0022] figure 1 It is a flowchart of a gain processing method for a speech recognition system according to an embodiment of the present application.

[0023] Such as figure 1 As shown, the gain processing method for the speech recognition system includes:

[0024] Step 101, from the input first audio data with a preset fr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The application provides a gain processing method and a gain processing device for a speech recognition system, wherein the method comprises the following steps: acquiring a peak value of each audio section according to a preset division length in inputted first audio data of a preset frame length; according to the peak value of each audio section and a preset expected audio amplitude, acquiring a block gain of each audio section, wherein the audio expected amplitude is matched with training data in the speech recognition system; selecting M pieces of preset block gain values in all block gains from small to large and conducting median filtering treatment, and acquiring expected gains of the first audio data; and adjusting amplitudes of the first audio data by virtue of the expected gains. The automatic gain adjustment on the audio data is achieved, so that the amplitude of a received audio signal is more than a threshold value of the speech recognition system ad is matched with the training data; therefore, the stability of the speech recognition system is enhanced.

Description

technical field [0001] The present application relates to the technical field of speech recognition processing, in particular to a gain processing method and device for a speech recognition system. Background technique [0002] With the development of speech recognition technology, the application fields of speech recognition system are becoming wider and wider. Existing speech recognition systems usually use massive audio data to train a general model for speech recognition. [0003] However, when the speech recognition system is actually used, there will inevitably be a mismatch between the statistical characteristics of the audio data to be recognized and the training data, and this mismatch is especially reflected in the amplitude of the audio signal. In addition, speech recognition systems generally require that the audio amplitude received by the microphone be higher than a certain threshold, and once the audio amplitude is lower than the threshold, the performance of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/02G10L15/06G10L15/26
CPCG10L15/02G10L15/06G10L15/26
Inventor 徐杨飞魏建强崔玮玮
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products