Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Training and identification method and system of bidirectional neural network model

A neural network model and neural network technology, applied in the field of speech signal processing, can solve problems such as mismatching training and testing conditions, and achieve the effect of increasing the amount of training data and reducing mismatching

Active Publication Date: 2018-08-17
AISPEECH CO LTD
View PDF12 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the process of realizing the present invention, the inventors found that although the neural network-based method superficially surpasses the cluster-based method, it is easy to cause a mismatch between training and testing conditions
This mismatch arises because NN-based hidden value estimators can only be trained on parallel simulated data implemented with real data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training and identification method and system of bidirectional neural network model
  • Training and identification method and system of bidirectional neural network model
  • Training and identification method and system of bidirectional neural network model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0027] In the following, the embodiment of the present application will be introduced first, and then the experimental data will be used to verify the difference between the solution of the present application and the prior art, and what beneficial effects can be achieved.

[0028] Please refer to figure 1 , which shows a flowchart of an embodime...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses the training and identification method and system of a bidirectional neural network model used for processing a noisy voice. The method comprises the followings steps of acquiring simulation noisy data and real noisy data; calculating the time-frequency hidden value label of the simulation noisy data, setting the label of the simulation noisy data to be a training target ina bidirectional neural network, and inputting the simulation noisy data after preset processing into the bidirectional neural network so as to carry out training; using a clustering mode to estimatethe soft time-frequency hidden value label of the real noisy data, setting the soft label of the real noisy data in the bidirectional neural network to be the training target, and inputting the real noisy data after preset processing into the bidirectional neural network so as to carry out training; and outputting the neural network parameters of the trained bidirectional neural network. In the invention, through introducing the real and non-simulated training data, the neural network model is trained. On one hand, a training data size is increased; on the other hand, the mismatching of the simulation data and the real data is reduced.

Description

technical field [0001] The invention belongs to the field of speech signal processing, and in particular relates to a training and recognition method, system and electronic equipment of a bidirectional neural network model for processing noisy speech. Background technique [0002] In recent years, significant progress has been made in Automatic Speech Recognition (ASR) due to the introduction of deep neural networks into acoustic modeling. ASR systems based on deep neural networks still perform poorly in many real-world far-field microphone scenarios. The main reason for poor performance is background interference, such as additive noise, channel distortion and reverberation, which reduces the signal-to-noise ratio and degrades the performance of ASR. Beamforming has proven to be a useful front-end approach to improve system performance under these conditions. While traditional beamforming methods usually rely on inaccurate prior knowledge, such as array geometry or plane ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L21/0216G10L25/30G06N3/04G06N3/08
CPCG06N3/049G06N3/084G06N3/088G10L21/0216G10L25/30G10L2021/02166
Inventor 俞凯周瑛
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products