A method and device for language recognition and classification based on denoising autoencoder

A noise-reducing automatic encoding and classification method technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of increasing model complexity and performance degradation, and achieve the effect of alleviating the imbalance of phoneme distribution and solving the length mismatch

Active Publication Date: 2022-05-03
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, the existing language recognition system has a high recognition rate when the length of the training speech and the test speech match; however, when the length of the training speech and the test speech do not match, its performance also decreases
The existing language recognition system, for the length mismatch problem, trains the matching models for different lengths of test speech, which greatly increases the complexity of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for language recognition and classification based on denoising autoencoder
  • A method and device for language recognition and classification based on denoising autoencoder
  • A method and device for language recognition and classification based on denoising autoencoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] The present invention proposes a TV i-vector language recognition system based on DAE to compensate the language characteristics of different lengths of test speech, which is specifically divided into the following steps: first, the speech is framed and transformed to obtain the underlying acoustic features; second, the original i-vector is extracted -vector, and calculate its phoneme vector at the same time; then, splice the original i-vector and phoneme vector, and send it to the DAE-based compensation network to obtain the compensated i-vector; finally, combine the compensated i-vector and the original i- The vectors are respectively sent to the back-end classifier to obtain two score vectors, which are then judged after the fusion of the score fields.

[0063] Such as figure 1 As shown, the present invention provides a method for language recognition and classification based on a denoising automatic encoder, which specifically includes:

[0064] Step 1) extract the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a method for language recognition and classification based on a noise-reducing automatic encoder, which includes: step 1) extracting the speech signal to be recognized from the speech segment to be recognized, and obtaining the underlying acoustic features; step 2) obtaining from step 1) Extract the original i‑vector of the underlying acoustic features; step 3) calculate and obtain the phoneme vector p c (u); step 4) combine the original i‑vector with the phoneme vector p c (u) splicing, and inputting it into the DAE-based i-vector compensation network to obtain the compensated i-vector; step 5) the original i-vector obtained in step 2) and the compensated i-vector obtained in step 4) respectively Input the i‑vector to the pre-trained logistic regression classifier to obtain the corresponding score vector; step 6) perform score fusion on the corresponding score vector obtained in step 5) to obtain the final score vector, and then obtain the probability of each language category , and determine the language category it belongs to.

Description

technical field [0001] The invention belongs to the technical field of language recognition, and in particular relates to a method and device for language recognition and classification based on a noise reduction automatic encoder. Background technique [0002] Language Identification (LID) refers to the process of automatically determining a given speech segment, extracting the difference information of each language from the speech signal of the speech segment, and judging the language type. Language recognition technology has important applications in multilingual speech processing, such as spoken language translation systems, multilingual speech recognition systems, speech and text processing, etc. [0003] At present, the traditional language recognition technology includes two methods: the first method is the language recognition technology based on the features of the phoneme layer; among them, the language recognition technology based on the features of the phoneme l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/00G10L15/02G10L15/08
CPCG10L15/005G10L15/02G10L15/08
Inventor 周若华苗晓晓颜永红
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products