Language identification and classification method and device based on noise reduction automatic encoder

A technology for automatic coding and classification of noise reduction, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as increasing model complexity and performance degradation

Active Publication Date: 2020-03-03
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, the existing language recognition system has a high recognition rate when the length of the training speech and the test speech match; however, when the length of the training speech and the test speech do not match, its performance also decreases
The existing language recognition system, for the length mismatch problem, trains the matching models for different lengths of test speech, which greatly increases the complexity of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Language identification and classification method and device based on noise reduction automatic encoder
  • Language identification and classification method and device based on noise reduction automatic encoder
  • Language identification and classification method and device based on noise reduction automatic encoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0062] The present invention proposes a TV i-vector language recognition system based on DAE to compensate the language characteristics of different lengths of test speech, which is specifically divided into the following steps: first, the speech is framed and transformed to obtain the underlying acoustic features; second, the original i-vector is extracted -vector, and calculate its phoneme vector at the same time; then, splice the original i-vector and phoneme vector, and send it to the DAE-based compensation network to obtain the compensated i-vector; finally, combine the compensated i-vector and the original i- The vectors are respectively sent to the back-end classifier to obtain two score vectors, which are then judged after the fusion of the score fields.

[0063] Such as figure 1 As shown, the present invention provides a method for language recognition and classification based on a denoising automatic encoder, which specifically includes:

[0064] Step 1) extract the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a language identification and classification method based on a noise reduction automatic encoder. The method includes: step 1) extracting a to-be-identified speech signal from ato-be-identified speech segment to obtain underlying acoustic characteristics; step 2) extracting an original i-vector from the underlying acoustic characteristics obtained in step 1); step 3) calculating and obtaining a phoneme vector pc (u); step 4) splicing the original i-vector with the phoneme vector pc (u), and then inputting to a DAE-based i-vector compensation network to obtain a compensated i-vector; step 5) inputting the original i-vector obtained in step 2) and the compensated i-vector obtained in step 4) respectively into a pre-trained logistic regression classifier to obtain corresponding score vectors; and step 6) performing score fusion on the corresponding score vectors obtained in step 5) to obtain a final score vector, further obtaining a probability of each language category, and determining the language category to which it belongs.

Description

technical field [0001] The invention belongs to the technical field of language recognition, and in particular relates to a method and device for language recognition and classification based on a noise reduction automatic encoder. Background technique [0002] Language Identification (LID) refers to the process of automatically determining a given speech segment, extracting the difference information of each language from the speech signal of the speech segment, and judging the language type. Language recognition technology has important applications in multilingual speech processing, such as spoken language translation systems, multilingual speech recognition systems, speech and text processing, etc. [0003] At present, the traditional language recognition technology includes two methods: the first method is the language recognition technology based on the features of the phoneme layer; among them, the language recognition technology based on the features of the phoneme l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/00G10L15/02G10L15/08
CPCG10L15/005G10L15/02G10L15/08
Inventor 周若华苗晓晓颜永红
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products