Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech emotion classifying method for emotion-based characteristic optimization

A technology of feature optimization and emotion classification, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of not being able to minimize the misrecognition rate, not being able to achieve the optimal distinction of emotion categories, and being single

Inactive Publication Date: 2010-11-24
SOUTHEAST UNIV
View PDF0 Cites 44 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] A problem in the feature optimization of speech emotion is that general speech emotion classifiers use a single set of optimal features to classify all emotion categories, and using this set of emotion features cannot achieve the classification of any two emotion categories. optimal distinction between
For example, a set of features is selected to best distinguish the five types of speech emotions A, B, C, D, and E, but this set of features often cannot minimize the misrecognition rate between A and B in an optimal sense

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech emotion classifying method for emotion-based characteristic optimization
  • Speech emotion classifying method for emotion-based characteristic optimization
  • Speech emotion classifying method for emotion-based characteristic optimization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The speech emotion recognition system block diagram among the present invention is as figure 1 shown.

[0043] 1. Speech emotion database

[0044] (1) On the type of emotion analyzed, the present invention selects and analyzes joy, anger, surprise, sadness, and five basic emotional states of tranquility.

[0045] (2) The Chinese speech emotion library used in the present invention is obtained by the method of performing speech (Acted Speech). Sentence material is recorded by people with performance or broadcasting experience (three men and three women, aged between 20 and 30, no recent cold, standard Mandarin). Recording takes place in a quiet studio. AKGWMS300 recording equipment and microphone, Creative sound card, and Cool Edit recording software are used for recording. Monophonic, 16bit quantization, and 11.025kHz sampling rate are used for recording. Statements are stored in WAV format.

[0046] (3) In the selection of sentence materials, two principles are f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech emotion classifying method for emotion-based characteristic optimization, which comprises the following steps: (1), acquiring speech data of basic emotion states: a happy state, an angry state, an alarmed state and a sorrowful state; (2), extracting speed emotion characteristics; (3) matching emotion pairs; (4) compacting and selecting the characteristics: (4-1) reducing dimensions of linear discriminant analysis and performing respective LDA conversion by using the respective projection vector of each emotion pair, and (4-2) selecting a characteristic selecting method which is based on fisher discriminant criterion; and (5) fusing discriminations on the basis of two types of classifiers: (5-1) recording input emotion speech data, (5-2) outputting Ci and j according to the discriminations of the two types of classifiers, (5-3) obtaining the confidence coefficients wi and j of the two types of classifiers by using a formula (3), and (5-4) making a discrimination by using a correlation decoding method, wherein the discrimination result is the emotion type corresponding to the maximum correlation value.

Description

technical field [0001] The invention relates to a speech recognition method, in particular to an automatic speech emotion recognition method based on emotion-to-feature optimization. Background technique [0002] Speech emotion recognition is to recognize the speaker's emotional state from the speech signal. General speech emotion classifiers can distinguish basic emotion categories such as joy, anger, surprise, sadness, and fear to a certain extent. To build a speech emotion classifier, you first need to determine the emotion category to be recognized, establish a corresponding emotion corpus, and then find suitable speech emotion features based on the speech data in the emotion corpus, usually based on pitch, short-term energy, formant, etc. Acoustic features constructed by parameters, and finally a speech emotion model is established using statistical methods. The quality of speech emotion features has a key impact on the performance of the classifier. [0003] The per...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/06G10L15/08
Inventor 赵力黄程韦邹采荣余华王开
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products