A multi-modal emotion recognition method and system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An emotion recognition and multi-modal technology, applied in the field of emotion recognition, can solve the problems of not considering the emotion distinguishability of facial expression images, low emotion recognizability of expression images, poor model performance, etc., to achieve effective feature learning, The effect of performance improvement and small amount of calculation

Active Publication Date: 2022-06-17

UNIV OF JINAN +1

View PDF7 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Under the slow-changing characteristics of facial expressions, this non-discrimination and non-filtering method of facial expression image collection ignores the connection between different modal emotional expressions, and does not consider the emotional differentiation of facial expression images, resulting in the collected The emotional recognizability of facial expression images is low and the redundancy is large, which leads to the poor performance of the model trained and learned in the follow-up emotion recognition research.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0057] The purpose of this embodiment is to provide a multimodal emotion recognition method.

[0058] A multimodal emotion recognition method, comprising:

[0059] Extract the emotional speech components and emotional image components in the emotional video, and store them separately;

[0060] Utilize the emotional voice residual conditional entropy difference endpoint detection method to perform endpoint detection on the emotional voice component, and obtain the endpoint detection result of each frame of voice;

[0061] Screen the emotional images in the emotional image component based on the endpoint detection results of the emotional speech component, and remove the emotional images with silent segments in the emotional image component;

[0062] Perform feature extraction on the reconstructed emotional speech components and the filtered emotional image components respectively;

[0063] The features of the emotional speech components and the features of the emotional image...

Embodiment 2

[0101] The purpose of this embodiment is to provide a multimodal emotion recognition system.

[0102] A multimodal emotion recognition system, comprising:

[0103] a data acquisition module, which is used to extract emotional speech components and emotional image components in the emotional video, and store them respectively;

[0104] An endpoint detection module, which is used to perform endpoint detection on the emotional speech component by utilizing the emotional speech residual conditional entropy difference endpoint detection method to obtain the endpoint detection result of each frame of speech;

[0105] an image screening module, which is used for screening emotional images in the emotional image components based on the endpoint detection results of the emotional speech components, and eliminating emotional images of silent segments in the emotional image components;

[0106] a feature extraction module, which is used to perform feature extraction on the reconstructed...

Embodiment 3

[0110] This embodiment provides a method for detecting the working state of customer service personnel in a call center, and the detection method utilizes the above-mentioned multimodal emotion recognition method.

[0111] When dealing with problems, customer service personnel need to communicate with customers and keep answering various questions from customers. This kind of work is characterized by cumbersome content and high pressure. At the same time, in some cases, the attitude of customers is not friendly. In the working environment, customer service personnel will have certain negative emotions, and if the customer service personnel have certain negative emotions such as disgust or anger, it will seriously affect the quality of service, and it is also very detrimental to the psychological health of the customer service personnel. The multimodal emotion recognition method proposed in the present disclosure can be effectively applied to the detection of the working state o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present disclosure provides a multi-modal emotion recognition method and system. The scheme uses a novel and robust endpoint detection algorithm for speech components in emotional video samples, and utilizes the prediction residue generated during the sample reconstruction process under the compressed sensing theory. The difference conditional entropy parameter is used to calculate the difference value of the residual conditional entropy during the iterative process of the Orthogonal Matching Pursuit (OMP) algorithm, to complete the endpoint detection according to the empirical threshold, and to complete the feature learning of the emotional speech of the vocal segment based on the reconstruction sample; at the same time, through the emotional The endpoint detection results of the voice, the facial expression images are screened, and only the facial expression images with active emotional voices in the same time period are retained to achieve the purpose of enhancing the emotional separability of the facial expression data set and reducing redundancy; emotional voice Features and facial expression features are fused through features to train an effective multi-modal emotion recognition model to achieve effective multi-modal emotion recognition.

Description

technical field [0001] The present disclosure belongs to the technical field of emotion recognition, and in particular, relates to a multimodal emotion recognition method and system. Background technique [0002] The statements in this section merely provide background information related to the present disclosure and do not necessarily constitute prior art. [0003] Emotion recognition is a research hotspot in the field of emotional computing. The emotional signals of the two modalities, emotional speech and facial expression images, have the characteristics of convenient collection and large amount of emotional information. They are important and highly correlated in emotion recognition research. Two types of data sources. [0004] The inventor found that the emotional speech data and facial expression images currently used in the multi-emotion recognition field are usually obtained by separately saving the emotional speech component and the image component in the emotion...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06V40/16G06V10/80G06V10/82G06N3/08G06N20/10G10L15/05

CPCG06N3/08G06N20/10G10L15/05G06V40/174G06V40/168

Inventor 姜晓庆陈贞翔杨倩郑永强

Owner UNIV OF JINAN

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A multi-modal emotion recognition method and system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology