A multi-modal emotion recognition method and system
An emotion recognition and multi-modal technology, applied in the field of emotion recognition, can solve the problems of not considering the emotion distinguishability of facial expression images, low emotion recognizability of expression images, poor model performance, etc., to achieve effective feature learning, The effect of performance improvement and small amount of calculation
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0057] The purpose of this embodiment is to provide a multimodal emotion recognition method.
[0058] A multimodal emotion recognition method, comprising:
[0059] Extract the emotional speech components and emotional image components in the emotional video, and store them separately;
[0060] Utilize the emotional voice residual conditional entropy difference endpoint detection method to perform endpoint detection on the emotional voice component, and obtain the endpoint detection result of each frame of voice;
[0061] Screen the emotional images in the emotional image component based on the endpoint detection results of the emotional speech component, and remove the emotional images with silent segments in the emotional image component;
[0062] Perform feature extraction on the reconstructed emotional speech components and the filtered emotional image components respectively;
[0063] The features of the emotional speech components and the features of the emotional image...
Embodiment 2
[0101] The purpose of this embodiment is to provide a multimodal emotion recognition system.
[0102] A multimodal emotion recognition system, comprising:
[0103] a data acquisition module, which is used to extract emotional speech components and emotional image components in the emotional video, and store them respectively;
[0104] An endpoint detection module, which is used to perform endpoint detection on the emotional speech component by utilizing the emotional speech residual conditional entropy difference endpoint detection method to obtain the endpoint detection result of each frame of speech;
[0105] an image screening module, which is used for screening emotional images in the emotional image components based on the endpoint detection results of the emotional speech components, and eliminating emotional images of silent segments in the emotional image components;
[0106] a feature extraction module, which is used to perform feature extraction on the reconstructed...
Embodiment 3
[0110] This embodiment provides a method for detecting the working state of customer service personnel in a call center, and the detection method utilizes the above-mentioned multimodal emotion recognition method.
[0111] When dealing with problems, customer service personnel need to communicate with customers and keep answering various questions from customers. This kind of work is characterized by cumbersome content and high pressure. At the same time, in some cases, the attitude of customers is not friendly. In the working environment, customer service personnel will have certain negative emotions, and if the customer service personnel have certain negative emotions such as disgust or anger, it will seriously affect the quality of service, and it is also very detrimental to the psychological health of the customer service personnel. The multimodal emotion recognition method proposed in the present disclosure can be effectively applied to the detection of the working state o...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com