Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Emotion recognition method and device based on LSTM audio and video fusion and storage medium

An emotion recognition and emotion technology, applied in character and pattern recognition, acquisition/recognition of facial features, voice analysis, etc., can solve unsolved problems such as acoustic channel emotion recognition, to improve accuracy and robustness, and accurate emotion recognition Effect

Pending Publication Date: 2020-02-21
南京励智心理大数据产业研究院有限公司
View PDF3 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Zhao Xiaoming and Zhang Shiqing proposed a robust speech emotion recognition method based on compressed sensing, providing a robust speech emotion recognition method in the background of noise; fully considering the effectiveness of different types of feature parameters, the extraction of feature parameters From the two aspects of prosodic features and sound quality features, the Mel frequency cepstral coefficient MFCC is extended to further improve the anti-noise effect of the feature parameters, but the emotional recognition when the acoustic channel cannot obtain signals has not yet been solved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Emotion recognition method and device based on LSTM audio and video fusion and storage medium
  • Emotion recognition method and device based on LSTM audio and video fusion and storage medium
  • Emotion recognition method and device based on LSTM audio and video fusion and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0066] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0067] It should be understood that when used in this specification and the appended claims, the terms "comprising" and "comprises" indicate the presence of described features, integers, steps, operations, elements and / or components, but do not exclude one or Presence or addition of multiple other features, integers, steps, operations, elements, components and / or collections thereof.

[0068] It should also be understood that the terminology used in the specificati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an emotion recognition method and device based on LSTM (Long Short Term Memory) audio and video fusion and a storage medium. An LSTM model is adopted, a more detailed frame-level feature is used for training the model, and the obtained emotion recognition is accurate. Meanwhile, a method of combining decision fusion with later fusion is adopted, the recognition results of the two modes can be more effectively fused for the features of speech emotion recognition and the features of facial expression recognition, and a more accurate emotion recognition result is obtainedthrough calculation. According to the method provided by the invention, the emotional state of the prediction object can be obtained more accurately, and the accuracy and robustness of emotion recognition are improved.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to an emotion recognition method, system, device and storage medium based on LSTM audio-video fusion. Background technique [0002] Emotion is important information in people's communication process, which is usually expressed in facial expressions, speech, text, body movements, etc. With the rapid development of information technology, people's demand for smart devices is becoming more and more vigorous, and intelligent technologies such as human-computer interaction are becoming more and more important. Emotion recognition technology has a wide range of applications in human-computer interaction, car and aircraft driving, and medical care. applications and prospects. [0003] The modalities of emotional expression include facial expressions, speech, text, physiological signals, postures, etc. The current mainstream affective computing methods are mainly divi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/62G06N3/04G10L17/26G10L25/24
CPCG10L17/26G10L25/24G06V40/174G06V20/40G06N3/045G06N3/044G06F18/214G06F18/253
Inventor 李浩然傅杰赵力张玲
Owner 南京励智心理大数据产业研究院有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products