The invention relates to a multi-
modal fusion
emotion recognition system and method based on multi-
task learning and an attention mechanism and an experimental evaluation method, and aims to solve the problems that in the prior art, a multi-
modal emotion recognition process without introducing a multi-
modal fusion mechanism is low in efficiency and accuracy. The invention belongs to the field of human-computer interaction, and provides a multi-modal fusion
emotion recognition method based on combination of multi-
task learning and an attention mechanism, and compared with single-modal emotion recognition work, the multi-modal fusion emotion recognition method based on combination of multi-
task learning and the attention mechanism is wider in application. Multi-task learning is utilized to introduce an auxiliary task, so that the emotion representation of each mode can be more efficiently learned, and an interactive attention mechanism can enable the emotion representations among the
modes to mutually learn and complement each other, so that the recognition accuracy of the multi-mode emotion is improved; experiments are carried out on the multi-
modal data sets CMU-MOSI and CMU-MOSEI, the accuracy and the F1 value are both improved, and meanwhile the accuracy and efficiency of emotion information recognition are improved.