Mongolian multi-modal fine-grained sentiment analysis method fusing prior knowledge model

A technology of prior knowledge and sentiment analysis, applied in semantic analysis, neural learning methods, biological neural network models, etc., can solve problems such as unregistered words, insufficient emoji important features, limited feature data information, etc., to improve quality Effect

Pending Publication Date: 2021-11-05
INNER MONGOLIA UNIV OF TECH
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First, due to the rich and varied forms of Mongolian vocabulary, there are serious unregistered words in the process of Mongolian text sentiment analysis, and the existence of a large number of unregistered words seriously affects the accuracy of sentiment analysis.
Second, current sentiment analysis mainly analyzes text data. For data containing emoticons, important information such as emoticons is generally deleted during the data cleaning stage, which cannot fully utilize emoticons, an important feature of sentiment analysis.
Third, the current single neural network model has limited classification efficiency when solving sentiment analysis, and is limited to certain feature data information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mongolian multi-modal fine-grained sentiment analysis method fusing prior knowledge model
  • Mongolian multi-modal fine-grained sentiment analysis method fusing prior knowledge model
  • Mongolian multi-modal fine-grained sentiment analysis method fusing prior knowledge model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The implementation of the present invention will be described in detail below in conjunction with the drawings and examples.

[0052] Such as Figure 1-2 As shown, a kind of Mongolian language multimodal fine-grained emotion analysis method of fusion prior knowledge model of the present invention, the process is as follows:

[0053] Step 1: Preprocess the Chinese and Mongolian emotional text corpora. Preprocessing is to clean the acquired corpus, including steps such as removing user name information, removing URLS, and removing special characters.

[0054] Step 2: Due to insufficient Mongolian corpus information, the obtained Chinese corpus containing emojis is machine translated into Mongolian corpus to achieve the purpose of expanding Mongolian predictions.

[0055] Step 3: Before model training, the emotional text corpus should be preprocessed. The present invention uses Chinese jieba word segmentation and regularized byte pair encoding technology (BPE) to segment...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a Mongolian multi-modal fine-grained sentiment analysis method fused with a prior knowledge model. The method comprises the following steps: preprocessing a Chinese and Mongolian sentiment corpus containing emoticons; converting the text words and the emoticons obtained through preprocessing into dynamic word vectors; introducing an attention mechanism to carry out fine-grained sentiment analysis on a word vector spliced by the text word quantity and the emoticon word vector; respectively creating a Mongolian emotion dictionary and an emoticon emotion dictionary, and taking features extracted by the Mongolian emotion dictionary and the emoticon emotion dictionary as emotion features finally extracted by the model; pre-training the model fused with the prior knowledge on large-scale Chinese and Mongolian linguistic data to obtain a Mongolian multi-modal fine-grained sentiment analysis model fused with the prior knowledge model; and comparing and evaluating the analysis result of the model and the analysis result of a single network analysis method according to the accuracy, the precision rate, the recall rate and the F1 value of each emotion category, so that the purpose of improving the Mongolian text emotion analysis performance is achieved.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence, in particular to a Mongolian language multimodal fine-grained emotion analysis method fused with prior knowledge models. Background technique [0002] With the rapid development of Internet technology, more and more people begin to express various opinions on social platforms such as Weibo, forums, film and television websites, shopping websites and other platforms to share their moods, views and opinions. In particular, with the rapid development of network technology, emoticons have gradually developed into a new data form different from text, images, and videos, and play an important role in the field of sentiment analysis. The texts, emoticons and other information published by users may contain different emotional colors: happy or favorite; sad or angry. The core of sentiment analysis is to accurately divide the emotion expressed by a piece of text into seven categories: hap...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/284G06F40/30G06F40/242G06F40/58G06N3/04G06N3/08
CPCG06F40/284G06F40/242G06F40/30G06F40/58G06N3/08G06N3/047G06N3/048G06N3/044
Inventor 仁庆道尔吉张倩张文静刘馨远张毕力格图郎佳珺苏依拉李雷孝
Owner INNER MONGOLIA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products