Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Short text sentiment analysis method based on CNN bidirectional GRU attention mechanism

A technology of emotion analysis and attention, which is applied in instruments, biological neural network models, electrical digital data processing, etc., can solve a large number of manual labeling data sets, time-consuming and labor-intensive, sentence emotional polarity deviation affects the accuracy of emotional analysis results, etc. question

Active Publication Date: 2020-02-18
BEIJING TECHNOLOGY AND BUSINESS UNIVERSITY
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The method based on machine learning relies on selecting effective feature combinations and using classifiers for sentiment classification, but this method requires a large number of manually labeled data sets to train the model, which is time-consuming and laborious
The method based on deep learning can mine deep semantic and emotional meanings and is widely used. However, supervised deep learning still requires a large number of annotations, while unsupervised methods have higher requirements for the semantic association of text, among which negative words, degree adverbs, transition words The use of etc. may cause the emotional polarity shift of the sentence to affect the accuracy of the sentiment analysis results, and further development and improvement are needed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short text sentiment analysis method based on CNN bidirectional GRU attention mechanism
  • Short text sentiment analysis method based on CNN bidirectional GRU attention mechanism
  • Short text sentiment analysis method based on CNN bidirectional GRU attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be described below in conjunction with the accompanying drawings and specific embodiments.

[0034] Among them, attached figure 1 The processing process of short text sentiment analysis method based on CNN bidirectional GRU attention mechanism is described.

[0035] Such as figure 1 Shown, the concrete implementation steps of the present invention:

[0036] (1) Preprocessing of short texts, noise removal, word segmentation, part-of-speech tagging, and removal of stop words. The word segmentation system of the Chinese Academy of Sciences (NLPIR) is used to segment the sentences and part-of-speech tagging, and the stop words of Harbin Institute of Technology are used to remove stop words.

[0037] (2) Through continuous bag-of-words model (CBOW), the sentence is trained and represented as a word sequence in units of words, and mapped to a multi-dimensional vector to construct a word vector set. The continuous bag of words model is a kind o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a short text sentiment analysis method based on a CNN (Convolutional Neural Network) bidirectional GRU attention mechanism. The method comprises the following steps: preprocessing a short text, denoising, segmenting words, labeling part of voice, and removing stop words; expressing sentences into a word sequence through negative sampling training by taking words as units through a continuous bag-of-words model (CBOW), and mapping the word sequence into a multi-dimensional vector to construct a word vector set; calculating the occurrence frequency of sentiment words in different data set documents, calculating sentiment scores, and converting the sentiment scores into a sentiment feature vector matrix; word embedding and feature embedding topology serves as input ofa convolutional neural network, and sentence representation is obtained through convolution and pooling; through a bidirectional GRU recurrent neural network, negative word turning words are set as parameter query items of an attention mechanism to obtain representation; and combining the two representation topologies to serve as full connection layer input, and outputting an emotion analysis result.

Description

technical field [0001] The invention belongs to the field of natural language processing, in particular to a short text sentiment analysis method based on CNN bidirectional GRU attention mechanism Background technique [0002] With the rapid development of the Internet, more and more users express their opinions and express their emotional comments through social media, most of which exist in the form of short texts. Hot events will arouse widespread concern and discussion in the society. It is an extremely challenging task to grasp the direction of public opinion in a timely manner and obtain user emotional tendencies. [0003] Sentiment analysis can understand the public's emotional changes on hot events by preprocessing, analyzing, and mining users' emotional tendencies. It is also one of the important directions of natural language processing. Sentiment analysis research methods are mainly divided into dictionary-based methods, machine learning-based methods, and deep l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/284G06N3/04
CPCG06N3/045Y02D10/00
Inventor 刘新亮张腾高彦平高圣乔陈念洪坤明
Owner BEIJING TECHNOLOGY AND BUSINESS UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products