Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text sentiment analysis method based on convolutional neural network based on Chinese character component features

A technology of convolutional neural network and Chinese character components, which is applied in the field of text sentiment analysis of convolutional neural network, can solve the problems of insufficient granularity of feature extraction, time-consuming and laborious, classification results rely on extraction, etc., and achieve the effect of improving the effect of emotional classification

Active Publication Date: 2021-11-26
SUN YAT SEN UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The former generally refers to the combination of grammatical rules and sentiment lexicon to identify words with emotional polarity to calculate the emotional orientation of the text. However, due to the dependence on the size and quality of the sentiment lexicon, it not only requires a lot of manual preprocessing, but also has poor generalization, especially when used in When cross-domain text; the latter is divided into two stages, the first stage mainly uses traditional machine learning techniques, such as support vector machine (SVM), naive Bayesian (NB) and random forest (RF) classification Algorithm, the main problem is that it needs to manually construct features, and Chinese emotional features are different from English, Chinese has no spaces between words, and Chinese words are usually composed of more than one Chinese character, so word segmentation is required first based on word extraction features; so Usually extracting features is not only time-consuming and laborious, but also the classification results are too dependent on the extracted features; in the second stage, deep learning technology appeared, mainly referring to the application of various deep neural networks CNN, RNN, etc. to achieve classification. This method does not require manual labeling. Emotional dictionaries do not need to manually construct features, and rely entirely on self-learning to extract features, but the granularity of feature extraction is not fine enough, and often requires a large amount of labeled corpus, which is scarce in the field of Chinese sentiment analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text sentiment analysis method based on convolutional neural network based on Chinese character component features
  • Text sentiment analysis method based on convolutional neural network based on Chinese character component features
  • Text sentiment analysis method based on convolutional neural network based on Chinese character component features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0041] Such as Figure 1-3 As shown, a text sentiment analysis method based on the convolutional neural network of Chinese character component features, including the following steps:

[0042] S1: Obtain the information of Chinese characters and their components: Crawl the data of all components and radicals of Chinese characters from the HTTPCN website, save all basic components in the form of a list, save all Chinese characters and their corresponding component sequences in the form of a dictionary, and finally generate word embeddings at the component level;

[0043] S2: Obtain the first input channel expression: for all Chinese texts of the sentiment classification corpus, the Chinese character is used as the unit, and the Chinese character-level component embedding expression is generated based on the dictionary of Chinese character components and component-level word embedding, that is, the input of the first channel ;

[0044] S3: Obtain the second input channel expre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a text sentiment analysis method based on the convolutional neural network of Chinese character component features. The method first considers the emotional intensity of the sentiment word, and optimizes the weight of the sentiment dictionary in combination with Attention; The feature of granularity, because the basic morpheme of Chinese is a Chinese character component, which carries rich information such as phonetics and semantics, which is different from the 26 letters of English, and finally proposes a convolution based on a dual-channel word embedding of Chinese character components and sentiment lexicon Chinese text sentiment classification method based on neural network. Experiments on multiple public data sets prove that the model can significantly improve the sentiment classification effect of text.

Description

technical field [0001] The present invention relates to the field of natural language in the direction of computer technology artificial intelligence, and more specifically, relates to a text sentiment analysis method based on a convolutional neural network based on the features of Chinese character components. Background technique [0002] In today's society, Internet users interact with information explosion, and interactive platforms in multiple industries such as e-commerce shopping, entertainment, catering, transportation and culture have produced a large number of short texts (Short Text). The emotional orientation of these text data is important for both parties. communication bridge. For example, movie distributors and movie fans are more inclined to show and watch new movies that are well received by the public; merchants and potential customers pay close attention to buyers' satisfaction with newly purchased products. These are typical usage scenarios of short text...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/242G06F40/289G06F16/35G06N3/04
CPCG06F16/35G06F40/289G06F40/30
Inventor 熊绘龙冬阳余跃甘加升
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products