Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Short text sentiment analysis method based on sum product network depth autocoder

An auto-encoder, network depth technology, applied in the field of sum-product network and short text sentiment analysis, can solve the problems of cumbersome training, serious gradient dilution, no global optimization, etc., to achieve the effect of reducing the size

Inactive Publication Date: 2017-11-17
JILIN UNIV
View PDF6 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, deep autoencoders also have many limitations: 1. Its structure is stacked by different types of modeling units. These modeling units have theoretical flaws, so the deep structure they build is not perfect.
2. Its unsupervised layer-by-layer greedy training only solves the local minimum problem to a certain extent, without global optimization. As the number of hidden layers increases, the gradient dilution becomes more serious, and its training is extremely cumbersome and requires a lot of skills
3. The model structure is often fixed, and it does not consider the relevance of the short text data itself, and cannot perfectly fit different types of short text data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short text sentiment analysis method based on sum product network depth autocoder
  • Short text sentiment analysis method based on sum product network depth autocoder
  • Short text sentiment analysis method based on sum product network depth autocoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Step 1. Preprocessing the short text data crawled online;

[0026] The collection of short text data is divided into two parts: the first part is to use Python's BeautifulSoup library with web crawling function to crawl unlabeled short text data online. Example: Use BeautifulSoup and Twitter API to crawl short text data of Weibo online, crawl short text data of movie reviews on IMDb movie review website, etc. The second part is to collect publicly available labeled short text data. Use the Porter algorithm to extract the stem of the obtained short text data; use the regular expression method to replace special text such as repeated characters, user handles, links, emoticons, and hashtags with concise representations; Negative words are detected by the distance between the nearest neighbor explicit negative words.

[0027] Step 2, using the doc2vec model to train sentence vectors;

[0028] Use the large amount of unlabeled short text data obtained in step 1 to train t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a short text sentiment analysis method based on a sum product network depth autocoder. The method comprises the following steps of 1, preprocessing short text data; 2, utilizing a doc2vec model to train sentence vectors; 3, utilizing the sum product network depth coder to code the sentence vectors, and obtaining layered abstract characteristics of the sentence vectors; 4, utilizing a maximum product network depth decoder to decode the layered abstract characteristics, comparing the decoded characteristics with the primary sentence vector characteristics, calculating a reconstruction error, adjusting parameters of the sum product network depth autocoder to make the reconstruction error smallest, obtaining an optimal sum product network depth coder, and obtaining an optimal layered abstract characteristic by the optimal sum product network depth coder; 5, utilizing the optimal layered abstract characteristic to conduct online structure learning to generate a sum product network structure, using a small amount of short text data with tags to finely adjust a sum product network, using an online parameter learning algorithm to continuously adjust network parameters, inputting a test set, and obtaining sentiment classifications through the trained sum product network.

Description

technical field [0001] The present invention relates to sum product networks and short text sentiment analysis. Background technique [0002] With the continuous development of social networks, people are more willing to express their opinions through social networks, so understanding users' emotional tendencies through online comments of users has received widespread attention in the academic community. But the online comments of these users are short texts with few words. Short texts provide less information and are more colloquial, which increases the difficulty of sentiment analysis. [0003] Most of the current research based on supervised learning short text sentiment analysis has achieved good results, relying on a large amount of manually labeled data, making the system based on supervised learning need to pay a high labeling cost. On the contrary, unsupervised learning does not require manual labeling of data to train the model, which is a solution to reduce the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/355
Inventor 王生生张航赖永
Owner JILIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products