Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Part-of-speech-tagging-based dictionary construction method applied in shopping comment emotion analysis

A sentiment dictionary and part-of-speech tagging technology, applied in the field of natural language processing or conversion, can solve the problems of influencing experiments, non-consideration, and high algorithm complexity, and achieve the effect of improving accuracy.

Inactive Publication Date: 2016-08-17
NANJING UNIV OF POSTS & TELECOMM
View PDF11 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The algorithm used in this method is complex and requires a large amount of appropriate and labeled corpus when training the sentiment feature classifier
[0006] The main disadvantages of the invention patent with the publication number CN104731923A and the name "Construction Method of Internet Commodity Review Mining Ontology Thesaurus" are: 1. It does not use a general stop vocabulary, but calculates the feature frequency and document frequency in the experimental data. , take words with high values ​​as stop words, which are prone to deviations in the process, and words with emotional tendencies are lost, which affects the experiment; second, in the process of constructing the thesaurus, it does not consider other part-of-speech words except nouns Impact on Product Review Analysis
[0007] The main disadvantages of the invention patent with the publication number CN103207855A and titled "Fine-Grained Sentiment Analysis System and Method for Product Review Information" are as follows: 1. The sentiment analysis system needs to be trained with a large number of marked texts, and it needs to be updated regularly. Increased a lot of manpower and time consumption; second, did not consider the impact of stop words and Internet buzzwords on sentiment analysis; third, relied too much on collocation and combination dictionaries in the database, which made the calculation process complicated, and did not consider words of different parts of speech Sentimental Tendency in Review Text
[0008] In summary, the existing sentiment classification research for shopping reviews is not accurate enough to meet the needs of practical applications.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Part-of-speech-tagging-based dictionary construction method applied in shopping comment emotion analysis
  • Part-of-speech-tagging-based dictionary construction method applied in shopping comment emotion analysis
  • Part-of-speech-tagging-based dictionary construction method applied in shopping comment emotion analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The specific implementation of the present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments. The embodiments described in the present invention are only a part of the embodiments of the present invention, not all the embodiments.

[0050] figure 1 It is a schematic flow diagram of an embodiment of a dictionary construction method based on part-of-speech tagging in a kind of shopping review sentiment analysis proposed by the present invention, including the following steps:

[0051] A. Preprocess the shopping review data of hotels, books, and computers, including comment segmentation, word segmentation, and filter stop words.

[0052] Specifically, such as figure 2 As shown, step A includes the steps of:

[0053] A1, read each comment, and use the Jieba word segmentation tool to divide the comment into independent words;

[0054] A2. Filter the segmented words using a stop vocabulary;

[0055] B. Buil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a part-of-speech-tagging-based dictionary construction method applied in shopping comment emotion analysis. The method comprises the steps of conducting pre-treatment on text data of a shopping comment, in other words, conducting segmentation and word segmentation on a comment text, filtering out words which are not used any more, and partitioning shopping domains; constructing a basic emotion dictionary and a network buzzword emotion dictionary; taking a shopping comment corpus as a data set, conducting part-of-speech tagging on the data set, extracting words with the part-of-speech as habitually used words, adverbs and adjectives as candidate words, selecting new emotion words as domain emotion words by calculating the PTF-IDF values of the candidate words, and adding the domain emotion words to a domain emotion dictionary. The domain emotion dictionary is combined with the basic emotion dictionary and the network buzzword emotion dictionary, emotional characteristic screening and extraction are conducted on the shopping comment, and the emotion classification of the shopping comment is studied. It is shown through experiments that the method is high in accuracy rate, free of limitation of shopping domains and more suitable for practical application.

Description

technical field [0001] The invention relates to the field of natural language processing or conversion in data processing methods suitable for specific functions, and in particular to a dictionary construction method based on part-of-speech tagging in emotional analysis of shopping reviews. Background technique [0002] With the vigorous development of the Internet, the rise of e-commerce has attracted more and more users to start shopping online, experiencing the "stay at home" and "cheap and beautiful" brought by online shopping. At the same time, users also express their subjective views and opinions on commodities by commenting on purchased commodities in online shopping malls. However, since online shopping has no geographical restrictions, while bringing convenience to users, it also prevents users from directly touching and understanding the quality of the products, which may cause differences between the descriptions of the products in online shopping malls and the a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27
CPCG06F40/242
Inventor 王磊吴潇周亮魏昕陈建新
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products