Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Joint extraction method for viewpoints and viewpoint holders based on self-attention

A holder and attention technology, applied in natural language data processing, special data processing applications, instruments, etc., can solve the problem that the accuracy cannot meet the requirements, and achieve increased flexibility and coverage, high accuracy, and semantic representation. rich effects

Active Publication Date: 2018-10-09
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT +1
View PDF4 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The final extracted result of this method is a small paragraph composed of multiple sentences containing emotional tendencies, which does not necessarily contain evaluation, and the accuracy rate cannot meet the requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Joint extraction method for viewpoints and viewpoint holders based on self-attention
  • Joint extraction method for viewpoints and viewpoint holders based on self-attention
  • Joint extraction method for viewpoints and viewpoint holders based on self-attention

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The technical solution of the present invention will be further described below in conjunction with the accompanying drawings.

[0045] The inventive method has the following characteristics:

[0046] First, in the news corpus, generally only some sentences contain the opinions of institutions or experts. We designed a method for judging the opinion sentences of institutions and experts in the news corpus, which can quickly judge whether a paragraph contains opinion sentences.

[0047] Second, in order to realize the effective identification and extraction of evaluation holders and evaluation content in the news corpus, we constructed an end-to-end neural network model, which is based on self-attention and Pointer Network to identify evaluation content and its holders. The joint extraction performed.

[0048] In this way, we implement a joint extraction method based on self-attention and its holders.

[0049] The task of the present invention mainly includes three asp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention is based on a joint extraction method for viewpoints and viewpoint holders based on self-attention. The method comprises the steps of S1, constructing a corpus for extracting the viewpoints and the viewpoint holders; S2, identifying statements containing the viewpoints; S3, conducting joint extraction on the viewpoints and the viewpoint holders. The method has the advantages that thesituation that extracted sentences do not contain the viewpoints is avoided through a text classification model; a joint extraction model for the viewpoints and the viewpoint holders is free from natural language processing links such as part-of-speech tagging, named entity recognition and syntactic dependency analysis, avoids the influence of errors in the links on the extraction effect of the model, and has high flexibility and coverage; the method comprises the steps of constructing the corpus for extracting the viewpoints and the viewpoint holders, identifying the statements containing the viewpoints and conducting joint extraction on the viewpoints and the viewpoint holders; self-attention is used on the basis of two-way LSTM, the advantages of the self-attention and the two-way LSTMare effectively combined, the representation semantics of word sequences is more abundant, and the accuracy of the trained model is higher.

Description

technical field [0001] The present invention relates to a natural language processing method, in particular to a joint extraction method based on self-attention (self-attention) viewpoints and their holders, which can automatically extract viewpoints and their holders in Chinese news corpus , belonging to the field of computer science and technology. Background technique [0002] With the development of Internet technology, a large amount of text information on the Internet has grown rapidly, electronic media has developed rapidly, traditional paper media is also joining the camp of electronic media, and news corpus has shown explosive growth. Opinion extraction from text has also attracted more and more attention from researchers, and has become one of the most active research areas in natural language processing. The explosive growth of news corpus on the Internet has, on the contrary, hindered access to information. In the past when the amount of news was small, it was ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30
CPCG06F40/289G06F40/30
Inventor 李雄刘春阳张传新张旭王萌闫昊唐彬
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products