Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Unsupervised entity relationship extraction method based on zero-shot

An entity-relationship, unsupervised technology, applied in the computer field, can solve the problem of lack of large-scale and complete annotated corpus, and achieve the effect of improving accuracy, reducing cost and reducing cost

Active Publication Date: 2019-12-10
BEIJING UNIV OF TECH
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Although the above methods can complete the task of entity relationship extraction, the existing entity relationship extraction methods still have the following challenges: (1) In many fields, the relationship trigger words between entities can not only be described by the entity pair nearby verbs , can also be described by adjectives or adverbs in the sentence
(2) Due to the late start of information technology in my country, there is a lack of large-scale and complete labeled corpora in most fields, such as the medical field

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unsupervised entity relationship extraction method based on zero-shot
  • Unsupervised entity relationship extraction method based on zero-shot
  • Unsupervised entity relationship extraction method based on zero-shot

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] Features and exemplary embodiments of various aspects of the invention will be described in detail below

[0024] The invention extracts the entity relationship by matching the triple feature extracted from the data with the entity type feature extracted from the ontology. It is hoped to improve the accuracy of entity relationship extraction and reduce manual labeling. The overall structure is as figure 1 As shown, it is divided into a data preprocessing module (1), a feature extraction module (2), a training relationship extraction network model (3), and an entity relationship classifier module (4).

[0025] Data preprocessing module (1): Firstly, the electronic medical records are divided into sentences according to ".", ";" punctuation marks, and secondly, the Harbin Institute of Technology LTP-Cloud platform is used to segment the sentences, and extract part-of-speech tags and dependency syntax analysis.

[0026] Feature extraction module (2): This module can be d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an unsupervised entity relationship extraction method based on zero-shot, and belongs to the field of computers. An entity relationship category is judged by extracting triplefeatures in text data and entity relationship type features in a domain knowledge graph and calculating the similarity between the triple features and the entity relationship type features, so that the dependence of a traditional entity relationship extraction method on manual annotation is reduced, and the accuracy of entity relationship extraction is improved. The method comprises the steps of data preprocessing, feature extraction, relation extraction network model training and entity relation classifier training. A convolutional neural network model which is good at capturing sentence information is adopted to respectively extract triple and relationship type features, and finally softmax is used to predict entity relationship type tags. In a model construction process, a sparse taggedcorpus can be used as a training set, and in a test process, the same parameters as those in the training process can also be used for predicting the type of an untagged triad.

Description

technical field [0001] The invention belongs to the field of computers and relates to a zero-shot based unsupervised entity relationship extraction method. Background technique [0002] In today's big data era, due to the rapid growth of data and the variety of types, the problem of information overload is becoming more and more serious. Therefore, how to quickly and accurately obtain the important information required is the main problem we are facing today. Information extraction technology extracts important information contained in the text by extracting specified types of factual information such as entities, relationships, and events from natural language texts. As an important subtask in information extraction technology, entity relationship extraction mainly identifies and classifies the relationship between concepts in sentences or texts. At the same time, it is also the basis for many tasks in the field of natural language processing, such as machine translation, q...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/28
CPCG06F16/285G06F16/288
Inventor 赵青王丹冯韦玮杜金莲付利华
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products