Method and device for remote supervision relation extraction based on consistent text enhancement

A technology of remote supervision and relationship extraction, applied in neural learning methods, unstructured text data retrieval, text database clustering/classification, etc., can solve the problem that effective information cannot be fully utilized, model training direction deviation, instability, etc. question

Active Publication Date: 2022-06-17
WUHAN UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] 2) Many research methods reduce the weight of noise samples in the training set or directly filter them out, so that the effective information contained in these noise samples cannot be fully utilized;
[0009] 3) The disturbance added by methods such as confrontation generation, although it can increase the anti-disturbance ability of the model, it usually cannot provide disturbances that meet the actual situation, is not stable, and tends to deviate the direction of model training

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for remote supervision relation extraction based on consistent text enhancement
  • Method and device for remote supervision relation extraction based on consistent text enhancement
  • Method and device for remote supervision relation extraction based on consistent text enhancement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0073] It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.

[0074] In a first aspect, an embodiment of the present invention provides a method for extracting long-distance supervision relationships based on consistent text enhancement.

[0075] In one embodiment, refer to figure 1 , figure 1 This is a schematic flowchart of an embodiment of a method for extracting a remote supervision relationship based on consistent text enhancement according to the present invention. like figure 1 As shown, the distantly supervised relation extraction method based on consistent text enhancement includes:

[0076] Step S10, obtaining multiple sentence instances, aligning each sentence instance to the knowledge base based on the assumption of remote supervision, determining the relation label corresponding to each sentence instance, and dividing the sentence instances with the same en...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a method and device for remote supervision relation extraction based on consistent text enhancement, the method comprising: dividing multiple sentence instances according to entity pairs and relation labels to obtain multiple sentence packages; Each sentence instance uses a different text enhancement method to obtain the strong enhancement samples and weak enhancement samples corresponding to each sentence instance in each sentence bag; determine the noise sample, and use the strong enhancement of the irrelevant sentence instance and the noise sample The samples and weakly enhanced samples train the relationship prediction model to obtain a trained relationship prediction model; use the trained relationship prediction model to predict the sentence package to be predicted, and obtain the corresponding relationship label. Through the present invention, the size of the data set can be increased through consistent text enhancement, the generalization learning ability of the model can be enhanced, and more supervision information can be learned by the "NA" category and the noise sample constraint model.

Description

technical field [0001] The present invention relates to the field of natural language processing, in particular to a method and device for extracting long-distance supervision relations based on consistent text enhancement. Background technique [0002] From the massive information on the Internet, a large amount of valuable knowledge and information can be extracted through the relevant technology of information extraction. As an important link in information extraction, Relation Extraction (RE) aims to extract the relationship between entities from text, for other natural language applications such as building knowledge graphs, search engines, dialogue generation, natural question answering, information retrieval, etc. Provided significant support. [0003] The training of relation extraction models requires a large number of labeled samples to provide supervision information. However, the same relation type may have different textual expressions, and at the same time, d...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F40/216G06N3/08G06N5/02
CPCG06F16/35G06F40/216G06N3/08G06N5/02
Inventor 彭敏罗娟胡刚廖庆文
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products