Entity linking method based on entity context semantic interaction

A semantic interaction and context technology, applied in the field of data processing, can solve the problems of local similarity feature extraction and loss of text details in the interaction between query text and knowledge base text, and achieve the effect of enriching detailed semantic features and verifying effectiveness.

Active Publication Date: 2020-07-17
CHINA ELECTRONICS TECH CYBER SECURITY CO LTD
View PDF7 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing research usually uses CNN (Convolutional Neural Network) or LSTM (Long Short-Term Memory Network) to encode entity text, and does not perform further local similarity feature extraction on the interaction between query text and knowledge base text, resulting in possible loss of text detail features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity linking method based on entity context semantic interaction
  • Entity linking method based on entity context semantic interaction
  • Entity linking method based on entity context semantic interaction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0045] This embodiment proposes an entity linking method based on entity context semantic interaction, such as figure 1 As shown in , the link entity context and the candidate entity knowledge base text are encoded using different encoding methods, and the word-level attention matrix is ​​used to interact with the two encoded texts, and then the interactive representation is sent to the full connection and the largest pool The layer further extracts the interactive text vector, and finally uses the vector splicing for binary classification, as follows:

[0046] 1.1 Candidate Entity Generation

[0047] In the training phase, candidate entities are generated by exact matching with entities in the knowledge base, and all candidate entities are entities with the same name as the query entity;

[0048] In the data preprocessing stage, all entity names, aliases and corresponding library names in the knowledge base are stored in the form of a dictionary, so as to facilitate querying e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of data processing, and discloses an entity linking method based on entity context semantic interaction. The context information of the entity to be linkedwith the attribute description information of the knowledge base entity are combined, a Transformer structure is adopted to encode a knowledge base entity text, an LSTM network is adopted to encode and inquire the entity text, and fine-grained word-level attention interaction is adopted to capture local similar information of the text for semantic encoding of the knowledge base entity text and the inquired entity text. According to the invention, on the basis of using the LSTM and Transformer networks to encode two segments of texts respectively, word-level fine-grained semantic feature interaction is increased; the detail semantic features of the text are enriched, the accuracy rates of 89.1% and 88.5% are achieved on a verification set and a test set and exceed 2.1% and 1.7% of a current mainstream entity link coding model CNN and an LSTM network respectively, and the effectiveness of the entity link method is shown.

Description

technical field [0001] The invention relates to the technical field of data processing, in particular to an entity linking method based on entity context semantic interaction. Background technique [0002] Entity linking is the process of mapping entity references in natural language to correct candidate entities in a knowledge base. Unstructured natural language representations often have a large number of vague and irregular expressions. In the medical field, such diversity and ambiguity are more common. For example, "Lilac Polygonum" can refer to traditional Chinese medicine for treating diseases such as lung heat and cough, and can also refer to plants of the willow leaf family. As a plant, it is also called "small pomegranate tree", "small pomegranate leaf", "small therapeutic medicine" etc. Accurately understanding the specific entity referred to in the text and linking the entity to the existing knowledge base entity can greatly improve the effectiveness of informat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/134G06F40/126G06F40/279G06F16/36G06N3/04G06N3/08
CPCG06F16/367G06F16/374G06N3/049G06N3/08G06N3/044G06N3/045
Inventor 王伟许峻峰张焱刘刚孙成胜敖佳
Owner CHINA ELECTRONICS TECH CYBER SECURITY CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products