Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Nested named entity recognition method and system, electronic equipment and readable medium

A named entity recognition and named entity technology, which is applied in the fields of electrical digital data processing, instruments, calculations, etc., can solve the problems of difficulty in forming large data sets, time-consuming, etc.

Pending Publication Date: 2020-04-03
INFORMATION SCI RES INST OF CETC
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The construction of training sample datasets for named entity recognition in professional fields is a time-consuming process that requires people with professional knowledge to annotate the data, so it is difficult to form large datasets

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Nested named entity recognition method and system, electronic equipment and readable medium
  • Nested named entity recognition method and system, electronic equipment and readable medium
  • Nested named entity recognition method and system, electronic equipment and readable medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] In order to enable those skilled in the art to better understand the technical solutions of the present invention, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0057] Such as figure 1 and figure 2 As shown, a nested named entity recognition method includes the following steps:

[0058] Mark each text in the corpus based on a preset text marking method to obtain a mark set, the mark set includes text and corresponding named entities, and at least one text corresponds to multiple named entities;

[0059] Based on the preset clustering method, the tag set is clustered according to each named entity to obtain a cluster set, and the cluster set includes the text and the named entity uniquely corresponding to the text;

[0060] Based on the preset adaptive data-enhanced named entity recognition model, the named entities in each cluster set are respectively recognized.

[0061] Thr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a nested named entity recognition method and system, electronic equipment and a readable medium. The nested named entity recognition method includes the steps: marking all textsin a corpus based on a preset text marking method to obtain a mark set, wherein the mark set comprises texts and corresponding named entities, and at least one text corresponds to multiple named entities; based on a preset clustering method, clustering the mark set according to each named entity to obtain a cluster set, the cluster set comprising a text and a named entity uniquely corresponding to the text; and based on a preset named entity recognition model with adaptive data enhancement, respectively identifying named entities in each cluster set. A nested named entity recognition problemis converted into a non-nested named entity recognition problem, so that the influence of named entity nesting on the recognition effect is reduced; the data enhancement degree is gradually improved according to the training effect; the data enhancement use intensity is controlled at the optimal level; and the training effect is improved so as to adapt to the nested named entity recognition task under the condition of insufficient samples.

Description

technical field [0001] The invention belongs to the technical field of named entity recognition, and in particular relates to a nested named entity recognition method, a nested named entity recognition system, an electronic device and a computer-readable storage medium. Background technique [0002] Named entity recognition (NER, Name Entity Recognition) is one of the basic research contents of natural language processing, and its task is to identify language blocks in text. Named entity recognition often faces the problems of named entity nesting and insufficient training samples in practical applications. [0003] The increase in the nesting of named entities makes it impossible to establish a one-to-one relationship between text and entity labels. For example, "Bethune Medical College" is an organization name entity, while "Bethune" is a person name entity. Therefore, in the text labeling process, "Bethune" There are two tabs. The multi-label problem will increase the c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/295
Inventor 温秀秀刘佩云郭橙潘博文高原原
Owner INFORMATION SCI RES INST OF CETC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products