Interactive relation labeling and extracting framework capable of being quickly started
A fast-start and relational extraction technology, applied in character and pattern recognition, biological neural network models, structured data retrieval, etc., can solve problems such as heavy labor cost input and high cold start cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] Embodiment 1: see Figure 1-Figure 3 , a fast-start interactive relation labeling and extraction framework, including the following steps:
[0049] S1: Pre-train the named entity recognition model using the general named entity recognition dataset;
[0050] S2: Use the general relation extraction dataset to pre-train the few-sample relation extraction model;
[0051] S3: Set the relationship to be extracted and a small amount of labeled data;
[0052] S4: Perform data preprocessing on the text to be extracted;
[0053] S5: Use the named entity recognition model to perform named entity recognition on the text to be extracted;
[0054] S6: Manually pair entities;
[0055] S7: Preliminary relationship extraction is performed on the matching result;
[0056] S8: Perform manual proofreading on the relationship extraction results;
[0057] S9: Fine-tune the few-sample relationship extraction model;
[0058] S10: Repeat S4 to S9 until all the texts to be extracted are p...
specific Embodiment
[0090] Specific examples: refer to figure 1 one image 3 , in this embodiment, the general named entity recognition datasets are MUC-6 and MUC-7 datasets, the general relation extraction dataset is FewRel, and the text segments in the text warehouse to be processed come from Wikipedia. The named entity recognition model is a sequence labeling model based on conditional random fields, and the relationship extraction model adopts the PrototypicalNetwork structure. Among them, the PCNN model is used to encode text sentences and entities. Its structure is as follows image 3 As shown, the GloVe word embedding vector is used as the pre-trained word vector to encode the words in the sentence.
[0091] In this embodiment, a fast-starting interactive relationship labeling and extraction framework provided by the present invention is applied, and its overall framework is as follows figure 2 As shown, it specifically includes the following steps:
[0092] Step 1) use the general nam...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com