Key information identification method based on hierarchical attention and label guide learning
A technology of key information and recognition methods, applied in character and pattern recognition, instruments, unstructured text data retrieval, etc., to achieve the effect of broad application prospects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0068] Taking the 2230 papers on the subject of "covid-19" on the biomedical paper website PubMed as an example, a key information identification method based on hierarchical attention and label-guided learning, such as figure 1 shown, including the following steps:
[0069] Step 1: Literature data collection.
[0070] Use the Selenium WEB automation toolkit crawler to collect the papers and documents published on the PubMed platform, and save them to the computer in pdf format;
[0071] Step 2: Document deconstruction and storage.
[0072] Include the following steps:
[0073] Step 2.1: First, use the fitz toolkit to read English documents page by page, and segment the content of the document at the paragraph level according to the distance between paragraphs to obtain text blocks.
[0074] Then, merge the abnormal block segmentation caused by page changing and inserting tables / pictures, remove irrelevant information including headers and footers, record their coordinates ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com