Judgment document information extraction method
An information extraction and document technology, which is applied to metadata text retrieval, text database clustering/classification, unstructured text data retrieval, etc. and other problems to achieve high efficiency and improve the effect of the model
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0019] The specific embodiments of the present invention will be described in further detail below in conjunction with the accompanying drawings.
[0020] Such as Figure 1-2 As shown, the present invention provides an event-based method for extracting referee document information, the method comprising the following steps:
[0021] (1) Obtain the entire HTML of the referee document and parse the HTML of the referee document through the Python module BeautifulSoup, and extract the unformatted text from the HTML;
[0022] (2) Label the extracted unformatted text. In the labeling task of each event, a label is defined as an event type or an entity type. If a label has a relationship with other labels, the label is defined as an event type. , while other tags are defined as entity types, and the event structure in the referee document is defined as: event type-entity type-...-entity type, and the event type and its entity type corresponding to each event are marked from the unfo...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com