Rule and model combination-based legal instrument information extraction method and system
A technology of information extraction and rules, applied in the direction of instruments, electrical digital data processing, data processing applications, etc., can solve the problems of impossible enumeration of all rules, failure to obtain a large number of rules, difficult maintenance of rules, etc., to improve the extraction effect and transplant Strong performance and avoid cold start problem
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0029] combined with figure 1 As shown, a legal document information extraction method based on the combination of rules and models,
[0030] First, collect legal industry terminology, business terminology, etc. to create a domain dictionary;
[0031] Secondly, sort out and extract entities according to business requirements, and then configure document entity extraction rules according to legal document writing rules;
[0032] Then, a rule-based method is used to extract legal document entities, and the accuracy and recall rate of extracted entities are used as indicators to evaluate the results, and the rules and dictionaries are modified and adjusted according to the evaluation results, and the extraction results are sent to the data as the initial labeling data The labeling module confirms and modifies, then trains the model, and releases the model; rule-based text paragraph classification processing, subject recognition processing, and rule-based element extraction proce...
Embodiment 2
[0037] combined with figure 2 , a legal document information extraction system based on the combination of rules and models, including:
[0038] The data acquisition module is used for the business data acquisition of the business application system and the legal document data acquisition. The collected data is used by the active learning text labeling module and the information extraction module. The data acquisition module collects data from three aspects. One is to use crawlers to obtain Internet public Data, the second is to obtain data from third parties, and the third is to obtain data from business systems.
[0039] The information extraction module mainly includes information extraction technology based on part-of-speech tagging rules and model-based information extraction technology, providing technical support for legal document extraction business. The processed result data is used by the active learning text labeling tool in the upper layer business application a...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com