Content extracting method and device
An extraction method and extraction device technology, applied in the field of communication, can solve the problems of extracting large templates, etc., and achieve the effect of strong adaptability, fast and accurate content data extraction
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] Embodiment 1 is a content extraction method based on semantic analysis and rules, such as figure 1 As shown, it is the processing flow of the content extraction method, which includes:
[0031] S01, perform semantic analysis on the sample data, and construct content extraction rules according to the semantic analysis results and target content;
[0032] S02, establishing a rule base using content extraction rules constructed from a plurality of sample data;
[0033] S03, perform semantic analysis on the data to be extracted, match the corresponding content extraction rules in the rule base according to the semantic analysis results, if the matching is successful, use the content extraction rules to extract content, if the matching fails, record the semantic analysis results, and establish A new content extraction rule is used to update the newly established content extraction rule to the rule base.
[0034] In this embodiment, the semantic analysis specifically includ...
Embodiment 2
[0047] Embodiment 2 On the basis of embodiment 1, the method for extracting content in combination with traditional sets and templates is used for content extraction, which includes steps:
[0048]S00, perform template matching on the data to be extracted, if the matching is successful, use the template for content extraction, and if the matching fails, perform steps S01 to S03;
[0049] S01, perform semantic analysis on the sample data, and construct content extraction rules according to the semantic analysis results and target content;
[0050] S02, establishing a rule base using content extraction rules constructed from a plurality of sample data;
[0051] S03, perform semantic analysis on the data to be extracted, match the corresponding content extraction rules in the rule base according to the semantic analysis results, if the matching is successful, use the content extraction rules to extract content, if the matching fails, record the semantic analysis results, and esta...
Embodiment 3
[0054] Based on the method described in Embodiment 1, the present invention also proposes a content extraction device, including:
[0055] The rule building module is configured to perform semantic analysis on the sample data, and construct content extraction rules according to the semantic analysis results and target content;
[0056] The rule base module is configured to use the content extraction rules constructed by a plurality of sample data to establish a rule base;
[0057] The content extraction module is configured to perform semantic analysis on the data to be extracted, and match the corresponding content extraction rule in the rule base according to the semantic analysis result. If the match is successful, use the content extraction rule to extract the content. If the match fails, record the semantic The result is analyzed, and a new content extraction rule is established, and the newly established content extraction rule is updated to the rule base.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com