Method for rapidly establishing full-text retrieval tool for common files
A full-text indexing and full-text technology, applied in semi-structured data retrieval, special data processing applications, semi-structured data mapping/conversion, etc., can solve problems such as difficulties in the completion process, database performance limitations, and database incompleteness, and achieve Easily manageable effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0027] The document parsing module is responsible for parsing files;
[0028] The Chinese word segmentation module is responsible for using the Chinese word segmentation algorithm to perform full-text word segmentation of the file content in order to establish a full-text index;
[0029] The full-text index building module is responsible for full-text indexing of the words after the word segmentation of the Chinese word segmentation module;
[0030] The full-text index library is responsible for data storage;
[0031] The retrieval module is responsible for various retrievals of users.
[0032] A method for quickly building a full-text search tool for commonly used documents, the specific steps are as follows
[0033] ①The document parsing module reads the word file and converts it into XML format after parsing, and parses each file into two attributes, which are the file name of the file and the full-text content of the file, where the file name includes the absolute path o...
Embodiment 2
[0040] The document parsing module is responsible for parsing files;
[0041] The Chinese word segmentation module is responsible for using the Chinese word segmentation algorithm to perform full-text word segmentation of the file content in order to establish a full-text index;
[0042] The full-text index building module is responsible for full-text indexing of the words after the word segmentation of the Chinese word segmentation module;
[0043] The full-text index library is responsible for data storage;
[0044] The retrieval module is responsible for various retrievals of users.
[0045] A method for quickly building a full-text search tool for commonly used documents, the specific steps are as follows
[0046] ①The document parsing module reads the PDF file and converts it into XML format after parsing, and parses each file into two attributes, which are the file name of the file and the full text content of the file, where the file name includes the absolute path of...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com