Spark SQL-based distributed full text retrieval system and method
A retrieval system and distributed technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as full-text retrieval that does not support massive data, and achieve the effect of small index storage
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0062] The present invention will be described in more detail below in conjunction with specific embodiments and accompanying drawings.
[0063] Such as figure 1 As shown, the present invention designs and implements a relational data-oriented distributed full-text retrieval system based on Spark SQL, and the system includes four parts: SQL translation layer, data source management layer, parallel computing layer, and distributed storage layer. In the SQL translation layer, the grammar of full-text retrieval based on SQL and the translation process of SQL statements in the SQL translation layer are proposed; in the data source management module, a parallel method for the full-text retrieval process is designed; the retrieval optimization module In the index building phase, two storage models and corresponding original table data restoration strategies are designed, namely, the full storage model and the index-specified column storage model, and a storage model for the original...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com