MapReduce calculation process optimization method based on B-tree data structure
A data structure and process optimization technology, applied in the field of MapReduce computing, can solve the problems of long time consumption and low work efficiency, and achieve the effect of reducing disk write operations, repeated addressing processes, and disk read operations.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0021] A method for optimizing the calculation process of MapReduce based on the B-tree data structure, such as figure 1 shown, including the following steps:
[0022] 1) Execute the Map task on the data input into the area;
[0023] 2) The output result after executing the Map task includes index file *.index and data file *.out;
[0024] 3) Store the index file *.index and data file *.out in a ring memory buffer;
[0025] 4) When the ring memory buffer is about to overflow, judge whether it is the last Map task. The criterion for judging whether the ring memory buffer is about to overflow is: when the storage capacity of the ring memory buffer reaches 80%, it is judged as Almost overflowing.
[0026] 5) If not, the data file *.out will be sorted and merged and written to the disk. At the same time, the process of merging the data file *.out before being stored on the disk will continuously perform sorting and compression operations, and the index file *.index will remain ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com