Data block balancing method in operation process of HDFS (Hadoop Distributed File System)
A data block and balance technology, applied in the computer field, can solve the problems of low map task data locality and uneven distribution of HDFS data blocks, and achieve the effect of improving task balance, locality, and execution balance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0044] The present invention will be described in detail below in conjunction with the accompanying drawings.
[0045] The HDFS data block balancing strategy based on runtime data block movement, its specific implementation steps are as follows:
[0046] The first step is node local task list preprocessing. The local task list of each node is preprocessed, and it is divided into a completely local task part and a non-completely local task part. The fully local task parts of all nodes implement a complete processing of the input dataset, and there is no task overlap with each other. Ideally, if each node is assigned all local tasks at the same time, the distribution of HDFS data blocks is in line with the allocation of the scheduler to each node, that is, the placement of HDFS data blocks is balanced. At this time, the conflicting task allocation can be determined by predicting the future task request of the node, and the possible non-local task allocation can be judged...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com