Data processing and distribution method and system based on hadoop system
A data processing and data technology, applied in the field of big data processing
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0015] like figure 1 As shown, this hadoop system-based data processing and distribution method includes the following steps:
[0016] (1) Massive data is numbered in a multi-task sequence so that the number of each data is unique;
[0017] (2) Perform multi-task concurrent transmission of massive data, and start multiple tasks to transmit a part of numbered data respectively.
[0018] The present invention carries out the multi-task concurrent transmission of the massive data by sequentially numbering the massive data, so that when the data scale is extremely large, the task execution will not be limited by the system memory and bandwidth.
[0019] In addition, the step (1) includes the following sub-steps:
[0020] (1.1) Start multiple tasks to process a part of the data, complete the part number, and record the maximum value;
[0021] (1.2) On the basis of the part number, scan the number data of each task, add the maximum value of the previous task, output the data, and...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com