Method and device for processing bulk data
A technology of batch data and processing methods, applied in the direction of multi-programming devices, etc., can solve problems such as dynamic adjustment of processing order, and achieve the effect of reducing waiting time and improving utilization rate
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0033] figure 2 It is a flowchart of a batch data processing method according to an embodiment of the present invention. Such as figure 2 As shown, this embodiment includes:
[0034] Step S102: read the data to be processed into the cache, and obtain the occupied space of the data to be processed;
[0035] Step S104: Calculate the estimated weight of the data to be processed according to the preset unit weight and occupied space;
[0036] Step S106: Insert the data to be processed into the data sequence to be processed according to the estimated weight;
[0037] Step S108: put the data to be processed in the data sequence to be processed into the thread for processing.
[0038] In this embodiment, before step S102, it also includes: reading the batch data from the batch data source into the cache, where the batch data source can be stored in the form of a file separated by a certain format, or stored in the form of a database table. The weight value can be determined in...
Embodiment 2
[0043] In this embodiment, on the basis of the first embodiment, the unit weight is dynamically adjusted to more accurately reflect the size of the data to be processed, reduce the waiting time of the data to be processed, and improve the processing efficiency. This embodiment is applied to firstly setting the initial value of the unit weight, and then gradually approaching a reasonable unit weight according to the actual time for processing the data to be processed, that is, the execution weight. In this embodiment, after step S108 in the first embodiment, it also includes:
[0044] Step S202: Obtain the execution weight of the data to be processed;
[0045] Step S204: Correct the unit weight according to the execution weight of multiple data to be processed;
[0046] Step S206: Calculate the estimated weight of subsequent data to be processed according to the corrected unit weight.
[0047] In this embodiment, the execution weight is the actual time for processing the data t...
Embodiment 3
[0087] This embodiment will describe other contents in the batch data processing method on the basis of the second embodiment: including submission of processed data, rollback after processing errors, dynamic adjustment of cache, and dynamic adjustment of the number of execution threads.
[0088] 1. Submission of processed data
[0089] The number of preset thread submissions; when the number of processed data in a thread reaches the preset number of submissions, the processed data will be written to the database or file system.
[0090] Specifically, the CommitCount is set, the processed data is stored in the memory, and it is written to the database or file system when the commit amount is reached or the entire processing is completed, so as to avoid frequent I / O operations. The CommitCount should not be set too large, one is to occupy a large amount of memory, and the other is to take too long to roll back. Every time a submission is made, a statistical thread is triggered...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com