Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data parallel job resource allocation method based on decision tree prediction

A resource allocation and decision tree technology, applied in resource allocation, electrical digital data processing, program control design, etc., can solve problems such as over-allocation of computing resources and inability to reduce job completion time, and achieve high prediction accuracy

Active Publication Date: 2019-10-11
NAT UNIV OF DEFENSE TECH
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the number of computing resources exceeds a threshold, the completion time of the job cannot be reduced, but increased
This leads to the problem of over-allocation of computing resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data parallel job resource allocation method based on decision tree prediction
  • Data parallel job resource allocation method based on decision tree prediction
  • Data parallel job resource allocation method based on decision tree prediction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] Figure 1 to Figure 7 It shows an embodiment of the data parallel job resource allocation method based on decision tree prediction in the present invention. The whole allocation method includes three processes: the decision tree prediction model trains and predicts the initial resource of a single job; and then optimizes the initial resource allocation ; Finally, dynamically adjust resource allocation. Among them, after the user submits the job, the job characteristics will be extracted to predict its initial resource allocation estimate. The predicted results are output to the initial resource allocation algorithm to calculate the initial resource allocation for each job. After determining the initial resource allocation value for each job, the job will be submitted to the Spark cluster to start execution. In the process of job execution, the method of dynamic resource adjustment can change computing resources between iterations of iterative machine learning jobs, fu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data parallel job resource allocation method based on decision tree prediction. The method comprises the steps that 1, acquiring a completed job data set; 2, training a decision tree model by using the completed job data set; 3, estimating an initial resource allocation estimated value of each job; 4, using an initial resource allocation algorithm to perform resource initial allocation on the jobs with the obtained initial resource allocation estimated values to obtain an initial resource allocation value of each job; and 5, submitting the jobs of which the initial resource allocation values are obtained to a computer cluster to start execution. The job completion time difference under different computing resource allocations is predicted through the decision treemodel, the computing resource allocation enabling the job completion time to be minimum is found, and a better computing resource allocation estimated value is obtained. According to the method, an underlying network, disk read-write operation and the like are used as prediction features to reflect network transmission and disk input and output overhead of a distributed computing framework, so that higher prediction accuracy than modeling prediction is realized.

Description

technical field [0001] The invention belongs to the field of parallel and distributed computing, in particular to a data parallel operation resource allocation method based on decision tree prediction. Background technique [0002] Resource allocation or resource management is one of the main research issues in computer science, involving issues such as network systems, distributed systems, and cloud environments. Its purpose is to allocate (allocation) and assign (assignment) specific computing resources (such as the number of CPU cores), network resources, storage resources, etc. for the jobs submitted by users. Since resource allocation can optimize the job completion efficiency and resource utilization of data centers and cloud platforms, it is an important functional component of the distributed computing framework. Especially in distributed systems like Hadoop and Spark, a data parallel job often involves the concurrent execution of multiple tasks, which includes the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/50
CPCG06F9/5005
Inventor 胡智尧李东升
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products