Method, device, medium, equipment and system for predicting operation performance

A technology of performance prediction and operation, applied in the direction of transmission system, multi-program device, program control design, etc., can solve the problems of incomplete consideration of the model, degradation of operation performance, and failure to consider the impact of data processing rate, etc., to reduce cost budget , Guaranteed performance effect

Active Publication Date: 2020-12-22
EAST CHINA NORMAL UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these models are often not considered comprehensively. Some models only establish a simple general linear function model on the amount of input data and the number of cloud hosts based on the calculation and data transmission structure, while some models do not consider the impact of task parallelism on data processing rate. impact, and others do not consider the time spent on intermediate data Shuffle
In addition, the existing performance prediction methods can only be used for big data analysis jobs without instantaneous cloud host withdrawal. A model method for predicting job completion time is not yet
Moreover, the huge overhead of recomputing will seriously degrade the job performance. Therefore, a suitable fault tolerance mechanism is also needed to reduce the performance loss caused by the withdrawal of the instantaneous cloud host, and try to ensure the job performance when the cloud host is withdrawn.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device, medium, equipment and system for predicting operation performance
  • Method, device, medium, equipment and system for predicting operation performance
  • Method, device, medium, equipment and system for predicting operation performance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0112] In order to verify the feasibility and accuracy of the present invention, follow the above steps to modify the source code of the open source Spark2.0.1 version in the real environment to realize the key RDD check backup mechanism, write event log analysis scripts, instantaneous cloud host parameter collection scripts and performance prediction model calculations program. The modified Spark source code is compiled into a binary installation package using the maven method, and the instant cloud host can be used to facilitate installation and deployment.

[0113] By comparing the performance prediction method proposed by the present invention to predict the completion time of a big data analysis job under a certain instantaneous cloud host configuration, and comparing the actual running time of the job in a real environment, the prediction accuracy of the method of the present invention is proved. In addition, by comparing the use of the key RDD data checking fault-tolera...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a job performance prediction method, comprising: acquiring job-related parameters of a big data job; collecting relevant feature parameters of the instantaneous cloud host; establishing a Spark job basic performance prediction model to obtain a Spark job completion time; determining whether an instantaneous cloud host withdrawal event occurs, if not, predicting the Spark job completion time under the resource configuration based on the Spark job completion time and the resource configuration of the instantaneous cloud host; if so, checking the backup mechanism based onthe critical RDD data to evaluate the overhead, and predicting the Spark job completion time under the resource configuration based on the overhead, the Spark job completion time and the resource configuration of the instantaneous cloud host. The invention may perform job performance prediction regardless of whether the instantaneous cloud host has been recalled, and reduce the extra time overheadby checking the backup mechanism through the critical RDD data when the recall event occurs, thereby helping the user to reduce the cost budget. The invention also relates to a job performance predicting device, a medium, equipment and a system thereof.

Description

technical field [0001] The invention belongs to the technical field of job performance prediction for big data processing platform applications, and in particular relates to a big data analysis job performance prediction method, device, medium, equipment and system for instantaneous cloud hosts. Background technique [0002] With the advent of the big data era, big data technology has also been continuously developed and updated. Various big data processing platforms such as Apache Spark, MapReduce, Dryad, etc. have become the main application platforms for big data analysis and processing. In the application scenario of distributed big data analysis, DAG (Directed Acyclic Graph) is a very common computing structure. DAG graph is the abbreviation of directed acyclic graph. DAG type computing refers to the internal decomposition of computing jobs into several subtasks, and the computing logic relationship between subtasks is constructed into a DAG graph. A big data analysis ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F11/34G06F9/50H04L29/08
CPCG06F9/5027G06F11/3447H04L67/10
Inventor 徐飞蒋欢
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products