Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Automatic parameter tuning method for iterative MapReduce operation

A job and parameter technology, applied in the field of automatic tuning of Hadoop operating parameters, can solve the problems of difficult parameter tuning, resource time waste, and time-consuming, etc., to achieve high efficiency, reduce waste, and improve operating efficiency

Active Publication Date: 2017-01-11
SOUTH CHINA NORMAL UNIVERSITY +1
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, a large number of related research work and practical applications have proved that the MapReduce Job running with the default parameter configuration is often not in the optimal performance state. Performance
However, for Hadoop operation parameter tuning work, it requires relevant tuning personnel to have a deep understanding of the entire Hadoop platform, be familiar with the function of the operating parameters provided by the Hadoop platform, and have certain practical tuning experience. Therefore, for For most people, this parameter tuning work is not only very difficult but also requires a lot of time and effort
At present, in order to solve the problem of Hadoop manual experience tuning is difficult, experts have proposed a variety of solutions, for example, the automatic parameter tuning scheme of the Hadoop simulator based on discrete events, the automatic parameter tuning based on machine training Strategy, etc.
However, these solutions are more suitable for a Hadoop cluster environment that runs a wide variety of applications, and for a Hadoop cluster that only runs a single application, using these existing automatic tuning solutions is obviously not worth the candle , because a large amount of pre-machine training or data collection work will cause a huge waste of resources and time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic parameter tuning method for iterative MapReduce operation
  • Automatic parameter tuning method for iterative MapReduce operation
  • Automatic parameter tuning method for iterative MapReduce operation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] Such as figure 1 As shown in , an automatic parameter tuning method for iterative MapReduce jobs, the method includes:

[0040] A. Determine whether the iteration end condition is satisfied, if so, the iterative calculation task ends, otherwise, execute step B;

[0041] B. Through the parameter search algorithm, and then according to the given MapReduce parameter search space, search for a new MapReduce parameter configuration;

[0042] C. Use the searched new MapReduce parameter configuration to realize the operation of the current MapReduce job. After the operation is completed, judge the operation effect of the current MapReduce job, and perform corresponding data adjustment processing according to the judgment result;

[0043] D. Go back to step A.

[0044] Further as a preferred embodiment, the step B includes:

[0045] B1. Determine whether there is a historically optimal MapReduce parameter configuration, if so, execute step B2, otherwise, execute step B4;

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an automatic parameter tuning method for an iterative MapReduce operation. The method comprises the following steps of A) judging whether to meet an iterative ending condition or not, if so, ending an iterative calculation task, otherwise, executing step B; B) searching to obtain new MapReduce parameter configuration according to a given MapReduce parameter search space by utilization of a parameter searching algorithm; C) realizing the running of the current MapReduce operation by utilization of the searched new MapReduce parameter configuration, judging the running effect of the current MapReduce operation after the operation is finished, and performing corresponding data adjustment treatment according to a judging result; D) returning to execute the step A. By utilization of the method disclosed by the invention, the running efficiency of each iterative MapReduce operation can be improved, the convenience is brought to a user, and the waste of time resources is greatly reduced. The method disclosed by the invention can be widely applied to a Hadoop running parameter tuning field.

Description

technical field [0001] The invention relates to computer information processing technology, in particular to an automatic tuning method for Hadoop operating parameters for iterative MapReduce jobs. Background technique [0002] technical term explanation [0003] Parameter configuration: This refers to a set of parameter values ​​that are composed of MapReduce parameters provided by Hadoop and can be configured by MapReduce jobs to form a set of parameter configurations; in two parameter configurations, if and only if each The parameter values ​​of a MapReduce parameter are the same, the two parameter configurations are the same, otherwise they are different parameter configurations. [0004] Parameter search space: refers to the set of all parameter configurations that can be used by the parameter search algorithm, called the parameter search space. [0005] Neighbor parameter configuration: refers to the new parameter configuration obtained by changing only one of the Ma...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F9/50
CPCG06F9/5077G06F9/5083
Inventor 赵淦森高晓杰唐华张海明王欣明聂瑞华汤庸朱佳廖智锐陈乐华涂继来
Owner SOUTH CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products