System and method for saving and restoring process checkpoint in multi-task environment

A recovery method and checkpoint technology, which can be used in multi-programming devices, generation of response errors, error detection of redundant data in operations, etc., can solve problems such as increasing the amount of inter-process communication and complex recovery line calculations, etc. To achieve the effect of reducing recovery overhead, reducing checkpoint overhead, and improving efficiency

Active Publication Date: 2015-04-15
XIDIAN UNIV
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Finally, the calculation of the recovery line is complex, based on the derivation of dependencies and graph theory
Other communication-based checkpoint recording protocols require additional information to be attached to the messages sent by the process, which in turn increases the amount of inter-process communication

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for saving and restoring process checkpoint in multi-task environment
  • System and method for saving and restoring process checkpoint in multi-task environment
  • System and method for saving and restoring process checkpoint in multi-task environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0081] The present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments.

[0082] Such as figure 1 As shown, the entire system for saving and restoring process checkpoints under the multi-tasking environment of the present invention includes the following components, task process, coordinator process, communication monitoring module, blocking domain module, checkpoint module, and recovery line module.

[0083] Task process: A large task usually requires multiple processes to execute in parallel. The task process here refers to each individually executed task process, and they also need to communicate with each other during their respective running processes.

[0084] Coordinator process: This task process is mainly used to analyze the running status of the entire task and the probability of failure, so as to make a decision, that is, whether a checkpoint should be done at a certain moment, and at the same time manage...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a system and a method for saving and restoring a process checkpoint in a multi-task environment. The system comprises a task process unit, a coordinator process module, a communication monitoring module, a blocking area module, a checkpoint module and a recovery line module, the coordinator process module is connected with the communication monitoring module, the communication monitoring module is connected with the blocking area module to form a cycle, the coordinator process module is connected to the task process unit respectively through the checkpoint module and the recovery line module or directly connected to the task process unit, and the task process unit is connected with the communication monitoring module to save and restore the process checkpoint in the multi-task environment. The method includes the steps: A, forming the checkpoint; B, restoring the checkpoint in failure. The problems of domino effects, live-lock and the like are solved, a local blocking consistency protocol is provided, a traditional blocking protocol is optimized, checkpoint making efficiency is improved, expenditure is decreased, and process states are consistent.

Description

technical field [0001] The invention belongs to the field of computer science and technology, and relates to a process-level checkpoint fault-tolerant technology, specifically a system and method for saving and restoring task processes to coordinate checkpoints in a multi-task communication environment. Compared with traditional methods, the performance There is an improvement. Background technique [0002] The performance of today's computers has greatly improved, but even so, some complex calculation problems still need to run for several days or even longer. When a task takes a long time to execute, the chances of it failing during execution increase. Especially in distributed systems and cluster systems, the failure of a subtask will often have a great impact on the normal operation of the entire task, and the resulting cost is also huge. Applying pure software-level fault-tolerant technology, without modifying the operating system, it is a common fault-tolerant method...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/46G06F11/14
Inventor 马建峰王时兴李金库马卓姚青松
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products