Method, device and system for monitoring working state of nodes in distributed cluster system

A distributed cluster, working state technology, applied in the field of distributed systems, can solve problems such as affecting the stability and performance of the cluster, unavailability, and difficulty in detecting suspended nodes

Inactive Publication Date: 2018-06-05
BEIJING HUAYUN WANGJI TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

After the node dies, if it cannot be identified effectively and timely, it will seriously affect the stability and performance of the entire cluster, and will cause the upper-layer application to be temporarily unavailable
However, it is difficult to detect the false dead node. If the method is wrong, it will be misjudged

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and system for monitoring working state of nodes in distributed cluster system
  • Method, device and system for monitoring working state of nodes in distributed cluster system
  • Method, device and system for monitoring working state of nodes in distributed cluster system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0025] like figure 1 As shown, it is a monitoring method for the working state of nodes in a distributed cluster system according to the present invention, including:

[0026] Step 11, obtaining the number of times that each node in the distributed cluster system is judged as heartbeat detection timeout by other nodes within a predetermined period of time;

[0027] Step 12, selecting the node with the highest number of times from the various nodes;

[0028] Step 13, obtaining the network connection state of the selected node; this step sp...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a monitoring method, device and system for the working state of nodes in a distributed cluster system. The monitoring method for the working state of the nodes in the distributed cluster system comprises the steps that the times number that each node in the distributed cluster system is judged as heartbeat detection timeout by the other nodes in a predetermined time is acquired; the node with the largest times number is selected from the nodes; the network connection state of the selected node is acquired; when the network connection state of the selected node is a smooth state, the selected node is judged as a suspended animation node; and when the network connection state of the selected node is a disconnected state, a judgment result that the selected node is a real death node is generated. According to the monitoring method, device and system, the suspended animation node can be effectively, reliably and quickly recognized in time, and therefore the cluster stability is improved.

Description

technical field [0001] The invention relates to the field of distributed systems, in particular to a method, device and system for monitoring the working status of nodes in a distributed cluster system. Background technique [0002] With the wide application of cloud computing in various fields and the increase of data volume, there are high demands on the scale, performance and reliability of distributed file systems. In large-scale clusters, small-probability events will become more frequent. Node suspended animation is one of the problems that needs to be solved. After a node dies, if it cannot be identified effectively and in a timely manner, it will seriously affect the stability and performance of the entire cluster, and will cause temporary unavailability of upper-layer applications. However, it is difficult to detect the false dead node, and if the method is wrong, it will be misjudged. Contents of the invention [0003] Embodiments of the present invention prov...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04L12/26
CPCH04L43/0811H04L43/10H04L43/50
Inventor 张俊峰游峰李纲彬金鑫鑫
Owner BEIJING HUAYUN WANGJI TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products