Method and device for handling faults

A fault handling method and processing unit technology, applied in the field of network communication, can solve problems such as Ceph cluster data recovery failure and reduce Ceph cluster processing performance, and achieve the effect of avoiding data recovery failure and OSD performance bottleneck

Active Publication Date: 2018-02-23
NEW H3C TECH CO LTD
View PDF7 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The present invention provides a fault processing method and device to solve the problem in the prior art that memory faults of storage nodes may reduce the processing performance of the Ceph cluster and even lead to the failure of Ceph cluster data recovery

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for handling faults
  • Method and device for handling faults
  • Method and device for handling faults

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] In order to enable those skilled in the art to better understand the technical solutions in the embodiments of the present invention, and to make the above-mentioned purposes, features and advantages of the embodiments of the present invention more obvious and understandable, the following describes the technical solutions in the embodiments of the present invention in conjunction with the accompanying drawings For further detailed explanation.

[0028] See figure 1 , is a schematic flowchart of a fault handling method provided by an embodiment of the present invention, wherein the fault handling method can be applied to a storage node of a Ceph cluster, such as figure 1 As shown, the troubleshooting method may include the following steps:

[0029] It should be noted that, in the embodiment of the present invention, unless otherwise specified, the memory capacity is in G, and the number of OSDs of the storage node is the OSD in the UP (working) state of the storage nod...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for handling faults. The method includes acquiring information of available memory capacity of storage nodes and the quantities of object storage devices OSD; rejecting OSD addition operation on target storage nodes when ratios of the available memory capacity of the target storage nodes to the quantities of the OSD are smaller than first preset thresholds and setting partial OSD of the target storage nodes in first types of non-working Down states to allow the ratios of the available memory capacity of the target storage nodes to the quantities ofthe other OSD to be larger than or equal to the first preset thresholds. The other OSD refer to the available OSD, except for the partial OSD set in the first types of Down states, among the OSD of the target storage nodes. The method and the device have the advantage that risks of possible OSD performance bottleneck and data recovery failure after faults of memories on the storage nodes occur canbe prevented by the aid of the method and the device.

Description

technical field [0001] The present invention relates to the technical field of network communication, in particular to a fault handling method and device. Background technique [0002] Ceph (Distributed Storage System) is an open source project that provides a software-defined, unified storage solution with the advantages of massive scalability, high performance, and no single point of failure. [0003] A typical Ceph cluster deployment will create an OSD (Object Storage Device, object storage device) for each physical hard disk in the cluster node. [0004] The failure domains of a Ceph cluster usually include disks, nodes (i.e. servers), racks, power circuits, etc. When any component in the fault domain fails, the corresponding OSDs deployed on it will fail, and the Ceph cluster will mark these OSDs as Down (non-working) and perform initialization operations to reorganize the faulty nodes. affected data. [0005] However, it has been found in practice that in the existi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/14G06F11/30
CPCG06F11/1464G06F11/3034
Inventor 顾雷雷乔辉
Owner NEW H3C TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products