A Link Fault Detection Method in HPC Indirect Network Environment

A link failure and network environment technology, applied in the direction of data exchange network, digital transmission system, electrical components, etc., can solve the problems of huge time overhead, detection of link failure, etc., and achieve the reduction of execution overhead and the total time-consuming delay measurement Effect

Active Publication Date: 2021-01-08
BEIHANG UNIV +1
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the HPC interconnection network environment, due to the consideration of communication efficiency, there is often no sufficient communication protocol support, making these methods often cannot be directly used to detect link failures in large-scale high-performance computer interconnection networks, and because HPC interconnection The network scale is often very large, and the time overhead of the existing link fault detection method is very huge under this network scale

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Link Fault Detection Method in HPC Indirect Network Environment
  • A Link Fault Detection Method in HPC Indirect Network Environment
  • A Link Fault Detection Method in HPC Indirect Network Environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0033] Such as figure 1 Shown is a flow chart of the link fault detection method in the HPC indirect network environment of the present invention.

[0034]A link fault detection method under an HPC indirect network environment, comprising the following steps:

[0035] (a) Query the routing information of the HPC interconnection network to obtain the link composition of the communication path between nodes; the HPC indirect network contains n nodes, the node set is N, and m links, the link set is M, Then there are n(n-1) / 2 communication paths between n nodes, and any communication path L can be obtained by querying the route query interface provided by the network. i The set of links M i , and have

[0036] (b) Combining the link composition of each communication path, determine the key communication path set that needs to be measured for delay; convert ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a link fault detection method in a HPC indirect network environment. According to the link fault detection method based on link delay information measurement applied in the HPCindirect network environment, the link fault is detected by detection of anomaly of link delay information; and thus, a fault link in a network can be precisely determined within a relatively short time. The link fault detection method comprises the steps of: (a), querying HPC interconnection network routing information, so that the link composition of communication paths between nodes is obtained; (b), in combination with the link composition of each communication path, determining a key communication path set needing to perform delay measurement; (c), measuring delay information of a key path in parallel, and, on the basis of the information, solving delay information of all links in the whole network; and (d), judging whether the link fails or not according to the link delay information, and solving the link delay expected value in the network, wherein the delayed link, which has relatively large deviation with the value, is the fault link.

Description

Technical field: [0001] The present invention relates to a link fault detection method, and more specifically, to a link fault detection method in an indirect network environment of a High Performance Computer (HPC for short). Background technique: [0002] High-performance computing refers to the process of using certain technologies to aggregate the computing power of a large number of processing units to solve complex problems. High-performance computing has gradually become an important means to solve major challenges in national economic construction, social development, technological innovation, and national security, and it is a strategic high ground for competition among countries around the world. The scale of high-performance computing continues to increase, and it is now moving towards the E-level scale. The scale of computing of such a volume corresponds to a very large scale of computers. Take the Sunway TaihuLight high-performance computer, which ranks first in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): H04L12/24H04L12/26
CPCH04L41/064H04L41/0677H04L43/0852
Inventor 肖利民刘成春杨章田泓蕴闫柏成王志昊
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products