Graph division method for distributed graph calculation

A graph computing and graph segmentation technology, applied in the field of distributed graph computing, can solve problems such as high overhead, and achieve the effects of improving performance, improving processing efficiency, and reducing communication overhead.

Active Publication Date: 2018-11-13
NAT UNIV OF DEFENSE TECH
View PDF4 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the technical problem that the existing graph segmentation method is too expensive for large-scale graph calculation, the present invention provides a distributed The graph segmentation method of graph computing, referred to as the TopoX partition method. The TopoX partition method is based on the open source framework PowerLyra. First, the input graph data is distributed to each node, and input from each node to the entire distributed framework for processing.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Graph division method for distributed graph calculation
  • Graph division method for distributed graph calculation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the purpose and technical solutions of the present invention clearer, the present invention will be further described in detail below in conjunction with specific examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0024] First, the basic concept involved in the present invention is given. The edge is the basic unit of the input graph data, and its format is (source point, target point, weight). A bag refers to a collection of partial edges with high locality, whose maximum value is modifiable. Adding an edge to a bag means placing the edge on the node where the bag resides. Other required data structures include related data structures such as the access status of each edge, which can be set by yourself. A vertex is an abstraction of an entity, an edge represents two entities and the relationship between entities, a source point represent...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a graph division method for distributed graph calculation in order to solve the technical problem that expenditure is excessively large when graph division is performed for large-scale graph calculation through an existing graph division method. According to the method, based on an open source framework PowerLyra, sides of graph data as input are scattered to all nodes according to target point hashing first, then all the nodes synchronously and concurrently process the sides distributed to themselves on the distributed framework, and a corresponding distributed algorithm is executed according to need. Through the method, the concept of ''package'' is proposed, wherein a package refers to a set of part of the sides with high locality, and the maximum value of the package can be modified; data locality characteristics are introduced into a metering standard of graph division through the package, so that divided sub-graphs have locality, the data locality principleis fully utilized, and the number of image vertexes of vertexes is effectively reduced; and meanwhile, load balance among the nodes is guaranteed, communication expenditure in a distributed system isreduced, processing efficiency of large-scale graph relevant applications is improved, and the performance of distributed graph calculation is improved.

Description

technical field [0001] The invention belongs to the field of distributed graph computing, in particular to a graph segmentation method for distributed graph computing. Background technique [0002] There are currently many distributed graph computing frameworks developed around the world, mainly including two parts: graph partitioning and graph computing. Graph partitioning refers to distributing large-scale graph data to each node in the cluster according to a certain strategy, and the graph computing part refers to performing distributed operations on the graph according to the needs of practical applications. [0003] In the early days, there was PowerGraph from Carnegie University, which introduced graph division from edge segmentation to the era of point segmentation. Before this, the principle of graph partitioning is that each point only exists on one node, and all edges related to this point are stored on this node, and each edge will be stored twice. In practical ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/50
CPCG06F9/5005G06F2209/548
Inventor 张一鸣王金岩李东升
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products