A Load-Driven Distributed Graph Data Segmentation and Replication Method

A replication method and graph data technology, applied in the computer field, can solve problems such as expensive network and storage overhead, distributed graph data disaster tolerance, and achieve the effects of reducing the number of edge cuts, high throughput, and improving query efficiency

Active Publication Date: 2022-04-29
WUHAN UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Another problem is the disaster recovery of distributed graph data. In order to prevent data loss due to system operation errors or system failures, we need to perform data backup, that is, data replication
The traditional solution is to perform a full backup of all data according to a fixed replication factor, resulting in expensive network and storage overhead

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Load-Driven Distributed Graph Data Segmentation and Replication Method
  • A Load-Driven Distributed Graph Data Segmentation and Replication Method
  • A Load-Driven Distributed Graph Data Segmentation and Replication Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0100] The present invention is mainly based on computer graphics topology, considering dynamic graph characteristics and user workload characteristics, and proposes an experimental method and system for adaptive distributed graph data segmentation and replication. This method fully considers the situation of different users with different workloads. By adaptively adjusting the storage location of graph vertices, the user can visit as few server nodes as possible in a single query, improve query efficiency, and ensure query efficiency. Low latency and high throughput. The results obtained by the invention are more scientific and more accurate.

[0101] The method provided by the invention can use computer software technology to realize the process. see figure 1 , the embodiment of the cluster distributed graph data segmentation and replication as an example to carry out a specific elaboration of the process of the present invention, as follows:

[0102] A workload-driven di...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention proposes a workload-driven distributed graph data segmentation and replication method. In the present invention, the data in the graph data set are respectively stored in the server cluster through the hash table method; combined with the user's workload information, the active vertex set is constructed in the vertex data set combined with the load determination of the source server, and then each active vertex is combined with the server cluster According to the score function score, construct the target server score list, determine the target migration server through the target server load judgment; combine the active vertex set and the corresponding target migration server, judge whether the active vertex is a high-read vertex through the threshold, if the active vertex is not high The read vertex further dynamically adjusts the copy data of the active vertex through the maximum replication factor. The invention utilizes the characteristics of dynamic change of the workload to dynamically adjust the position of the graph vertex data, thereby improving the query efficiency and ensuring low delay and high throughput of the query.

Description

technical field [0001] The invention belongs to the field of computers, and in particular relates to a load-driven distributed graph data division and replication method. Background technique [0002] In recent years, with the continuous expansion of social network and Wanwei graph data, and the continuous expansion of multi-user access requirements, it is difficult for a common single database server to meet multi-user high-quality access services under existing resources. The traditional solution is to use vertical expansion and complete replication of data, and the resulting high cost is unsatisfactory, so a distributed data storage method emerges as the times require. Distributed data storage is based on cheap server clusters for horizontal partition expansion and partial replication backup, and provides concurrent graph data processing. In the face of distributed parallel data storage, how to reasonably store graph data in different nodes, so that users can improve the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/22G06F16/2453G06F16/2455G06F16/27
CPCG06F16/2255G06F16/2453G06F16/2455G06F16/27
Inventor 涂宏伟刘梦赤
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products