A method and system for selectively rewriting self-referencing blocks in the process of data deduplication

A self-referencing and selective technology, applied in special data processing applications, electronic digital data processing, digital data information retrieval, etc., can solve the problems of reducing recovery performance and inability to eliminate internal fragments, and achieve recovery performance improvement and deduplication Comprehensive, Utilization-Increased Effects

Active Publication Date: 2020-11-24
HUAZHONG UNIV OF SCI & TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the defects of the prior art, the purpose of the present invention is to solve the technical problem that the HAR rewriting method in the prior art cannot eliminate internal fragments caused by self-referential blocks, thus reducing the recovery performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for selectively rewriting self-referencing blocks in the process of data deduplication
  • A method and system for selectively rewriting self-referencing blocks in the process of data deduplication

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.

[0032] figure 1 It is a flow chart of a method for selectively rewriting self-referential blocks in the data deduplication process provided by the embodiment of the present invention. Such as figure 1 As shown, the method includes the following steps:

[0033] S1. Obtain the duplicate blocks in the data deduplication process and the total number of containers backed up last time;

[0034] S2...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a system for selectively rewriting a self-referencing block in a data deduplication process. The method comprise the following steps: obtaining a repeating block in the data deduplication process and a total number of containers backed up last time; 2, judging whether that contain ID of the repeating block is more than the total number of containers back up last time, if so, the repeating block being a self-referencing block, otherwise, the repeating block being not a self-referencing block; creating a buffer for simulating a recovery buffer, finding out whether a container ID of a self-reference block exists in the buffer, if so, the self-reference block being not an internal fragmentation block, otherwise, the self-reference block being an internal fragmentation block; depending on whether the utilization of the container in which the internal debris block is located is below a preset threshold, if so, overwriting the internal debris block to thecontainer, otherwise, the self-referencing block need not be overwritten to the container. The invention identifies the internal fragment block by simulating the recovery buffer, selectively rewritespart of the internal fragment block to limit the rewriting quantity, so that the utilization rate of the container is increased, and the recovery performance is remarkably improved.

Description

technical field [0001] The invention belongs to the technical field of computer storage, and more specifically relates to a method and system for selectively rewriting self-referential blocks in the process of deduplication of data. Background technique [0002] Deduplication technology is the most effective way to eliminate redundant data and save space in modern backup systems. The deduplication technology first divides the file into equal-length or variable-length data blocks, then uses the hash method to calculate the fingerprint of each data block, and determines whether the data block is repeated by comparing the fingerprint values. It is a lossless compression technology that can effectively save system storage space. [0003] Since the data blocks in a backup stream are scattered in different containers, two types of fragmentation are introduced: one is inter-version fragmentation, which is caused by the regular backup of similar data; the other is caused by the sam...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/174
Inventor 王芳左春雪冯丹张宇成胡燏翀
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products