Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Rapid data deletion method based on source end deduplication

A deletion method and data technology, which are applied in the redundant operation of data error detection, electrical digital data processing, special data processing applications, etc., can solve problems such as time-consuming, low performance of deletion schemes, and impact on backup performance. , to simplify database operations, improve delete performance, and optimize backup processes.

Pending Publication Date: 2020-05-08
STATE GRID CORP OF CHINA +2
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This approach has the following disadvantages: First, when recording the number of indexes of each block, the granularity is relatively fine. Every time a backup task is executed, the database needs to be accessed regardless of whether the data block is a new block (for new blocks To insert a new fingerprint record, the number of references to an existing block needs to be updated), which will affect the performance of the backup, even if the space in the actual deduplication library may not be released after the deletion operation (if the number of references is not 0, it cannot be truly clean up the data blocks on the disk); secondly, when deleting the backup set, since it is necessary to traverse all the deduplicated blocks used in the backup set and adjust the fingerprint tables corresponding to all blocks, the deletion task will be time-consuming longer
Therefore, the existing deletion scheme has relatively low performance and is not suitable for scenarios where backup and deletion are frequent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rapid data deletion method based on source end deduplication
  • Rapid data deletion method based on source end deduplication
  • Rapid data deletion method based on source end deduplication

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0025] Such as figure 1 Shown is the storage structure of the backup set using the deduplication function in the backup device. After the source data is backed up to the backup device, a corresponding backup set will be generated, and the data in the backup set will be stored in such figure 1 In the two databases shown, and in the two types of files. Among them, the guiddb table in the objdb database records all object information in the backup set, and each object points to an objfile file, and the fingerprint index of each data block is sequentially stored in the object file; the dedupdb database includes fingerdb table, filedb table, guiddb table, Among them, the fingerdb table is a fingerprint table that records all the fingerprints of the deduplication database. The fingerprint table records block fingerprints and the data file location ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a rapid data deletion method based on source end deduplication. The method comprises two steps of deleting a backup set and cleaning a medium. According to the method, the specific frequency of recording each specific block is no longer used for the reference frequency of the data block; a backup set object records a data file existing in a referenced data block, a deletionfunction uses a delayed deletion strategy and comprises two steps of deleting the backup set and cleaning a medium, deletion operation is simplified, deletion performance is improved, time for the data block to exist in a repeated deletion library is prolonged as much as possible, and resource waste caused by frequent backup and deletion is avoided.

Description

technical field [0001] The invention belongs to the technical field of data deduplication, and in particular relates to a fast data deletion method based on source-side deduplication. Background technique [0002] Backup devices are always filled with a lot of redundant data. In order to solve this problem and save more space, deduplication technology has naturally become the focus of attention. The use of deduplication technology can greatly reduce the amount of stored data, thereby freeing up more backup space, so that backup data can be kept on the disk for a longer period of time, and the source-side deduplication technology can also save a lot of bandwidth during backup. A backup device for data protection must have the basic functions of backup, restore and delete. [0003] The feature of the deduplication function is that only one data block is kept in the deduplication library storing data, and each data block is different and unique. The data backed up using the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/14G06F16/16
CPCG06F11/1448G06F16/162
Inventor 佟芳周建华李晖秦浩徐铁军张文飞李国栋王婷王忠花马文珍
Owner STATE GRID CORP OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products