Data compression method, equipment and system

A data compression and data technology, applied in the field of communication, can solve the problems of low data deduplication efficiency, high overhead, coarse granularity, etc., and achieve the effect of improving the data deduplication rate

Active Publication Date: 2015-07-01
HUAWEI TECH CO LTD
View PDF9 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the granularity of the data blocks divided by the CDC algorithm mostly depends on the settings of the data blocks. If the set data blocks are smaller, the granularity will be finer, and the duplicate data search will be more accurate, but the overhead of data block indexing and comparison of data blocks will be relatively high. Large; if the set data block is larger, the granularity will be coarser, and the data deduplication efficiency will be lower

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data compression method, equipment and system
  • Data compression method, equipment and system
  • Data compression method, equipment and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0068] The data compression method in the embodiment of the present invention can be implemented in the local WOC (Wan Optimization Controllers, wide area network optimization equipment) according to the redundancy rate setting corresponding to the network application data flow to obtain the data block parameters of the target network data, according to the set The data block parameter divides the target network data into data blocks to obtain at least one target da...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention discloses a data compression method. The method comprises the following steps: acquiring target network data; setting a data segmentation parameter of the target network data according to the redundancy rate of a network application data stream to which the target network data belongs; performing data segmentation on the target network data according to the set data segmentation parameter to obtain at least one target data block; comparing the at least one target data block with data blocks in a database in sequence; and deleting the target data blocks which are the same as the data blocks in the database. The embodiment of the invention also discloses equipment and a system. The embodiment of the invention further discloses equipment and a system. Through adoption of the method, the equipment and the system, the data block partitioning granularity can be set according to the redundancy rate corresponding to the network application data stream, so that the data deduplication rate is increased under the situation of not influencing the throughput rate.

Description

technical field [0001] The present invention relates to the field of communication technology, in particular to a data compression method, device and system. Background technique [0002] Data deduplication technology is based on the principle of deduplication, and eliminates duplicate data between the same file or similar files through a certain algorithm. Data block-level deduplication refers to first dividing the file into data blocks and calculating the data fingerprints of each data block. By comparing the data fingerprints of the data blocks, it is judged whether the same data blocks are stored in the database. If the target data is detected If the data fingerprint of the block is the same as that in the database, delete the target data block. [0003] CDC (Content-Defined Chunking, content-based data chunking) algorithm is a variable-length chunking algorithm, which uses data fingerprints (such as Rabin fingerprints) to divide files into chunks of different lengths. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04L1/00G06F17/30
CPCG06F16/1744
Inventor 张亮刘屹葛雄资陆承涛吴俊
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products