Data processing method and device

A data processing device and data processing technology, applied in the storage field, can solve problems such as deduplication performance degradation, and achieve the effect of improving the deduplication rate

Active Publication Date: 2020-09-08
HUAWEI TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The inventor found in the research that in the existing data deduplication, for example, the data received for the first time is stored as new data; when the data received for the second time is changed relative to the data received for the first time, The changed data will be stored separately as new data; and when the same data received for the third time is received for the second time, the data most similar to the data received for the third time is likely to be the data received for the first time , then compared to the data changed for the first time, the changed data will still be considered as new data and stored, but in fact, the changed data has already been stored, so it can be seen that the deduplication process of the prior art In , the more data is stored, the more storage areas the data will be distributed to, but the overall deduplication performance will decrease

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and device
  • Data processing method and device
  • Data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0049] The embodiment of the present invention is applicable to a storage system, and the storage system may include multiple physical nodes or only one physical node, which is not limited by the embodiment of the present invention. Wherein, the physical node with the deduplication engine may serve as the execution subject of the embodiment of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention provide a data processing method and device. Through the data processing method and device, when a data hashed value in a currently received data flow exceeds a preset first threshold value, part or all of data in the data flow is directly stored without re-deletion, so that the data in the data flow is prevented from being dispersedly stored in a plurality of storage areas and is intensively stored in one storage area, and then the re-deletion rate is effectively improved on the whole under the scenes of large data storage amount.

Description

technical field [0001] Embodiments of the present invention relate to storage technology, and in particular, to a data processing method and device. Background technique [0002] Data deduplication (abbreviated as deduplication), also known as intelligent compression or single instance storage, is a method that can automatically search for duplicate data, keep only one copy of the same data, and replace other duplicate copies with pointers to the single copy , to achieve a storage technology that eliminates redundant data and reduces storage capacity requirements. [0003] In the prior art, in the data deduplication scheme, the received data is divided into blocks to obtain data blocks, and then the data blocks are formed into several data segments, and the characteristic value of each data segment is calculated using a certain method , using the computed eigenvalues ​​to represent data segments. Match the characteristic value of the data segment with the characteristic va...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/16
CPCG06F16/162
Inventor 钟延辉张宗全
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products