Method and system for carrying out IO deduplication on non-homologous data of storage system in operation process

A storage system and non-homologous technology, applied in the field of data processing, to improve performance, reduce usage and storage pressure

Inactive Publication Date: 2016-03-30
北京云巢动脉科技有限公司
View PDF2 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the present invention is to provide a method and system for deduplication of IO when the non-homologous data of the storage system is running, so as to solve the aforementioned problems in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for carrying out IO deduplication on non-homologous data of storage system in operation process
  • Method and system for carrying out IO deduplication on non-homologous data of storage system in operation process
  • Method and system for carrying out IO deduplication on non-homologous data of storage system in operation process

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach 1

[0042] refer to figure 1 , specific implementation mode 1, a method for IO deduplication when non-homologous data in a storage system is running, comprising the following steps:

[0043] S1. The virtual machine reads the data and obtains the feature code of the data;

[0044] S2, judging whether the feature code of the data exists in the dedicated cache, if it exists, enter S3; if it does not exist, enter S4;

[0045] S3. Call the data corresponding to the feature code of the data stored in the dedicated cache, and enter S5;

[0046] S4, transfer the data corresponding to the feature code of the data in the image file, and enter S5;

[0047] Step S4, more specifically:

[0048] S4-1. When calling the data corresponding to the feature code of the data in the image file, storing the called data and its feature code in a dedicated cache in memory;

[0049] S4-2. The dedicated cache processes the received data and the characteristic code of the data;

[0050] S5. The virtual ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a method and system for carrying out IO deduplication on non-homologous data of a storage system in the operation process, and relates to the field of data processing. The method comprises: S1, by a virtual machine, reading data and acquiring a characteristic code of the data; S2, determining whether the characteristic code of the data exists in a dedicated cache, proceeding to S3 if yes, and if not, proceeding to S4; S4, calling data which is stored in the dedicated cache and corresponds to the characteristic code of the data, and proceeding to S5; S4, calling data in an image file, which corresponds to the characteristic code of the data, and proceeding to S5; and S5, enabling the virtual machine to work by using the acquired data. The system comprises: a physical machine system module, a physical machine caching module, a virtual machine image file module and a virtual machine system module. The method and system provided by the present invention solve the problem that when a plurality of virtual machines simultaneously read respective image files, a server system cache is easy to reach a bottleneck.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a method and a system for deduplicating IOs during operation of non-homogeneous data in a storage system. Background technique [0002] A virtual machine uses an image file to simulate a disk, and the image file stores the virtual machine system. For different virtual machines, most of the data stored in the image files of the same type of system are the same. Although the server system has a caching mechanism, it can only be cached based on files, not data blocks. When multiple virtual machines read their respective image files at the same time, the server system cache can easily reach a bottleneck, unable to deduplicate the same data in different files, and it also puts a huge pressure on storage. Contents of the invention [0003] The object of the present invention is to provide a method and system for deduplication of IOs during operation of non-homogeneous data in a stora...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F12/08G06F3/06
Inventor 杨耀敏易乐天曲维杰
Owner 北京云巢动脉科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products