System and method for data migration between high-performance computing architectures and data storage devices with increased data reliability and integrity

Active Publication Date: 2014-05-06
DATADIRECT NETWORKS
View PDF6 Cites 95 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

The present invention provides a data migration system that uses software to determine the best order in which to transfer data between a generating system and storage disks. This is based on factors like the fullness of the storage layer, the time elapsed from previous activity, and minimizing the usage of storage disks. After data is written in the intermediate storage module, a signal is sent to the generating system to switch to a compute state, reducing its input / output cycle. The technical effect is an improved data migration process that optimizes data transfer and reduces storage usage.

Problems solved by technology

The ratio of computer elements to File Servers is often very large and may exceed 1000 in some implementations.
Disk drives do not favor the regime of satisfying random requests since the recording heads have to be moved to various sectors of the drive (aka “seeking”), and this “seeking” movement takes a larger amount of time when compared to the actual “write” or “read” operation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for data migration between high-performance computing architectures and data storage devices with increased data reliability and integrity
  • System and method for data migration between high-performance computing architectures and data storage devices with increased data reliability and integrity
  • System and method for data migration between high-performance computing architectures and data storage devices with increased data reliability and integrity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0087]Referring to FIGS. 2A-2B, the system 20 of the present invention includes a number of computer nodes (or client nodes) 22. The computer nodes may be arranged in computing groups, or computer clusters, to perform complex computations of various types. The operation of the computer nodes depends on the system application. They may function as servers, super computing clusters, etc., and have the capacity to “write” by outputting data, as well as “read” from an external memory or any other device. In the present description the above-presented devices will be referenced further herein as data generating entities.

[0088]The computer nodes 22 are connected through a high speed network 24 to file servers 26 which manage data migration from and to the computer nodes 22. The ratio of the computer nodes 22 to the servers 26 may be in excess of a thousand in some applications. The file servers 26 satisfy requests of the computer nodes 22 in the same order as the requests are received at ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A system for data migration between high performance computing architectures and data storage disks includes an intermediate data migration handling system which has an intermediate data storage module coupled to the computer architecture to store data received, and a data controller module which includes data management software supporting the data transfer activity between the intermediate data storage module and the disk drives in an orderly manner independent of the random I / O activity of the computer architecture. RAID calculations are performed on the data prior to storage in the intermediate storage module, as well as when reading data from it for assuring data integrity, and carrying out reconstruction of corrupted data. The data transfer to the disk drives is actuated in sequence determined by the data management software based on minimization of seeking time, tier usage, predetermined time since the previous I / O cycle, or fullness of the intermediate data storage module. The storage controller deactivates the disk drives which are not needed for the data transfer.

Description

FIELD OF THE INVENTION[0001]The present invention is directed to data migration between high performance computing cluster architectures (data generating entities) and data storage disks, and particularly, the present invention relates to a data migration technique rendering an orderly access favored by data storage disks which is decoupled from a randomized input / output (I / O) activity of data generating entities.[0002]More in particular, the present invention is directed to a data migration system employing an intermediate data migration handling system that performs high speed data caching into an intermediate data storage layer from data generating devices, aggregates and organizes the ingress and further controls migration of the cached data to a target data storage device in a manner that is most efficient for the target storage devices. When the data is stored in the intermediate data storage layer, the computing entity may return from the I / O state to the compute state withou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F12/02
CPCG06F3/0689G06F3/061G06F3/0659G06F11/1076G06F2211/1028
Inventor PISZCZEK, MICHAEL J.FERNANDES, CEDRIC T.FELLINGER, DAVE F.HARKER, WILLIAM JOSEPHMANNING, JOHN GORDONMCBRYDE, LEE DOUGLASUPPU, PAVAN KUMARMISHRA, MANJARIFUGINI, THOMAS EDWARDPANDIT, SHIVKUMARDE LEON, JOHN ALBERT
Owner DATADIRECT NETWORKS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products