Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Organization method of client directory cache in distributed file system

A distributed file and client technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of low access efficiency of massive small files

Inactive Publication Date: 2012-07-04
DAWNING INFORMATION IND BEIJING
View PDF5 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention aims at disclosing a method for organizing the cache of client directory items in a distributed file system, which can effectively solve the problem of low access efficiency of a large number of small files under a single directory in network applications

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] Describe in detail below in conjunction with embodiment:

[0015] (1) In the present invention, the directory entries in a single directory are first hashed according to their names and divided into several subsets, and each subset is distributed on a metadata server.

[0016] (2) The directory item cache structure on the client is organized according to the directory item subsets, that is, the directory items distributed on each metadata are managed separately and kept independent of each other.

[0017] (3) When the application needs to traverse a certain directory, the client first inquires whether the local cache exists, and if so, directly returns it to the user. If the cache does not exist, it needs to read from the metadata server. When reading, since all directory entries of a single directory are stored on different metadata servers according to subsets, the invention adopts a parallel method for reading, which can speed up the speed of directory entry reading...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an organization method of a client directory cache in a distributed file system, wherein the distributed file system adopts a multi-metadata server framework, i.e. contents of a single directory are distributed on a plurality of metadata servers. The reason why the multi-metadata framework is adopted is mainly that the pressure of metadata access can be decentralized and the concurrency can be improved. According to the method, aiming at the characteristic that people write less but read more in network application, contents of a directory item and corresponding index nodes are remained in the cache of a client side, thereby avoiding the client side from communicating with the servers for multiple times when reading repeatedly; meanwhile when a directory is accessed for the first time, the directory items of the directory distributed on different metadata servers are pre-read, and the file index nodes and the file contents are pre-read according to a default pre-reading strategy or a pre-reading strategy issued by an application program. Consequently when the application program needs to access a certain file under the directory items, the metadata and data of the file are pre-read into the local cache of the client side already, so that the execution speed of the application program is accelerated greatly.

Description

technical field [0001] The invention relates to directory item management in a distributed file system, in particular to a method for organizing client directory cache in a distributed file system. Background technique [0002] With the rapid development of computer technology, various applications have an increasing demand for storage, among which network applications are the most typical. The storage requirements of network applications are roughly divided into two types. One is the storage requirements mainly for large files, such as video and audio network applications. This type of application is characterized by a small number of files, but the size of a single file is usually at the GB or even TB level. ;Another type of storage demand mainly for small files, such as online shopping malls, portals, etc., is characterized by a single file is not large, but the number of files is huge, usually tens of millions of files are stored in a single directory, and this Class fi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 杨浩常涛吕明强邵宗有刘新春苗艳超王勇
Owner DAWNING INFORMATION IND BEIJING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products