Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Suffix tree based catalog organizing method in distributed file system

A distributed file and suffix tree technology, which is applied in the fields of instruments, computing, and electrical digital data processing, etc., can solve the problems of large similarity, complex implementation, and low efficiency

Active Publication Date: 2011-04-20
DAWNING INFORMATION IND BEIJING +1
View PDF4 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the number of directory items in the directory is large, if the traditional directory organization method of the ext3-like file system is adopted, the time complexity of directory item search is 0(n), and the efficiency is low; if the directory organization method of B+ tree is adopted , on the one hand, the implementation is more complicated. On the other hand, due to the high similarity of each directory item, it is necessary to frequently adjust the balance of the tree when inserting, which also has no advantage in efficiency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Suffix tree based catalog organizing method in distributed file system
  • Suffix tree based catalog organizing method in distributed file system
  • Suffix tree based catalog organizing method in distributed file system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0014] specific implementation plan

[0015] (1) The metadata server of the distributed file system is usually a powerful server equipped with multiple disks. According to the characteristics of the disk controller, different disks are located in different channels of the disk controller, and thus are independent of each other in terms of operation control. Therefore, multiple disks on the metadata server are actually independent of each other and can be accessed in parallel. In order to speed up the access speed to the super-large directory, the directory items are divided into several groups in the present invention and stored on different disks respectively. The grouping method adopts a simple string hash method. Given a string S and a total number of groups N, the group number n in which S is located is: n=hash(S)%N

[0016] (2) After directory items are grouped, each group is a collection of directories. Due to the particularity of directory item names in the applicati...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a suffix tree based catalog organizing method in a distributed file system. The method comprises the following steps of: grouping catalog items according to names, and storing different groups of catalog items on different discs on a storage server; and organizing and storing different groups of catalog items by adopting a suffix tree method.

Description

technical field [0001] The invention relates to file management in a distributed file system, in particular to a directory organization method based on a suffix tree in a distributed file system. Background technique [0002] With the rapid development of computer technology, the requirements for storage in the fields of network and scientific computing are getting higher and higher, so the distributed file system is gradually introduced into these fields to meet the storage needs of these fields. [0003] Applications in the Internet and other fields have relatively distinctive characteristics, one of which is that a single directory often stores millions or even hundreds of millions of files, such as storing mp3 files and picture files, etc. The characteristic of these files is that they are usually composed of numbers or letters name, such as 1.mp3, abc.jpg, etc. When the number of directory items in the directory is large, if the traditional directory organization metho...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 杨浩邵宗有苗艳超王勇马照云
Owner DAWNING INFORMATION IND BEIJING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products