Remote sensing data rapid concurrent read-write method based on distributed file system

A technology for distributed files and remote sensing data, which is applied in file systems, file system management, and electronic digital data processing. It can solve problems such as not supporting a large number of small file storage, reducing access efficiency, and seeking time exceeding reading time. , to achieve the effect of solving practical application inconvenience, improving user experience, and important market value

Active Publication Date: 2021-08-03
WUHAN UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, HDFS also has the following three disadvantages (HDFS is not applicable in these cases): 1. Low-latency data access cannot be achieved, such as reading and writing data at the millisecond level
HDFS is only suitable for high-throughput scenarios, that is, a large amount of data is written in a certain period of time, but it does not support reading data back immediately
2. Does not support a large number of small file storage
Storing a large number of small files will occupy a large amount of memory of the index service (NameNode) to store data block index information. However, the index service memory of HDFS is limited and cannot achieve massive expansion. In addition, a large number of indexes will cause the seek time to exceed the read time, which is extremely Greatly reduce access efficiency
3. Unable to concurrently write or modify files randomly
[0004] Therefore, HDFS containing the above three shortcomings cannot be applied to remote sensing big data processing. For this reason, the present invention proposes a method for improving the distributed file system to realize fast concurrent reading and writing of remote sensing big data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Remote sensing data rapid concurrent read-write method based on distributed file system
  • Remote sensing data rapid concurrent read-write method based on distributed file system
  • Remote sensing data rapid concurrent read-write method based on distributed file system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The technical solution of the present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0029] The invention provides a method for improving the distributed file system to realize fast concurrent reading and writing of remote sensing data, named as channel reading and writing technology (English Gate IO), which can realize fast concurrent reading and writing of massive data. In order to solve the three shortcomings of HDFS, the embodiment of the present invention transforms the original HDFS distributed file system:

[0030] First of all, it inherits the characteristics of high fault tolerance, high reliability, high scalability, high availability, and high throughput of the HDFS file system on the underlying physical structure to ensure the efficiency and stability of the distributed file system.

[0031] Then, a first-level encapsulation is performed on the HDFS business processing layer to take over the opera...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a remote sensing data rapid concurrent read-write method based on a distributed file system. A bottom physical structure inherits the characteristics of the HDFS file system, and the method comprises the steps that a Hadoop system is installed on each data server in a computer group, and the HDFS file system is established; then a part of space is divided on each data server to serve as a physical storage space of an own file system; primary packaging is carried out on an HDFS service processing layer, access of an operating system to a file system is taken over, when the operating system only requires to read a file and file data exists, an HDFS file system interface is directly referenced, and reading of the file data is completed by an HDFS; when the operation system requires that the access to the file comprises the file writing operation, the file operation is comprehensively taken over, the own file system realizes data reading and writing, and after the data reading and writing are completed, the data are synchronized to the HDFS; and the own file system only reads and writes one server. According to the invention, rapid concurrent reading and writing of mass remote sensing data can be realized.

Description

technical field [0001] The invention relates to the field of computer application technology and remote sensing big data processing, in particular to the concurrent reading and writing technology of large data files, and in particular to the fast concurrent reading and writing technology of remote sensing data based on a distributed file system. Background technique [0002] With the rapid development of remote sensing technology, more and more remote sensing satellites are launched by various countries, and the satellite data received every day has reached PB level. The processing of massive remote sensing data puts forward higher requirements for storage technology and processing speed. On the other hand, with the rapid development of computers, especially Internet technology, data storage technology has also made a qualitative leap, especially the emergence of cloud technology in recent years, which has raised data storage technology to an unprecedented level. Cloud techn...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/11G06F16/16G06F16/178G06F16/182
CPCG06F16/122G06F16/16G06F16/178G06F16/182Y02D10/00
Inventor 段延松张祖勋陶鹏杰柯涛张永军
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products