Storage optimization method for Hadoop distributed file system
A distributed file and storage optimization technology, applied in the direction of file system, file system type, file/folder operation, etc., can solve the problems of large proportion of stored data, small proportion of duplicate data, data screening and deletion, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0035] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.
[0036] The present invention is a kind of storage optimization method for Hadoop distributed file system, such as figure 1 As shown, the specific steps are as follows:
[0037] Step 1, extract the file operation records, specifically:
[0038] Step 1.1: Select the INFO level log file, the selected log file contains the specific execution time stamp and file name information;
[0039] HDFS stores a large amount of log file content, recording various operations on the distributed file system, mainly divided into three levels: WARN, INFO, and DEBUG, and the detail level of the records increases in turn. The DEBUG-level logs are located at the bottom layer, and the recorded content is the most direct and detailed, but the data volume is large; the WARN-level logs are at the top layer, and only key information and information that may cause erro...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com