Data storage method and system
A data storage and data technology, applied in the field of data processing, can solve the problems of increasing investment in storage hardware equipment, increasing data replication time overhead, wasting physical storage space, etc., so as to improve storage space utilization, enhance security, and enhance The effect of the scope of application
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0034] Embodiment 1, a data storage method, such as figure 1 shown, including:
[0035] dividing each stored file into data segments of a predetermined size;
[0036] generating identification information uniquely corresponding to the data segment for each divided data segment, the identification information being used to carry attribute information of the corresponding data segment;
[0037] Compare the contents of each data segment to find duplicate data;
[0038] Two or more copies of data with the same content are regarded as a group; for each group of duplicate data, one of the data is retained, and the physical storage location of the data is saved as a redundant data watermark for other data in the group ; If there is duplicate data in a data segment, replace the duplicate data in the data segment with its redundant data watermark.
[0039] In this embodiment, the step of dividing each stored file into data segments of a predetermined size may be performed once when ...
Embodiment 2
[0052] Embodiment 2, a data storage system, such as figure 2 shown, including:
[0053] A segmentation module, configured to divide each stored file into data segments of a predetermined size;
[0054] An index module, configured to generate identification information uniquely corresponding to the data segment for each divided data segment, and the identification information is used to carry attribute information of the corresponding data segment;
[0055] A comparison module is used to compare the contents of each data segment and find out duplicate data;
[0056]The processing module is used to treat two or more copies of data with the same content as a group; for each group of repeated data, one of the data is retained, and the physical storage location of the data is saved as the other data in the group redundant data watermark; if there is duplicate data in a data segment, the duplicate data in the data segment will be replaced by its redundant data watermark.
[0057...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com