Cold and hot index identification and classification management method in data deduplication system
A classification management and hot indexing technology, which is applied in the fields of electronic digital data processing, special data processing applications, digital data information retrieval, etc., can solve the problems of reducing backup data performance, avoid frequent disk access, reduce misjudgment rate, The effect of improving performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030]This embodiment discloses a method for identifying and classifying hot and cold indexes in a data deduplication system, which divides indexes into cold indexes and hot indexes according to the frequency or probability of index access, and cold indexes can be further divided into fragmented indexes And useless indexes, through the classification management of indexes, the purpose of improving the overall performance of the data deduplication system is achieved.
[0031] Traditional methods do not identify and separate hot and cold indexes, and the data deduplication system needs the following steps to manage indexes:
[0032] 1) The cold index and the hot index are mixedly stored in the memory, and all indexes (cold index and hot index) are mapped to the Bloom filter;
[0033] 2) With the increase of the backup version and the amount of backup data, the number of indexes is also increasing. When the memory is not enough to store all the indexes, a part of the cold indexes...
Embodiment 2
[0052] Such as figure 1 , figure 2 and image 3 As shown, the hot and cold index identification and classification management method in the data deduplication system disclosed in the present invention, in order to prevent the data deduplication system from frequently accessing the index on the disk (the disk index in the figure) during the index search process, through Container utilization (the frequency or probability that a container is accessed during a certain backup process) classifies and manages the index. Remove the cold index from the global index / memory and put it on the disk, so as to free up more memory space for prefetching the hot index, so that the index lookup operation can be hit in the memory as much as possible, avoiding the data deduplication system from accessing the disk The index on the index can improve the performance of the backup data; only the hot index is mapped to the Bloom filter to reduce the false positive rate of the Bloom filter, so as to...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com