科技报告详细信息
Improving Restore Speed for Backup Systems that Use Inline Chunk-Based Deduplication
Lillibridge, Mark ; Eshghi, Kave ; Bhagwat, Deepavali
HP Development Company
关键词: deduplication;    fragmentation;    restore;    caching;    off-line caching;   
RP-ID  :  HPL-2013-41
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】
Slow restoration due to chunk fragmentation is a serious problem facing inline chunk-based data deduplication systems: restore speeds for the most recent backup can drop orders of magnitude over the lifetime of a system. We study three techniques--increasing cache size, container capping, and using a forward assembly area--for alleviating this problem. Container capping is an ingest-time operation that reduces chunk fragmentation at the cost of forfeiting some deduplication, while using a forward assembly area is a new restore-time caching and prefetching technique that exploits the perfect knowledge of future chunk accesses available when restoring a backup to reduce the amount of RAM required for a given level of caching at restore time. We show that using a larger cache per stream--we see continuing benefits even up to 8 GB--can produce up to a 5-16X improvement, that giving up as little as 8% deduplication with capping can yield a 2-6X improvement, and that using a forward assembly area is strictly superior to LRU, able to yield a 2-4X improvement while holding the RAM budget constant.
【 预 览 】
附件列表
Files Size Format View
RO201804100000508LZ 2252KB PDF download
  文献评价指标  
  下载次数:17次 浏览次数:44次