科技报告详细信息
Reliability Analysis of Deduplicated and Erasure-Coded Storage
Li, Xiaozhou ; Lillibridge, Mark ; Uysal, Mustafal
HP Development Company
关键词: Deduplication;    erasure coding;   
RP-ID  :  HPL-2010-146
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

Space efficiency and data reliability are two primary concerns for modern storage systems. Chunk-based deduplication, which breaks up data objects into single-instance chunks that can be shared across objects, is an effective method for saving storage space. However, deduplication affects data reliability because an object's constituent chunks are often spread across a large number of disks, potentially decreasing the object's reliability. Therefore, an important problem in deduplicated storage is how to achieve space efficiency yet maintain each object's original reliability. In this paper, we present initial results on the reliability analysis of HP-KVS, a deduplicated key-value store that allows each object to specify its own reliability level and that uses software erasure coding for data reliability. The combination of deduplication and erasure coding gives rise to several interesting research problems. We show how to compare the reliability of erasure codes with different parameters and how to analyze the reliability of a big data object given its constituent parts' reliabilities. We also present a method for system designers to determine under what conditions deduplication will save space for erasure-coded data.

【 预 览 】
附件列表
Files Size Format View
RO201804100002712LZ 357KB PDF download
  文献评价指标  
  下载次数:24次 浏览次数:7次