科技报告详细信息
Instant Snapshots in a Federated Array of Bricks
Ji, Minwen
HP Development Company
关键词: snapshots;    checkpoints;    storage;    distributed systems;    clusters;   
RP-ID  :  HPL-2005-15
学科分类:计算机科学(综合)
美国|英语
来源: HP Labs
PDF
【 摘 要 】

Snapshot has become a fundamental requirement on mid to high end storage systems. Its applications include archiving, recovery, report generation, decision making tools and remote mirroring. State-of-the-art snapshot techniques on existing storage systems typically work on a single (fault-tolerant) controller, and need to pause the applications or change the operation mode of the file systems or databases when a snapshot is taken. In a federated array of bricks (FAB), a snapshot may involve tens to thousands of independent controllers or processors, and may be taken at a high frequency, e.g., once every 30 seconds for atomic updates in remote mirroring. Therefore, an efficient distributed snapshot algorithm that can make the snapshot operations transparent to applications is needed in FAB. In this paper, we propose such an algorithm, which avoids pausing or aborting write requests by the novel use of a tentative data structure during the two phase commit of a snapshot creation. The snapshot operations are serializable with data operations (i.e., reads and writes), hence ensure consistency of the snapshots. Read-only operations on snapshots are optimized in common cases, only requiring communications to a small subset of the bricks, in particular, a single replica set or three bricks in FAB. The algorithm has been prototyped in FAB and has been tested with trace-based experiments. 21 Pages

【 预 览 】
附件列表
Files Size Format View
RO201804100000959LZ 233KB PDF download
  文献评价指标  
  下载次数:15次 浏览次数:35次