科技报告详细信息
Using the Sirocco File System for high-bandwidth checkpoints.
Klundt, Ruth Ann ; Curry, Matthew L. ; Ward, H. Lee
关键词: DECISION MAKING;    FLEXIBILITY;    STORAGE;    TARGETS;    VELOCITY;   
DOI  :  10.2172/1039010
RP-ID  :  SAND2012-1087
PID  :  OSTI ID: 1039010
Others  :  TRN: US201209%%297
学科分类:社会科学、人文和艺术(综合)
美国|英语
来源: SciTech Connect
PDF
【 摘 要 】
The Sirocco File System, a file system for exascale under active development, is designed to allow the storage software to maximize quality of service through increased flexibility and local decision-making. By allowing the storage system to manage a range of storage targets that have varying speeds and capacities, the system can increase the speed and surety of storage to the application. We instrument CTH to use a group of RAM-based Sirocco storage servers allocated within the job as a high-performance storage tier to accept checkpoints, allowing computation to potentially continue asynchronously of checkpoint migration to slower, more permanent storage. The result is a 10-60x speedup in constructing and moving checkpoint data from the compute nodes. This demonstration of early Sirocco functionality shows a significant benefit for a real I/O workload, checkpointing, in a real application, CTH. By running Sirocco storage servers within a job as RAM-only stores, CTH was able to store checkpoints 10-60x faster than storing to PanFS, allowing the job to continue computing sooner. While this prototype did not include automatic data migration, the checkpoint was available to be pushed or pulled to disk-based storage as needed after the compute nodes continued computing. Future developments include the ability to dynamically spawn Sirocco nodes to absorb checkpoints, expanding this mechanism to other fast tiers of storage like flash memory, and sharing of dynamic Sirocco nodes between multiple jobs as needed.
【 预 览 】
附件列表
Files Size Format View
RO201704190004237LZ 323KB PDF download
  文献评价指标  
  下载次数:24次 浏览次数:20次