会议论文详细信息
High Performance Computing Symposium 2013
Modeling Remote I/O versus Staging Tradeoff in Multi-Data Center Computing
计算机科学;物理学
Suslu, Ibrahim H.^1
Department of Computer Science, North American College, Houston
TX
77038, United States^1
关键词: Application data;    Cloud applications;    Data intensive;    End-to-end performance;    Generic modeling;    Large datasets;    Network bandwidth;    Synthetic benchmark;   
Others  :  https://iopscience.iop.org/article/10.1088/1742-6596/540/1/012003/pdf
DOI  :  10.1088/1742-6596/540/1/012003
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】

In multi-data center computing, data to be processed is not always local to the computation. This is a major challenge especially for data-intensive Cloud computing applications, since large amount of data would need to be either moved the local sites (staging) or accessed remotely over the network (remote I/O). Cloud application developers generally chose between staging and remote I/O intuitively without making any scientific comparison specific to their application data access patterns since there is no generic model available that they can use. In this paper, we propose a generic model for the Cloud application developers which would help them to choose the most appropriate data access mechanism for their specific application workloads. We define the parameters that potentially affect the end-to-end performance of the multi-data center Cloud applications which need to access large datasets over the network. To test and validate our models, we implemented a series of synthetic benchmark applications to simulate the most common data access patterns encountered in Cloud applications. We show that our model provides promising results in different settings with different parameters, such as network bandwidth, server and client capabilities, and data access ratio.

【 预 览 】
附件列表
Files Size Format View
Modeling Remote I/O versus Staging Tradeoff in Multi-Data Center Computing 1337KB PDF download
  文献评价指标  
  下载次数:10次 浏览次数:22次