| Orchestrating Bulk Data Movement in Grid Environments | |
| Vazhkudai, SS | |
| Oak Ridge National Laboratory | |
| 关键词: Availability; Architecture; Forecasting; 99 General And Miscellaneous//Mathematics, Computing, And Information Science; | |
| DOI : 10.2172/885937 RP-ID : ORNL/TM-2004/121 RP-ID : DE-AC05-00OR22725 RP-ID : 885937 |
|
| 美国|英语 | |
| 来源: UNT Digital Library | |
PDF
|
|
【 摘 要 】
Data Grids provide a convenient environment for researchers to manage and access massively distributed bulk data by addressing several system and transfer challenges inherent to these environments. This work addresses issues involved in the efficient selection and access of replicated data in Grid environments in the context of the Globus Toolkit{trademark}, building middleware that (1) selects datasets in highly replicated environments, enabling efficient scheduling of data transfer requests; (2) predicts transfer times of bulk wide-area data transfers using extensive statistical analysis; and (3) co-allocates bulk data transfer requests, enabling parallel downloads from mirrored sites. These efforts have demonstrated a decentralized data scheduling architecture, a set of forecasting tools that predict bandwidth availability within 15% error and co-allocation architecture, and heuristics that expedites data downloads by up to 2 times.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 885937.pdf | 675KB |
PDF