| Cooperative fault-tolerant distributed computing U.S. Department of Energy Grant DE-FG02-02ER25537 Final Report | |
| Sunderam, Vaidy S. | |
| 关键词: ARCHITECTURE; KERNELS; PERFORMANCE; PRODUCTIVITY; RESOURCE MANAGEMENT Distributed computing; fault tolerance; high performance computing; | |
| DOI : 10.2172/916972 RP-ID : DOE/ER/25537-1 PID : OSTI ID: 916972 Others : TRN: US201006%%621 |
|
| 学科分类:社会科学、人文和艺术(综合) | |
| 美国|英语 | |
| 来源: SciTech Connect | |
PDF
|
|
【 摘 要 】
The Harness project has developed novel software frameworks for the execution of high-end simulations in a fault-tolerant manner on distributed resources. The H2O subsystem comprises the kernel of the Harness framework, and controls the key functions of resource management across multiple administrative domains, especially issues of access and allocation. It is based on a âpluggableâ architecture that enables the aggregated use of distributed heterogeneous resources for high performance computing. The major contributions of the Harness II project result in significantly enhancing the overall computational productivity of high-end scientific applications by enabling robust, failure-resilient computations on cooperatively pooled resource collections.
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO201705190002980LZ | 82KB |
PDF