科技报告详细信息
Algorithmic support for commodity-based parallel computing systems.
Leung, Vitus Joseph ; Bender, Michael A. (State University of New York, Stony Brook, NY) ; Bunde, David P. (University of Illinois, Urbna, IL) ; Phillips, Cynthia Ann
Sandia National Laboratories
关键词: 99 General And Miscellaneous//Mathematics, Computing, And Information Science;    Supercomputers;    Allocations;    Parallel Processing;    Memory Management;   
DOI  :  10.2172/918344
RP-ID  :  SAND2003-3702
RP-ID  :  AC04-94AL85000
RP-ID  :  918344
美国|英语
来源: UNT Digital Library
PDF
【 摘 要 】
The Computational Plant or Cplant is a commodity-based distributed-memory supercomputer under development at Sandia National Laboratories. Distributed-memory supercomputers run many parallel programs simultaneously. Users submit their programs to a job queue. When a job is scheduled to run, it is assigned to a set of available processors. Job runtime depends not only on the number of processors but also on the particular set of processors assigned to it. Jobs should be allocated to localized clusters of processors to minimize communication costs and to avoid bandwidth contention caused by overlapping jobs. This report introduces new allocation strategies and performance metrics based on space-filling curves and one dimensional allocation strategies. These algorithms are general and simple. Preliminary simulations and Cplant experiments indicate that both space-filling curves and one-dimensional packing improve processor locality compared to the sorted free list strategy previously used on Cplant. These new allocation strategies are implemented in Release 2.0 of the Cplant System Software that was phased into the Cplant systems at Sandia by May 2002. Experimental results then demonstrated that the average number of communication hops between the processors allocated to a job strongly correlates with the job's completion time. This report also gives processor-allocation algorithms for minimizing the average number of communication hops between the assigned processors for grid architectures. The associated clustering problem is as follows: Given n points in {Re}d, find k points that minimize their average pairwise L{sub 1} distance. Exact and approximate algorithms are given for these optimization problems. One of these algorithms has been implemented on Cplant and will be included in Cplant System Software, Version 2.1, to be released. In more preliminary work, we suggest improvements to the scheduler separate from the allocator.
【 预 览 】
附件列表
Files Size Format View
918344.pdf 823KB PDF download
  文献评价指标  
  下载次数:2次 浏览次数:27次