科技报告详细信息
Performance Technolgies for Peta-Scale Systems: A White Paper Prepared by the Performance Evaluation Research Center
Bailey, D H ; de Supinski, B R ; Dongarra, J ; Dunigan, T ; Gao, G ; Hoisie, A ; Hovland, J ; Hollangsworth, J ; Jeffferson, D R ; Kamath, C ; Malony, A ; Norris, B ; Quinlan, D ; McKee, S A ; Mendes, C ; Moore, S ; Reed, D ; Snavely, A ; Strohmaier, E ; Vetter, J S ; Worley, P
Lawrence Livermore National Laboratory
关键词: Mining;    Monitoring;    Simulation;    99;    Optimization;   
DOI  :  10.2172/15004540
RP-ID  :  UCRL-ID-153359
RP-ID  :  W-7405-ENG-48
RP-ID  :  15004540
美国|英语
来源: UNT Digital Library
PDF
【 摘 要 】
Future-looking high end computing initiatives will deploy powerful, large-scale computing platforms that leverage novel component technologies for superior node performance in advanced system architectures with tens or even hundreds of thousands of nodes. Recent advances in performance tools and modeling methodologies suggest that it is feasible to acquire such systems intelligently and achieve excellent performance, while also significantly reducing the user time required to attain high performance. These developments are relevant to several aspects of future HEC technology outlined in the recent HECRTF white paper request, in particular items 5.4, 5.5, 5.6, and 5.8. We envision the following specific capabilities: (1) Performance modeling tools, available to researchers and vendors, will extrapolate performance from prototype systems to full-scale systems, and even accurately predict performance behavior before systems are manufactured, thus enabling both improved designs and more intelligent selection of systems in procurements. (2) System simulation facilities, implemented on highly parallel platforms and available to researchers and vendors, will for instance realistically model the performance of a specific interprocessor network design running a specific scientific application code. As with item 1, these facilities can lead both to improved designs and procurement decisions that yield significantly greater sustained performance for targeted scientific applications. (3) A program monitoring and analysis infrastructure, scalable to 100,000 processors and beyond, will provide performance information at every level of system's memory hierarchy and network. This infrastructure will build upon knowledge discovery and data mining techniques to be significantly more scalable and easier to use than the current infrastructure, and a standard version will be incorporated in most high-end systems. (4) Self-tuning software facilities, now available only for a few specialized libraries and requiring separate test runs, will be integrated into a broad range of scientific application codes. Eventually these facilities will make use of the performance monitoring infrastructure mentioned above, and will extend to dynamic optimization at the subroutine level. Each of these capabilities appears to be feasible, based on successes in current research projects. However, while the prototypes of many of these facilities are already in hand, significant additional research and development will be required to realize the full potential described above. In the following, we sketch some of this required research.
【 预 览 】
附件列表
Files Size Format View
15004540.pdf 520KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:17次