科技报告详细信息
Identifying Performance Bottlenecks on Modern Microarchitectures using an Adaptable Probe.
Griem, G.
Technical Information Center Oak Ridge Tennessee
关键词: Microprocessors;    Benchmarks;    Probes;    Performance;    Optimization;   
RP-ID  :  DE2004828731
学科分类:工程和技术(综合)
美国|英语
来源: National Technical Reports Library
PDF
【 摘 要 】

The gap between peak and delivered performance for scientific applications running on microprocessor-based systems has grown considerably in recent years. The inability to achieve the desired performance even on a single processor is often attributed to an inadequate memory system, but without identification or quantification of a specific bottleneck. In this work, we use an adaptable synthetic benchmark to isolate application characteristics that cause a significant drop in performance, giving application programmers and architects information about possible optimizations. Our adaptable probe, called sqmat, uses only four parameters to capture key characteristics of scientific workloads: working-set size, computational intensity, indirection, and irregularity. This paper describes the implementation of sqmat and uses its tunable parameters to evaluate four leading 64-bit microprocessors that are popular building blocks for current high performance systems: Intel Itanium2, AMD Opteron, IBM Power3, and IBM Power4.

【 预 览 】
附件列表
Files Size Format View
DE2004828731.pdf 279KB PDF download
  文献评价指标  
  下载次数:16次 浏览次数:9次