科技报告详细信息
TUNE: Compiler-Directed Automatic Performance Tuning
Hall, Mary1 
[1] University of Utah
关键词: autotuning;    compiler;    SIMD;   
DOI  :  10.2172/1156961
RP-ID  :  DOE-UTAH-03777
PID  :  OSTI ID: 1156961
学科分类:数学(综合)
美国|英语
来源: SciTech Connect
PDF
【 摘 要 】

This project has developed compiler-directed performance tuning technology targeting the Cray XT4 Jaguar system at Oak Ridge, which has multi-core Opteron nodes with SSE-3 SIMD extensions, and the Cray XE6 Hopper system at NERSC. To achieve this goal, we combined compiler technology for model-guided empirical optimization for memory hierarchies with SIMD code generation, which have been developed by the PIs over the past several years. We examined DOE Office of Science applications to identify performance bottlenecks and apply our system to computational kernels that operate on dense arrays. Our goal for this performance-tuning technology has been to yield hand-tuned levels of performance on DOE Office of Science computational kernels, while allowing application programmers to specify their computations at a high level without requiring manual optimization. Overall, we aim to make our technology for SIMD code generation and memory hierarchy optimization a crucial component of high-productivity Petaflops computing through a close collaboration with the scientists in national laboratories.

【 预 览 】
附件列表
Files Size Format View
796KB PDF download
  文献评价指标  
  下载次数:20次 浏览次数:39次