科技报告详细信息
Automatic Thread-Level Parallelization in the Chombo AMR Library
Christen, Matthias ; Keen, Noel ; Ligocki, Terry ; Oliker, Leonid ; Shalf, John ; Van Straalen, Brian ; Williams, Samuel
关键词: auto-tuning;    Chombo;    ChomboFortran;    HPC;    OpenMP;    hybrid;    AMR;   
DOI  :  10.2172/1051285
RP-ID  :  LBNL-5109E
PID  :  OSTI ID: 1051285
学科分类:数学(综合)
美国|英语
来源: SciTech Connect
PDF
【 摘 要 】

The increasing on-chip parallelism has some substantial implications for HPC applications. Currently, hybrid programming models (typically MPI+OpenMP) are employed for mapping software to the hardware in order to leverage the hardware?s architectural features. In this paper, we present an approach that automatically introduces thread level parallelism into Chombo, a parallel adaptive mesh refinement framework for finite difference type PDE solvers. In Chombo, core algorithms are specified in the ChomboFortran, a macro language extension to F77 that is part of the Chombo framework. This domain-specific language forms an already used target language for an automatic migration of the large number of existing algorithms into a hybrid MPI+OpenMP implementation. It also provides access to the auto-tuning methodology that enables tuning certain aspects of an algorithm to hardware characteristics. Performance measurements are presented for a few of the most relevant kernels with respect to a specific application benchmark using this technique as well as benchmark results for the entire application. The kernel benchmarks show that, using auto-tuning, up to a factor of 11 in performance was gained with 4 threads with respect to the serial reference implementation.

【 预 览 】
附件列表
Files Size Format View
RO201704210002222LZ 848KB PDF download
  文献评价指标  
  下载次数:20次 浏览次数:54次