期刊论文详细信息
JOURNAL OF COMPUTATIONAL PHYSICS 卷:407
Provably optimal parallel transport sweeps on semi-structured grids
Article
Adams, Michael P.1  Adams, Marvin L.1  Hawkins, W. Daryl1  Smith, Timmie2  Rauchwerger, Lawrence2,3  Amato, Nancy M.2,3  Bailey, Teresa S.4  Falgout, Robert D.4  Kunen, Adam4  Brown, Peter4 
[1] Texas A&M Univ, Dept Nucl Engn, 3133 TAMU, College Stn, TX 77843 USA
[2] Texas A&M Univ, Dept Comp Sci & Engn, 3112 TAMU, College Stn, TX 77843 USA
[3] Univ Illinois, Dept Comp Sci, Chicago, IL 60680 USA
[4] Lawrence Livermore Natl Lab, Livermore, CA 94550 USA
关键词: Parallel transport sweeps;    Parallel algorithms;    STAPL;    Performance models;    Unstructured mesh;    Scheduling algorithms;   
DOI  :  10.1016/j.jcp.2020.109234
来源: Elsevier
PDF
【 摘 要 】

We have found provably optimal algorithms for full-domain discrete-ordinate transport sweeps on a class of grids in 2D and 3D Cartesian geometry that are regular at a coarse level but arbitrary within the coarse blocks. We describe these algorithms and show that they always execute the full eight-octant (or four-quadrant if 2D) sweep in the minimum possible number of stages for a given P-x x P-y x P-z, partitioning. Computational results confirm that our optimal scheduling algorithms execute sweeps in the minimum possible stage count. Observed parallel efficiencies agree well with our performance model. Our PDT transport code has achieved approximately 68% parallel efficiency with > 1.5M parallel threads, relative to 8 threads, on a simple weak-scaling problem with only three energy groups, 10 directions per octant, and 4096 cells/thread. Our ARDRA code has achieved 71% efficiency with > 1.5M cores, relative to 16 cores, with 36 directions per octant and 48 energy groups. We demonstrate similar efficiencies with PDT on a realistic set of nuclear-reactor test problems, with unstructured meshes that resolve fine geometric details. These results demonstrate that discrete-ordinates transport sweeps can be executed with high efficiency using more than 10(6) parallel processes. (C) 2020 Published by Elsevier Inc.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_jcp_2020_109234.pdf 6650KB PDF download
  文献评价指标  
  下载次数:4次 浏览次数:0次