科技报告详细信息
Performance of an MPI-Only Semiconductor Device Simulator on a Quad Socket/Quad Cord InfiniBand Platform.
Lin, P. T. ; Shadid, J. N.
Technical Information Center Oak Ridge Tennessee
关键词: Semiconductor devices;    Finite element method;    Scaling;    Performance;    Simulations;   
RP-ID  :  DE2009948289
学科分类:工程和技术(综合)
美国|英语
来源: National Technical Reports Library
PDF
【 摘 要 】

This preliminary study considers the scaling and performance of a finite element (FE) semiconductor device simulator on a capacity cluster with 272 compute nodes based on a homogeneous multicore node architecture utilizing 16 cores. The inter-node communication backbone for this Tri-Lab Linux Capacity Cluster (TLCC) machine is comprised of an InfiniBand interconnect. The nonuniform memory access (NUMA) nodes consist of 2.2 GHz quad socket/quad core AMD Opteron processors. The performance results for this study are obtained with a FE semiconductor device simulation code (Charon) that is based on a fully-coupled Newton-Krylov solver with domain decomposition and multilevel preconditioners. Scaling and multicore performance results are presented for large-scale problems of 100+ million unknowns on up to 4096 cores. A parallel scaling comparison is also presented with the Cray XT3/4 Red Storm capability platform. The results indicate that an MPI-only programming model for utilizing the multicore nodes is reasonably efficient on all 16 cores per compute node. However, the results also indicated that the multilevel preconditioner, which is critical for large-scale capability type simulations, scales better on the Red Storm machine than the TLCC machine.

【 预 览 】
附件列表
Files Size Format View
DE2009948289.pdf 403KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:16次