会议论文详细信息
20th International Conference on Computing in High Energy and Nuclear Physics
High energy electromagnetic particle transportation on the GPU
物理学;计算机科学
Canal, P.^1 ; Elvira, D.^1 ; Jun, S.Y.^1 ; Kowalkowski, J.^1 ; Paterno, M.^1 ; Apostolakis, J.^2
Fermilab, MS234, PO Box 500, Batavia
IL
60510, United States^1
CERN, PH Department, Geneva
CH-1211, Switzerland^2
关键词: Concurrent programming;    Electromagnetic particle;    Electromagnetic physics;    Floating-point accuracies;    Graphics Processing Unit (GPU);    High performance parallel computing;    Nonuniform magnetic fields;    Performance evaluations;   
Others  :  https://iopscience.iop.org/article/10.1088/1742-6596/513/5/052013/pdf
DOI  :  10.1088/1742-6596/513/5/052013
学科分类:计算机科学(综合)
来源: IOP
PDF
【 摘 要 】
We present massively parallel high energy electromagnetic particle transportation through a finely segmented detector on a Graphics Processing Unit (GPU). Simulating events of energetic particle decay in a general-purpose high energy physics (HEP) detector requires intensive computing resources, due to the complexity of the geometry as well as physics processes applied to particles copiously produced by primary collisions and secondary interactions. The recent advent of hardware architectures of many-core or accelerated processors provides the variety of concurrent programming models applicable not only for the high performance parallel computing, but also for the conventional computing intensive application such as the HEP detector simulation. The components of our prototype are a transportation process under a non-uniform magnetic field, geometry navigation with a set of solid shapes and materials, electromagnetic physics processes for electrons and photons, and an interface to a framework that dispatches bundles of tracks in a highly vectorized manner optimizing for spatial locality and throughput. Core algorithms and methods are excerpted from the Geant4 toolkit, and are modified and optimized for the GPU application. Program kernels written in C/C++ are designed to be compatible with CUDA and OpenCL and with the aim to be generic enough for easy porting to future programming models and hardware architectures. To improve throughput by overlapping data transfers with kernel execution, multiple CUDA streams are used. Issues with floating point accuracy, random numbers generation, data structure, kernel divergences and register spills are also considered. Performance evaluation for the relative speedup compared to the corresponding sequential execution on CPU is presented as well.
【 预 览 】
附件列表
Files Size Format View
High energy electromagnetic particle transportation on the GPU 621KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:34次