期刊论文详细信息
JOURNAL OF COMPUTATIONAL PHYSICS 卷:229
BCYCLIC: A parallel block tridiagonal matrix cyclic solver
Article
Hirshman, S. P.1  Perumalla, K. S.1  Lynch, V. E.1  Sanchez, R.1 
[1] Oak Ridge Natl Lab, Oak Ridge, TN 37830 USA
关键词: Cyclic reduction;    Block matrix;    Dense blocks;    Tridiagonal matrix;    Thomas algorithm;    Parallel computing;   
DOI  :  10.1016/j.jcp.2010.04.049
来源: Elsevier
PDF
【 摘 要 】

A block tridiagonal matrix is factored with minimal fill-in using a cyclic reduction algorithm that is easily parallelized. Storage of the factored blocks allows the application of the inverse to multiple right-hand sides which may not be known at factorization time. Scalability with the number of block rows is achieved with cyclic reduction, while scalability with the block size is achieved using multithreaded routines (OpenMP, GotoBLAS) for block matrix manipulation. This dual scalability is a noteworthy feature of this new solver, as well as its ability to efficiently handle arbitrary (non-powers-of-2) block row and processor numbers. Comparison with a state-of-the art parallel sparse solver is presented. It is expected that this new solver will allow many physical applications to optimally use the parallel resources on current supercomputers. Example usage of the solver in magneto-hydrodynamic (MHD), three-dimensional equilibrium solvers for high-temperature fusion plasmas is cited. (C) 2010 Elsevier Inc. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_jcp_2010_04_049.pdf 870KB PDF download
  文献评价指标  
  下载次数:3次 浏览次数:0次