The Journal of Engineering | |
Highly efficient parallel direct solver for solving dense complex matrix equations from method of moments | |
Yan Chen1  Daniel GarcÃa Doñoro1  Zhongchao Lin2  Yu Zhang3  | |
[1] School of Electronic Engineering, Xidian University, No. 2 South Taibai Road, Xi'an, Shaanxi 710071, People's Republic of China | |
关键词: memory accessing time; data rearrangement; CPU cache utilisation; electromagnetic radiation problems; assembler instructions; vectorised optimised kernel; dense complex matrix equations; cache optimised kernel; parallel lower upper decomposition; communication pattern optimisation; method of moments; communication avoiding pivoting scheme; binary tree; parallel direct solver; | |
DOI : 10.1049/joe.2017.0069 | |
学科分类:工程和技术(综合) | |
来源: IET | |
【 摘 要 】
Based on the vectorised and cache optimised kernel, a parallel lower upper decomposition with a novel communication avoiding pivoting scheme is developed to solve dense complex matrix equations generated by the method of moments. The fine-grain data rearrangement and assembler instructions are adopted to reduce memory accessing times and improve CPU cache utilisation, which also facilitate vectorisation of the code. Through grouping processes in a binary tree, a parallel pivoting scheme is designed to optimise the communication pattern and thus reduces the solving time of the proposed solver. Two large electromagnetic radiation problems are solved on two supercomputers, respectively, and the numerical results demonstrate that the proposed method outperforms those in open source and commercial libraries.
【 授权许可】
CC BY
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
RO201902027915182ZK.pdf | 237KB | download |