期刊论文详细信息
The Journal of Engineering
Highly efficient parallel direct solver for solving dense complex matrix equations from method of moments
Yan Chen1  Daniel García Doñoro1  Zhongchao Lin2  Yu Zhang3 
[1] School of Electronic Engineering, Xidian University, No. 2 South Taibai Road, Xi'an, Shaanxi 710071, People's Republic of China
关键词: memory accessing time;    data rearrangement;    CPU cache utilisation;    electromagnetic radiation problems;    assembler instructions;    vectorised optimised kernel;    dense complex matrix equations;    cache optimised kernel;    parallel lower upper decomposition;    communication pattern optimisation;    method of moments;    communication avoiding pivoting scheme;    binary tree;    parallel direct solver;   
DOI  :  10.1049/joe.2017.0069
学科分类:工程和技术(综合)
来源: IET
PDF
【 摘 要 】

Based on the vectorised and cache optimised kernel, a parallel lower upper decomposition with a novel communication avoiding pivoting scheme is developed to solve dense complex matrix equations generated by the method of moments. The fine-grain data rearrangement and assembler instructions are adopted to reduce memory accessing times and improve CPU cache utilisation, which also facilitate vectorisation of the code. Through grouping processes in a binary tree, a parallel pivoting scheme is designed to optimise the communication pattern and thus reduces the solving time of the proposed solver. Two large electromagnetic radiation problems are solved on two supercomputers, respectively, and the numerical results demonstrate that the proposed method outperforms those in open source and commercial libraries.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO201902026935677ZK.pdf 237KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:6次