| JOURNAL OF COMPUTATIONAL PHYSICS | 卷:444 |
| A hardware accelerated unstructured overset method to simulate turbulent fluid flow | |
| Article | |
| Horne, Wyatt James1  Mahesh, Krishnan1  | |
| [1] Univ Minnesota, Minneapolis, MN 55455 USA | |
| 关键词: GPU acceleration; Unstructured overset method; Real-time raytracing; Incompressible fluid flow; Turbulent flows; DNS/LES; | |
| DOI : 10.1016/j.jcp.2021.110574 | |
| 来源: Elsevier | |
PDF
|
|
【 摘 要 】
Hardware acceleration consists of offloading computational work to devices such as graphics processing units (GPUs) to produce overall speed-up. Algorithms and numerical methods must be constructed to suit the available hardware in order to effectively produce speed-up. In this work a numerical method is presented which can effectively use hardware acceleration to simulate incompressible turbulent fluid flow. The method is an unstructured overset method where unstructured meshes are attached to individual bodies and connected throughout the flow domain to produce a single domain solution through an overset assembly process. The unstructured overset method shown in Home and Mahesh [1] and Home and Mahesh [2] was found capable of scaling to O(10(5)) computational cores for O(10(5)) moving bodies in turbulent flow fields while producing accurate flow results. This highly scalable method is modified and extended to effectively utilize on-node hardware acceleration. Overset assembly algorithms which use hardware acceleration are presented based on successful accelerated algorithms in real-time ray tracing and computational geometry. Timing results for core overset assembly operations are presented showing a maximum 0(100x) speedup when using hardware acceleration. A novel method for turbulent fluid flow is presented which utilizes over-decomposition of the flow domain to produce task-parallelism allowing asynchronous calculation of the different steps of the method while also providing overlap between data transfer and computation. A mixed precision solver is utilized which provides a balance between optimal performance and numerical accuracy. A cost effective and accurate artificial compressibility pressure regularization is used which has minimal memory complexity and minimizes computational cost while maintaining accuracy. A primal-dual Laplacian operator is introduced which produces accurate results on skewed meshes. Results for canonical flow cases with overset meshes are shown illustrating the method's accuracy and numerical properties. Substantial speed-up is demonstrated for the numerical method reaching upwards of 50 times as fast as the non-accelerated method for high cell loadings. (C) 2021 Elsevier Inc. All rights reserved.
【 授权许可】
Free
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| 10_1016_j_jcp_2021_110574.pdf | 2847KB |
PDF