An efficient parallel multigrid pressure correction algorithm is proposed for the solution of the incompressible Navier-Stokes equations on computing architectures with acceleration devices. The pressure correction pr...
详细信息
An efficient parallel multigrid pressure correction algorithm is proposed for the solution of the incompressible Navier-Stokes equations on computing architectures with acceleration devices. The pressure correction procedure is based on the numerical solution of a Poisson-type problem, which is discretized using a fourth-order finite difference compact scheme. Since this is the most time-consuming part of the solver, we propose a parallel pressure correction algorithm using an iterative method based on a block cyclic reduction solution method combined with a multigrid technique. The grid points are numbered with respect to the red-black ordering scheme for the parallel Gauss-Seidel smoother. These parallelization techniques allow the execution of the entire simulation computations on the acceleration device, minimizing memory communication costs. The realization is developed using the OpenACC API, and the numerical method is demonstrated for the solution of two classical incompressible flow test problems. The first is the two-dimensional lid-driven cavity problem over equal mesh sizes while the other is the Stokes boundary layer, which is a decent benchmark problem for unequal mesh spacing. The effect of several multigrid components on modern and legacy acceleration architectures is examined. Eventually the performance investigation demonstrates that the proposed parallel multigrid solver achieves an acceleration of more than 10 over the sequential solver and more than 4 over multi-core CPU only realizations for all tested accelerators.
This introduction to the issue discusses challenges facing mobile systems and presents five articles that address challenges in the area of specialized accelerators, application start-up, energy-efficient browsing, se...
详细信息
This introduction to the issue discusses challenges facing mobile systems and presents five articles that address challenges in the area of specialized accelerators, application start-up, energy-efficient browsing, security and isolation, and vulnerability detection.
The purpose of this research was the parallelization of the Wilms' Oncosimulator, an integrated cancer treatment support system modeling the growth of nephroblastoma tumors and their in vivo response to chemothera...
详细信息
ISBN:
(纸本)9781479998852
The purpose of this research was the parallelization of the Wilms' Oncosimulator, an integrated cancer treatment support system modeling the growth of nephroblastoma tumors and their in vivo response to chemotherapeutic modalities. In this concept, the Oncosimulator has been optimized in order to perform efficient computations on the newest heterogeneous parallel architectures: the CPU and GPU based computing architectures. The simulator has been implemented using a novel solution for distributed computing on heterogeneous architectures - the Cactus computational toolkit with CaKernel as the module for the computations performed on computing accelerators. In this pUblication the challenges faced during the process of porting the Oncosimulator onto the aforementioned architectures (within CaKernel framework) are addressed, and the performance benefits of such approaches are analyzed. The successful parallelization of the Oncosimulator advances its computational efficiency and enhances its reusability as well as its eventual translation into clinical practice. The research was performed in the context of the p-medicine project*.
暂无评论