In this paper we describe some parallel algorithms for solving nonlinear systems using CUDA (Compute Unified Device Architecture) over a GPU (Graphics Processing Unit). The proposed algorithms are based on both the Fl...
详细信息
ISBN:
(纸本)9781905088423
In this paper we describe some parallel algorithms for solving nonlinear systems using CUDA (Compute Unified Device Architecture) over a GPU (Graphics Processing Unit). The proposed algorithms are based on both the Fletcher-Reeves version of the nonlinearconjugategradient method and a polynomial preconditioner type based on block two-stage methods. Several strategies of parallelization and different storage formats for sparse matrices are discussed. The reported numerical experiments analyze the behavior of these algorithms working in a fine grain parallel environment compared with a thread-based environment.
In this work we describe some parallel algorithms for solving nonlinear systems using CUDA (Compute Unified Device Architecture) over a CPU (Graphics Processing Unit). The proposed algorithms are based on both the Fle...
详细信息
In this work we describe some parallel algorithms for solving nonlinear systems using CUDA (Compute Unified Device Architecture) over a CPU (Graphics Processing Unit). The proposed algorithms are based on both the Fletcher-Reeves version of the nonlinearconjugategradient method and a polynomial preconditioner type based on block two-stage methods. Several strategies of parallelization and different storage formats for sparse matrices are discussed. The reported numerical experiments analyze the behavior of these algorithms working in a fine grain parallel environment compared with a thread-based environment. (C) 2011 Elsevier Inc. All rights reserved.
Parallel nonlinear preconditioners, for solving mildly nonlinear systems, are proposed. These algorithms are based on both the Fletcher-Reeves version of the nonlinearconjugategradient method and a polynomial precon...
详细信息
Parallel nonlinear preconditioners, for solving mildly nonlinear systems, are proposed. These algorithms are based on both the Fletcher-Reeves version of the nonlinearconjugategradient method and a polynomial preconditioner type based on block two-stage methods. The behavior of these algorithms is analyzed when incomplete LU factorizations are used in order to obtain the inner splittings of the block two-stage method. As our illustrative example we have considered a nonlinear elliptic partial differential equation, known as the Bratu problem. The reported experiments show the performance of the algorithms designed in this work on two multicore architectures.
In this paper we present a parallel library treated as a high-level interface for solving nonlinear systems. This work follows the guidelines of a high-level interface in terms of usability, but in fact it is a librar...
详细信息
ISBN:
(纸本)9781905088416
In this paper we present a parallel library treated as a high-level interface for solving nonlinear systems. This work follows the guidelines of a high-level interface in terms of usability, but in fact it is a library developed using a mixed model in relation to the utilization of different programming languages. In order to create the high-level interfaces, we have chosen the Python language. On the other hand, the developed Fortran routines offer all the performance of the low-level language. The developed library, PyPANCG, consists of two modules, PySParNLCG and PySParNLPCG. The PySparNLCG module parallelizes the conjugategradient method for solving the nonlinear system Ax = phi(x), and the PySParNLPCG module implements the preconditioning technique based on block two-stage methods. Experimental results report the numerical accuracy and the parallel performance of our approach on different parallel computers.
暂无评论