检索结果-内蒙古大学图书馆

VECTOR AND parallelIZATION OF ODE BVP CODES

parallel COMPUTING 1989年第3期12卷 343-350页

作者： GLADWELL, I HAY, RI UNIV LIVERPOOL DEPT STAT & COMP MATHLIVERPOOL L69 3BXENGLAND

We consider the spline collocation code COLSYS and its successor COLNEW for solving boundary value problems (BVPs) in ordinary differential equations (ODEs) paying particular attention to the cost of solving the resulting almost block diagonal systems (ABDS) on scalar and vector computers. Our costings include analyses for extreme cases of large or high order systems of ODE's. These are designed to provide insight as to the asymptotic behaviour of the codes on vector processors. The paper closes with a discussion of parallelisation of the codes and of the conflicts between vectorisation and parallelisation.

关键词： parallel linear algebra boundary value problems ordinary differential equations vectorisation parallelisation

来源：评论

学校读者我要写书评

暂无评论

Massively parallel Poisson and QR factorization solvers

引用

COMPUTERS & MATHEMATICS WITH APPLICATIONS 1996年第4-5期31卷 19-26页

作者： Lucka, M Vajtersic, M Viktorinova, E SLOVAK ACAD SCI INST INFORMATBRATISLAVA 84000SLOVAKIA

The paper brings a massively parallel Poisson solver for rectangle domain and parallel algorithms for computation of QR factorization of a dense matrix A by means of Householder reflections and Givens rotations. The computer model under consideration is a SIMD mesh-connected toroidal n x n processor array. The Dirichlet problem is replaced by its finite-difference analog on an M x N (M + 1, N are powers of two) grid. The algorithm is composed of parallel fast sine transform and cyclic odd-even reduction blocks and runs in a fully parallel fashion. Its computational complexity is O(MN log L/n(2)), where L = max(M + 1, N). A parallel proposal of QR factorization by the Householder method zeros all subdiagonal elements in each column and updates all elements of the given submatrix in parallel. For the second method with Givens rotations, the parallel scheme of the Sameh and Kuck was chosen where the disjoint rotations can be computed simultaneously. The algorithms were coded in MPF and MPL parallel programming languages and results of computational experiments on the MasPar MP-1 system are also presented.

关键词： parallel linear algebra fast sine transform odd-even reduction QR decomposition massively SIMD-type computer arrays

来源：评论

学校读者我要写书评

暂无评论

A case study in scalability: An ADI method for the two-dimensional time-dependent Dirac equation

引用

parallel COMPUTING 1999年第5期25卷 525-533页

作者： Rathe, UW Sanders, P Knight, PL Max Planck Inst Informat D-66123 Saarbrucken Germany Univ London Imperial Coll Sci Technol & Med Blackett Lab Opt Sect London SW7 2BZ England

The dynamics of relativistic atomic wave functions evolving under the influence of intense laser pulses is used as an example of a general class of applications employing the alternating direction implicit method. The method requires the solution of many tridiagonal systems of linear equations. A range of parallel algorithms for this setting are analyzed with respect to their scalability on large parallel machines. (C) 1999 Elsevier Science B.V. All rights reserved.

关键词： parallel linear algebra tridiagonal systems alternating direction implicit method scalable algorithm Dirac equation

来源：评论

学校读者我要写书评

暂无评论

PIPELINED ITERATIVE METHODS FOR SHARED MEMORY MACHINES

引用

parallel COMPUTING 1989年第2期11卷 187-199页

作者： BONOMO, JP DYKSEN, WR Department of Computer Sciences Purdue University Computer Science Building West Lafayette IN 47907 U.S.A.

In this paper we describe a new parallel iterative technique to solve a set of linear equations. The technique can be applied to any serial iterative scheme and involves pipelining successive iterations. We give an example of this technique by modifying the classical successive overrelaxation method (SOR). The algorithm is implemented on a Sequent Symmetry multiprocessor machine and the experimental results are presented.

关键词： parallel linear algebra pipelined iterative techniques shared memory machines solution of linear equations

来源：评论

学校读者我要写书评

暂无评论

State-space truncation methods for parallel model reduction of large-scale systems

引用

parallel COMPUTING 2003年第11-12期29卷 1701-1722页

作者： Benner, P Quintana-Ortí, ES Quintana-Ortí, G Tech Univ Berlin Inst Math D-10623 Berlin Germany

We discuss a parallel library of efficient algorithms for model reduction of large-scale systems with state-space dimension up to O(10(4)). We survey the numerical algorithms underlying the implementation of the chosen model reduction methods. The approach considered here is based on state-space truncation of the system matrices and includes absolute and relative error methods for both stable and unstable systems. In contrast to serial implementations of these methods, we employ Newton-type iterative algorithms for the solution of the major computational tasks. Experimental results report the numerical accuracy and the parallel performance of our approach on a cluster of Intel Pentium II processors. (C) 2003 Published by Elsevier B.V.

关键词： model reduction state-space truncation linear matrix equations algebraic Riccati equations sign function method parallel linear algebra

来源：评论

学校读者我要写书评

暂无评论

parallel SOLUTION OF FREDHOLM INTEGRAL-EQUATIONS

引用

parallel COMPUTING 1989年第1期12卷 95-106页

作者： BABOLIAN, E DELVES, LM UNIV LIVERPOOL CTR MATH SOFTWARE RESLIVERPOOL L69 3BXENGLAND

We consider here parallel variants of the Nyström and Fast Galerkin methods for the solution of Fredholm integral equations of the second kind. Numerical examples, and timings for an Ada implementation on a multi... 详细信息

关键词： Fredholm integral equations MIMD algorithms multiprocessor Sequent Balance multitasking parallel linear algebra timing results

来源：评论

学校读者我要写书评

暂无评论

parallel SOLUTION OF ALMOST BLOCK DIAGONAL SYSTEMS ON THE CRAY Y-MP USING LEVEL-3 BLAS

引用

JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS 1993年第1-2期45卷 181-189页

作者： GLADWELL, I PAPRZYCKI, M SO METHODIST UNIV DEPT MATHDALLASTX 75275

In a recent publication (1992), the authors showed how efficient a new level 3 BLAS algorithm for almost block diagonal systems could be using just one processor of a CRAY Y-MP. Here they compare the corresponding res... 详细信息

关键词： parallel linear algebra BOUNDARY VALUE PROBLEMS LEVEL-3 BLAS

来源：评论

学校读者我要写书评

暂无评论

AN ARCHITECTURAL APPROACH TO BUILDING GRIDS USING LEGACY CODE

引用

parallel PROCESSING LETTERS 2007年第4期17卷 363-378页

作者： Gamess, Eric Cent Univ Venezuela Escuela Computac Caracas Venezuela

In this paper, we present a new architecture to build grids that can execute parallel programs based on legacy code. This architecture is layer based and software component performances are validated with benchmarks. To illustrate the construction of a grid using the proposed architecture, we develop a case study that consists of a grid oriented to efficient execution of Java bytecode for which we validate and integrate legacy code of parallel linear algebra.

关键词： Architecture Grids High Performance Computing Legacy Code Benchmarks parallel linear algebra

来源：评论

学校读者我要写书评

暂无评论

A balanced accumulation scheme for parallel PDE solvers

引用

COMPUTING AND VISUALIZATION IN SCIENCE 2013年第1期16卷 33-40页

作者： Liebmann, Manfred Neic, Aurel Haase, Gundolf Karl Franzens Univ Graz Inst Math & Sci Comp Heinrichstr 36 A-8010 Graz Austria

We present a tailored load balancing technique that addresses specific performance issues in the boundary data accumulation algorithm for non-overlapping domain decompositions. The technique is used to speed up a parallel conjugate gradient algorithm with an algebraic multigrid pre-conditioner to solve a potential problem on an unstructured tetrahedral finite element mesh. The optimized accumulation algorithm significantly improves the performance of the parallel solver and we show up to 50% runtime improvements over the standard approach in benchmark runs with up to 48 MPI processes. The load balancing problem itself is a global optimization problem that is solved approximately by local optimization algorithms in parallel that require no communication during the optimization process.

关键词： parallel linear algebra Balanced MPI communication parallel optimization algorithms

来源：评论

学校读者我要写书评

暂无评论

APPROXIMATE SCHUR COMPLEMENT PRECONDITIONERS ON SERIAL AND parallel COMPUTERS

引用

SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING 1989年第3期10卷 581-605页

作者： ELMAN, HC UNIV MARYLAND INST ADV COMP STUDIESCOLLEGE PKMD 20742

A class of preconditioning techniques for sparse matrices is considered, based on computing an approximation of the Schur complement of a (suitably ordered) matrix. The techniques generalize the reduced system methodology for 2-cyclic matrices to non-2-cyclic matrices, and in addition, they are well suited to parallel architectures. Their effectiveness with numerical experiments on a nine-point finite-difference operator is demonstrated, and an analysis showing that they can be implemented efficiently on multiprocessors is presented.

关键词： 65F10 65N20 15A06 sparse iterative methods preconditioners parallel linear algebra

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：