检索结果-内蒙古大学图书馆

Dimensional analysis applied to a parallel QR algorithm

7th International Conference on parallel Processing and Applied Mathematics

作者： Numrich, Robert W. Univ Minnesota Minnesota Supercomp Inst Minneapolis MN 55455 USA

ISBN: (纸本)9783540681052

We apply dimensional analysis to a formula for execution time for a QR algorithm from a paper by Henry and van de Geijn. We define a single efficiency surface that reduces performance analysis for this algorithm to an exercise in differential geometry. As the problem size and the number of processors change, different machines move along different paths on the surface determined by two computational forces specific to each machine. We show that computational force, also called computational intensity, is a unifying concept for understanding the performance of parallel numerical algorithms.

关键词： scalability performance analysis computational intensity computational force parallel numerical algorithms dimensional analysis

来源：评论

学校读者我要写书评

暂无评论

Computational challenges in nanoscale device modeling

Computational challenges in nanoscale device modeling

引用

Nanotechnology Conference and Trade Show (Nanotech 2004)

作者： Polizzi, E Sameh, A Sun, H Purdue Univ Dept Comp Sci W Lafayette IN 47907 USA

ISBN: (纸本)0972842284

The development of new simulation tools is critical for the exploration of quantum transport in nanoscale devices. Such simulation is commonly performed by solving self-consistently the transport problem using the Non-Equilibrium Green's Functions (NEGF) formalism and the Poisson's equation to account for the space charge e ects. The quest for ever higher levels of detail and realism in such simulations as the modeling of multidimensional devices with detailed band structure calculations with(or without) the inclusion of scattering e ects, requires huge computational e ort. Hence, the need for an active research e ort in developing novel numerical techniques and parallel algorithms that axe ideally suited for high-end computing platforms. In this article, we will identify the identify the challenging numerical problems which arise from the NEGF/Poisson procedure and we will present new efficient parallel schemes for computing the problem.

关键词： nanoscale devices Green's function NEGF-poisson parallel numerical algorithms linear systems generalized eigenvalue problems

来源：评论

学校读者我要写书评

暂无评论

Execution Behavior Analysis of parallel Schemes for Implicit Solution Methods for ODEs 17

Execution Behavior Analysis of Parallel Schemes for Implicit...

引用

17th Annual International Symposium on parallel and Distributed Computing (ISPDC)

作者： Kalinnik, Natalia Rauber, Thomas Univ Bayreuth Dept Comp Sci Bayreuth Germany

ISBN: (纸本)9781538653302

In this article, we consider diagonal-implicitly iterated Runge-Kutta (DIIRK) methods for the numerical solution of stiff ordinary differential equations (ODEs) and investigate their performance behavior on a modern cluster system using MPI. DIIRK methods are implicit methods and require the solution of non-linear equation systems in each iteration step. In particular, we are interested in the parallel execution behavior when using different basis Newton methods for solving the resulting non-linear equation systems of different versions of the DIIRK method. We explore the use of direct solution methods based on LU factorization for the resulting linear equation systems as well as the use of Krylov subspace methods and investigate the resulting performance and accuracy.

关键词： parallel numerical algorithms implicit ODE methods predictor-corrector methods

来源：评论

学校读者我要写书评

暂无评论

parallel simulations for Fractional-Order Systems 18

Parallel simulations for Fractional-Order Systems

引用

18th International Symposium on Symbolic and Numeric algorithms for Scientific Computing (SYNASC)

作者： Baban, Andrada Bonchis, Cosmin Fikl, Alexandru Rosu, Florin West Univ Timisoara Bd V Parvan 4Cam 045B RO-300223 Timisoara Romania eAustria Res Inst Bd V Parvan 4Cam 045B RO-300223 Timisoara Romania Univ Illinois Dept Aerosp Engn Champaign IL USA

ISBN: (纸本)9781509057078

In this paper, we explore how numerical calculations can be accelerated by implementing several numerical methods of fractional-order systems using parallel computing techniques. We investigate the feasibility of parallel computing algorithms and their efficiency in reducing the computational costs over a large time interval. Particularly, we present the case of Adams-Bashforth-Mouhlton predictor-corrector method and measure the speedup of two parallel approaches by using GPU and HPC cluster implementations.

关键词： Fractional-order systems parallel numerical algorithms GPU processing HPC processing

来源：评论

学校读者我要写书评

暂无评论

parallel integration of hydrodynamical approximations of the Boltzmann equation for rarefied gases on a cluster of computers

引用

Journal of Computational Methods in Sciences and Engineering 2004年第1-2期4卷 33-41页

作者： Mantas Ruiz, José Miguel Pareschi, Lorenzo Carrillo, José Antonio Lopera, Julio Ortega Software Engineering Department. University of Granada C/P. Daniel de Saucedo s/n. 18071 Granada Spain Department of Mathematics. University of Ferrara Via Machiavelli 35 I-44100 Italy ICREA Depto. Matemàtiques University Autònoma Barcelona Bellaterra E-08193 Spain Computer Architecture and Technology Department. University of Granada C/P. Daniel de Saucedo s/n 18071 Granada Spain

The relaxed Burnett system, recently introduced in as a hydrodynamical approximation of the Boltzmann equation, is numerically solved. Due to the stiffness of this system and the severe CFL condition for large Mach numbers, a fully implicit Runge-Kutta method has been used. In order to reduce computing time, we apply a parallel stiff ODE solver based on 4-stage Radau IIA IRK. The ODE solver is combined with suitable first order upwind and second order MUSCL relaxation schemes for the spatial derivatives. Speedup results and comparisons to DSMC and Navier-Stokes approximations are reported for a 1D shock profile.

关键词： parallel numerical algorithms boltzmann equation burnett equations parallel stiff ODE solvers relaxation implicit runge-kutta methods

来源：评论

学校读者我要写书评

暂无评论

THE HUARD METHOD ON A SHARED MEMORY MIMD COMPUTER∗∗Research is supported by Greek General Secretary of Reasearch and Technology (EIIET II, No.322)

引用

parallel algorithms and Applications 1997年第3-4期11卷 249-272页

作者： N. M. Missirlis[a] F. I. Tjaferis[a] [a] Department of Informatics University of Athens Panepislimiopolis 157 10 Athens Greece

In this paper we study the implementation of a variant of the classic Gauss-Jordan (GJ) method which was recently introduced by Huard [8] on a shared memoryMIMDcomputer. Two parallel versions are derived by dividing the sequential Huard method into noninterfering tasks. Taking into consideration the computation as well as the communication complexity we present a parallel scheduling algorithm for each task graph. Next, in an attempt to reduce the communication cost we introduce block versions and follow a similar approach for their study.

关键词： parallel numerical algorithms Gaussian Elimination Computation complexity Communication complexity shared memory MIMD computers

来源：评论

学校读者我要写书评

暂无评论

parallel solvers for fractional power diffusion problems

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2017年第24期29卷 e4216.1-e4216..12页

作者： Ciegis, Raimondas Starikovicius, Vadimas Margenov, Svetozar Kriauziene, Rima Vilnius Gediminas Tech Univ Sauletekio Ave 11 Vilnius LT-10223 Vilnius Lithuania Bulgarian Acad Sci Inst Informat & Commun Technol Acad G Bonchev StrBl 25A BU-1113 Sofia Bulgaria Vilnius Univ Inst Math & Informat Akademijos Str 4 LT-08663 Vilnius Lithuania

Mathematical models with fractional-order differential operators are computationally expensive due to the non-local nature of these operators. In this work, we construct and investigate parallel solvers for problems described by fractional powers of elliptic operators, like fractional diffusion. Three state-of-the-art approaches are used to transform the non-local fractional-order differential problem into local partial differential equation problems formulated in a space of higher dimension. numerical schemes and parallel algorithms are developed for all three approaches. The resulting parallel algorithms have very different properties. We investigate the weak and strong scalability of the developed parallel algorithms and compare their parallel performance.

关键词： fractional diffusion fractional Laplacian multigrid parallel efficiency and scalability parallel numerical algorithms

来源：评论

学校读者我要写书评

暂无评论

Scalability analysis of different parallel solvers for 3D fractional power diffusion problems

引用

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2019年第19期31卷

作者： Ciegis, Raimondas Starikovicius, Vadimas Margenov, Svetozar Kriauziene, Rima Vilnius Gediminas Tech Univ Math Modelling Dept Sauletekio Al 11 LT-10223 Vilnius Lithuania Bulgarian Acad Sci Inst Informat & Commun Technol Sofia Bulgaria Vilnius Univ Inst Math & Informat Vilnius Lithuania

In this paper, we develop and investigate the parallel numerical algorithms for three different state-of-the-art numerical methods for solving the non-local problems described by fractional powers of elliptic operators. These methods transform the non-local problem into some local differential problems of elliptic or parabolic type. A two-level parallelization approach is applied to construct the efficient parallel algorithms using the domain decomposition and master-slave methods, to deal with the increase in computational complexity. We show and compare the serial and parallel solution times that are required to achieve similar accuracy of the solution using different algorithms. Results of extensive convergence tests are presented solving a three-dimensional test problem with known decrease of the solution's convergence rate depending on the fractional power coefficient. We analyze and discuss the non-trivial question, which parallel algorithm is recommended to achieve certain accuracy for the given fractional power coefficient.

关键词： convergence fractional diffusion fractional Laplacian multigrid parallel numerical algorithms parallel scalability

来源：评论

学校读者我要写书评

暂无评论

A highly parallel algorithm for approximating all zeros of a polynomial with only real zeros

引用

Communications of the ACM 1972年第11期15卷 952-955页

作者： Patrick, Merrell L. Computer Science Program Duke University Durham NC 27706 United States

An algorithm is described based on Newton's method which simultaneously approximates all zeros of a polynomial with only real zeros. The algorithm, which is conceptually suitable for parallel computation, determines its own starting values so that convergence to the zeros is guaranteed. Multiple zeros and their multiplicity are readily determined. At no point in the method is polynomial deflation used. © 1972, ACM. All rights reserved.

关键词： guaranteed convergence Newton's method parallel numerical algorithms real polynomials real zeros starting values

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：