检索结果-内蒙古大学图书馆

Numerical performance of preconditioning techniques for the solution of complex sparse linear systems

COMMUNICATIONS IN NUMERICAL METHODS IN ENGINEERING 2003年第1期19卷 37-48页

作者： Mazzia, A Pini, G Univ Padua Dipartimento Metodi & Modelli Matemat Sci Applica I-35131 Padua Italy

Preconditioning techniques based on ILU decomposition, on Frobenius norm minimization and on factorized sparse approximate inverse are considered. These algorithms are applied with conjugate gradient-type methods, namely Bi-CGSTAB, QMR and TFQMR for the solution of complex, large, sparse linear systems. The results of numerical experiments in scalar environment with matrices arising from transport in porous media, quantum chemistry, structural dynamics and electromagnetism are analysed. The preconditioner that appears most significant in parallel environment (based on factorized sparse approximate inverse) is then employed on a Cray T3E supercomputer. The experimental results show the satisfactory parallel performance of the proposed algorithm. Copyright (C) 2003 John Wiley Sons, Ltd.

关键词： complex sparse linear systems preconditioned iterative methods incomplete factorizations approximate inverses parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel delaunay triangulation based on circum-circle criterion 03

Parallel delaunay triangulation based on circum-circle crite...

引用

Spring Conference on Computer Graphics, SCCG 2003 - Conference Proceedings

作者： Kohout, Josef Kolingerová, Ivana Ctr. Comp. Graphics/Data V. Dept. of Comp. Sci. and Engineering University of West Bohemia Pilsen Czech Republic

ISBN: (纸本)158113861X

This paper describes a newly proposed simple and efficient parallel algorithm for the construction of the Delaunay triangulation (DT) in E 2 by randomized incremental insertion. The construction of the DT is one of the fundamental problems in computer graphics. The proposed algorithm is designed for parallel systems with shared memory and several processors. Such hardware (especially with two-processors) became available in the last few years thanks to low prices and at present, there is still a lack of parallel algorithms that are simple to implement and efficient enough to be an attractive alternative to long existing serial algorithms. The designed algorithm incorporates new method for synchronization among PEs based on the simple geometric test (i.e. if no other points lie in the circum-circle of accessed triangle, this triangle can be modified independently on others PEs). We implemented the algorithm in C++ and tested it on workstations up to four processors where we reached relatively good speed-up to our serial implementation. When only two processors were used we reached even super-linear speed-up.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel split-step fourier methods for the CMKdV equation

Parallel split-step fourier methods for the CMKdV equation

引用

Proceedings of the International Conference on parallel and Distributed Processing Techniques and Applications

作者： Taha, Thiab R. Liu, Ruihua Department of Computer Science University of Georgia Athens GA 30602 United States

ISBN: (纸本)1892512416

The class of complex modified Korteweg-de Vriet (CMKdV) equations has many applications. One form of the CMKdV equation has been used to create models for the nonlinear evolution of plasma waves [5], for the propagation of transverse waves in a molecular chain [3], Another form of the CMKdV equation has been used for the traveling-wave and for a double homoclinic orbit [4]. In this paper we introduce sequential and parallel split-step Fourier methods for numerical simulations of the above-equation. The parallel methods are implemented on the Origin 2000 multiprocessor computer. Our numerical experiments have shown that these methods give considerable speedup.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallelized three-dimensional unstructured Euler solver for unsteady aerodynamics

引用

JOURNAL OF AIRCRAFT 2003年第2期40卷 348-354页

作者： Oktay, E Akay, HU Uzun, A Indiana Univ Purdue Univ Dept Mech Engn Indianapolis IN 46202 USA Missiles Ind Inc Roketsan Aerodynam Dept TR-06780 Ankara Turkey

A parallel algorithm for the solution of unsteady Euler equations on unstructured and moving meshes is developed. A cell-centered finite volume scheme is used. The temporal discretization involves an implicit time-integration scheme based on backward-Euler time differencing. The movement of the computational mesh is accomplished by means of a dynamically deforming mesh algorithm. The parallelization is based on decomposition of the domain into a series of subdomains with overlapped interfaces. The scheme is computationally efficient, time accurate, and stable for large time increments. Detailed descriptions of the solution algorithm are given, and computations for airflow around a NACA0012 airfoil and a missile configuration are presented to demonstrate the applications.

关键词： parallel algorithms Solvers Scheme Euler equations unsteady aerodynamics time-discrete Euler MOVING MESH computational grids

来源：评论

学校读者我要写书评

暂无评论

Degree of scalability: scalable reconfigurable mesh algorithms for multiple addition and matrix-vector multiplication

引用

parallel COMPUTING 2003年第1期29卷 95-109页

作者： Vaidyanathan, R Trahan, JL Lu, CM Louisiana State Univ Dept Elect & Comp Engn Baton Rouge LA 70803 USA Certware Technol Sterling VA USA

The usual concern when scaling an algorithm on a parallel model ofcomputation is preserving efficiency while increasing or decreasing the number of processors. Manyalgorithms for reconfigurable models, however, attain constant time at the expense of an inefficientalgorithm. For these algorithms, scaling down the number of processors while preservinginefficiency is no benefit once constant time execution is lost. In fact, one can often acceleratethe efficiency of these algorithms while reducing the number of processors. To quantify thisimprovement in efficiency, this paper introduces the measure of degree of scalability to complementthe insight obtained from efficiency for such algorithms. Demonstrating the utility of this measure,we present new reconfigurable mesh (R-Mesh) algorithms for multiple addition and matrix-vectormultiplication, improving both the number of processors and the degree of scalability compared toprevious algorithms. We also extend these results to floating point number operands, which havepreviously received little attention on the R-Mesh.

关键词： reconfigurable mesh parallel algorithms scalability arithmetic algorithms reconfigurable models

来源：评论

学校读者我要写书评

暂无评论

List-ranking on interconnection networks

引用

INFORMATION AND COMPUTATION 2003年第2期181卷 75-87页

作者： Sibeyn, JF Univ Halle Wittenberg Inst Informat D-06120 Halle Saale Germany

The list-ranking problem is considered for parallel computers which communicate through an interconnection network. Each PU holds k nodes of a set of linked lists. A no-vel randomized algorithm gives a considerable improvement over earlier ones: for a large class of networks and sufficiently large k, it takes only twice the number of steps required by a k-k routing. For hypercubes the condition is k = omega(log(2) N). Even better results are achieved for d-dimensional meshes: we show that the ranking time exceeds the routing time only by lower-order terms for all k = omega(d(2)). We also show that list-ranking requires at least the time required for k-k routing. Thus, the results are within a factor two from optimal, those for meshes even match the lower bound up to lower-order terms. (C) 2002 Elsevier Science (USA). All rights reserved.

关键词： parallel algorithms interconnection networks list-ranking randomization meshes hypercubes

来源：评论

学校读者我要写书评

暂无评论

Fault-tolerant computations over replicated finite rings

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-FUNDAMENTAL THEORY AND APPLICATIONS 2003年第7期50卷 858-864页

作者： Imbert, L Dimitrov, VS Jullien, GA CNRS Lab Informat Robot & Microelect F-34392 Montpellier 5 France Univ Calgary Dept Elect & Comp Engn ATIPS Lab Calgary AB T2N 1N4 Canada

This paper presents a fault-tolerant technique based on the modulus replication residue number system. (MRRNS) which allows for modular arithmetic computations over identical channels. In this system, fault tolerance is provided by adding extra computational channels that can be used to redundantly compute the mapped output. An algebraic technique is used to determine the error position in the mapped outputs and provide corrections. We also show that by taking advantage of some elementary polynomial properties we obtain the same level of fault tolerance with about a 30% decrease in the number of channels. This new system is referred to as.. the symmetric MRRNS (SMRRNS).

关键词： fault-tolerant computation modulus replication parallel algorithms residue arithmetic symmetric modulus replication residue number system (SMRRNS)

来源：评论

学校读者我要写书评

暂无评论

Evolving a model of transaction management with embedded concurrency control for mobile database systems

引用

INFORMATION AND SOFTWARE TECHNOLOGY 2003年第9期45卷 587-596页

作者： Bhalla, S Univ Aizu Data Syst Lab Fukushima 9658580 Japan

Transactions within a mobile database management system face many restrictions. These cannot afford unlimited delays or participate in multiple retry attempts for execution. The proposed embedded concurrency control (ECC techniques provide support on three counts, namely-to enhance concurrency, to overcome problems due to heterogeneity, and to allocate priority to transactions that originate from mobile hosts. These proposed ECC techniques can be used to enhance the server capabilities within a mobile database management system. Adoption of the techniques can be beneficial in general, and for other special cases of transaction management in distributed real-time database management systems. The proposed model can be applied to other similar problems related to synchronization, such as the generation of a backup copy of an operational database system. (C) 2003 Elsevier Science B.V. All rights reserved.

关键词： distributed algorithms mobile database systems non-blocking protocols parallel algorithms serializability

来源：评论

学校读者我要写书评

暂无评论

Design and implementation of a distributed evolutionary computing software

引用

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS 2003年第3期33卷 325-338页

作者： Tan, KC Tay, A Cai, J Natl Univ Singapore Dept Elect & Comp Engn Singapore 117576 Singapore

Although evolutionary algorithm is a powerful optimization tool, its computation cost involved in terms of, time and hardware resources increases as the size or complexity of the problem increases. One promising approach to overcome this limitation is to exploit the inherent parallelism of evolutionary algorithms by creating an infrastructure necessary to support distributed evolutionary computing using existing Internet, and hardware resources. This paper presents a Java-based distributed evolutionary computing software (Paladin-DEC), which enhances the concurrent processing and performance of evolutionary algorithms by allowing inter-communications of subpopulations among various computers over the Internet. Such a distributed system enables individuals to migrate among multiple subpopulations according to some patterns to induce diversity of elite individuals periodically, in a way that simulates the species evolve in natural environment. The Paladin-DEC software is capable of keeping data integrity throughout the computation, and is incorporated with the features of robustness, security, fault tolerance, and work balancing. The effectiveness and advantages of the Paladin-DEC are illustrated upon two case studies of drug scheduling in cancer chemotherapy and searching probe sets of yeast genome.

关键词： distributed systems evolutionary algorithms parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

An improved parallel algorithm for certain Toeplitz cyclic tridiagonal systems on distributed-memory multicomputer

引用

5th International Workshop on Advanced parallel Processing Technologies

作者： Zhang, XB Luo, ZG Li, XM Coll Equipment Command & Technol Beijing 101416 Peoples R China Natl Lab Parallel & Distributed Proc Changsha 410073 Peoples R China

ISBN: (纸本)3540200541

Based on Luo's parallel algorithm [4] for certain Toeplitz cyclic tridiagonal systems on distributed-memory multicomputer, we present an improved algorithm. Its communication mechanism is simple and redundant computing is small for solving massively systems. The numerical experiments show that the parallel efficiency of the improved algorithm is higher than Luo's algorithm [4].

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：