检索结果-内蒙古大学图书馆

MPI ruby: Scripting in a parallel environment

COMPUTING IN SCIENCE & ENGINEERING 2002年第4期4卷 78-82页

作者： Ong, E Univ Calif Berkeley Berkeley CA 94720 USA

Most of the work in scientificomputing today is done in parallelalgorithms, often via message-passing architectures such as the message-passing interface (MPI). Anewly emerging language called Ruby, which maintains a strict adherence to object orientedprinciples and a clean, intuitive syntax. The author created MPI Ruby, a complete binding of MPI toRuby. In this article, he introduces Ruby and MPI Ruby. Some applications and information on theproject's current status and its availability are described.

关键词： application program interfaces parallel algorithms object-orientedprogramming program interpreters high level languages message-passing interface MPI Ruby object-oriented scripting language parallel algorithms interpreter Ruby module scie

来源：评论

学校读者我要写书评

暂无评论

A parallel general implementation of Kohonen's self-organizing map algorithm: performance and scalability

引用

NEUROCOMPUTING 2002年 44卷 567-571页

作者： Ozdzynski, P Lin, A Liljeholm, M Beatty, J Univ Calif Los Angeles Dept Psychol Los Angeles CA 90095 USA Univ Calif Los Angeles Brain Res Inst Los Angeles CA 90095 USA

Kohonen's self-organizing map algorithm provides computational neurobiology with a useful model of the primate cerebral cortex. However, simulations of only modestly sized maps quickly exceed the capacity of even very fast workstations. Here, we report that a parallel implementation of the algorithm on a Beowulf commodity-class computing cluster scales very favorably with the number of available nodes and greatly speeds the computation of medium-to-large-scale cortical maps. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： parallel algorithms self-organizing maps Beowulf computing neuronal empiricism hypothesis

来源：评论

学校读者我要写书评

暂无评论

Fast parallel reordering and isomorphism testing of k-trees

引用

ALGORITHMICA 2002年第1期32卷 61-72页

作者： Del Greco, JG Sekharan, CN Sridhar, R Loyola Univ Dept Math & Comp Sci Chicago IL 60626 USA Univ Oklahoma Sch Comp Sci Norman OK 73019 USA

In this paper two problems on the class of k-trees, a subclass of the class of chordal graphs, are considered: the fast reordering problem and the isomorphism problem. An O (log(2) it) time parallel algorithm for the fast reordering problem is described that uses O (nk (n - k)/log n) processors on a CRCW PRAM proving membership in the class NC for fixed k. An O(nk(k + 1)!) time sequential algorithm for the isomorphism problem is obtained representing an improvement over the O(n(2)k(k + 1)!) algorithm of Sekharan (the second author) [10]. A parallel version of this sequential algorithm is presented that runs in O(log(2) n) time using O((nk((k + 1)! +n -k))/log n) processors improving on a parallel algorithm of Sekbaran for the isomorphism problem [ 10]. Both the sequential and parallel algorithms use a concept introduced in this paper called the kernel of a k-tree.

关键词： parallel algorithms chordal graph k-tree reordering isomorphism testing

来源：评论

学校读者我要写书评

暂无评论

parallel efficient hierarchical algorithms for module placement of large chips on distributed memory architectures

Parallel efficient hierarchical algorithms for module placem...

引用

International Conference on parallel Computing in Electrical Engineering (PARLEC)

作者： L.T. Yang Department of Computer Science Saint Francis Xavier University Antigonish NS Canada

The PROUD module placement algorithm mainly uses a hierarchical decomposition technique and the solution of sparse linear systems based on a resistive network analogy. It has been shown that the PROUD algorithm can achieve a comparable design of the placement problems for very large circuits with the best placement algorithm based on simulated annealing, but with several order of magnitude faster. The modified PROUD, namely MPROUD algorithm by perturbing the coefficient matrices performs much faster that the original PROUD algorithm. Due to the instability and unguaranteed convergence of MPROUD algorithm, we have proposed a new convergent and numerically stable PROUD, namely Improved PROUD algorithm, denoted as IPROUD with attractive computational costs to solve the module placement problems by making use of the SYMMLQ and MINRES methods based on Lanczos process (Yang, 1997). We subsequently propose parallel versions of the improved PROUD algorithms. The parallel algorithm is derived such that all inner products and matrix-vector multiplications of a single iteration step are independent. Therefore, the cost of global communication which represents the bottleneck of the parallel performance on parallel distributed memory computers can be significantly reduced, therefore, to obtain another order of magnitude improvement in the runtime without loss of the quality of the layout.

关键词： Memory architecture Linear systems Algorithm design and analysis Circuit simulation Simulated annealing Convergence of numerical methods Computational efficiency parallel algorithms Costs Global communication

来源：评论

学校读者我要写书评

暂无评论

An optimal simple parallel algorithm for testing isomorphism of maximal outerplanar graphs

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2002年第2期62卷 221-227页

作者： Ku, SC Wang, BF Natl Tsing Hua Univ Dept Comp Sci Hsinchu 30043 Taiwan

An outerplanar graph is a planar graph that can be imbedded in the plane in such a way that all vertices lie on the exterior face. An outerplanar graph is maximal if no edge can be added to the graph without violating the outer-planarity. In this paper, an optimal parallel algorithm is proposed on the EREW PRAM for testing isomorphism of two maximal outerplanar graphs. The proposed algorithm takes O(log n) time using O(n) work. Besides being optimal, it is very simple. Moreover, it can be implemented optimally on the CRCW PRAM in O(1) time. (C) 2002 Elsevier Science (USA).

关键词： maximal outerplanar graphs isomorphism parallel algorithms PRAM

来源：评论

学校读者我要写书评

暂无评论

A parallel algorithm for Lagrange interpolation on the star graph

引用

JOURNAL OF parallel AND DISTRIBUTED COMPUTING 2002年第4期62卷 605-621页

作者： Sarbazi-Azad, H Ould-Khaoua, M Mackenzie, LM Akl, SG Univ Glasgow Dept Comp Sci Glasgow G12 8RZ Lanark Scotland Queens Univ Dept Comp & Informat Sci Kingston ON K7L 3N6 Canada

This paper introduces a new parallel algorithm for computing an N( = n!)-point Lagrange interpolation on an n-star (n > 2). The proposed algorithm exploits several communication techniques on stars in a novel way, which can be adapted for computing similar functions. It is optimal and consists of three phases: initialization, main, and final. While there is no computation in the initialization phase, the main phase is composed of n!/2 steps, each consisting of four multiplications, four subtractions, and one communication operation and an additional step including one division and one multiplication. The final phase is carried out in (n-1) subphases each with O(log n) steps where each step takes three communications and one addition. Results from a cost-performance comparative analysis reveal that for practical network sizes the new algorithm on the star exhibits superior performance over those proposed for common interconnection networks. (C) 2002 Elsevier Science (USA).

关键词： interconnection networks star graph hypercubes tori parallel algorithms Lagrange interpolation speedup cost-performance analysis

来源：评论

学校读者我要写书评

暂无评论

parallel external selection algorithm on distributed memory systems 5

Parallel external selection algorithm on distributed memory ...

引用

5th International Conference on algorithms and Architectures for parallel Processing

作者： Zhong, C Chen, GL Yan, C Univ Sci & Technol China Dept Comp Sci Hefei 230027 Peoples R China

ISBN: (纸本)0769515126

The external selection problem is to select the record with the K-th smallest key from the given N records that are distributed and stored evenly on the D disks for the parallel machine with D processors. Each processor has its own primary memory of size M records and one disk, where N/D> M. The processors are connected with a root D X rootD Mesh architecture. Based on a two-stage approach, this paper presents an efficient parallel external selection algorithm for the distributed-memory parallel systems. First, all the processors execute local external sorting in parallel, each processor sorts the N/D records on its own disk. Next, they execute parallel external selection from the D sorted sub files on the D disks. This algorithm is asymptotically optimal and has a small constant factor of time complexity.

关键词： external selection parallel algorithms distributed-memory systems

来源：评论

学校读者我要写书评

暂无评论

Verification of computations of a parallel FDTD algorithm

Verification of computations of a parallel FDTD algorithm

引用

International Conference on parallel Computing in Electrical Engineering (PAR ELEC 2002)

作者： Walendziuk, W Forenc, J Bialystok Tech Univ Fac Elect Engn Bialystok Poland

ISBN: (纸本)0769517315

In the presented work the authors included the comparison of the calculations of a parallel FDTD algorithm with the computations obtained with the use of the Quick Wave programme published by QWED. The authors worked out a parallel implementation of the standard FDTD algorithm which is based on MPI communication library. The parallel algorithm was examined in a heterogeneous PC cluster.

关键词： Boundary conditions Clustering algorithms Concurrent computing Dielectrics Electromagnetic analysis Finite difference methods Frequency Material properties parallel algorithms Time domain analysis

来源：评论

学校读者我要写书评

暂无评论

A new version of conjugate gradient method parallel implementation

A new version of conjugate gradient method parallel implemen...

引用

International Conference on parallel Computing in Electrical Engineering (PAR ELEC 2002)

作者： Bycul, RP Jordan, A Cichomski, M Bialystok Tech Univ Fac Elect Engn PL-15893 Bialystok Poland

ISBN: (纸本)0769517315

In the article the authors describe an idea of parallel implementation of a conjugate gradient method in a heterogeneous PC cluster and a supercomputer Hitachi SR-2201. The new version of algorithm implementation differs from the one applied earlier[1], because it uses a special method for storing sparse coefficient matrices: only non-zero elements are stored and taken into account during computations, so that the sparsity of the coefficient matrix is taken full advantage of. The article includes a comparison of the two versions. A speedup of the parallel algorithm has been examined for three different cases of coefficient matrices resulting in solving different physical problems. The authors have also investigated a preconditioning method, which uses the inversed diagonal of the coefficient matrix, as a preconditioning matrix.

关键词： Clustering algorithms Convergence Distributed computing Equations Gradient methods Information technology parallel algorithms Sparse matrices Supercomputers Symmetric matrices

来源：评论

学校读者我要写书评

暂无评论

parallel computing of non-ideal 3-D space detonation wave propagation with CC-NUMA system 5

Parallel computing of non-ideal 3-D space detonation wave pr...

引用

5th International Conference on algorithms and Architectures for parallel Processing

作者： Huang, QN Zhong, M Mianyang Sichuan China

ISBN: (纸本)0769515126

Investigations of the parallel computing of the non-ideal 3-D space detonation wave propagation are presented in this paper on the hi-performance computer based on CC-NUMA architecture. Upon analyzing and testing the previous serial program, the computation of curvature, the first-order and the second-order difference were determined to be the main objects of parallelization. Some processing techniques were applied to convert the serial program into parallel program, such as the strategy of "Divide and Conquer", the balance of the loading distribution. Numerical simulation computation of the parallel program results in a great increase of computing speed of the non-ideal 3-D space detonation wave propagation.

关键词： wave propagation detonation parallel programming parallel architectures distributed shared memory systems divide and conquer methods numerical analysis parallel algorithms computational fluid dynamics wave equations parallel computing nonideal 3D space detonation wave propagation high-performance computer CC-NUMA architecture curvature first-order difference second-order difference parallelization serial program parallel program divide and conquer load distribution balancing numerical simulation computing speed

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：