检索结果-内蒙古大学图书馆

8th ACM SIGPLAN Symposium on the Principles and Practice of parallel Computing

作者： Vetter, JS McCracken, MO Lawrence Livermore Natl Lab Ctr Appl Sci Comp Livermore CA 94551 USA

ISBN: (纸本)9781581133462

Current trends in high performance computing suggest that users will soon have widespread access to clusters of multiprocessors with hundreds, if not thousands, of processors. This unprecedented degree of parallelism will undoubtedly expose scalability limitations in existing applications, where scalability is the ability of a parallel algorithm on a parallel architecture to effectively utilize an increasing number of processors. Users will need precise and automated techniques for detecting the cause of limited scalability. This paper addresses this dilemma. First, we argue that users face numerous challenges in understanding application scalability: managing substantial amounts of experiment data, extracting useful trends from this data, and reconciling performance information with their application's design. Second, we propose a solution to automate this data analysis problem by applying fundamental statistical techniques to scalability experiment data. Finally, we evaluate our operational prototype on several applications, and show that statistical techniques offer an effective strategy for assessing application scalability. In particular, we find that non-parametric correlation of the number of tasks to the ratio of the time for communication operations to overall communication time provides a reliable measure for identifying communication operations that scale poorly.

关键词： Scalability Experimental data statistical techniques Consumers Communication Research Automated recording parallel architectures Surgery Multiprocessor High Performance Computing parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

3D wavelet compression by message passing on a myrinet cluster

3D wavelet compression by message passing on a myrinet clust...

引用

Canadian Conference on Electrical and Computer Engineering

作者： Moyano, E González, P Orozco-Barbosa, L Quiles, FJ García, PJ Garrido, A Univ Castilla La Mancha Dept Informat Escuela Politecn Super Albacete Albacete 02071 Spain

ISBN: (纸本)0780367154

In this paper, we present a parallel algorithm for lossless compression. The algorithm is based on the 3D Wavelet Transform (3-D WT). The system under study consists of a parallel implementation of a wavelet compression software running on a cluster of eight nodes linked by a high performance local area network, Myrinet. The parallel implementation is based on the standard Message Passing Interface (MPI). We have used three implementations of the MPI standard: MPI-BIP, MPICH, and LAM. Experimental results are reported based on these three implementations. We provide performance results of this parallel system for the compression video sequences. Some bugs in efficiency for TCP implementation are reported and resolved for this system.

关键词： 3D-WT MPI myrinet parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Dynamic code management on a Java multicomputer 9

Dynamic code management on a Java multicomputer

引用

9th Euromicro Workshop on parallel and Distributed Processing

作者： Sage, PP Milligan, P Bouridane, A Queens Univ Belfast Sch Comp Sci Belfast BT7 1NN Antrim North Ireland

ISBN: (纸本)0769509886

It is clear that writing software for parallel architectures is a non-trivial process. This has encouraged much research in art effort to provide tools to assist parallel software development. However while these tools may cater for architecture-specific problems, they do little for the concept of parallel software engineering as the end product is usually neither scaleable nor portable. The introduction of a level of abstraction in the expression of parallel algorithms can elevate the reasoning process above architectural constraints and assist the production of more flexible code. This paper our-lines an object-oriented parallel algorithm development paradigm based on a Task and Channel notation, and examines the utilisation of Java TM technologies in the development of a distributed Java TM Virtual Machine architecture on which algorithms expressed in this notation may be executed dynamically.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Partial stabilization of large-scale discrete-time linear control systems

Partial stabilization of large-scale discrete-time linear co...

引用

30th International Conference on parallel Processing (ICPP Workshops)

作者： Benner, P Castillo, M Quintana-Ortí, ES Univ Bremen Zentrum Technomath Fachbereich 3 Math & Informat D-28334 Bremen Germany

ISBN: (纸本)0769512607

We propose a parallel algorithm for stabilizing large discrete-time linear control systems on a Beowulf cluster Our algorithm first separates the Schur stable part of the linear control system using an inverse-free iteration for the matrix disc function, and then computes a stabilizing feedback matrix for the unstable part. This stage requires the numerical solution of a Stein equation. This linear matrix equation is solved using the sign function method after applying a Cayley transformation to the original equation. The experimental results on a cluster composed of Intel PII processors and a Myrinet interconnection network show the parallelism and scalability of our approach.

关键词： Clustering algorithms Concurrent computing Control systems Kernel Large-scale systems Multiprocessor interconnection networks parallel algorithms Riccati equations Scalability Symmetric matrices

来源：评论

学校读者我要写书评

暂无评论

Displacement decomposition and parallelization of the PCG method for elasticity problems

Displacement decomposition and parallelization of the PCG me...

引用

30th International Conference on parallel Processing (ICPP Workshops)

作者： Blaheta, R Jakl, O Stary, J Acad Sci Czech Republic Inst Geon Ostrava 70800 Czech Republic

ISBN: (纸本)0769512607

This article describes the displacement decomposition and its benefits for the parallelization of the preconditioned conjugate gradient method for finite element elasticity problems. It deals with both the fixed and variable preconditioning based on this decomposition. Numerical efficiency of the parallel algorithms is demonstrated on an academic benchmark and real-life modelling problem.

关键词： Anisotropic magnetoresistance Character generation Convergence Elasticity Equations Finite element methods Gold Gradient methods Linear systems parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A parallel version of a quasigradient methoid In stochastic control theory∗

引用

Optimization 2001年第1-2期49卷 95-114页

作者： Holger. Blaar[a] [a] Fachbereich Matrzematik und Informatik Martin-Luther-Universitt Halle- Wittenberg HallelSaale Germany

We solve an optimal control problem for controlled parabolic Ito equations by a stochastic quasigradient method. Because of high amounts of computation time required by numerical solution of such problems we investigate the parallelization of the algorithm. We distribute the computations of space stages over several processor nodes of a parallel computer. We obtain an efficient algorithm with low communication cost by using a ring topology

关键词： Stochastic optimization Control parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

EMPLOYINGK-ARY n-CUBES FOR parallel LAGRANGE INTERPOLATION

引用

parallel algorithms and Applications 2001年第4期16卷 283-299页

作者： H. Sarbazi-Azad - e-mail: hsa@dcs.gla.ac.uk.[a] M. Ould-Khaoua - e-mail: mohamed@dcs.gla.ac.uk.[a] L. M. Mackenzie - e-mail: ewis@dcs.gla.ac.uk.[a] [a] Department of Computing Science University of Glasgow Glasgow UK

This paper proposes a parallel algorithm for computing anN( = Kn) point Lagrange interpolation on fc-ary n-cube networks. The algorithm consists of three phases: initialisation, main and final. There is no computation in the initialisation phase. The main phase is composed of N/2 steps, each consisting of four multiplications and four subtractions, and an additional step including one division and one multiplication. Communication in the main phase is based on an all-to-all broadcast algorithm on a Hamiltonian ring embedded in a k-ary n-cube. The final phase is carried out in n x ⌊k/l⌋ steps, each requiring one addition. A performance evaluation of the proposed algorithm reveals a near to optimum speedup for a typical range of sy:;tem parameters used in current state-of-the-art implementations. Our study also reveals that when implementation cost is taken into account low-dimensional K-ary n-cubes achieve better speedup than their higher-dimensional counterparts.

关键词： Multicomputer Interconnection networks K-ary n-cubus parallel algorithms Lagrange interpolation Speedup Performance analysis

来源：评论

学校读者我要写书评

暂无评论

Incremental quantitative rule derivation by multidimensional data partitioning 15

Incremental quantitative rule derivation by multidimensional...

引用

15th International parallel and Distributed Processing Symposium, IPDPS 2001

作者： Sun, Junping School of Computer and Information Sciences Nova Southeastern University DavieFL33314-4416 United States

ISBN: (纸本)0769509908

By using cardinality and relevance information about a set of attributes and concept hierarchies, a top-down incremental data partitioning method is proposed for quantitative rule derivation from database in parallelism. Based on sequential incremental approach, we proposed two parallel versions of incremental partitioning algorithms. These two parallel algorithms are multidimensional-based to partition data set into multiple independent subsets for further rule derivation process. The second version of the parallel algorithm improves the first in terms of load balance. © 2001 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallelizability of some P-complete geometric problems in the EREW-PRAM 1

引用

7th Annual International Conference on Computing and Combinatorics, COCOON 2001

作者： Castanho, Carla Denise Chen, Wei Wada, Koichi Fujiwara, Akihiro Nagoya Institute of Technology Showa Nagoya466-8555 Japan Nanzan University Seirei-cho 27 Seto-shiAichi-ken489-0863 Japan Kyushu Institute of Technology 680-4 Kawazu IizukaFukuoka820-8502 Japan

P-complete problems seem to have no parallel algorithm which runs in polylogarithmic time using a polynomial number of processors. A P-complete problem is in class EP (Efficient and Polynomially fast) if and only if t... 详细信息

ISBN: (数字)9783540446798

ISBN: (纸本)9783540424949

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization

引用

International Journal of parallel and Distributed Systems and Networks 2001年第1期4卷 26-35页

作者： Strazdins, P.E. Department of Computer Science Australian National University Acton ACT 0200 Australia

This article analyses and compares the techniques of algorithmic blocking and storage blocking with lookahead for distributed memory LU, LLT, and QR factorizations. Concepts and some useful properties of a simplified model of lookahead are explored. Issues in the implementation of lookahead are discussed, which are more involved for the case of LLT and QR factorizations. The article also explains how hybrid algorithmic blocking and lookahead techniques can be implemented. Results, given on the Fujitsu AP1000 and AP+ multicomputers, indicate that both methods are superior to storage blocking, and that the hybrid method is optimal for smaller matrices, due to savings in communication startups. For larger matrices, algorithmic blocking gave the best performance (excepting LLT for the AP+), due to its better load-balancing properties. Performance models, predicting the minimum matrix size where lookahead becomes effective, indicate this trend can be expected for machines with lower communication-to-computation speeds, but that the range for where lookahead is superior is extended.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：