检索结果-内蒙古大学图书馆

A note on constructing binary heaps with periodic networks

INFORMATION PROCESSING LETTERS 2002年第3期83卷 129-134页

作者： Piotrów, M Univ Wroclaw Inst Comp Sci PL-51151 Wroclaw Poland

We consider the problem of constructing binary heaps on constant degree networks performing compare-exchange operations only. The heap data structure, introduced by William and Williams [Comm. ACM 7 (6) (1964) 347-348], has many applications and, therefore, has been intensively studied in sequential and parallel context. In particular, Brodal and Pinotti [Theoret. Comput. Sci. 250 (2001) 235-245] have recently presented two families of comparator networks: the first of depth 4 log N and the second of size O(N log log N) for constructing binary heaps of size N. In this note, we give an new construction of such a network with the running time improved to 3 log N. Moreover, the network has a novel property of being 3-periodic, that is, for each unit of time i the same sets of operations are performed in units i and i + 3. Then we argue that our construction is optimal with respect to the length of the period, that is, we prove that there is no 2-periodic network that is able to build a binary heap in sublinear time. Finally, we show that our construction can be used to decrease also the depth of the networks with O(N log log N) size. (C) 2001 Elsevier Science B.V. All rights reserved.

关键词： parallel algorithms data structures binary heaps periodic networks comparators

来源：评论

学校读者我要写书评

暂无评论

Quantitative performance analysis of the improved quasi-minimal residual method on massively distributed memory computers

引用

ADVANCES IN ENGINEERING SOFTWARE 2002年第3期33卷 169-177页

作者： Yang, LT Brent, RP St Francis Xavier Univ Dept Comp Sci Antigonish NS B2G 2W5 Canada Univ Oxford Comp Lab Oxford OX1 3QD England

For the solutions of linear systems of equations with unsymmetric coefficient matrices, we have proposed an improved version of the quasi-minimal residual (IQMR) method [Proceedings of The International Conference on High Performance Computing and Networking (HPCN-97) (1997);IEICE Trans Inform Syst E80-D (9) (1997) 919] by using the Lanczos process as a major component combining elements of numerical stability and parallel algorithm design. For the Lanczos process, stability is obtained by a coupled two-term procedure that generates Lanczos vectors scaled to unit length. The algorithm is derived so that all inner products and matrix-vector multiplications of a single iteration step are independent and the communication time required for inner product can be overlapped efficiently with computation time. In this paper, a theoretical model of computation and communications phases is presented to allow us to give a quantitative analysis of the parallel performance with a two-dimensional grid topology. The efficiency, speed-up, and runtime are expressed as functions of the number of processors scaled by the number of processors that gives the minimal runtime for the given problem size. The model not only evaluates effectively the improvements in performance due to communication reduction by overlapping, but also provides useful insight into the scalability of the IQMR method. The theoretical results on the performance are demonstrated by experimental timing results carried out on a massively parallel distributed memory Parsytec system. (C) 2002 Published by Elsevier Science Ltd.

关键词： LINEAR systems HIGH performance computing parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Sublogarithmic deterministic selection on arrays with a reconfigurable optical bus

引用

IEEE TRANSACTIONS ON COMPUTERS 2002年第6期51卷 702-707页

作者： Han, YJ Pan, Y Shen, H Elect Data Syst Inc Troy MI 48098 USA Georgia State Univ Dept Comp Sci Atlanta GA 30303 USA Japan Adv Inst Sci & Technol Grad Sch Informat Sci Tatsunokuchi Ishikawa 9231292 Japan

The Linear Array with a Reconfigurable Pipelined Bus System (LARPBS) is a newly introduced parallel computational model, where processors are connected by a reconfigurable optical bus. In this paper, we show that the selection problem can be solved on the LARPBS model deterministically in O((log log N)(2)/log log jog N) time. To our best knowledge, this is the best deterministic selection algorithm on any model with a reconfigurable optical bus.

关键词： analysis of algorithms massive parallelism optical bus parallel algorithms selection

来源：评论

学校读者我要写书评

暂无评论

Scaling multiple addition and prefix sums on the reconfigurable mesh

引用

INFORMATION PROCESSING LETTERS 2002年第6期82卷 277-282页

作者： Trahan, JL Vaidyanathan, R Louisiana State Univ Dept Elect & Comp Engn Baton Rouge LA 70803 USA

Multiple addition is the problem of adding N b-bit integers. Prefix sums and multiple addition play fundamental roles in many algorithms, particularly on the reconfigurable mesh (R-Mesh). Scaling algorithms on the R-Mesh to run with the same or increased efficiency on fewer processors is a challenging and important proposition. In this paper. we present algorithms that scale with increasing efficiency for multiple addition, prefix sums. and matrix-vector multiplication. Along the way. we obtain an improved multiple addition algorithm. (C) 2001 Elsevier Science B.V. All rights reserved.

关键词： reconfigurable mesh parallel algorithms scalability arithmetic algorithms reconfigurable models

来源：评论

学校读者我要写书评

暂无评论

Near optimal Cholesky factorization on orthogonal multiprocessors

引用

INFORMATION PROCESSING LETTERS 2002年第1期84卷 23-30页

作者： Bansal, SS Vishal, B Gupta, P Indian Inst Technol Dept Comp Sci & Engn Kanpur 208016 Uttar Pradesh India

The effect of data allocation strategies on the running time of parallel Cholesky factorization algorithms on orthogonal multiprocessors has been studied. Four new strategies which give better running time are proposed and their time complexities are analyzed. Finally it is shown that near optimal performance can be obtained using two of our strategies. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： parallel algorithms Cholesky factorization orthogonal multiprocessors

来源：评论

学校读者我要写书评

暂无评论

A taxonomy of hybrid metaheuristics

引用

JOURNAL OF HEURISTICS 2002年第5期8卷 541-564页

作者： Talbi, EG Lab Informat Fondamentale Lille CNRS URA 369 F-59655 Villeneuve Dascq France

Hybrid metaheuristics have received considerable interest these recent years in the field of combinatorial optimization. A wide variety of hybrid approaches have been proposed in the literature. In this paper, a taxonomy of hybrid metaheuristics is presented in an attempt to provide a common terminology and classification mechanisms. The taxonomy, while presented in terms of metaheuristics, is also applicable to most types of heuristics and exact optimization algorithms. As an illustration of the usefulness of the taxonomy an annoted bibliography is given which classifies a large number of hybrid approaches according to the taxonomy.

关键词： taxonomy combinatorial optimization metaheuristics hybrid algorithms parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Fast gossiping in square meshes/tori with bounded-size packets

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2002年第4期13卷 349-358页

作者： Lau, FCM Zhang, SH Univ Hong Kong Dept Comp Sci & Informat Syst Hong Kong Hong Kong Peoples R China

Gossiping is the communication problem in which each node has a unique message (token) to be transmitted to every other node. The nodes exchange their tokens by packets. A solution to the problem is judged by how many rounds of packet sending it requires. In this paper, we consider the version of the problem in which small-size packets (each carrying exactly one token) are used, the links (edges) of the network are half-duplex (only one packet can flow through a link at a time), and the nodes are all-port (a node's incident edges can all be active at the same time). This is also known as the H* model. We study the 2D square mesh and the 2D square torus. An improved, asymptotically optimal algorithm for the mesh and an optimal algorithm for the torus are presented.

关键词： gossiping all-to-all broadcast total exchange collective communication parallel algorithms interconnection networks communication optimization scheduling

来源：评论

学校读者我要写书评

暂无评论

An algorithm visualization tool on the reconfigurable mesh

引用

VLSI DESIGN 2002年第3期14卷 239-248页

作者： Bordim, JL Hayashi, T Nakano, K Nagoya Inst Technol Dept Elect & Comp Engn Showa Ku Nagoya Aichi 4668555 Japan

Many parallel algorithms on the reconfigurable mesh have been developed so far. However, it is hard to understand the behavior of these parallel algorithms, mainly because the bus topology dynamically changes during the execution of an algorithm. In this work, we present the visual mesh system (VMesh), a tool for visualizing algorithms on the reconfigurable mesh. The main objective of the VMesh is to provide a comprehensive environment for algorithm visualization and development. The VMesh has shown to be a valuable tool for studying and understanding the behavior of parallel algorithms on the reconfigurable mesh.

关键词： algorithm visualization reconfigurable mesh parallel algorithms field programmable gate arrays

来源：评论

学校读者我要写书评

暂无评论

parallel dynamic programming for solving the string editing problem on a CGM/BSP

Parallel dynamic programming for solving the string editing ...

引用

Fourteenth Annual ACM Symposium on parallel algorithms and Architectures

作者： Alves, C.E.R. Cáceres, E.N. Dehne, F. FTCE Universidade São Judas Tadeu São Paulo SP Brazil Universidade Federal de Mato Grosso do Sul Campo Grande MS Brazil School of Computer Science Carleton University Ottawa Ont. K1S 5B6 Canada

In this paper we present a coarse-grained parallel algorithm for solving the string edit distance problem for a string A and all substrings of a string C. Our method is based on a novel CGM/BSP parallel dynamic programming technique for computing all highest scoring paths in a weighted grid graph. The algorithm requires log p rounds/supersteps and O(p/n2 log m) local computation, where p is the number of processors, p2 ≤ m ≤ n. To our knowledge, this is the first efficient CGM/BSP algorithm for the alignment of all substrings of C with A. Furthermore, the CGM/BSP parallel dynamic programming technique presented is of interest in its own right and we expect it to lead to other parallel dynamic programming methods for the CGM/BSP.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

PRO:a model for parallel resource-optimal computation 16

PRO:a model for parallel resource-optimal computation

引用

16th Annual International Symposium on High Performance Computing Systems and Applications, HPCS 2002

作者： Gebremedhin, Assefaw Hadish Lassous, Isabelle Guérin Gustedt, Jens Telle, Jan Arne Department of Informatics University of Bergen Norway LIP and INRIA Rhone-Alpes France LORIA and INRIA Lorraine France

We present a new parallel computation model that enables the design of resource-optimal scalable parallel algorithms and simplifies their analysis. The model rests on the novel idea of incorporating relative optimalit... 详细信息

ISBN: (纸本)0769516262

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：