检索结果-内蒙古大学图书馆

A note on parallel selection on coarse-grained multicomputers

algorithmICA 1999年第3-4期24卷 371-380页

作者： Saukas, ELG Song, SW Univ Sao Paulo Dept Comp Sci Inst Math & Stat BR-05508900 Sao Paulo SP Brazil

Consider the selection problem of determining the kth smallest element of a set of n elements. Under the CGM (coarse-grained multicomputer) model with p processors and O(n/p) local memory, we present a deterministic parallel algorithm for the selection problem that requires O(log p) communication rounds. Besides requiring a low number of communication rounds, the algorithm also attempts to minimize the total amount of data transmitted in each round (only O(p) except in the last round). In addition to showing theoretical complexities, we present very promising experimental results obtained on a parallel machine that show almost linear speedup, indicating the efficiency and scalability of the proposed algorithm.

关键词： coarse-grained multicomputer parallel algorithm selection problem

来源：评论

学校读者我要写书评

暂无评论

Optimal parallel clustering algorithms on a reconfigurable array of processors with wider bus networks

引用

IMAGE AND VISION COMPUTING 1999年第13期17卷 925-936页

作者： Tsai, HR Horng, SJ Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei 10764 Taiwan Overseas Chinese Coll Commerce Dept Informat Management Taichung Taiwan

Clustering techniques are usually used in pattern recognition, image segmentation and object detection. For N patterns and k centers each with M features, in this paper, we first design an O(kM) time optimal parallel algorithm for one pass process of clustering with the k-means method on a linear array of processors with a wider bus network using N1+1/epsilon processors with one bus network, where c is any constant and c greater than or equal to 1. Then, based on the proposed algorithm, two O(k) and O(1) time optimal parallel clustering algorithms are also derived using MN1+1/epsilon and kMN(1+1/epsilon) processors with M row and MN row bus networks, respectively. These results improve the best known bounds and achieve cost optimal in their time and processor complexities. (C) 1999 Elsevier Science B.V. All rights reserved.

关键词： k-means method cluster analysis pattern cluster image processing pattern recognition parallel algorithm array of processors with a wider bus network (RAPWBN)

来源：评论

学校读者我要写书评

暂无评论

parallel decomposition of generalized series-parallel graphs

引用

JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 1999年第3期15卷 407-417页

作者： Ho, CW Hsieh, SY Chen, GH Natl Cent Univ Dept Comp Sci & Informat Engn Chungli 320 Taiwan Natl Taiwan Univ Dept Comp Sci & Informat Engn Taipei 106 Taiwan

Generalized series-parallel (GSP) graphs belong to the class of decomposable graphs which can be represented by their decomposition trees. Given a decomposition tree of a GSP graph, there are many graph-theoretic problems which can be solved efficiently. An efficient parallel algorithm for constructing a decomposition tree of a given GSP graph is presented. It takes O(log n) time with C(m, n) processors on a CRCW PRAM, where C(m, n) is the number of processors required to find connected components of a graph with m edges and n vertices in logarithmic time. Based on our algorithmic results, we also derive some properties for GSP graphs, which may be of interest in and of themselves.

关键词： parallel algorithm generalized series-parallel graph CRCW PRAM decomposable graph decomposition tree

来源：评论

学校读者我要写书评

暂无评论

Load balancing for the parallel map overlay-operation in the geographic information system

引用

JOURNAL OF INFORMATION SCIENCE AND ENGINEERING 1999年第3期15卷 441-449页

作者： Ching, YT Natl Tsing Hua Univ Dept Comp & Informat Sci Hsinchu 300 Taiwan

The map overlay-operation is one of the most important and most time consuming operations in the Geographic Information System. In general, a map consists of a huge number of line segments. The major cost for overlaying two maps is that of computing the intersection points between the line segments. parallel processing is one of the ways to reduce the computing time. If one can partition the line segments into independent subsets, then these subsets can be processed in different processors simultaneously;thus, the computing time can be reduced. In this paper, we consider the problem of partitioning line segments into independent sets such that the load is balanced among the processors. An easy yet effective strategy is proposed to balance the load for a multi-processor computer which does not have many processors. The proposed algorithm can achieve good load balance when the average length of the line segments is short compared to the width of a map.

关键词： geographic information system parallel algorithm load balancing computational geometry line segments intersection

来源：评论

学校读者我要写书评

暂无评论

A parallel Gauss-Seidel method using NR data flow ordering

引用

APPLIED MATHEMATICS AND COMPUTATION 1999年第2-3期99卷 209-220页

作者： Kim, T Lee, CO Inha Univ Dept Math Nam Gu Inchon 402751 South Korea

A parallel variant of the block Gauss-Seidel method is presented to solve the Poisson equation with Dirichlet boundary condition. This method uses two-dimensional logically connected parallel processors. Furthermore, natural rowwise (NR) data flow block ordering is used so that its convergence rate is the same as that of the standard block Gauss-Seidel method. Spectral radius is determined by the formula for general k x I block iterative methods. Numerical computations on a parallel computer are included. (C) 1999 Published by Elsevier Science Inc. All rights reserved.

关键词： Gauss-Seidel method NR data flow ordering parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

An efficient general in-place parallel sorting scheme

引用

JOURNAL OF SUPERCOMPUTING 1999年第1期14卷 5-17页

作者： Zheng, SQ Calidas, B Zhang, YJ Univ Texas Dept Comp Sci Richardson TX 75083 USA Louisiana State Univ Dept Elect & Comp Engn Baton Rouge LA 70803 USA So Methodist Univ Dept Comp Sci & Engn Dallas TX 75275 USA

We present a simple and general parallel sorting scheme, ZZ-sort, which can be used to derive a class of efficient in-place sorting algorithms on realistic parallel machine models. We prove a tight bound for the worst case performance of ZZ-sort. We also demonstrate the average performance of ZZ-sort by experimental results obtained on a MasPar parallel computer. Our experiments indicate that ZZ-sort can be incorporated into a distributed memory parallel computer system as a standard routine, and this routine is useful for space critical situations. Finally, we show that ZZ-sort can be used to convert a non-adaptive parallel sorting algorithm into an in-place and adaptive one by considering the problem of sorting an arbitrarily large input on fixed-size reconfigurable meshes.

关键词： parallel algorithm parallel architecture performance evaluation scalability sorting supercomputing

来源：评论

学校读者我要写书评

暂无评论

Implicit residual smoothing in a parallel 2D explicit Euler solver

引用

INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS 1999年第3期72卷 313-324页

作者： Gasparo, MG Pieraccini, S Univ Florence Dipartimento Energet S Stecco I-50134 Florence Italy Univ Milan Dipartimento Matemat F Enriques I-20133 Milan Italy

This paper deals with the parallel implementation on distributed memory architectures of the implicit residual smoothing procedure in the context of a explicit method for two dimensional inviscid flows. The governing equations are discretized by a cell centered finite volume method and the time integration is performed by a explicit Runge Kutta method. Artificial dissipation and implicit residual smoothing are used in order to stabilize and speed up the method. The parallelism is introduced by grid partitioning. The parallel implementation of the residual smoothing, a inherently implicit procedure, is crucial for the efficiency of the method. Here, two different parallel residual smoothing strategies are discussed and some experimental results are given to illustrate parallel performances of the proposed strategies.

关键词： computational fluid dynamics implicit residual smoothing parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

Optimal computing the chessboard distance transform on parallel processing systems

引用

COMPUTER VISION AND IMAGE UNDERSTANDING 1999年第3期73卷 374-390页

作者： Lee, YH Horng, SJ Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei Taiwan

The distance transform (DT) is an image computation tool which can be used to extract the information about the shape and the position of the foreground pixels relative to each other. It converts a binary image into a grey-level image, where each pixel has a value corresponding to the distance to the nearest foreground pixel. The time complexity for computing the distance transform is fully dependent on the different distance metrics. Especially, the more exact the distance transform is, the worse execution time reached will be, Nowadays, quite often thousands of images are processed in a limited time. It seems quite impossible for a sequential computer to do such a computation for the distance transform in real time. In order to provide efficient distance transform computation, it is considerably desirable to develop a parallel algorithm for this operation. In this paper, based on the diagonal propagation approach, we first provide an O(N-2) time sequential algorithm to compute the chessboard distance transform (CDT) of an N x N image, which is a DT using the chessboard distance metrics. Based on the proposed sequential algorithm, the CDT of a 2D binary image array of size N x N can be computed in O(log N) time on the EREW PRAM model using O(N-2/log N) processors, O(log log N) time on the CRCW PRAM model using O(N-2/log log N) processors, and O(log N) time on the hypercube computer using O(N-2/log N) processors. Following the mapping as proposed by Lee and Horng, the algorithm for the medial axis transform is also efficiently derived. The medial axis transform of a 2D binary image array of size N x N can be computed in O(log N) time on the EREW PRAM model using O(N-2/log N) processors, O(log log N) time on the CRCW PRAM model using O(N-2/log log N) processors, and O(log N) time on the hypercube computer using O(N-2/log N) processors. The proposed parallel algorithms are composed of a set of prefix operations. In each prefix operation phase, only increase (add-one)

关键词： chessboard distance computer vision CRCW PRAM model distance transform EREW PRAM model hypercube computer image processing medial axis transform parallel algorithm

来源：评论

学校读者我要写书评

暂无评论

New parallel randomized algorithms for the traveling salesman problem

引用

COMPUTERS & OPERATIONS RESEARCH 1999年第4期26卷 371-394页

作者： Shi, LY Olafsson, S Sun, N Univ Wisconsin Dept Ind Engn Madison WI 53706 USA

We recently developed a new randomized optimization framework, the Nested Partitions (NP) method. This approach uses partitioning, global random sampling, and local search heuristics to create a Markov chain that has global optima as its absorbing states. This new method combines global and local search in a natural way and it is highly matched to emerging massively parallel processing capabilities. In this paper, we apply the NP method to the Travelling Salesman Problem. Preliminary numerical results show that the NP method generates high-quality solutions compared to well-known heuristic methods, and that it can be a very promising alternative for finding a solution to the TSP. (C) 1999 Elsevier Science Ltd. All rights reserved.

关键词： optimization randomized algorithm parallel algorithm traveling salesman problem

来源：评论

学校读者我要写书评

暂无评论

Monte Carlo simulations of water clusters on a parallel computer using an ab initio potential

引用

INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 1999年第6期74卷 709-719页

作者： Akhmatskaya, EV Cooper, MD Burton, NA Masters, AJ Hillier, IH Univ Manchester Dept Chem Manchester M13 9PL Lancs England

We performed a simulation of a cluster of five water molecules at 300 K using an ab initio potential. In our first study, the interactions were calculated at the Hartree-Fock level using a 6-31G* basis set. A parallel big move (hybrid) Monte Carlo algorithm was used to conduct this modeling. We compared the results obtained for this "quantum" system with those obtained from a conventional simulation employing an effective pair potential. We also estimated properties of the quantum system in terms,of the configurations generated by the conventional simulation by employing the appropriate Boltzmann weighting. These estimates are in good agreement with those obtained from the full quantum simulation. We then repeated the Boltzmann weighting method, but this time using the BLYP density functional in our ab initio calculations, so as to include correlation effects. (C) 1999 John Wiley & Sons, Inc.

关键词： water cluster hybrid Monte Carlo parallel algorithm ab initio

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：