检索结果-内蒙古大学图书馆

A parallel GRASP heuristic for the 2-path network design problem 8th

8th international Euro-Par conference on parallel processing, Euro-Par 2002

作者： Ribeiro, Celso C. Rosseti, Isabel Department of Computer Science Catholic University of Rio de Janeiro Rua Marquês de São Vicente 225 Rio de Janeiro RJ22453-900 Brazil

ISBN: (纸本)3540440496

We propose a parallel GRASP heuristic with path-relinking for the 2-path network design problem. A parallel strategy for its implementation is described. Computational results illustrating the effectiveness of the new heuristic are reported. the parallel implementation obtains linear speedups on a cluster with 32 machines. © Springer-Verlag Berlin Heidelberg 2002.

关键词： Artificial intelligence

来源：评论

学校读者我要写书评

暂无评论

Compiler-controlled parallelism-independent scheduling for parallel and distributed system 6th

Compiler-controlled parallelism-independent scheduling for p...

引用

6th international conference on Applied parallel Computing, PARA 2002

作者： Nikolova, Kirilka You, Sou Pei Sowa, Masahiro University of Electro-Communications Chofugaoka-1-5-1 Tokyo182-8585 Japan

ISBN: (纸本)354043786X

the objective of the parallelism-independent (PI) scheduling is minimization of the completion time of a parallel application for any number of processing elements in the computing system. We propose several parallelism-independent algorithms which are either applicable for distributed computing systems, i.e. systems of autonomous processors connected via communication links (in this case we provide explicit message communication scheduling) or for tightly coupled multiprocessor systems or architectures exploiting instruction level parallelism as well. the algorithms are hybrid but predominantly done at compile time in order to reduce the dynamic overhead and scheduling hardware. All the traditional static scheduling algorithms produce machine codes with fixed degree of parallelism which cannot be executed efficiently on computer systems with different degrees of parallelism. Our algorithms eliminate this problem closely related to the distribution of parallel programs. © Springer-Verlag Berlin Heidelberg 2002.

关键词： Scheduling

来源：评论

学校读者我要写书评

暂无评论

Efficient power-sum systolic architectures for public-key cryptosystems in GF(2m) 8th

Efficient power-sum systolic architectures for public-key cr...

引用

8th Annual international conference on Computing and Combinatorics, COCOON 2002

作者： Kim, Nam-Yeun Lee, Won-Ho Yoo, Kee-Young Department of Computer Engineering Kyungpook National University Deagu702-701 Korea Republic of

ISBN: (纸本)354043996X

the current paper presents a new algorithm and two architectures for the power-sum operation (AB2 + C) over GF(2m) using a standard basis. the proposed algorithm is based on the MSB-first scheme and the proposed architectures have a low hardware complexity and small latency compared to conventional approaches. In particular, the hardware complexity and latency of the proposed parallel-in parallel-out array are about 19.8% and 25% lower, respectively, than Wei’s. In addition, since the proposed architectures incorporate simplicity, regularity, modularity, and pipelinability, they are well suited to VLSI implementation and can be easily applied to inverse/division architecture. © Springer-Verlag Berlin Heidelberg 2002.

关键词： Public key cryptography

来源：评论

学校读者我要写书评

暂无评论

Sources of parallel inefficiency for incompressible CFD simulations 8th

引用

8th international Euro-Par conference on parallel processing, Euro-Par 2002

作者： Buijssen, Sven H. M. Turek, Stefan INF 294 Heidelberg69120 Germany University of Dortmund Institute for Applied Mathematics and Numerics Vogelpothsweg 87 Dortmund44227 Germany

ISBN: (纸本)3540440496

parallel multigrid methods are very prominent tools for solving huge systems of (non-)linear equations arising from the discretisation of PDEs, as for instance in Computational Fluid Dynamics (CFD). the superiority of multigrid methods in regard of numerical complexity mainly stands and falls with the smoothing algorithms (‘smoother’) used. Since the inherent highly recursive character of many global smoothers (SOR, ILU) often impedes a direct parallelisation, the application of block smoothers is an alternative. However, due to the weakened recursive character, the resulting parallel efficiency may decrease in comparison to the sequential performance, due to a weaker total numerical efficiency. Within this paper, we show the consequences of such a strategy for the resulting total efficiency if incorporated into a parallel CFD solver for 3D incompressible flow. Moreover, we compare this parallel version with the related optimised sequential code in FeatFlow and we analyse the numerical losses of parallel efficiency due to communication costs, numerical efficiency and finally the choice of programming language (C++ vs. F77). Altogether, we obtain quite surprising, but more realistic estimates for the total efficiency of such a parallel CFD tool in comparison to the related ‘optimal’ sequential version. © Springer-Verlag Berlin Heidelberg 2002.

关键词： Computational fluid dynamics

来源：评论

学校读者我要写书评

暂无评论

A parallel solution in texture analysis employing a massively parallel processor 8th

引用

8th international Euro-Par conference on parallel processing, Euro-Par 2002

作者： Svolos, Andreas I. Konstantopoulos, Charalambos Kaklamanis, Christos Computer Technology Institute and Computer Engineering and Informatics Dept Univ. of Patras PatrasGR 265 00 Greece

ISBN: (纸本)3540440496

Texture is a fundamental feature for image analysis, classification, and segmentation. therefore, the reduction of the time needed for its description in a real application environment is an important objective. In this paper, a texture description algorithm running over a hypercube massively parallel processor, is presented and evaluated through its application in real texture analysis. It is also shown that its hardware requirements can be tolerated by modern VLSI technology. © Springer-Verlag Berlin Heidelberg 2002.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

A duality theorem for two connectivity-preserving parallel shrinking transformations

引用

FUTURE GENERATION COMPUTER SYSTEMS 2002年第7期18卷 931-937页

作者： Umeo, H Mauri, G Osaka Electrocommun Univ Grad Sch Engn Fac Informat Sci & Technol Dept Informat Engn Neyagawa Osaka 5728530 Japan Univ Milano Bicocca Dipartimento Informat Sistemat & Comun I-20126 Milan Italy

We study two classical connectivity-preserving parallel shrinking algorithms proposed to recognize and label two-dimensional connected components of binary images. the algorithms we consider were developed by Beyer [Recognition of topological invariants by iterative arrays, Ph.D. thesis, MIT, 1969, p. 144] and Levialdi [Commun. ACM 15 (1) (1972) 7] independently for the purpose of shrinking 4-connected and 8-connected components of binary images in linear time, respectively. It is shown that those two independently developed algorithms are closely related and in a sense they are in a dual relation such that, for any initially given binary image and its inverted one, one algorithm produces, simultaneously, an image which is dual of the one produced by the other, step-by-step. (C) 2002 Elsevier Science B.V. All rights reserved.

关键词： binary image processing connectivity-preserving parallel shrinking algorithm connectivity recognition labeling problem cellular automaton

来源：评论

学校读者我要写书评

暂无评论

parallel iterative methods for navier-stokes equations and application to stability assessment 8th

引用

8th international Euro-Par conference on parallel processing, Euro-Par 2002

作者： Graham, Ivan G. Spence, Alastair Vainikko, Eero Department of Mathematical Sciences University of Bath BathBA2 7AY United Kingdom

ISBN: (纸本)3540440496

We describe the construction of parallel iterative solvers for finite element approximations of the Navier-Stokes equations on unstructured grids using domain decomposition methods. the iterative method used is FGMRES, preconditioned by a parallel adaptation of a recent block preconditioner proposed by Kay, Loghin and Wathen. the parallelisation is achieved by adapting the technology of our domain decomposition solver DOUG (previously used for scalar problems) to block-systems. An application of the resultant linear solver to the stability assessment of flows is briefly indicated. © Springer-Verlag Berlin Heidelberg 2002.

关键词： Navier Stokes equations

来源：评论

学校读者我要写书评

暂无评论

Instant-access cycle-stealing for parallel applications requiring interactive response 8th

引用

8th international Euro-Par conference on parallel processing

作者： Kelly, PHJ Pelagatti, S Rossiter, M Univ London Imperial Coll Sci Technol & Med Dept Comp London SW7 2BZ England

ISBN: (纸本)3540440496

In this paper we study the use of idle cycles in. a network of desktop workstations under unfavourable conditions: we aim to use idle cycles to improve the responsiveness of interactive applications through parallelism. Unlike much prior work in the area, our focus is on response time, not throughput, and short jobs - of the order of a few seconds. We therefore assume a high level of primary activity by the desktop workstations' users, and aim to keep interference with their work within reasonable limits. We present a fault-tolerant, low-administration service for identifying idle machines, which can usually assign a group of processors to task in less than 200ms. Unusually, the system has no job queue: each job is started immediately with the resources which are predicted to be available. Using trace-driven simulation we study allocation policy for a stream of parallel-jobs. Results show that even under heavy load it is possible to accommodate multiple concurrent guest jobs and obtain good speedup with very small disruption of host applications.

关键词： parallel computing cycle stealing performance prediction

来源：评论

学校读者我要写书评

暂无评论

Designing scalable object oriented parallel applications 8th

引用

8th international Euro-Par conference on parallel processing, Euro-Par 2002

作者： Sobral, João Luís Proença, Alberto José Departamento de Informática - Universidade do Minho Braga4710 - 057 Portugal

ISBN: (纸本)3540440496

the SCOOPP (Scalable Object Oriented parallel Programming) system efficiently adapts, at run-time, an object oriented parallel application to any distributed memory system. It extracts as much parallelism as possible at compile time, and it removes excess of parallel tasks and messages through run-time packing. these object and call aggregation techniques are briefly presented. A design methodology was developed for three main types of scalable applications: pipeline, divide & conquer and farming. this paper reviews how the method can help programmers to design portable and efficient parallel applications. It details its application to a farming case study (image threshold) with measured performance data, and compares with programmer’s tuned versions in a Pentium cluster. © Springer-Verlag Berlin Heidelberg 2002.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

parallelizing the data cube 1

引用

8th international conference on Database theory (ICDT 2001)

作者： Dehne, F Eavis, T Hambrusch, S Rau-Chaplin, A Carleton Univ Sch Comp Sci Ottawa ON K1S 5B6 Canada Dalhousie Univ Fac Comp Sci Halifax NS B3H 1W5 Canada Purdue Univ Dept Comp Sci W Lafayette IN 47907 USA

ISBN: (数字)9783540445036

ISBN: (纸本)9783540414568

this paper presents a general methodology for the efficient parallelization of existing data cube construction algorithms. We describe two different partitioning strategies, one for top-down and one for bottom-up cube algorithms. Both partitioning strategies assign subcubes to individual processors in such a way that the loads assigned to the processors are balanced. Our methods reduce inter processor communication overhead by partitioning the load in advance instead of computing each individual group-by in parallel. Our partitioning strategies create a small number of coarse tasks. this allows for sharing of prefixes and sort orders between different group-by computations. Our methods enable code reuse by permitting the use of existing sequential (external memory) data cube algorithms for the subcube computations on each processor. this supports the transfer of optimized sequential data cube code to a parallel setting. the bottom-up partitioning strategy balances the number of single attribute external memory sorts made by each processor. the top-down strategy partitions a weighted tree in which weights reflect algorithm specific cost measures like estimated group-by sizes. Both partitioning approaches can be implemented on any shared disk type parallel machine composed of p processors connected via an interconnection fabric and with access to a shared parallel disk array. We have implemented our parallel top-down data cube construction method in C++ with the MPI message passing library for communication and the LEDA library for the required graph algorithms. We tested our code on an eight processor cluster, using a variety of different data sets with a range of sizes, dimensions, density, and skew. Comparison tests were performed on a SunFire 6800. the tests show that our partitioning strategies generate a close to optimal load balance between processors. the actual run times observed show an optimal speedup of p.

关键词： OLAP data cube parallel processing partitioning load balancing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：