检索结果-内蒙古大学图书馆

International Conference on parallel Computing in Electrical Engineering (PAR ELEC 2002)

作者： Yang, LT St Francis Xavier Univ Dept Comp Sci Antigonish NS B2G 2W5 Canada

ISBN: (纸本)0769517307;0769517315

The PROUD module placement algorithm mainly uses a hierarchical decomposition technique and the solution of sparse linear systems based on a resistive network analogy. It has been shown that the PROUD algorithm can achieve a comparable design of the placement problems for very large circuits with the best placement algorithm based on simulated annealing, but with several order of magnitude faster. The modified PROUD, namely MPROUD algorithm by perturbing the coefficient matrices performs much faster that the original PROUD algorithm. Due to the instability and unguaranteed convergence of MPROUD algorithm, we have proposed a new convergent and numerically stable PROUD, namely Improved PROUD algorithm, denoted as IPROUD with attractive computational costs to solve the module placement problems by making use of the SYMMLQ and MINRES methods based on Lanczos process in [11]. In this paper, we subsequently propose parallel versions of the improved PROUD algorithms. The parallel algorithm is derived such that all inner products and matrix-vector multiplications of a single iteration step am independent. Therefore, the cost of global communication which represents the bottleneck of the parallel performance on parallel distributed memory computers can be significantly reduced, therefore, to obtain another order of magnitude improvement in the runtime without loss of the quality of the layout.

关键词： Algorithm design and analysis Circuit simulation Computational efficiency Convergence of numerical methods Costs Global communication Linear systems Memory architecture parallel algorithms Simulated annealing

来源：评论

学校读者我要写书评

暂无评论

parallel hybrid adventures with simulated annealing and genetic algorithms

Parallel hybrid adventures with simulated annealing and gene...

引用

6th International Symposium on parallel Architectures, algorithms and Networks (I-SPAN 02)

作者： Calaor, AE Hermosilla, AY Corpus, BO Univ Philippines Dept Math Quezon City 1101 Philippines

ISBN: (纸本)0769515797

In this study, a solution to the school timetabling problem using parallel genetic algorithm with simulated annealing is presented. The hybridization of simulated annealing and parallel genetic algorithm is explained. Also, how these algorithms are run in parallel on a local network of workstations are discussed. Some comparative results among the different parallel models are exhibited. The implementation of the parallel algorithms are used to construct conflict-free and satisfying timetables for the Department of Mathematics of the University of the Philippines Diliman. The program output of this study can be easily modified to be used as a helpful and efficient guide to the decision-making process of the scheduler.

关键词： Cities and towns Data systems Educational institutions Electronics packaging Genetic algorithms Mathematics parallel algorithms parallel processing Simulated annealing Workstations

来源：评论

学校读者我要写书评

暂无评论

Calculational design of special purpose parallel algorithms

Calculational design of special purpose parallel algorithms

引用

7th IEEE International Conference on Electronics, Circuits and Systems, ICECS 2000

作者： Abdallah, Ali E. Hawkins, John South Bank University Borough Road London United Kingdom

ISBN: (纸本)0780365429

This paper adopts a transformational programming approach for deriving massively parallel algorithms from functional specific ations. It gives a brief description of a framework for relating key higher order functions such as map, reduce, and scan with communicating processes with differ entconfigurations. The parallelisation of many interesting functional algorithms can then be systematically synthesized by combining "off the shelf" parallel implementations of instances of these higher order functions. Efficiency in the final message- passing algorithms is achieved by exploiting data parallelism, for generating the intermediate results in jarallel;and functional parallelism, for processing intermediate results in stages such that the output of one stage is simultaneously input to the next one. This approach is illustrated through a case study for testing whether all the elements of a given list are distinct. Bird-Meertens Formalism is used to concisely carry out algebraic transformations. © 2000 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Coarse grained parallel algorithms for detecting convex bipartite graphs 26th

Coarse grained parallel algorithms for detecting convex bipa...

引用

26th International Workshop on Graph-Theoretic Concepts in Computer Science, WG 2000

作者： Cáceres, Edson Chan, Albert Dehne, Frank Prencipe, Giuseppe Departamento de Computação e Estatística Universidade Federal de Mato Grosso do Sul Campo Grande Brazil School of Computer Science Carleton University OttawaK1S 5B6 Canada Dipartimento di Informatica Corso Italia 40 Pisa56125 Italy

ISBN: (纸本)3540411836

In this paper, we present parallel algorithms for the coarse grained multicomputer (CGM) and the bulk synchronous parallel computer (BSP) for solving two well known graph problems: (1) determining whether a graph G is bipartite, and (2) determining whether a bipartite graph G is convex. Our algorithms require O(log p) and O(log2 p) communication rounds, respectively, and linear sequential work per round on a CGM with p processors and N/p local memory per processor, N=|G|. The algorithms assume that N/ p ≥ p€ for some fixed€ > 0, which is true for all commercially available multiprocessors. Our results imply BSP algorithms with O(log p) and O(log2 p) supersteps, respectively, O(g log(p) N p) communication time, and O(log(p) N p) local computation time. Our algorithm for determining whether a bipartite graph is convex includes a novel, coarse grained parallel, version of the PQ tree data structure introduced by Booth and Lueker. Hence, our algorithm also solves, with the same time complexity as indicated above, the problem of testing the consecutive-ones property for (0, 1) matrices as well as the chordal graph recognition problem. These, in turn, have numerous applications in graph theory, DNA sequence assembly, database theory, and other areas. © Springer-Verlag Berlin Heidelberg 2000.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

A class of parallel algorithms for solving large sparse linear systems on multiprocessors 4

A class of parallel algorithms for solving large sparse line...

引用

4th International Conference/Exhibition on High Performance Computing in the Asia-Pacific Region, HPC-Asia 2000

作者： Wang, Xiaoge Chen, R.M.M. Wu, Xue An, Xinghua Department of Computer Science and Technology Tsinghua University Beijing China Department of Electronic Engineering City University of Hong Kong Hong Kong Hong Kong

ISBN: (纸本)0769505902

We present a class of new parallel algorithms for solving large sparse linear systems with special structure on distributed memory multiprocessor systems such as PC clusters. The objective of these algorithms is to reduce the communication between processors so that they could be efficiently implemented. These algorithms are implemented on a cluster of PCs. The experiment results are presented and discussed. © 2000 IEEE.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

Linear expressing based approach for optimizing locality using non-singular loop transformations

引用

Jisuanji Xuebao/Chinese Journal of Computers 2003年第12期26卷 1609-1620页

作者： Xia, Jun Dai, Hua-Dong Yang, Xue-Jun Inst. of Comp. Natl. Univ. of Defense Technol. Changsha 410073 China

Exploiting programs' locality is one of the most important problems in parallel compiling optimization and the program transformations are one of the most important approaches in exploiting programs' temporal locality and spatial locality. The paper presents a new locality optimization approach using non-singular loop transformations to optimize programs' locality, namely linear expressing based loop transformations. This approach uses a group of the least linearly independent vectors to express array accesses' subscripts, and then constructs a non-singular loop transformation matrix to optimize array accesses' temporal locality and spatial locality. The approach can fully exploit array accesses' temporal locality, and easily determine whether array accesses' temporal locality or spatial locality can be exploited, it can also simultaneously optimize the given loop nest's temporal locality and spatial locality. The experimental results show that the linear expressing based approach for optimizing locality using non-singular loop transformations presented in this paper is effective.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel distance-k coloring algorithms for numerical optimization 8th

引用

8th International Euro-Par Conference on parallel Processing, Euro-Par 2002

作者： Gebremedhin, Assefaw Hadish Manne, Fredrik Pothen, Alex Department of Informatics University of Bergen BergenN-5020 Norway Computer Science Department Old Dominion University NorfolkVA23529 United States CSRI Sandia National Labs AlbuquerqueNM87185 United States ICASE NASA Langley Research Center HamptonVA23681-2199 United States

ISBN: (纸本)3540440496

Matrix partitioning problems that arise in the efficient estimation of sparse Jacobians andHessians can be modeledusing variants of graph coloring problems. In a previous work [6], we argue that distance-2 and distance-(formula presented) graph coloring are robust andflexible formulations of the respective matrix estimation problems. The problem size in large-scale optimization contexts makes the matrix estimation phase an expensive part of the entire computation both in terms of execution time andmemory space. Hence, there is a needfor both sharedand distributed-memory parallel algorithms for the stated graph coloring problems. In the current work, we present the first practical shared address space parallel algorithms for these problems. The main idea in our algorithms is to randomly partition the vertex set equally among the available processors, let each processor speculatively color its vertices using information about already colored vertices, detect eventual conflicts in parallel, andfinally re-color conflicting vertices sequentially. Randomization is also usedin the coloring phases to further reduce conflicts. Our PRAM-analysis shows that the algorithms shouldgiv e almost linear speedup for sparse graphs that are large relative to the number of processors. Experimental results from our OpenMP implementations on a Cray Origin2000 using various large graphs show that the algorithms indeed yield reasonable speedup for modest numbers of processors. © Springer-Verlag Berlin Heidelberg 2002.

关键词： parallel algorithms

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for detecting hazards in combinational logic circuits

Parallel algorithms for detecting hazards in combinational l...

引用

IEEE Region 10 International Conference TENCON

作者： E.C. Tan School of Computer Engineering Nanyang Technological University Singapore

Data and control parallelism algorithms are described for a matrix method which detects and locates the presence of logic hazards in combinational logic circuits. Examples are given for illustration.

关键词： parallel algorithms Hazards Combinational circuits Propagation delay parallel processing Pulse circuits Input variables Logic Circuit analysis Circuit testing

来源：评论

学校读者我要写书评

暂无评论

parallel algorithms for Radiation Transport on Unstructured Grids

Parallel Algorithms for Radiation Transport on Unstructured ...

引用

Supercomputing Conference

作者： S. Plimpton B. Hendrickson S. Burns W. McLendon Sandia National Laboratories Albuquerque NM USA Texas A&M University College Station TX USA

The method of discrete ordinates is commonly used to solve the Boltzmann radiation transport equation for applications ranging from simulations of fires to weapons effects. The equations are most efficiently solved by sweeping the radiation flux across the computational grid. For unstructured grids this poses several interesting challenges, particularly when implemented on distributed-memory parallel machines where the grid geometry is spread across processors. We describe a asynchronous, parallel, message-passing algorithm that performs sweeps simultaneously from many directions across unstructured grids. We identify key factors that limit the algorithm’s parallel scalability and discuss two enhancements we have made to the basic algorithm: one to prioritize the work within a processor’s subdomain and the other to better decompose the unstructured grid across processors. Performance results are give for the basic and enhanced algorithms implemented withi a radiation solver running on hundreds of processors of Sandia’s Intel Tflops machine and DEC-Alpha CPlant cluster.

关键词： parallel algorithms Equations Clustering algorithms Computational modeling Fires Weapons Grid computing parallel machines Geometry Scalability

来源：评论

学校读者我要写书评

暂无评论

UNIFIED parallel algorithms FOR GAUSSIAN ELIMINATION WITH BACKWARD SUBSTITUTION ON PRODUCT NETWORKS

引用

parallel algorithms and Applications 2000年第4期14卷 253-269页

作者： Abdel-Elah Al-Ayyoub - Tel.: (330) 972-8004. Fax: (330) 374-8630. E-mail: ayyoub@cs.uakron.edu[a] Khaled Day - E-mail: kday@***.[b] [a] Department of Mathematics and Computer Science The University of Akron Akron Ohio USA [b] Department of Computer Science Sultan Qaboos University Al-Khod Muscat Sultanate of Oman

The increasing interest in product networks (PNs) as a method of combining desirable properties of component networks, has prompted a need for the general study of the algorithmic issues related to this important class of interconnection networks. In this paper we present unified parallel algorithms for Gaussian elimination, with partial and complete pivoting, on product networks. A parallel algorithm for backward substitution is also presented. The proposed algorithms are network independent and are also independent of the matrix distribution methods employed. These algorithms can be used on a wide range of PNs including hypercube, mesh, and k-ary n-cube. Unified models for estimating computation time and interprocessor communication time are also presented. These models are then used to measure the performance of the proposed algorithms on several product networks

关键词： Product networks Interconnection networks Gaussian elimination Backward substitution parallel algorithms Performance evaluation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：