Floorplanning is a critical phase in physical design of VLSI circuits and has been acknowledged as a compute-intensive process. As a result, several research efforts have been undertaken to parallelize the algorithm. ...
详细信息
ISBN:
(纸本)0780386477
Floorplanning is a critical phase in physical design of VLSI circuits and has been acknowledged as a compute-intensive process. As a result, several research efforts have been undertaken to parallelize the algorithm. While previous work has been focused on slicing floorplan, we present a parallel algorithm for non-slicing floorplan using Corner Block List (CBL) topological representation. In this paper, a parallel interconnection cost calculation algorithm with load balancing strategy is initiated in order to speed up the especially time consuming wire length calculation in floorplanning. Multiple Markov chains strategy is also embedded in our algorithm. The experiment results obtained from the tests on MCNC benchmarks indicate considerable speedup and preserved floorplanning quality.
A parallel algorithm for Hough transform on a linear array with reconfigurable pipeline bus system (LARPBS) is presented. Suppose the number of theta-values to be considered is m, for an image with n x n pixels, the a...
详细信息
A parallel algorithm for Hough transform on a linear array with reconfigurable pipeline bus system (LARPBS) is presented. Suppose the number of theta-values to be considered is m, for an image with n x n pixels, the algorithm can complete Hough transform in O(1) time using mn(2) processors and achieve optimal speed and efficiency. We also illustrate how to partition data and perform the algorithm on a LARPBS with fewer than mn(2) processors, and hence show that the algorithm is highly scalable.
This paper discusses the possibility of parallel images processing. It proposes several parallel algorithms for primary processing of grey and colour images by method of scanning mask. An evaluation of proposed algori...
详细信息
ISBN:
(纸本)9789549641387
This paper discusses the possibility of parallel images processing. It proposes several parallel algorithms for primary processing of grey and colour images by method of scanning mask. An evaluation of proposed algorithms has been made here.
Rosenbrock methods are frequently used for the numerical solution of stiff initial value problems. Such linearly implicit methods are characterized by a relatively easy implementation together with excellent linear st...
详细信息
Rosenbrock methods are frequently used for the numerical solution of stiff initial value problems. Such linearly implicit methods are characterized by a relatively easy implementation together with excellent linear stability properties. In this paper, we consider modified Rosenbrock methods with s external linearly implicit stages each of which contains p additional linearly implicit internal stages. The internal stages are already parallel so that they can be solved for independently of each other and, consequently, the processors need to exchange their-results only after the completion, of each of the s external stages. We focus on the design of fourth-order methods with three external stages. Using embedded third-order methods, a variable step size implementation is compared with well-known Rosenbrock codes for performance on the Robertson problem. (C) 2005 Elsevier Ltd. All rights reserved.
This paper describes the design of the Abstract Library for parallel Search (ALPS), a framework for implementing scalable, parallel algorithms based on tree search. ALPS is specifically designed to support data-intens...
详细信息
This paper describes the design of the Abstract Library for parallel Search (ALPS), a framework for implementing scalable, parallel algorithms based on tree search. ALPS is specifically designed to support data-intensive algorithms, in which large amounts of data are required to describe each node in the search tree. Implementing such algorithms in a scalable manner is challenging both because of data storage requirements and communication overhead. ALPS incorporates a number of new ideas to address this challenge. The paper also describes the design of two other libraries forming a hierarchy built on top of ALPS. The first is the Branch, Constrain, and Price Software (BiCePS) library, a framework that supports the implementation of parallel branch and bound algorithms in which the bounds are obtained by solving some sort of relaxation, usually Lagrangian. In this layer, the notion of global data objects associated with the variables and constraints is introduced. These global objects provide a connection between the various subproblems in the search tree, but they pose further difficulties for designing scalable algorithms. The other library is the BiCePS linear integer solver (BLIS), a concretization of BiCePS, in which linear programming is used to obtain bounds in each search tree node.
The partial differential equations are often used in science and engineering *** solutions cannot be got analytical,so the numerical methods are often used to get approximate *** achieve high precision,more computing ...
详细信息
The partial differential equations are often used in science and engineering *** solutions cannot be got analytical,so the numerical methods are often used to get approximate *** achieve high precision,more computing time is *** the long time in practical applications is not often allowed,so the precision of final results has to be *** on cluster system,this paper studies the parallel multi-grid method and its application in the numerical analysis of heat *** results from the sample show that the method not only can expand the size of solved problems efficiently,but also can gain excellent parallel efficiency;therefore it is a method suitable for network parallel environment based on cluster system.
This paper presents a novel parallel algorithm for multispectral image fusion called the s-PCT. The algorithm is designed based on a manager-worker threading concept and goes further to use redundancy to achieve fault...
详细信息
This paper presents a novel parallel algorithm for multispectral image fusion called the s-PCT. The algorithm is designed based on a manager-worker threading concept and goes further to use redundancy to achieve fault tolerance. The paper emphasizes asymptotic analysis using analytical models and empirical studies that allow performance to be assessed for a wide variety of typical variations in use. For example, changes to the number of spectra, image resolution, number of processors, degree of redundancy, and reliability. Software reliability concept is studied and applied to the algorithm analysis. The performance and reliability relationship is also defined. The motivation in deriving the performance-reliability model is to assess the impact of changes in technology and problem size associated with different applications, allowing performance-reliability tradeoffs to be assessed. In other words, given a problem size and a reliability constraint, the model can be used to estimate the achievable performance. (C) 2004 Elsevier Ltd. All rights reserved.
In this paper, we propose a multi-dimensional category model (MDCM) for classifying multi-dimensional text collection. We can parallel and distribute the process of text classification in separately on each dimension....
详细信息
ISBN:
(纸本)3540240136
In this paper, we propose a multi-dimensional category model (MDCM) for classifying multi-dimensional text collection. We can parallel and distribute the process of text classification in separately on each dimension. With this model, performance of classifiers improves in both accuracy and time complexity. For classification accuracy, some benefits can be obtained. Classifiers learn from larger training documents with a small number of classes on each dimension. We can select the best classifier for each dimension and combine the results from them. For time complexity, the learning and classifying phases can be in parallel and distributed manner. The efficiency of MDCM is investigated on drug information data set which assigns topics in monographs in the first dimension and primary therapeutic classes in the second dimension. The experimental results show that parallel text classification on MDCM performs better than flat model in both accuracy and time complexity.
In this paper, a modified signal energy method called the envelope method will first be proposed to estimate the transient stability limit, and then a parallel envelope method is developed to speed up the limit search...
详细信息
ISBN:
(纸本)0780386108
In this paper, a modified signal energy method called the envelope method will first be proposed to estimate the transient stability limit, and then a parallel envelope method is developed to speed up the limit search for online applications. A new weighting limit estimation formula is used for improving the estimation accuracy and minimizing of the effects of observing nodes of the original signal energy method. The effectiveness and the computation speed of the proposed method have been fully evaluated on a 20 machine 100 bus UK system using a 4 CPU SW workstation. Simulation results show that the envelope method could achieve better accuracy than the original signal energy method and is very fast. Timing measurements show that even the serial envelope method could have faster computation speed than the parallel time-domain-simulation method, and the parallel envelope method is the fastest and is suitable for online security analysis and control applications.
In this paper, the parallel resolution of the Langford problem is studied. Two different approaches are developed. First, an explicit construction of all the solutions is done using it shared memory. The application a...
详细信息
ISBN:
(纸本)0769522106
In this paper, the parallel resolution of the Langford problem is studied. Two different approaches are developed. First, an explicit construction of all the solutions is done using it shared memory. The application associated to this approach is written in C using the standard OpenMP library. Second, a parallelization of the algebraic method introduced by Godfrey is proposed. The application is taking advantage of MPI and has revealed efficient up to 128 processors. This solution opens up some new perspectives such as solving the already resolved instances of the problem more quickly and solving the next two open instances of the problem in a near future.
暂无评论