In this paper, first, a fault diagnosis approach of large system based on iteration of function space in dynamic programming is proposed, and it new concept about the minimum of coefficient product of routes from sour...
详细信息
ISBN:
(纸本)0780342534
In this paper, first, a fault diagnosis approach of large system based on iteration of function space in dynamic programming is proposed, and it new concept about the minimum of coefficient product of routes from source node to target node is put forward;an algorithm to figure out this minimum and to track down this correspondent route is given. Second, the algorithm is changed into a tabular iteration method, which is simpler, more regular and helpful for programming. Third, the algorithm's operation time is discussed. In order to reduce the operation time, a concept of bipartite state space is brought up. According to this concept, a large net can be divided into two or more subnets which are independent of each other. Fourth, a hind of parallel algorithm based on tabular iteration for fault diagnosis is also offered;the steps of applying this algorithm are detailed. Finally, this approach is shown to be more effective and simpler by an example of fault diagnosis.
In this article, we study the effects of network topology and load balancing on the performance of a new parallel algorithm for solving triangular systems of linear equations on distributed-memory message-passing mult...
详细信息
In this article, we study the effects of network topology and load balancing on the performance of a new parallel algorithm for solving triangular systems of linear equations on distributed-memory message-passing multiprocessors. The proposed algorithm employs novel runtime data mapping and workload redistribution methods on a communication network which is configured as a toroidal mesh. A fully parameterized theoretical model is used to predict communication behaviors of the proposed algorithm relevant to load balancing, and the analytical performance results correctly determine the optimal dimensions of the toroidal mesh, which vary with the problem size, the number of available processors, and the hardware parameters of the machine. Further enhancement to the proposed algorithm is then achieved through redistributing the arithmetic workload at runtime. Our FORTRAN implementation of the proposed algorithm as well as its enhanced version has been tested on an Intel iPSC/2 hypercube, and the same code is also suitable for executing the algorithm on the iPSC/860 hypercube and the Intel Paragon mesh multiprocessor. The actual timing results support our theoretical findings, and they both confirm the very significant impact a network topology chosen at runtime can have on the computational load distribution, the communication behaviors and the overall performance of parallel algorithms.
In this paper, a BSP (bulk synchronous parallel) Bareiss algorithm for Toeplitz system is described. We investigate various data distribution and scheduling strategies for mapping a typical class of systolic array alg...
详细信息
In this paper, a BSP (bulk synchronous parallel) Bareiss algorithm for Toeplitz system is described. We investigate various data distribution and scheduling strategies for mapping a typical class of systolic array algorithms onto BSP machines. Load balance, both in communication and computation, as well as linear speedup have been achieved for the Toeplitz system solver and at the same time the minimum memory requirement is achieved. An implementation has been tested on Sun workstations, an SGI Power Challenge, and an IBM SP?, using the Oxford BSPlib (Hill et al., 1997. (C) 1999 Academic Press.
A hyper-bus broadcast network (HBBN) consists of processors only sharing by some global buses, and there are no local links between processors. Based on such an architecture, we will exploit several efficient time par...
详细信息
A hyper-bus broadcast network (HBBN) consists of processors only sharing by some global buses, and there are no local links between processors. Based on such an architecture, we will exploit several efficient time parallel algorithms for solving the well-known fundamental data movement problems which had been extensively studied by researchers and widely applied to the field of image processing, digitized geometry and computer graphics. These include the leftmost one problem, the prefix maxima/minima problem, the m-contour problem, the all nearest neighbor problem and the all nearest smaller values problem, respectively. Note that the proposed algorithms not only can be implemented on the HBBN but also can be easily modified to run on other broadcast-based networks with the same time and processor complexities. (C) 1999 Elsevier Science B.V. All rights reserved.
The design of array processors for solving linear systems using two-step division-free Gaussian elimination method is considered. The two-step method can be used to improve the systems based on the one-step method in ...
详细信息
The design of array processors for solving linear systems using two-step division-free Gaussian elimination method is considered. The two-step method can be used to improve the systems based on the one-step method in terms of numerical stability as well as the requirements for high-precision. In spite of the rather complicated computations needed at each iteration of the two-step method, we develop an innovative parallel algorithm whose data dependency graph meets the requirements for regularity and locality. Then we derive two-dimensional array processors by adopting a systematic approach to investigate the set of all admissible solutions and obtain the optimal array processors under linear time-space scheduling. The array processors is optimal in terms of the number of processing elements used.
New topology-based hybrid method for electrical networks with magnetic couplings analysis is presented. Due to difficulties of general nodal formulation for the electrical circuits with magnetic couplings the network ...
详细信息
New topology-based hybrid method for electrical networks with magnetic couplings analysis is presented. Due to difficulties of general nodal formulation for the electrical circuits with magnetic couplings the network is split into two parts. First part contains all branches with magnetic couplings and its equations are formulated in mesh coordinates. The second part contains the rest of the network and its equations are formulated in nodal coordinates. Combination of two approaches for a single network with magnetic couplings yields new general mesh-nodal network analysis formulation with natural network decomposition into subsystems, suitable for parallel computational algorithm application. The example of proposed approach application is presented.
A new two-dimensional systolic algorithm is proposed in this paper for parallel implementation of the multi-layered neural network. To reduce communication overhead, the input data flow is passed along the horizontal ...
详细信息
A new two-dimensional systolic algorithm is proposed in this paper for parallel implementation of the multi-layered neural network. To reduce communication overhead, the input data flow is passed along the horizontal and vertical directions of the systolic array alternately, over different layers of the neural network. This new algorithm accelerates learning process of the neural network. Transputer implementation of the proposed algorithm and experimental results are presented to show efficiency of the new algorithm. (C) 1999 Elsevier Science B.V. All rights reserved.
In this paper, we solve three geometric problems, including the ranking, convex hull and closest pair problems, under the broadcast communication model. To solve these problems, we propose a general scheme, the p-divi...
详细信息
In this paper, we solve three geometric problems, including the ranking, convex hull and closest pair problems, under the broadcast communication model. To solve these problems, we propose a general scheme, the p-division approach, which is based upon the divide-and-conquer strategy. In the 2-dimensional space, the time complexities of our algorithms for solving these problems are all O(n + n/p log n/p), where n is the number of input points and p is the number of processors used. Furthermore, our algorithms are all conflict-free and optimal. In the k-dimensional space, k greater than or equal to 3, our ranking algorithm requires O(n/p log(k-1) n/p n/p + n log(k-2) n/p) time.
We survey results on the sequential and parallel complexity of hamiltonian path and cycle problems in various classes of digraphs which generalize tournaments. We give detailed informations on the difference in diffic...
详细信息
We survey results on the sequential and parallel complexity of hamiltonian path and cycle problems in various classes of digraphs which generalize tournaments. We give detailed informations on the difference in difficulties for these problems for the various classes as well as prove new results on hamiltonian paths starting in a specified vertex for a quite general class of digraphs. (C) 1999 Elsevier Science B.V. All rights reserved.
The problem of rearranging scattered information arises in many applications of data/information management systems. A simpler case of this problem is the so-called k-compaction problem, which requires Omega(root log ...
详细信息
The problem of rearranging scattered information arises in many applications of data/information management systems. A simpler case of this problem is the so-called k-compaction problem, which requires Omega(root log n) EREW PRAM time, Omega (log log n) CREW PRAM time, or Theta (log k/ log log n) CRCW PRAM time using ii processors. In this paper, we give constant time parallel algorithms using n processors on BSR, another PRAM model, for the problem of rearranging scattered information and the k-compaction problem. (C) 1999 Published by Elsevier Science B.V. All rights reserved.
暂无评论