OTIS (optical transpose interconnection system) is a popular model of optoelectronic parallel computers that has gained enormous attention in the recent years. Several parallel algorithms have been published for many ...
详细信息
OTIS (optical transpose interconnection system) is a popular model of optoelectronic parallel computers that has gained enormous attention in the recent years. Several parallel algorithms have been published for many fundamental problems on this architecture. In this paper, first we propose a parallel algorithm for sparse enumeration sort on OTISMesh of Trees (OTIS-MOT). For N (= n(2)) data elements, our sorting algorithm requires 4 log N ($) electronic moves + 3 OTIS moves. We next present a shortest path routing algorithm that runs also in logarithmic time.
For more than three decades, the very well known and famous two-list Horowitz and Sahni algorithm [3] remains the serial upper-bound for the 0-1 Knapsack problem with n items (KP01) in a time bounded by O(2(n/2)). Rec...
详细信息
For more than three decades, the very well known and famous two-list Horowitz and Sahni algorithm [3] remains the serial upper-bound for the 0-1 Knapsack problem with n items (KP01) in a time bounded by O(2(n/2)). Recently, Chedid [2] Suggested an optimal parallelization for that algorithm to a KP01 variation - the subset-sum problem - in a PRAM CREW with p = 2(n/8) processors. It is presented here that, in addition to be incomplete, the Chedid result is a particular case given by Sanches et al. [6]. (c) 2009 Elsevier B.V. All rights reserved.
This article analyzes mathematical and technological problems that arise in performing computational experiments on modern high-performance computers (supercomputers). As a means of overcoming the difficulties associa...
详细信息
This article analyzes mathematical and technological problems that arise in performing computational experiments on modern high-performance computers (supercomputers). As a means of overcoming the difficulties associated with the analysis and solution of computer model problems under conditions of approximate initial data on computers with parallel architectures, intelligent technologies are proposed that are based on intelligent software supported by architectural decisions of an intelligent computer and predictive system software.
Markov decision process (MDP) provides the foundations for a number of problems, such as artificial intelligence Studying, automated planning and reinforcement learning. MDP can be solved efficiently in theory. Howeve...
详细信息
ISBN:
(纸本)9783642030949
Markov decision process (MDP) provides the foundations for a number of problems, such as artificial intelligence Studying, automated planning and reinforcement learning. MDP can be solved efficiently in theory. However, for large scenarios, more investigations are needed to reveal practical algorithms. algorithms for solving MDP have a natural concurrency. In this paper, we present parallel algorithms based on dynamic programming Meanwhile, the cost of computation and communication complexity of this method is analyzed. Moreover, experimental results demonstrate excellent speedups and scalability.
A parallel algorithm for translating regular expression into its Follow automata is proposed in the paper. Firstly, we construct the Thompson automata of a regular expression. Then, the Glushkov automata are achieved ...
详细信息
ISBN:
(纸本)9780769536040
A parallel algorithm for translating regular expression into its Follow automata is proposed in the paper. Firstly, we construct the Thompson automata of a regular expression. Then, the Glushkov automata are achieved by removing the xi path and the equivalent states which have equivalent relations are merged into one. So we can get a smaller finite automata, named Follow automata. Finally the parallel processing of algorithm is described in detail with an example.
For a large-scale adaptive array, the heavy computational load and the high-rate data transmission are two challenges in the implementation of an adaptive digital beamforming system. An efficient parallel digital beam...
详细信息
For a large-scale adaptive array, the heavy computational load and the high-rate data transmission are two challenges in the implementation of an adaptive digital beamforming system. An efficient parallel digital beamforming (DBF) algorithm based on the least mean square algorithm (PLMS) is proposed. An appropriate method is found to partition the least mean square (LMS) algorithm into a number of operational modules, which can be easily executed in a distributed-parallel-processing fashion. As a result, the proposed PLMS algorithm provides an effective solution that can alleviate the bottleneck of high-rate data transmission and reduce the computational cost. PLMS requires less computational load than that of the conventional parallel algorithms based on the recursive least square (RLS) algorithm, as well as it is easier to be implemented to do real time adaptive array processing. Moreover, low sidelobe of the beam pattern is obtained by constraining the static steering vector with Tschebyscheff coefficients. Finally, a scheme of the PLMS algorithm using distributed-parallel-processing system is also proposed. The simulation results demonstrate that the PLMS algorithm has the same interference cancellation performance as that of the conventional LMS algorithm. Moreover, the PLMS algorithm can obtain the same good beamforming performance, regardless how the algorithm is partitioned. It is expected that the proposed algorithm will be used in a large-scale adaptive array system to deal with real time adaptive digital beamforming processing.
In order to improve the speed of surface reconstruction from densely scattered points, and reduce the application cost, this paper describes a new and fast surface reconstruction method based on grid computing. The pr...
详细信息
ISBN:
(纸本)9783642118418
In order to improve the speed of surface reconstruction from densely scattered points, and reduce the application cost, this paper describes a new and fast surface reconstruction method based on grid computing. The proposed method converts large-scale unorganized 3D scanned datasets into layered datasets firstly. Then based on data parallel mechanism, a loosely coupled parallel reconstruction algorithm is designed;the algorithm has less inter-node communication, so that it is more suitable for grid computing. In order to realize load balance in grid, the priority preemptive scheduling strategy is designed based on two-level scheduling model. Finally, the grid environment is built by Globus Toolkit, and the parallel reconstruction and visualization are achieved based on mpich-G2 and the Visualization Toolkit (VTK), this experiment shows that the reconstruction time is reduced significantly.
In the last decade, several algorithms, usually based on information filtering techniques, have been proposed to address multi-robot mapping problem. Less interest has been devoted to investigate a parallel or distrib...
详细信息
ISBN:
(纸本)9781424466757
In the last decade, several algorithms, usually based on information filtering techniques, have been proposed to address multi-robot mapping problem. Less interest has been devoted to investigate a parallel or distributed organization of such algorithms in the perspective of multi-robot exploration. In this paper, we propose a distributed algorithm for map estimation based on Gauss-Seidel relaxation. The complete map is shared among independent tasks running on each robot, which integrate the independent robot measurements in local submaps, and a server, which stores contour nodes separating the submaps. Each task updates its local submap and periodically checks for inter-robot data associations. Gauss-Seidel relaxation is performed independently on each robot and afterwards on the contour nodes set on the server. Results illustrate the potential and flexibility of the new approach.
Kriging is one of the important interpolation methods in geostatistics, which has been widely applied in engineering project. In this paper, we present an efficient method for the parallelization of universal Kriging ...
详细信息
ISBN:
(纸本)9780769541105
Kriging is one of the important interpolation methods in geostatistics, which has been widely applied in engineering project. In this paper, we present an efficient method for the parallelization of universal Kriging interpolation on shared memory multiprocessors. By using OpenMP directives, we implement a portable parallel algorithm, which enables an incremental approach to add parallelism, without modifying the rest part of sequential code. To achieve optimal performance, the parallel grain size has been considered and analyzed. Numerical experiments have been carried out on two multicore windows workstations, the results of which demonstrate this method could enhance the overall performance significantly.
A new approach is described to the evaluation of the quantum scattering S-matrix in 3D atom-diatom reactive collision. The theory is developed in terms of natural collision coordinates where the coordinate reaction fu...
详细信息
A new approach is described to the evaluation of the quantum scattering S-matrix in 3D atom-diatom reactive collision. The theory is developed in terms of natural collision coordinates where the coordinate reaction fulfills the same role as a time in a time-dependent scattering formulation. Having written the full wavefunction of the particles system in the coupled-channel representation we have proved that the 3D multi-channel scattering problem can be reduced to the inelastic single-arrangement problem which is described by system of ordinary differential equations (ODE) of second order. The system of coupled-channel second order ODEs exactly is reduced to the system of integro-differential equations (IDE) of first order which is solved with the initial conditions. The problem of Koshi for the system of IDEs is proposed to be solved by the method of Runge-Kutta of fourth order. The detailed algorithm for parallel simulation of initial 3D scattering problem is proposed. In result of simulation of IDEs the full wavefunction and all S-matrix elements of reactive transitions and state-to-state cross section are obtained simultaneously without other extra calculations. (C) 2010 Published by Elsevier Ltd.
暂无评论