In this paper we propose a novel computational technique to solve the Eikonal equation efficiently on parallel architectures. The proposed method manages the list of active nodes and iteratively updates the solutions ...
详细信息
In this paper we propose a novel computational technique to solve the Eikonal equation efficiently on parallel architectures. The proposed method manages the list of active nodes and iteratively updates the solutions on those nodes until they converge. Nodes are added to or removed from the list based on a convergence measure, but the management of this list does not entail an extra burden of expensive ordered data structures or special updating sequences. The proposed method has suboptimal worst-case performance but, in practice, on real and synthetic datasets, runs faster than guaranteed-optimal alternatives. Furthermore, the proposed method uses only local, synchronous updates and therefore has better cache coherency, is simple to implement, and scales efficiently on parallel architectures. This paper describes the method, proves its consistency, gives a performance analysis that compares the proposed method against the state-of-the-art Eikonal solvers, and describes the implementation on a single instruction multiple datastream (SIMD) parallel architecture.
In order to improve the speed of convergence of ant colony optimization, a parallel algorithm based on coarse-grained model is proposed in the paper, search tasks are assigned to q ant subgroups, and parallel searchin...
详细信息
In order to improve the speed of convergence of ant colony optimization, a parallel algorithm based on coarse-grained model is proposed in the paper, search tasks are assigned to q ant subgroups, and parallel searching are finished by q subgroups. Results of experiment show that the algorithm described in this paper makes the searching speed hundreds of times faster than the latest improved algorithm.
A method of specifying agent interaction protocols in multiagent systems in the PRALU language (parallel logic control algorithms) is considered. A methodology for programming agents in the PRALU language based on a t...
详细信息
A method of specifying agent interaction protocols in multiagent systems in the PRALU language (parallel logic control algorithms) is considered. A methodology for programming agents in the PRALU language based on a two-block architecture, a synchronization block and a functional block, is proposed. It is shown that through use of a description of the behavior of agents in the PRALU language it becomes possible to simulate the behavior of a multiagent system as a whole.
Cellular automaton (CA) are considered to be a non-linear model of complex systems in which an infinite one-dimensional array of finite state machines (cells) updates itself in a synchronous manner according to a unif...
详细信息
ISBN:
(纸本)9784907764302
Cellular automaton (CA) are considered to be a non-linear model of complex systems in which an infinite one-dimensional array of finite state machines (cells) updates itself in a synchronous manner according to a uniform local rule. It is studied in many fields such as complex systems. We study a sequence generation problem on the CA. Arisawa, Fischer and Korec studied generation of a class of natural numbers on CA. In this paper, we study the sequence generation power of CA with 2 internal states.
This extended abstract presents a synthesis of our work concerning a parallel algorithm for Vehicle Routing Problem with Time Windows (VRPTW). Two colonies work in parallel and cooperate to solve the problem, each one...
详细信息
ISBN:
(纸本)9781934272350
This extended abstract presents a synthesis of our work concerning a parallel algorithm for Vehicle Routing Problem with Time Windows (VRPTW). Two colonies work in parallel and cooperate to solve the problem, each one being charged with an objective of the optimization. In addition, the importance of these objectives can vary, depending on some environmental changes. This approach introduces a dinamicity level, modeled by managing some new parameters. Experiments on various aspects of the algorithm and computational results are developed on benchmark problems, but also on the distribution net of Maryland Trading Company in Bacau, Romania, distributor of gluten-free dietary products.
Traditionally, the block-based medial axis transform (BB-MAT) and the chessboard distance transform (CDT) were usually viewed as two completely different image computation problems, especially for three dimensional (3...
详细信息
ISBN:
(纸本)9783540695004
Traditionally, the block-based medial axis transform (BB-MAT) and the chessboard distance transform (CDT) were usually viewed as two completely different image computation problems, especially for three dimensional (3D) space. We achieve the computation of the 3D CDT problem by implementing the 3D BB-MAT algorithm first. For a 3D binary image of size N-3, our parallel algorithm can be run in O(logN) time using N-3 processors on the concurrent read exclusive write (CREW) parallel random access machine (PRAM) model to solve both 3D BB-MAT and 3D CDT problems, respectively. In addition, we have implemented a message passing interface (MPI) program on an AMD Opteron Model 270 cluster system to verify the proposed parallel algorithm, since the PRAM model is not available in the real world. The experimental results show that the speedup is saturated when the number of processors used is more than four, regardless of the problem size.
Dynamic information flow tracking (DIFT) is an important tool for detecting common security attacks and memory bugs. A DIFT tool tracks the flow of information through a monitored program's registers and memory lo...
详细信息
ISBN:
(纸本)9781595939739
Dynamic information flow tracking (DIFT) is an important tool for detecting common security attacks and memory bugs. A DIFT tool tracks the flow of information through a monitored program's registers and memory locations as the program executes, detecting and containing/fixing problems on-the-fly. Unfortunately, sequential DIFT tools are quite slow, and DIFT is quite challenging to parallelize. In this paper, we present a new approach to parallelizing DIFT-like functionality. Extending our recent work on accelerating sequential DIFT, we consider a variant of DIFT that tracks the information flow only through unary operations (relaxed DIFT), and yet makes sense for detecting security attacks and memory bugs. We present a parallel algorithm for relaxed DIFT, based on symbolic inheritance tracking, which achieves linear speed-up asymptotically. Moreover, we describe techniques for reducing the constant factors, so that speed-ups can be obtained even with just a few processors. We implemented the algorithm in the context of a Log-Based Architectures (LBA) system, which provides hardware support for logging a program trace and delivering it to other (monitoring) processors. Our simulation results on SPEC benchmarks and a video player show that our parallel relaxed DIFT reduces the overhead to as low as 1.2X using 9 monitoring cores on a 16-core chip multiprocessor.
For conventional eigenstructure assignment technique. the strict constraint of complete difference between open-loop poles and closed-loop ones is required and the high cost of the resulting Sylvester matrix equations...
详细信息
For conventional eigenstructure assignment technique. the strict constraint of complete difference between open-loop poles and closed-loop ones is required and the high cost of the resulting Sylvester matrix equations computation under the condition of higher dimensions is encountered. One parallel algorithm of eigenstructure assignment is presented in this paper. The combination of orthogonal transformation with partition block technique is employed to transform one high order matrix equations into two lower order ones. parallel computing is used to solve two reduced-order matrix equations in order to decrease computing cost. Moreover. partial pole assignment is implemented due to the fact that the sufficient condition in which there exists solution to transformed matrix equations is relaxed in comparison with the sufficient condition of initiative matrix equations solution existence. Parameterized eigenvector is determined through the error 2-norm between desired eigenvector and objective one is minimized while the exactly assigned eigenvalues are guaranteed. Two numerical examples indicate that the presented algorithm for eigenstructure assignment is effective.
A nonsmooth PGD scheme for minimizing a nonsmooth convex function is presented. In the parallelization step of the algorithm, a method due to Pang, Han and Pangaraj (1991), [7], is employed to solve a subproblem for c...
详细信息
A nonsmooth PGD scheme for minimizing a nonsmooth convex function is presented. In the parallelization step of the algorithm, a method due to Pang, Han and Pangaraj (1991), [7], is employed to solve a subproblem for constructing search directions. The convergence analysis is given as well.
Fuzzy C-Means clustering is one of the most perfective and widely, used algorithms based on objective function for unsupervised classification. Considering the spatial relationship of pixels when it is used in remote ...
详细信息
ISBN:
(纸本)9781424410651
Fuzzy C-Means clustering is one of the most perfective and widely, used algorithms based on objective function for unsupervised classification. Considering the spatial relationship of pixels when it is used in remote sensing imagery, Neighbor-based FCM algorithm is put forward with the method of modifying the value of fuzzy membership degrees with the neighbor information during the clustering iterations. We use dominant class, if it can be determined in a fixed neighbor region, or the weighted parameters based 017 the distance of neighbours to perfect the membership degrees of central pixel. Then parallel implement for the algorithim is also proposed by, taking account into the communication complexity and the spatial relationship for image partition. 117 the end, the experimental data indicate the efficiency of the algorithim in decreasing the amount of clustering iterations and increasing the classfied precision;the parallel algorithm also achieves the satisfied linear speedup.
暂无评论