In classical combinatorial string matching repetitions and other regularities play a central role. Besides their theoretical importance, repetitions in strings have been found relevant to coding and automata theory, f...
详细信息
In classical combinatorial string matching repetitions and other regularities play a central role. Besides their theoretical importance, repetitions in strings have been found relevant to coding and automata theory, formal languages, data compression, and molecular biology. An important motivation for developing a 2D pattern matching theory is seen in its relation with pattern recognition, image processing, computer vision and multimedia. Repetitions in 2D arrays have been defined and classified recently.(5) In this paper we present an optimally fast CRCW-PRAM algorithm for testing whether a given n x n array contains repetitions of certain type. The algorithm takes optimal O (log log n) time with n(2)log(2)n/log log n processors.
The design is described of a parallel version of Tarjan's algorithm for the determination of equivalence classes in graphs that represent images. Distribution of the vertices of the graph over a number of processe...
详细信息
The design is described of a parallel version of Tarjan's algorithm for the determination of equivalence classes in graphs that represent images. Distribution of the vertices of the graph over a number of processes leads to a message passing algorithm. The algorithm is mapped to a shared-memory architecture by means of POSIX threads. It is applied to the determination of connected components in image processing. Experiments show a satisfactory speedup for sufficiently large images. (C) 2001 Elsevier Science B.V. All rights reserved.
This paper deals with the evaluation of acyclic Boolean conjunctive queries in relational databases. By well-known results of Yannakakis [1981], this problem is solvable in polynomial time, its precise complexity, how...
详细信息
This paper deals with the evaluation of acyclic Boolean conjunctive queries in relational databases. By well-known results of Yannakakis [1981], this problem is solvable in polynomial time, its precise complexity, however, has not been pinpointed so far, We show that the problem of evaluating acyclic Boolean conjunctive queries is complete for LOGCFL, the class of decision problems that are logspace-reducible to a context-free language. Since LOGCFL is contained in AC(1) and NC(2). the evaluation problem of acyclic Boolean conjunctive queries is highly parallelizable, We present a parallel database algorithm solving this problem with a logarithmic number of parallel join operations. The algorithm is generalized to computing the output of relevant classes of non-Boolean queries. We also show that the acyclic versions of the following well-known database and Al problems are all LOGCFL-complete: The Query Output Tuple problem for conjunctive queries. Conjunctive Query Containment, Clause Subsumption, and Constraint Satisfaction. The LOGCFL-completeness result is extended to the class of queries of bounded treewidth and to other relevant query classes which are more general than the acyclic queries.
An adaptive algorithm is presented for converting the quadtree representation of a binary image to its chain code representation. Our algorithm has the advantage of constructing the chain codes of the resulting quadtr...
详细信息
An adaptive algorithm is presented for converting the quadtree representation of a binary image to its chain code representation. Our algorithm has the advantage of constructing the chain codes of the resulting quadtree of the Boolean operation of two quadtrees by re-using the original chain codes. This algorithm is adaptive because it can adjust the total number of internal nodes to be stored and retrieve it later in the reconstruction stage. The algorithm possesses parallelism and is suited for pyramid architecture. Our algorithm requires time O(H + L) in sequential and time O(N) in parallel, where H is the height of the quadtree, L is the length of the chain code sequence generated, and N x N is the size of the input image. (C) 2001 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
We consider the mathematical modelling and solution of robust and cost-optimizing structural (topology) design problems. The setting is the optimal design of a linear-elastic structure, for example a truss topology, u...
详细信息
We consider the mathematical modelling and solution of robust and cost-optimizing structural (topology) design problems. The setting is the optimal design of a linear-elastic structure, for example a truss topology, under unilateral frictionless contact, and under uncertainty in the data describing the load conditions, the material properties, and the rigid foundation. The resulting stochastic bilevel optimization model finds a structural design that responds the best to the given probability distribution in the data. This model is of special interest when a structural failure will lead to a reconstruction cost, rather than loss of life. For the mathematical model, we provide results on the existence of optimal solutions which allow for zero lower design bounds. We establish that the optimal solution is continuous in the lower design bounds, a result which validates the use of small but positive values of them, and for such bounds we also establish the locally Lipschitz continuity and directional differentiability of the implicit upper-level objective function. We also provide a heuristic algorithm for the solution of the problem, which makes use of its differentiability properties and parallelization strategies across the scenarios. A small set of numerical experiments illustrates the behaviour of the stochastic solution compared to all average-case deterministic one, establishing ail increased robustness.
Consider an n-dimensional SIMD hypercube H-n with [3n/2] - 1 faulty nodes. Given 2(n) operands, this paper presents an efficient algorithm for prefix computation on the faulty H-n. Employing the newly proposed delay-u...
详细信息
Consider an n-dimensional SIMD hypercube H-n with [3n/2] - 1 faulty nodes. Given 2(n) operands, this paper presents an efficient algorithm for prefix computation on the faulty H-n. Employing the newly proposed delay-update technique and the subcube partition scheme, the proposed algorithm takes n+5logn+7 steps, and it tolerates [n/2] more Faulty nodes than does Raghavendra and Sridhar's algorithm [4] although 11 extra steps are needed.
In spite of their good filtering characteristics for vector-valued image processing, the usability of vector median filters is limited by their high computational complexity. Given an N x N image and a W x W window, t...
详细信息
In spite of their good filtering characteristics for vector-valued image processing, the usability of vector median filters is limited by their high computational complexity. Given an N x N image and a W x W window, the computational complexity of vector median filter is O((WN2)-N-4). In this paper, we design three fast and efficient parallel algorithms for vector median filtering based on the 2-norm (L-2) on the arrays with reconfigurable optical buses (AROB). For 1 less than or equal to p less than or equal to W less than or equal to q less than or equal to N, our algorithms run in O(W-4 log W/p(4)), O((WN2)-N-4/p(4)p(2) log W) and W(1) times using p(4)N(2)/log W, p(4)q(2)/log W, and W(4)N(2)log N processors, respectively. In the sense of the product of time and the number of processors used, the first two results are cost optimal and the last one is time optimal.
We present a BSP (Bulk Synchronous parallel) algorithm for solving the All Nearest Smaller Values Problem (ANSVP), a fundamental problem in both graph theory and computational geometry. Our algorithm achieves optimal ...
详细信息
We present a BSP (Bulk Synchronous parallel) algorithm for solving the All Nearest Smaller Values Problem (ANSVP), a fundamental problem in both graph theory and computational geometry. Our algorithm achieves optimal sequential computation time and uses only three communication supersteps. In the worst case, each communication phase takes no more than an (n/p + p)-relation, where p is the number of the processors. In addition, our average-case analysis shows that, on random inputs, the expected communication requirements for all three steps are bounded above by a p-relation, which is independent of the problem size n. Experiments have been carried out on an SGI Origin 2000 with 32 R10000 processors and a SUN Enterprise 4000 multiprocessing server supporting 8 UltraSPARC processors, using the MPI libraries. The results clearly demonstrate the communication efficiency and load balancing for computation. (C) 2001 Academic Press.
In this paper, tao new system architectures, overlap-state sequential and split-and-merge parallel, are proposed based on a novel boundary postprocessing technique for the computation of the discrete wavelet transform...
详细信息
In this paper, tao new system architectures, overlap-state sequential and split-and-merge parallel, are proposed based on a novel boundary postprocessing technique for the computation of the discrete wavelet transform (DWT). The basic idea is to introduce multilevel partial computations for samples near data boundaries based on a finite state machine model of the DWT derived from the lifting scheme. The key observation is that these partially computed (lifted) results can also be stored back to their original locations and the transform can be continued anytime later as long as these partial computed results are preserved. It is shown that such an extension of the in-place calculation feature of the original lifting algorithm greatly helps to reduce the extra buffer and communication overheads, in sequential and parallel system implementations, respectively. Performance analysis and experimental results show that, for the Daubechies (9,7) wavelet filters, using the proposed boundary postprocessing technique, the minimal required buffer size in the line-based sequential DWT algorithm [1] is 40% less than the best available approach. In the parallel DWT algorithm me show 30% faster performance than existing approaches.
A parallel algorithm for efficient calculation of the second derivatives (Hessian) of the conformational energy in internal coordinates is proposed. This parallel algorithm is based on the master/slave model. A master...
详细信息
暂无评论