In this paper, we address the construction of a Hamiltonian path for conforming tetrahedral meshes on distributed memory machines. The path is constrained to pass from one element to the next one through a vertex. For...
详细信息
In this paper, we address the construction of a Hamiltonian path for conforming tetrahedral meshes on distributed memory machines. The path is constrained to pass from one element to the next one through a vertex. For a conforming tetrahedral mesh whose dual graph is connected, if it can be split into many submeshes and the dual graphs of these submeshes are connected, then we can construct partial Hamiltonian paths for all submeshes independently and a Hamiltonian path for the mesh can be obtained by connecting these partial Hamiltonian paths.
parallel multilevel fast multipole algorithm for scattering analysis of conducting targets situated in a half-space environment is presented. Due to the introducing of image sources to deal with the effects of the far...
详细信息
ISBN:
(纸本)9781629931340
parallel multilevel fast multipole algorithm for scattering analysis of conducting targets situated in a half-space environment is presented. Due to the introducing of image sources to deal with the effects of the far interface interactions, the communication traffic among processes is nearly doubled compared with the free space case, which is a great challenge for designing the parallel algorithm. To achieve high parallel efficiency when using hundreds of CPU cores, the adaptive direction partition strategy is improved in this paper. The correctness is demonstrated by using a conducting sphere above ground. In addition, a real-life vehicle above ground is simulated using the proposed method and the parallel efficiency is tested.
Based on a fully overlapping domain decomposition technique and finite element discretization, a parallel subgrid stabilized method for the incompressible Navier Stokes equations is proposed and analyzed. In this meth...
详细信息
Based on a fully overlapping domain decomposition technique and finite element discretization, a parallel subgrid stabilized method for the incompressible Navier Stokes equations is proposed and analyzed. In this method, each processor computes a local stabilized solution in its own subdomain by solving a global problem on a mesh that is fine around its own subdomain and coarse elsewhere, where the stabilization term is based on an elliptic operator defined on the same mesh. This method has low communication complexity. It only requires the application of an existing sequential solver on the global meshes associated with each subdomain, and hence can reuse the existing sequential software. Convergence theory of the method is developed. algorithmic parameter scalings are derived. Numerical results are also given to verify the theoretical predictions and demonstrate the effectiveness of the method. (C) 2013 Elsevier Inc. All rights reserved.
Local mesh refinement is one of the key steps in the implementations of adaptive finite element methods. This paper presents a parallel algorithm for distributed memory parallel computers for adaptive local refinement...
详细信息
Local mesh refinement is one of the key steps in the implementations of adaptive finite element methods. This paper presents a parallel algorithm for distributed memory parallel computers for adaptive local refinement of tetrahedral meshes using bisection. This algorithm is used in PHG, parallel Hierarchical Grid Chttp://lsec. cc. ac. cn/phg/), a toolbox under active development for parallel adaptive finite element solutions of partial differential equations. The algorithm proposed is characterized by allowing simukaneous refinement of submeshes to arbitrary levels before synchronization between submeshes and without the need of a central coordinator process for managing new vertices. Using the concept of canonical refinement, a simple proof of the independence of the resulting mesh on the mesh partitioning is given, which is useful in better understanding the behaviour of the biseetioning refinement procedure.
In this paper, the problem of A-order of binary tree is studied with the PRAM (parallel Random Access Machine) model of parallel computation and a parallel algorithm for A-order of binary tree is proposed. The process...
详细信息
ISBN:
(纸本)9780769535579
In this paper, the problem of A-order of binary tree is studied with the PRAM (parallel Random Access Machine) model of parallel computation and a parallel algorithm for A-order of binary tree is proposed. The process of the parallel algorithm is proceed with detailed description and verified analysis with an application instance. The parallel algorithm of A-order of binary tree provides using and reference for applying it to binary tree traverse sequence and solving the parallelism problem of application program.
The comparison of red-black ordering and natural ordering in solving linear system in parallel is presented in this *** is proved that red-black ordering can keep the convergence of linear system and improve the paral...
详细信息
The comparison of red-black ordering and natural ordering in solving linear system in parallel is presented in this *** is proved that red-black ordering can keep the convergence of linear system and improve the parallelism through theoretical *** reach the load balancing and reduce communication, the data is stored by blocks and the communication is proceeded between the adjacent *** this way, it is avoided that the time increases too much with the number of processors increasing,which improve the computation efficiency.
Voronoi diagrams are among the most important data structures in geometric modeling. Among many efficient algorithms for computing 2D Voronoi diagrams, Fortune's sweepline algorithm (Fortune, 1986 [5]) is popular ...
详细信息
Voronoi diagrams are among the most important data structures in geometric modeling. Among many efficient algorithms for computing 2D Voronoi diagrams, Fortune's sweepline algorithm (Fortune, 1986 [5]) is popular due to its elegance and simplicity. Dehne and Klein (1987) [8] extended sweepline to sweepcircle and suggested computing a type of transformed Voronoi diagram, which is parallel in nature. However, there is no practical implementation of the sweepcircle algorithm due to the difficulty in representing the transformed edges. This paper presents a new algorithm, called untransformed sweepcircle, for constructing Voronoi diagram in R-2. Starting with a degenerate circle (of zero radius) centered at an arbitrary location, as the name suggests, our algorithm sweeps the circle by increasing its radius across the plane. At any time during the sweeping process, each site inside the sweep circle defines an ellipse composing of points equidistant from that point and from the sweep circle. The union of all ellipses forms the beach curve-a star shape inside the sweep circle which divides the portion of the plane within which the Voronoi diagram can be completely determined, regardless of what other points might be outside of the sweep circle. As the sweep circle progresses, the intersection of expanding ellipses defines the Voronoi edges. We show that the sweep line algorithm is the degenerate form of the proposed sweep circle algorithm when the circle center is at infinity, and our algorithm has the same time and space complexity as the sweep line algorithm. Our untransformed sweepcircle algorithm is flexible in allowing multiple circles at arbitrary locations to sweep the domain simultaneously. The parallelized implementation is pretty easy without complicated numerical computation;the most complicated case is nothing but an arc-cosine operation. Furthermore, our algorithm supports the additively weighted Voronoi diagrams of which the Voronoi edges are hyperbolic and
The analysis of data streams is of great value in many fields such as network monitoring and sensor *** a common operation,top-k query over data stream is the basis and core of other problems in data stream *** this p...
详细信息
The analysis of data streams is of great value in many fields such as network monitoring and sensor *** a common operation,top-k query over data stream is the basis and core of other problems in data stream *** this paper,we introduce a parallel algorithm based on Frequent algorithm and implement it by utilizing Apache ***,we evaluate the algorithm by estimated error under various situations and show that the algorithm can effectively improve the precision of top-k query by adjusting the parallel *** parallelized implementation is of significance in network traffic monitoring.
After a deep investigation on the maximum terms space of the clause set,the concept of the partial maximum terms space of the clause set,which the maximum terms of the clause set decomposed,is brought *** investigatin...
详细信息
After a deep investigation on the maximum terms space of the clause set,the concept of the partial maximum terms space of the clause set,which the maximum terms of the clause set decomposed,is brought *** investigating the extension rule,this paper introduces the concept of the satisfiability and the unsatisfiability of the partial maximum terms space,and gives an algorithm determining the satisfiability of a partial space of the maximum terms—algorithm PSER(Partial Semi-Extension Rule).Then,the TP problem is decomposed into several sub-problems independent of each other,which can be solved by the given parallel computing method PPSER(parallel Partial Semi-Extension Rule).
In the paper, the preconditioner of linear systems is directly extented to the solutions of block-tridiagonal equations. And by corresponding parallel processing, a parallel algorithm of two stage iterative method for...
详细信息
ISBN:
(纸本)9781846260612
In the paper, the preconditioner of linear systems is directly extented to the solutions of block-tridiagonal equations. And by corresponding parallel processing, a parallel algorithm of two stage iterative method for solving block-tridiagonal linear equations is presented. According to theoretical analysis, a sufficient condition of the new algorithm convergence is given. Finally, the results of numerical experiments on HP rx2600 cluster indicate that the algorithm is feasible and effective.
暂无评论