Kirchhoff pre-stack depth migration (KPSDM) algorithm, as one of the most widely used migration algorithms, plays an important part in getting the real image of the earth. However, this program takes considerable time...
详细信息
ISBN:
(数字)9783319111940
ISBN:
(纸本)9783319111940;9783319111933
Kirchhoff pre-stack depth migration (KPSDM) algorithm, as one of the most widely used migration algorithms, plays an important part in getting the real image of the earth. However, this program takes considerable time due to its high computational cost;hence the working efficiency of the oil industry is affected. The general purpose Graphic Processing Unit (GPU) and the Compute Unified Device Architecture (CUDA) developed by NVIDIA have provided a new solution to this problem. In this study, we have proposed a parallel algorithm of the Kirchhoff pre-stack depth migration and an optimization strategy based on the CUDA technology. Our experiments indicate that for large data computations, the accelerated algorithm achieves a speedup of 8 similar to 15 times compared with NVIDIA GPU.
The MCDPar (parallel algorithm for multi-scale simulations based on Mesh and BCF Decomposition) algorithm significantly reduced the execution time and improved the parallel scalability for the multi-scale fluid simula...
详细信息
ISBN:
(纸本)9781450362955
The MCDPar (parallel algorithm for multi-scale simulations based on Mesh and BCF Decomposition) algorithm significantly reduced the execution time and improved the parallel scalability for the multi-scale fluid simulations. However, the performance bottleneck still exists for extremely large-scale parallel simulations. In this paper, we designed a communication-overlapped hybrid decomposition parallel algorithm to improve the performance of the original MCDPar on large-scale clusters. Through non-blocking communication and code scheduling, the communication overhead between the master and slave groups have been overlapped with the computation of more microscopic configuration fields for the master process. Thus the parallel efficiency and scalability of the multi-scale solver could be improved on large-scale parallel simulations. In the test case with the number of configuration fields N-BCF = 1000 and mesh cells N-cell = 64000, the communication percentage between the corresponding master and slave processes is reduced by 39.71%. In the test case with N-BCF = 3000 and N-cell = 64000, the time cost of the fastest execution is reduced by 31.13% using the communication-overlapped algorithm, which offers a better parallel scaling on 256 cores compared to original 128 cores.
In this paper, we propose a parallel algorithm for H.264/AVC deblocking filter which is scalable to the number of processors. Unlike the conventional approach, which is limited by the independent data units, the desig...
详细信息
ISBN:
(纸本)9781424410163
In this paper, we propose a parallel algorithm for H.264/AVC deblocking filter which is scalable to the number of processors. Unlike the conventional approach, which is limited by the independent data units, the designed algorithm allows issuing dependent data units concurrently to decrease the penalty from synchronization of data units. For the general-purpose dual-core processors, experimental results show that our method speeds up 1.72 and 1.39 times as compared with optimized sequential method and the well-known wavefront parallelizing method, respectively.
In a disaster field, to obtain the optimal path in unknown environment,a rescue robot needs to build an environment map. Sensors mounted on the robots cooperate to monitor the environment, the information of the disas...
详细信息
ISBN:
(纸本)9783037851579
In a disaster field, to obtain the optimal path in unknown environment,a rescue robot needs to build an environment map. Sensors mounted on the robots cooperate to monitor the environment, the information of the disaster field is collected by the sonsors of different robots, all signal from sensors (mounted on all robots and signal form GPS) are sent to the bakeside parllel processors with wireless network. A grid computing environment serves as the backside parallel processors with Globus Toolkit, the grid computing processor process all the signals and construct the global map to help robot for navigation path *** rescue robot get control signal from the grid computing processor with wireless network,thus, the robot is not necessary to be sophisticated. New computing methods are given for parallel algorithm on grid *** experiments show that the method is more practical and helps the path planning problem to be solved more efficiently, the advantages of large seale computing on grid are shown.
The paper deals with the problem of analyzing fault, susceptibility of a parallel algorithm designed for multiprocessor array (MIMD structure). This algorithm realizes quite complex communication protocol in the syste...
详细信息
ISBN:
(纸本)0769517307;0769517315
The paper deals with the problem of analyzing fault, susceptibility of a parallel algorithm designed for multiprocessor array (MIMD structure). This algorithm realizes quite complex communication protocol in the system. We present an original methodology of the analysis based on the use of software implemented fault injector. The considered algorithm is modeled as a multithreaded application. The experiment set up an I results are presented and commented The performed experiments proved relatively high natural robustness of the analyzed algorithm and showed further possibilities of its improvement.
This paper research on how to select a subtree with exactly k leaves and a diameter of at most 1, which minimizes the distance from the farthest vertex to the subtree. We call such a subtree (k, l)-center of a tree ne...
详细信息
ISBN:
(纸本)9780889866386
This paper research on how to select a subtree with exactly k leaves and a diameter of at most 1, which minimizes the distance from the farthest vertex to the subtree. We call such a subtree (k, l)-center of a tree network. In this paper, an efficient parallel algorithm is proposed for finding a (k, l)-center of a tree network. This algorithm performs on the EREW PRAM in O(log n) time using O(n) work.
A lot of mathematical approaches are used in importance analysis, which permits to investigate influence of system component state changes on the system reliability or availability. One of these approaches is Logical ...
详细信息
ISBN:
(纸本)9783319192161;9783319192154
A lot of mathematical approaches are used in importance analysis, which permits to investigate influence of system component state changes on the system reliability or availability. One of these approaches is Logical Differential Calculus, in particular Direct Partial Boolean Derivatives. A new algorithm for the calculation of Importance Measures with application of Direct Partial Boolean Derivatives is proposed in this paper. This algorithm is developed based on parallel procedures.
A workload distribution problem is quite topical nowadays. A large number of applications, which function in distributed environments, uses various techniques of a workload relocation. Yet very few studies consider th...
详细信息
ISBN:
(数字)9783030303297
ISBN:
(纸本)9783030303297;9783030303280
A workload distribution problem is quite topical nowadays. A large number of applications, which function in distributed environments, uses various techniques of a workload relocation. Yet very few studies consider the workload relocation in fog-computing environment emphasizing the increase of a search space and the distances between the computational nodes. In this paper a new workload distribution technique, based on ontological analysis of algorithm structures and the available resources is presented. The aim is to limit and reduce the search space of the workload distribution problem. Such strategy decreases the time of workload location and so decreases the time needed to solve the general computational task of application.
The pickup and delivery problem with time windows (PDPTW) is an NP-hard optimization problem of serving transportation requests using a limited number of vehicles. Its main objective is to minimize the number of deliv...
详细信息
ISBN:
(纸本)9781467394734
The pickup and delivery problem with time windows (PDPTW) is an NP-hard optimization problem of serving transportation requests using a limited number of vehicles. Its main objective is to minimize the number of delivering trucks, whereas the secondary objective is to decrease the distance traveled during the service. A feasible routing schedule must satisfy the time window, capacity and precedence constraints. In this paper, we propose to partition the search space in our parallel guided ejection search algorithm (P-GES) to minimize the fleet size in the PDPTW. The introduced techniques help decrease the convergence time of the algorithm without affecting the quality of results. An extensive experimental study (comprising nearly 52, 000 CPU hours on an SMP cluster) performed on the Li and Lim's benchmark set shows that the parallel algorithm is effective, and is able to retrieve very high-quality results. We report 10 new world's best solutions obtained using P-GES enhanced with the proposed search space partition approaches.
Solving large-scale sparse linear systems is a critical problem in scientific and engineering computing. Partial differential equations can solve problems in many fields. They can be transformed into large-scale linea...
详细信息
ISBN:
(数字)9781510651890
ISBN:
(纸本)9781510651890;9781510651883
Solving large-scale sparse linear systems is a critical problem in scientific and engineering computing. Partial differential equations can solve problems in many fields. They can be transformed into large-scale linear systems with a series of methods, and the parallel solution of tridiagonal linear systems is one of them. The solution of linear systems is very time-consuming in most of the problems, accounting for more than half of the total time. Load balancing can reduce process time for waiting and improves computational efficiency, and it is the focus of many algorithms. The article is based on Stone's proposed recursive doubling algorithm, an improved algorithm for solving tridiagonal linear systems using the full-recursive-doubling communication model and the Mobiu transform. The improved algorithm can calculate the million-dimensional linear systems. Numerical experiments show that compared with ordinary parallel algorithms, the improved algorithm shows up to 2x improvement than the original version, and some results even show up to 3x. In addition, the load-balancing performance has been greatly improved, and the time difference of the processes is 1/7 of the original version. The improved algorithm has a good load balancing, and the running time of each process is not much different, avoiding process waiting and resource wastage.
暂无评论