This article presents results of experiments, which were carried out to discover the impact of memory effect on computing time. A series of runs were conducted to execute sequential and parallel algorithm of solving s...
详细信息
ISBN:
(纸本)9781509040704
This article presents results of experiments, which were carried out to discover the impact of memory effect on computing time. A series of runs were conducted to execute sequential and parallel algorithm of solving system of linear equations (SLE) with the steepest-descent method. The quantitative evaluation of parallel algorithm realization exponents in different ranks SLE solving is given. Dependences and factors, which have impact on computing time, are detected. parallel computing was implemented in the Distributed computing system with programmable structure of Novosibirsk State Technical University at Computer Engineering Department
The IDES project at Sandia National Laboratories is developing a large scale portable parallel simulator for use in stockpile stewardship. IDES will use the Breathing-Time-Buckets synchronization protocol; to support ...
详细信息
ISBN:
(纸本)9780818679650
The IDES project at Sandia National Laboratories is developing a large scale portable parallel simulator for use in stockpile stewardship. IDES will use the Breathing-Time-Buckets synchronization protocol; to support IDES development, this paper studies a performance model and describes performance experiments on expected work load and architectural parameters. A new parallel algorithm for terminating the window quickly is also described and analyzed.
Hybrid Scheme is a important formulation in solving Vavier-Stokes equations(incompressible flow problems),and SRM algorithm keeps the benefits of the penalty method,that is,velocity and pressure can be obtained separa...
详细信息
Hybrid Scheme is a important formulation in solving Vavier-Stokes equations(incompressible flow problems),and SRM algorithm keeps the benefits of the penalty method,that is,velocity and pressure can be obtained separately and no pressure-Poisson equation is involved,unlike the penalty method the SRM is more stable,less *** then case that a large number of time steps are needed,we introduce Domain-Decomposition based parallel techniques,and apply Sequential Approximation Virtual Boundary to compute the internal boundaries of sub-domains,less iterative is needed., computational results shows that the result is very well,and the speedup ratio of our method is larger.
Matrix inversion is a fundamental operation to solve linear equations for many computational applications. However, it is a challenging task to invert large-scale matrices of extremely high order (several thousands), ...
详细信息
ISBN:
(纸本)9781467399562
Matrix inversion is a fundamental operation to solve linear equations for many computational applications. However, it is a challenging task to invert large-scale matrices of extremely high order (several thousands), which are common in most of web-scale systems like social networks and recommendation systems. In this paper, we present a LU decomposition based block-recursive algorithm for large-scale matrix inversion, and its well-designed implementation with optimized data structure, reduction of space complexity and effective matrix multiplication on the Spark parallel computing platform. The experimental evaluation results show that the proposed algorithm is efficient to invert large-scale matrices on a cluster composed of commodity servers and scalable to invert even larger matrices. The proposed algorithm and implementation will be a solid base to build a high-performance linear algebra library on Spark for big data processing.
The existing directed graph clustering algorithms are born with some problems such as high latency, resource depletion and poor performance of iterative data processing. A distributed parallel algorithm of structure s...
详细信息
The existing directed graph clustering algorithms are born with some problems such as high latency, resource depletion and poor performance of iterative data processing. A distributed parallel algorithm of structure similarity clustering on Spark(Spark SCAN) is proposed to solve these problems: considering the interaction between nodes in the network, the similar structure of nodes are clustered together;Aiming at the large-scale characteristics of directed graphs, a data structure suitable for distributed graph computing is designed, and a distributed parallel clustering algorithm is proposed based on Spark framework, which improves the processing performance on the premise of the accuracy of clustering results. The experimental results show that the Spark SCAN have a good performance, and can effectively deal with the problem of clustering algorithm for large-scale directed graph.
Artificial bee colony is a meta-heuristic optimization algorithm based on the behavior of honey bee swarm. These bees work largely independently of other bees, making the algorithm suitable for parallel implementation...
详细信息
ISBN:
(纸本)9781479938414
Artificial bee colony is a meta-heuristic optimization algorithm based on the behavior of honey bee swarm. These bees work largely independently of other bees, making the algorithm suitable for parallel implementation. Within this paper, we introduce the algorithm itself and its subsequent parallelization utilizing the CUDA platform. The runtime speedup is demonstrated on several commonly used test functions for optimization. The algorithm is subsequently applied to the problem of clustering real data.
This paper presents a new dynamic method of subpopulation in solving multi-modal search problems with evolutionary algorithms. The new method identify the modes found at each generation and equalises the subpopulation...
详细信息
This paper presents a new dynamic method of subpopulation in solving multi-modal search problems with evolutionary algorithms. The new method identify the modes found at each generation and equalises the subpopulation sizes assigned to each mode. Modes are identified sequentially starting with the highest fitness mode. Mode membership is determined by successive grouping of fitness dominated convex bounding neighbours, starting from the fittest individual. This new dynamic modal subpopulation approach is able to fmd a representative sample of optima for multi-modal landscape with infinite number of global and local optima with uneven heights and non-uniform distribution. The algorithm also facilitates parallel implementation.
In order to solve parallel algorithm of Petri net system with concurrent functions and implement parallel control and execution of Petri net, parallel programming model of Petri net based on multi-core clusters is put...
详细信息
In order to solve parallel algorithm of Petri net system with concurrent functions and implement parallel control and execution of Petri net, parallel programming model of Petri net based on multi-core clusters is put forward. First, P-invariant technology is used to do the functional division of Petri net system and the parallel analysis of Petri net process. Next, based on architecture of multi-core cluster and combined with parallelism of Petri net process, Petri net system model is ***, parallel algorithm of Petri net system is raised. Finally, under the environment of multi-core cluster,the simulation experiment of parallel algorithm is done. The experimental results show that parallel algorithm based on Petri net system model is correct and feasible. The model can effectively simulate the actual operation of the system and is an effective model of Petri net parallelization system.
In order to solve parallel algorithm of Petri net system with concurrent functions and implement parallel control and execution of Petri net,parallel programming model of Petri net based on multi-core clusters is put ...
详细信息
ISBN:
(纸本)9781479941681
In order to solve parallel algorithm of Petri net system with concurrent functions and implement parallel control and execution of Petri net,parallel programming model of Petri net based on multi-core clusters is put ***,P-invariant technology is used to do the functional division of Petri net system and the parallel analysis of Petri net ***,based on architecture of multi-core cluster and combined with parallelism of Petri net process,Petri net system model is ***,parallel algorithm of Petri net system is ***,under the environment of multi-core cluster,the simulation experiment of parallel algorithm is *** experimental results show that parallel algorithm based on Petri net system model is correct and *** model can effectively simulate the actual operation of the system and is an effective model of Petri net parallelization system.
It becomes more and more important to design high-speed parallel cryptographic algorithms due to a growing need for information security. Conic curves cryptography is a new developing direction in the field of informa...
详细信息
It becomes more and more important to design high-speed parallel cryptographic algorithms due to a growing need for information security. Conic curves cryptography is a new developing direction in the field of information security in recent years and there are less works focused on the parallel encryption algorithms for conic curves crypto system. This paper proposes four parallel algorithms for conic curves cryptosystem over finite field GF(2(n)). One parallel algorithm of modular-multiplication is designed by analyzing its data dependency and making some modifications of several steps. In order to figure out the average runtime, we consider the probability distributions of different cases to compute the mathematical expectation. The operations of point-addition, point-double and pointmultiplication, three fundamental point operations in conic curves cryptosystem over finite field GF(2(n)), are paralleled based on this parallel algorithm of modular-multiplication and two parallel algorithms we proposed before. Time complexities and speedup ratios of the parallel algorithms and the sequential algorithms are calculated to make the quantitative comparison. The performance evaluation shows better efficiencies of the proposed parallel algorithms compared to the traditional algorithms.
暂无评论