A new algorithm for nonlinear eigenvalue problems is proposed. The numerical technique is based on a perturbation of the coefficients of differential equation combined with the Adomian decomposition method for the non...
详细信息
A new algorithm for nonlinear eigenvalue problems is proposed. The numerical technique is based on a perturbation of the coefficients of differential equation combined with the Adomian decomposition method for the nonlinear part. The approach provides an exponential convergence rate with a base which is inversely proportional to the index of the eigenvalue under consideration. The eigenpairs can be computed in parallel. Numerical examples are presented to support the theory. They are in good agreement with the spectral asymptotics obtained by other authors.
This paper evaluates features of graph coloring algorithms implemented on graphics processing units (GPUs), comparing coloring heuristics and thread decompositions. As compared to prior work on graph coloring for othe...
详细信息
ISBN:
(纸本)9781450301190
This paper evaluates features of graph coloring algorithms implemented on graphics processing units (GPUs), comparing coloring heuristics and thread decompositions. As compared to prior work on graph coloring for other parallel architectures, we find that the large number of cores and relatively high global memory bandwidth of a GPU lead to different strategies for the parallel implementation. Specifically, we find that a simple uniform block partitioning is very effective on GPUs and our parallel coloring heuristics lead to the same or fewer colors than prior approaches for distributed-memory cluster architecture. Our algorithm resolves many coloring conflicts across partitioned blocks on the GPU by iterating through the coloring process, before returning to the CPU to resolve remaining conflicts. With this approach we get as few color (if not fewer) than the best sequential graph coloring algorithm and performance is close to the fastest sequential graph coloring algorithms which have poor color quality.
A new approach for Sensitivity Analysis (SA) in the field of air pollution modelling is proposed and applied to the Unified Danish Eulerian Model (UNI-DEM), a large-scale air pollution model. The SA requires numerous ...
详细信息
A new approach for Sensitivity Analysis (SA) in the field of air pollution modelling is proposed and applied to the Unified Danish Eulerian Model (UNI-DEM), a large-scale air pollution model. The SA requires numerous model experiments with different values of the studied parameters. By simultaneous variation of these parameters we produce a set of multidimensional discrete functions. These huge computational tasks require extensive resources of storage and CPU time. A highly parallel implementation of UNI-DEM has been created for this purpose and implemented on two powerful supercomputers. Some details of this implementation and numerical results on these supercomputers are presented.
To obtain the optimal path in a unknown disaster field,a rescue robot needs to build an environment map. The information of the disaster field is collected by the sonsors of different robots, all signal from sensors (...
详细信息
ISBN:
(纸本)9783037851517
To obtain the optimal path in a unknown disaster field,a rescue robot needs to build an environment map. The information of the disaster field is collected by the sonsors of different robots, all signal from sensors (mounted on all robots and signal form GPS) are sent to the bakeside parllel processors with wireless network. A grid computing environment serves as the backside parallel processors with Globus Toolkit, the grid computing processor process all the signals and construct the global map to help robot for navigation path planning. The rescue robot get control signal from the grid computing processor with wireless network,thus, the robot is not necessary to be sophisticated. New computing methods are given for parallel algorithm on grid environment. The navigation control is implemented with the cooperation among heterogeneous agents, the advantages of large seale computing on grid are shown.
This paper presents a practical approach to parallelize the test data generation algorithm by which computing resources can be fully used. The test data generation approach that we are using is based on the dynamic sy...
详细信息
This paper presents a practical approach to parallelize the test data generation algorithm by which computing resources can be fully used. The test data generation approach that we are using is based on the dynamic symbolic execution (concolic testing). The basic idea of parallelizing the algorithm is to distribute analysis processes of different paths to different computing units. Although a centralized scheduler with several sub processes can directly achieve the goal of parallelism, it may cause global idle time when parallel processes frequently end at same time. In our approach, a runtime deterministic scheduler is introduced to reduce the potential global idle time. Our experiments show some notable results when using a proper scheduling function. Compared with the sequential concolic testing, our approach can save nearly 70% computing time in some cases on a system with eight CPU cores from our experiments.
The aim of this paper is to show how to determine the neighborhood of the complex discrete optimization problem and how to search it in the parallel environment, this being illustrated by an example of the hybrid sche...
详细信息
The aim of this paper is to show how to determine the neighborhood of the complex discrete optimization problem and how to search it in the parallel environment, this being illustrated by an example of the hybrid scheduling, more precisely a flexible job shop problem. We present a parallel single-walk approach in this respect. A theoretical analysis based on PRAM model of parallel computing has been made. We propose a cost-optimal method of neighborhood generation parallelization.
Large scale nonlinear programming (NLP) has proven to be an effective framework for obtaining profit gains through optimal process design and operations in chemical engineering. While the classical SQP and Interior Po...
详细信息
Large scale nonlinear programming (NLP) has proven to be an effective framework for obtaining profit gains through optimal process design and operations in chemical engineering. While the classical SQP and Interior Point methods have been successfully applied to solve many optimization problems, the focus of both academia and industry on larger and more complicated problems requires further development of numerical algorithms which can provide improved computational efficiency. The primary purpose of this dissertation is to develop effective problem formulations and an advanced numerical algorithms for efficient solution of these challenging problems. As problem sizes increase, there is a need for tailored algorithms that can exploit problem specific structure. Furthermore, computer chip manufacturers are no longer focusing on increased clock-speeds, but rather on hyperthreading and multi-core architectures. Therefore, to see continued performance improvement, we must focus on algorithms that can exploit emerging parallel computing architectures. In this dissertation, we develop an advanced parallel solution strategy for nonlinear programming problems with block-angular structure. The effectiveness of this and modern off-the-shelf tools are demonstrated on a wide range of problem classes. Here, we treat optimal design, optimal operation, dynamic optimization, and parameter estimation. Two case studies (air separation units and heat-integrated columns) are investigated to deal with design under uncertainty with rigorous models. For optimal operation, this dissertation takes cryogenic air separation units as a primary case study and focuses on formulations for handling uncertain product demands, contractual constraints on customer satisfaction levels, and variable power pricing. Multiperiod formulations provide operating plans that consider inventory to meet customer demands and improve profits. In the area of dynamic optimization, optimal reference trajectories are d
Average (mean) voter is one of the commonest voting methods suitable for decision making in highly-available and long-missions applications where the availability and the speed of the system are *** this paper,a new...
详细信息
Average (mean) voter is one of the commonest voting methods suitable for decision making in highly-available and long-missions applications where the availability and the speed of the system are *** this paper,a new generation of average voter based on parallel algorithms and parallel random access machine(PRAM) structure are *** analysis shows that this algorithm is optimal due to its improved time complexity,speed-up,and efficiency and is especially appropriate for applications where the size of input space is large.
Through the research of MPI's theory and features, the G-MPI parallel program design and running framework have been constructed. Afterwards the design and communication cost of GMRES (m) algorithm has been studie...
详细信息
ISBN:
(数字)9783642161674
ISBN:
(纸本)9783642161667
Through the research of MPI's theory and features, the G-MPI parallel program design and running framework have been constructed. Afterwards the design and communication cost of GMRES (m) algorithm has been studied, so one parallel numerical algorithm, with coarse granularity and low communication cost which is applied to solving the large elastic problems by using boundary element method, has been presented. Through the comparison with the result of the traditional parallel GMRES (m) in MPI, the new parallel algorithm in G-MPI has comparatively higher calculation accuracy and calculation efficiency.
We propose here a parallel implementation of multidimensional scaling (MDS) method which can be used for visualization of large datasets of multidimensional data.. Unlike in traditional approaches, which employ classi...
详细信息
ISBN:
(纸本)9783642143892
We propose here a parallel implementation of multidimensional scaling (MDS) method which can be used for visualization of large datasets of multidimensional data.. Unlike in traditional approaches, which employ classical minimization methods for finding the global optimum of the "stress function", we use a heuristic based on particle dynamics. This method allows avoiding local minima and is convergent to the global one. However, due to its O(N-2) complexity, the application of this method in data mining problems involving large datasets requires efficient parallel codes. We show that employing both optimized Taylor's algorithm and hybridized model of parallel computations, our solver is efficient enough to visualize multidimensional data sets consisting of 10(4) feature vectors in time of minutes.
暂无评论