检索结果-内蒙古大学图书馆

Development and assessment of a parallel computing implementation of the Coarse Mesh Radiation Transport (COMET) method

引用

ANNALS OF NUCLEAR ENERGY 2018年 114卷 288-300页

作者： Remley, Kyle Rahnema, Farzad Georgia Inst Technol 770 State St NWRoom 3-39S Atlanta GA 30318 USA

The reactor physics (neutronics) method of the Coarse Mesh Radiation Transport (COMET) code has been used to solve whole core reactor eigenvalue and power distribution problems. COMET solutions are computed to Monte Carlo accuracy on a single processor with several orders of magnitude faster computational speed. However, to extend the method to include on-the-fly depletion and incident flux response expansion function calculations via Monte Carlo an implementation for a parallel execution of deterministic COMET calculations has been developed. COMET involves inner and outer iterations;inner iterations contain local (i.e., response data) calculations that can be carried out independently, making the algorithm amenable to parallelization. Taking advantage of this fact, a distributed memory algorithm featuring domain decomposition was developed. To allow for efficient parallel implementation of a distributed algorithm, changes to response data access and sweep order are made, along with considerations for communications between processors. These changes make the approach generalizable to many different problem types. A software implementation called COMET-MPI was developed and implemented for several benchmark problems. Analysis of the computational performance of COMET-MPI resulted in an estimated parallel fraction of 0.98 for the code, implying a high level of parallelism. In addition, wall clock times on the order of minutes are achieved when the code is used to solve whole core benchmark problems, showing vastly improved computational efficiency using the distributed memory algorithm. (C) 2017 Elsevier Ltd. All rights reserved.

关键词： distributed memory algorithm Parallel COMET Coarse mesh radiation transport

来源：评论

学校读者我要写书评

暂无评论

Scenario Decomposition for 0-1 Stochastic Programs: Improvements and Asynchronous Implementation 30

Scenario Decomposition for 0-1 Stochastic Programs: Improvem...

引用

30th IEEE International Parallel and distributed Processing Symposium (IPDPS)

作者： Ryan, Kevin Rajan, Deepak Ahmed, Shabbir Georgia Inst Technol Atlanta GA 30332 USA Lawrence Livermore Natl Lab Livermore CA 94550 USA

ISBN: (纸本)9781509036820

A recently proposed scenario decomposition algorithm for stochastic 0-1 programs finds an optimal solution by evaluating and removing individual solutions that are discovered by solving scenario subproblems. In this work, we develop an asynchronous, distributed implementation of the algorithm which has computational advantages over existing synchronous implementations of the algorithm. Improvements to both the synchronous and asynchronous algorithm are proposed. We test the results on well known stochastic 0-1 programs from the SIPLIB test library and is able to solve one previously unsolved instance from the test set.

关键词： distributed memory algorithm Scenario decomposition Two-stage stochastic mixed-integer programming

来源：评论

学校读者我要写书评

暂无评论

Fast and Accurate Support Vector Machines on Large Scale Systems

Fast and Accurate Support Vector Machines on Large Scale Sys...

引用

IEEE International Conference on Cluster Computing (CLUSTER)

作者： Vishnu, Abhinav Narasimhan, Jeyanthi Holder, Lawrence Kerbyson, Darren Hoisie, Adolfy Pacific NW Natl Lab Adv Comp Math & Data Div 902 Battelle Blvd Richland WA 99352 USA Washington State Univ Sch Elect Engn & Comp Sci Pullman WA 99164 USA

ISBN: (纸本)9781467365987

Support Vector Machines (SVM) is a supervised Machine Learning and Data Mining (MLDM) algorithm, which has become ubiquitous largely due to its high accuracy and obliviousness to dimensionality. The objective of SVM is to find an optimal boundary - also known as hyperplane - which separates the samples (examples in a dataset) of different classes by a maximum margin. Usually, very few samples contribute to the definition of the boundary. However, existing parallel algorithms use the entire dataset for finding the boundary, which is sub-optimal for performance reasons. In this paper, we propose a novel distributed memory algorithm to eliminate the samples which do not contribute to the boundary definition in SVM. We propose several heuristics, which range from early (aggressive) to late (conservative) elimination of the samples, such that the overall time for generating the boundary is reduced considerably. In a few cases, a sample may be eliminated (shrunk) pre-emptively - potentially resulting in an incorrect boundary. We propose a scalable approach to synchronize the necessary data structures such that the proposed algorithm maintains its accuracy. We consider the necessary trade-offs of single/multiple synchronization using in-depth time space complexity analysis. We implement the proposed algorithm using MPI and compare it with libsvm - de facto sequential SVM software - which we enhance with OpenMP for multi-core/many-core parallelism. Our proposed approach shows excellent efficiency using up to 4096 processes on several large datasets such as UCI HIGGS Boson dataset and Offending URL dataset.

关键词： computational complexity data analysis data mining data structures distributed memory systems learning (artificial intelligence) parallel algorithms support vector machines synchronisation MLDM algorithm MPI OpenMP UCI HIGGS Boson dataset boundary definition distributed memory algorithm hyperplane large scale systems many-core parallelism multicore parallelism multiple synchronization offending URL dataset sequential SVM software single synchronization supervised machine learning and data mining algorithm time-space complexity analysis Accuracy algorithm design and analysis Complexity theory Data structures Kernel Machine learning algorithms Support vector machines Extreme Scale Support Vector Machines Support Vector Network Data structures Parallel algorithms large-scale systems message passing mannose phosphate isomerase remote procedure calls algorithm design and analysis distributed memory systems Hyperplane Machine learning algorithms Complexity theory data mining complexity classes

来源：评论

学校读者我要写书评

暂无评论

A vertex centric parallel algorithm for linear temporal logic model checking in Pregel

引用

JOURNAL OF PARALLEL AND distributed COMPUTING 2014年第11期74卷 3161-3174页

作者： Xie, Miao Yang, Qiusong Zhai, Jian Wang, Qing Chinese Acad Sci Inst Software Natl Engn Res Ctr Fundamental Software Beijing 100190 Peoples R China Chinese Acad Sci Inst Software State Key Lab Comp Sci Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China

Linear Temporal Logic (LTL) Model Checking is a very important and popular technique for the automatic verification of safety-critical hardware and software systems, aiming at ensuring their quality. However, it is well known that LTL model checking suffers from the state explosion problem, often leading to insurmountable scalability problems when applying it to real-world systems. While there has been work on distributed algorithms for explicit on-the-fly LTL model checking, these are not sufficiently scalable and capable of tolerating faults during computation, significantly limiting their usefulness in huge cluster environments. Moreover, implementing these algorithms is generally viewed as a very challenging, error-prone task. In this paper, we instead rely on Pregel, a simple yet powerful model for distributed computation on large graphs. Pregel has from the start been designed for efficient, scalable and fault tolerant operation on clusters of thousands of computers, including large cloud setups. To harness Pregel's power, we propose a new vertex centric distributed algorithm for explicit LTL model checking of concurrent systems. Experimental results illustrate feasibility and scalability of the proposed algorithm. Compared with other distributed algorithms, our algorithm is more scalable, reliable and efficient. (C) 2014 Elsevier Inc. All rights reserved.

关键词： Model checking Linear temporal logic distributed memory algorithm Formal method Scalable algorithm Reliable model checker

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：