检索结果-内蒙古大学图书馆

Toward performance portability of the Albany finite element analysis code using the Kokkos library

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2019年第2期33卷 332-352页

作者： Demeshko, Irina Watkins, Jerry Tezaur, Irina K. Guba, Oksana Spotz, William F. Salinger, Andrew G. Pawlowski, Roger P. Heroux, Michael A. Los Alamos Natl Lab Porting Albany Code Kokkos Los Alamos NM USA Sandia Natl Labs Extreme Scale Data Sci & Analyt Dept Livermore CA USA Sandia Natl Labs Ctr Comp Res POB 5800 Albuquerque NM 87185 USA Sandia Natl Labs Multiphys Applicat Dept POB 5800 Albuquerque NM 87185 USA Sandia Natl Labs POB 5800 Albuquerque NM 87185 USA

Performance portability on heterogeneous high-performance computing (HPC) systems is a major challenge faced today by code developers: parallel code needs to be executed correctly as well as with high performance on machines with different architectures, operating systems, and software libraries. The finite element method (FEM) is a popular and flexible method for discretizing partial differential equations arising in a wide variety of scientific, engineering, and industrial applications that require HPC. This article presents some preliminary results pertaining to our development of a performance portable implementation of the FEM-based Albany code. Performance portability is achieved using the Kokkos library. We present performance results for the Aeras global atmosphere dynamical core module in Albany. Numerical experiments show that our single code implementation gives reasonable performance across three multicore/many-core architectures: NVIDIA General Processing Units (GPU's), Intel Xeon Phis, and multicore CPUs.

关键词： Performance portability many-core programming finite element code climate simulations Kokkos library

来源：评论

学校读者我要写书评

暂无评论

The Missing Link! A New Skeleton for Evolutionary Multi-agent Systems in Erlang

引用

INTERNATIONAL JOURNAL OF PARALLEL programming 2018年第1期46卷 4-22页

作者： Stypka, Jan Turek, Wojciech Byrski, Aleksander Kisiel-Dorohinicki, Marek Barwell, Adam D. Brown, Christopher Hammond, Kevin Janjic, Vladimir AGH Univ Sci & Technol Al Mickiewicza 30 PL-30059 Krakow Poland Univ St Andrews Sch Comp Sci St Andrews Fife Scotland

Evolutionary multi-agent systems (EMAS) play a critical role in many artificial intelligence applications that are in use today. In this paper, we present a new generic skeleton in Erlang for parallel EMAS computations. The skeleton enables us to capture a wide variety of concrete evolutionary computations that can exploit the same underlying parallel implementation. We demonstrate the use of our skeleton on two different evolutionary computing applications: (1) computing the minimum of the Rastrigin function;and (2) solving an urban traffic optimisation problem. We show that we can obtain very good speedups (up to 142.44 the sequential performance) on a variety of different parallel hardware, while requiring very little parallelisation effort.

关键词： Multi-core programming Erlang Agent-based computing Metaheuristics many-core programming Algorithmic skeletons

来源：评论

学校读者我要写书评

暂无评论

A Performance Optimization Framework for the Simultaneous Heterogeneous Computing Platforms

A Performance Optimization Framework for the Simultaneous He...

引用

ACM Workshop on Software Engineering Methods for Parallel and High Performance Applications (SEM4HPC)

作者： Li, Shuo Intel Corp 2111 NE 25th Ave Hillsboro OR 97124 USA

ISBN: (纸本)9781450343510

Heterogeneous computing platforms with multicore host system and many-core accelerator devices have taken a major step forward in the mainstream HPC computing market this year with the announcement of HP Apollo 6000 System's ProLiant XL250a server features the Intel (R) Xeon Phi (TM) coprocessors. Although many application developers attempt to use it in the same way as GPGPU acceleration platforms, doing so forfeits the processing capability of multicore host processors and introduces power inefficiency in business operations. In this paper, we propose an application optimization framework to turn sequential legacy applications into highly parallel applications that make use of the hardware resources both on the host CPU and on the accelerator devices to enable simultaneous heterogeneous computing. As a case study, we look at how to apply this framework and adopt a structured methodology to develop option pricing applications to take advantages of a heterogeneous computing environment.

关键词： programming Model Performance Measurement many-core programming vector programming Development Tools application acceleration Power consumption multithreaded parallelism Heterogeneous programming

来源：评论

学校读者我要写书评

暂无评论

Reliable and Efficient Execution of Multiple Streaming Applications on Intel's SCC Processor

引用

19th Workshop on Parallel Processing (Euro-Par)

作者： Schor, Lars Rai, Devendra Yang, Hoeseok Bacivarov, Iuliana Thiele, Lothar ETH Comp Engn & Networks Lab CH-8092 Zurich Switzerland

ISBN: (纸本)9783642544194;9783642544200

Intel's Single-chip Cloud Computer (SCC) is a prototype architecture for on-chip many-core systems. By incorporating 48 cores into a single die, it provides unique opportunities to gain insights into many-core software development. Earlier results have shown that programming efficient and reliable software for many-core processors is difficult due to a lack of appropriate programming tools. In this paper, we present a programming framework to execute multiple applications specified as Kahn process networks on the SCC. These applications might be started or stopped at runtime based on requests of the user. The proposed application programming interface (API) abstracts low-level implementation details from the application designer enabling high-level performance analysis and automated mapping optimization. To efficiently execute workload specified by the proposed API, a lightweight runtime-system and an automated program synthesis backend are presented. Extensive experiments are carried out to characterize the performance of the proposed framework.

关键词： many-core programming Single-chip Cloud Computer SCC Runtime-System Mapping Distributed Application Layer DAL

来源：评论

学校读者我要写书评

暂无评论

Bamboo: A Data-Centric, Object-Oriented Approach to many-core Software 10

Bamboo: A Data-Centric, Object-Oriented Approach to Many-cor...

引用

ACM SIGPLAN Conference on programming Language Design and Implementation

作者： Zhou, Jin Demsky, Brian Univ Calif Irvine Dept Elect Engn & Comp Sci Irvine CA 92697 USA

ISBN: (纸本)9781450300193

Traditional data-oriented programming languages such as dataflow languages and stream languages provide a natural abstraction for parallel programming. In these languages, a developer focuses on the flow of data through the computation and these systems free the developer from the complexities of low-level, thread-oriented concurrency primitives. This simplification comes at a cost traditional data-oriented approaches restrict the mutation of state and, in practice, the types of data structures a program can effectively use. Bamboo borrows from work in typestate and software transactions to relax the traditional restrictions of data-oriented programming models to support mutation of arbitrary data structures. We have implemented a compiler for Bamboo which generates code for the TILEPro64 many-core processor. We have evaluated this implementation on six benchmarks: Tracking, a feature tracking algorithm from computer vision;KMeans, a K-means clustering algorithm;Monte Carlo, a Monte Carlo simulation;Filter Bank, a multi-channel filter bank;Fractal, a Mandelbrot set computation;and Series, a Fourier series computation. We found that our compiler generated implementations that obtained speedups ranging from 26.2 x to 61.6 x when executed on 62 cores.

关键词： many-core programming Data-Centric Languages

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：