检索结果-内蒙古大学图书馆

International Conference on High Performance Computing & Simulation (HPCS)

作者： Legaux, Joeffrey Loulergue, Frederic Jubertie, Sylvain Univ Orleans INSA Ctr Val Loire LIFO EA F-4022 Orleans France

ISBN: (纸本)9781479953134

Research on high-level parallel programming approaches systematically evaluate the performance of applications written using these approaches and informally argue that high-level parallel programming languages or libraries increase the productivity of programmers. In this paper we present a methodology that allows to evaluate the trade-off between programming effort and performance of applications developed using different programming models. We apply this methodology on some implementations of a function solving the all nearest smaller values problem. The high-level implementation is based on a new version of the BSP homomorphism algorithmic skeleton.

关键词： parallel programming software metrics algorithmic skeletons C plus

来源：评论

学校读者我要写书评

暂无评论

Safe parallel programming in Ada with language extensions

Safe parallel programming in Ada with language extensions

引用

ACM SIGAda's Annual International Conference High Integrity Language Technology, HILT 2014

作者： Taft, S. Tucker Moore, Brad Pinho, Luís Miguel Michell, Stephen AdaCore United States General Dynamics Canada CISTER ISEP Portugal Maurya Software Inc. Canada

ISBN: (纸本)9781450332170

The increased presence of parallel computing platforms brings concerns to the general purpose domain that were previously prevalent only in the specific niche of high-performance computing. As parallel programming technologies become more prevalent in the form of new emerging programming languages and extensions of existing languages, additional safety concerns arise as part of the paradigm shift from sequential to parallel behaviour. In this paper, we propose various syntax extensions to the Ada language, which provide mechanisms whereby the compiler is given the necessary semantic information to enable the implicit and explicit parallelization of code. The model is based on earlier work, which separates parallelism specification from concurrency implementation, but proposes an updated syntax with additional mechanisms to facilitate the development of safer parallel programs. Copyright 2014 ACM.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Jcogin: A parallel programming infrastructure for monte carlo particle transport

Jcogin: A parallel programming infrastructure for monte carl...

引用

2014 International Conference on Physics of Reactors, PHYSOR 2014

作者： Zhang, Baoyin Li, Gang Deng, Li Ma, Yan Shangguan, Danhua Zhang, Aiqing Cao, Xiaolin Mo, Zeyao Institute of Applied Physics and Computational Mathematics Beijing China

The advantages of the Monte Carlo method for reactor analysis are well known, but the full-core reactor analysis challenges the computational time and computer memory. Meanwhile, the exponential growth of computer power in the last 10 years is now creat-ing a great opportunity for large scale parallel computing on the Monte Carlo full-core reactor analysis. In this paper, a parallel programming infrastructure is introduced for Monte Carlo particle transport, named JCOGIN, which aims at accelerating the develop-ment of Monte Carlo codes for the large scale parallelism simulations of the full-core re-actor. Now, JCOGIN implements the hybrid parallelism of the spatial decomposition and the traditional particle parallelism on MPI and OpenMP. Finally, JMCT code is developed on JCOGIN, which reaches the parallel efficiency of 70% on 20480 cores for fixed source problem. By the hybrid parallelism, the full-core pin-by-pin simulation of the Dayawan reactor was implemented, with the number of the cells up to 10 million and the tallies of the fluxes utilizing over 40GB of memory. © PHYSOR 2014 - The role of reactors physics toward a sustainable future. All rights reserved.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

A DSL for integrative parallel programming 13

A DSL for integrative parallel programming

引用

13th IEEE International Symposium on parallel and Distributed Computing, ISPDC 2014

作者： Eijkhout, Victor Texas Advanced Computing Center University of Texas at Austin AustinTX United States

ISBN: (纸本)9780769552651

parallel programming is commonly done through a library approach, as in the Message Passing Interface (MPI), directives, as in OpenMP, language extensions, as in High Performance Fortran (HPF), or whole new languages, as in Chapel. However, we argue that the concepts underlying these different programming systems show great commonality. Hence, we propose a Domain-Specific Language (DSL) that expresses an abstraction of these common concepts. As we show by means of a prototype that uses both MPI and OpenMP tasks as backend, this common vocabulary can then be expressed in multiple parallelism types. © 2014 IEEE.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Productivity, Portability, Performance, and Reproducibility: Data-Centric Python

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2025年第5期36卷 804-820页

作者： Ziogas, Alexandros Nikolaos Schneider, Timo Ben-Nun, Tal Calotoiu, Alexandru De Matteis, Tiziano de Fine Licht, Johannes Lavarini, Luca Hoefler, Torsten Swiss Fed Inst Technol Dept Informat Technol & Elect Engn CH-8092 Zurich Switzerland Swiss Fed Inst Technol Dept Comp Sci CH-8092 Zurich Switzerland Lawrence Livermore Natl Lab Ctr Appl Sci Comp Livermore CA 94550 USA Vrije Univ Amsterdam Dept Comp Sci NL-1081 HV Amsterdam Netherlands NextSilicon CH-8005 Zurich Switzerland 1plusX CH-8005 Zurich Switzerland

Python has become the de facto language for scientific computing. programming in Python is highly productive, mainly due to its rich science-oriented software ecosystem built around the NumPy module. As a result, the demand for Python support in High-Performance Computing (HPC) has skyrocketed. However, the Python language itself does not necessarily offer high performance. This work presents a workflow that retains Python's high productivity while achieving portable performance across different architectures. The workflow's key features are HPC-oriented language extensions and a set of automatic optimizations powered by a data-centric intermediate representation. We show performance results and scaling across CPU, GPU, FPGA, and the Piz Daint supercomputer (up to 23,328 cores), with 2.47x and 3.75x speedups over previous-best solutions, first-ever Xilinx and Intel FPGA results of annotated Python, and up to 93.16% scaling efficiency on 512 nodes. Our benchmarks were reproduced in the Student Cluster Competition (SCC) during the Supercomputing Conference (SC) 2022. We present and discuss the student teams' results.

关键词： Productivity Codes Semantics Computer architecture Supercomputers Software Field programmable gate arrays Optimization Python Computer languages high-performance computing dataflow computing parallel programming distributed computing distributed computing

来源：评论

学校读者我要写书评

暂无评论

Cross-platform parallel programming in PARRAY: A case study

Cross-platform parallel programming in PARRAY: A case study

引用

11th IFIP WG 10.3 International Conference on Network and parallel Computing, NPC 2014

作者： Cui, Xiang Li, Xiaowen Chen, Yifeng HCST Key Lab. School of EECS Peking University Beijing China State Key Laboratory of Mathematical Engineering and Advanced Computing Wuxi China Air Defense Forces Academy Zhengzhou China College of Computer and Information Engineering Henan University Kaifeng China

ISBN: (纸本)9783662449165

PARRAY (or parallelizing ARRAYs) is an extension of C language that supports system-level succinct programming for heterogeneous parallel systems. Parray extends mainstream C programming with novel array types. This leads to shorter, more portable and maintainable parallel codes, while the programmer still has control over performance-related features necessary for deep manual optimization. This paper uses the case study on stepwise program refinement of matrix transposition to illustrate the basic techniques of PARRAY programming. © 2014 IFIP International Federation for Information Processing.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

Fast parallel CPU-GPU Approximate Spectral Clustering for Transcriptomics Data

引用

INTERNATIONAL JOURNAL OF parallel programming 2025年第1期53卷 1-25页

作者： Brankovic, Stefan Smiljkovic, Lazar Obradovic, Predrag Radonjiic, Milos Misic, Marko Univ Belgrade Sch Elect Engn Bulevar Kralja Aleksandra 73 Belgrade 11000 Serbia MGI Tech Belgrade 11000 Serbia

Spectral clustering algorithms have been used in various research domains to discover structure and patterns in data. However, high computational and space complexity hinders their usage for large-scale datasets in machine learning and bioinformatics. Various approximate spectral clustering methods were proposed in the open literature to solve those problems. In this paper, we describe our GPU-based, parallel implementation of an approximate spectral algorithm based on the Nystrom method and column sampling and its memory-efficient variant. We evaluate our solution using several annotated datasets, such as USPS, MNIST, and MNIST8, as well as bioinformatics data, especially from the domain of single-cell and spatial transcriptomics. We obtain speedups of up to 31.8x depending on the dataset used and demonstrate the scalability of the solution for the datasets with up to four million samples.

关键词： Approximate spectral clustering GPU computing parallel algorithms parallel programming Performance analysis

来源：评论

学校读者我要写书评

暂无评论

Performance and usability evaluation of a Pattern-Oriented parallel programming interface for multi-core architectures 26

Performance and usability evaluation of a Pattern-Oriented P...

引用

26th International Conference on Software Engineering and Knowledge Engineering, SEKE 2014

作者： Griebler, Dalvan Adornes, Daniel Fernandes, Luiz Gustavo Av. Ipiranga 6681 Porto Alegre Brazil

Multi-core architectures have increased the power of parallelism by coupling many cores in a single chip. This becomes even more complex for developers to exploit the available parallelism in order to provide high performance scalable programs. To address these challenges, we propose the DSLPOPP (Domain-Specific Language for Pattern-Oriented parallel programming), which links the pattern-based approach in the programming interface as an alternative to reduce the e ort of parallel software development, and achieve good performance in some applications. In this paper, the objective is to evaluate the usability and performance of the master/slave pattern and compare it to the Pthreads library. Moreover, experiments have shown that the master/slave interface of the DSL-POPP reduces up to 50% of the programming e ort, without significantly a ecting the performance. Copyright © 2014 by Knowledge Systems Institute Graduate School.

关键词： parallel programming Pattern-oriented Performance evaluation Usability evaluation

来源：评论

学校读者我要写书评

暂无评论

Comparison of performance of threads and forks parallel programming techniques and application in image processing 41

Comparison of performance of threads and forks parallel prog...

引用

41st EPS Conference on Plasma Physics, EPS 2014

作者： Moraes, Femanda D. Gioani, M. Ahes, N. De Albuquerque, Marcelo Portes De Albuquerque, Márcio P. Centro Brasileiro de Peaquisas Fíaicaa - CBPF Rua De Xavier Sigaud 150 Urca Rio de Janeiro - RJ Brazil

Several algorithms applied to the solution of specific problems in ph sics require high performance computing. This is the case, for exanjple, in the field of digital image processing, where the required performance in terms of speed, and sometimes running an a real time environment, leads to the use of parallel programrmng tools. To meet this demand it is important to understand these tools, highlighting differences and their possible applications. Moreover, research centers around the world has available a clusters of computer, or a multi-core platform. with a strong potential of using parallel programming techniques. Ibis study aims to charaetertre threads and forks parallel programming techniques. Both techniques allow the develcpnient of parallel codes, which with its own restrictions on the inter process comniunication and programming format. This Technical Note aims to highlight the use of each of these techniques, and to present an agplication in the area of image processing in which they were used. The application part of this work was develctped in the international collaboration with the JET Laboratory (Join European Torus of the European Atomic Energy Community I EURATOM). The TET Laboratory investigates the process of forming the plasma and its nstability, which appears as a toroidal ring of increased radiation, known as MARFE (Multifaceted Asymmetric Radiation From The Edge). The activities have explored the techniques of parallel programming algorithms in digital image processing. The presented algorithms allow achieving a processing rate higher than 10 000 images per second and use threads and shared memory communication between independent processes, which is equivalent to fork.

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

GSParLib: A multi-level programming interface unifying OpenCL and CUDA for expressing stream and data parallelism

引用

COMPUTER STANDARDS & INTERFACES 2025年 92卷

作者： Rockenbach, Dinei A. Araujo, Gabriell Griebler, Dalvan Fernandes, Luiz Gustavo Pontif Catholic Univ Rio Grande do Sul PUCRS Sch Technol Porto Alegre Brazil

The evolution of Graphics Processing Units (GPUs) has allowed the industry to overcome long-lasting problems and challenges. Many belong to the stream processing domain, whose central aspect is continuously receiving and processing data from streaming data producers such as cameras and sensors. Nonetheless, programming GPUs is challenging because it requires deep knowledge of many-core programming, mechanisms and optimizations for GPUs. Current GPU programming standards do not target stream processing and present programmability and code portability limitations. Among our main scientific contributions resides GSParLib, a C++ multi-level programming interface unifying CUDA and OpenCL for GPU processing on stream and data parallelism with negligible performance losses compared to manual implementations;GSParLib is organized in two layers: one for general-purpose computing and another for high-level structured programming based on parallel patterns;a methodology to provide unified and driver agnostic interfaces minimizing performance losses;a set of parallelism strategies and optimizations for GPU processing targeting stream and data parallelism;and new experiments covering GPU performance on applications exposing stream and data parallelism.

关键词： Application programming interface Stream processing parallel programming parallel and distributed computing Graphics processing units

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：