检索结果-内蒙古大学图书馆

Concurrent Kleene algebra with tests and branching automata

JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN programming 2016年第4期85卷 637-652页

作者： Jipsen, Peter Moshier, M. Andrew Chapman Univ Orange CA 92866 USA

We introduce concurrent Kleene algebra with tests (CKAT) as a combination of Kleene algebra with tests (KAT) of Kozen and Smith with concurrent Kleene algebras (CKA), introduced by Hoare, Moller, Struth and Wehrman. CKAT provides a relatively simple algebraic model for reasoning about semantics of concurrent programs. We generalize guarded strings to guarded series-parallel strings, or gsp-strings, to give a concrete language model for CKAT. Combining nondeterministic guarded automata of Kozen with branching automata of Lodaya and Weil one obtains a model for processing gsp-strings in parallel. To ensure that the model satisfies the weak exchange law (x parallel to y)(z parallel to w) <= (xz)parallel to(yw) of CKA, we make use of the subsumption order of Gischer on the gsp-strings. We also define deterministic branching automata and investigate their relation to (nondeterministic) branching automata. To express basic concurrent algorithms, we define concurrent deterministic flowchart schemas and relate them to branching automata and to concurrent Kleene algebras with tests. (C) 2016 Elsevier Inc. All rights reserved.

关键词： Concurrent Kleene algebra Kleene algebra with tests parallel programming models Deterministic fork-join automata Series-parallel strings Weak exchange law Positive separation algebra Flowchart schemas

来源：评论

学校读者我要写书评

暂无评论

Using shared arrays in message-driven parallel programs

引用

parallel COMPUTING 2012年第1-2期38卷 66-74页

作者： Miller, Phil Becker, Aaron Kale, Laxmikant Univ Illinois Dept Comp Sci Urbana IL 61801 USA

This paper describes a safe and efficient combination of the object-based message-driven execution and shared array parallel programming models. In particular, we demonstrate how this combination engenders the composition of loosely coupled parallel modules safely accessing a common shared array. That loose coupling enables both better flexibility in parallel execution and greater ease of implementing multi-physics simulations. As a case study, we describe how the parallelization of a new method for molecular dynamics simulation benefits from both of these advantages. We also describe a system of typed handle objects that embed some of the determinacy constraints of the Multiphase Shared Array programming model in the C++ type system, to catch some violations at compile time. The combined programming model communicates in terms of these handles as a natural means of detecting and preventing errors. (C) 2011 Elsevier B.V. All rights reserved.

关键词： parallel programming models Composition Distributed shared arrays Asynchronous execution

来源：评论

学校读者我要写书评

暂无评论

programming big data analysis: principles and solutions

引用

JOURNAL OF BIG DATA 2022年第1期9卷 1-50页

作者： Belcastro, Loris Cantini, Riccardo Marozzo, Fabrizio Orsino, Alessio Talia, Domenico Trunfio, Paolo Univ Calabria Arcavacata Di Rende Italy Dtok Lab Arcavacata Di Rende Italy

In the age of the Internet of Things and social media platforms, huge amounts of digital data are generated by and collected from many sources, including sensors, mobile devices, wearable trackers and security cameras. This data, commonly referred to as Big Data, is challenging current storage, processing, and analysis capabilities. New models, languages, systems and algorithms continue to be developed to effectively collect, store, analyze and learn from Big Data. Most of the recent surveys provide a global analysis of the tools that are used in the main phases of Big Data management (generation, acquisition, storage, querying and visualization of data). Differently, this work analyzes and reviews parallel and distributed paradigms, languages and systems used today to analyze and learn from Big Data on scalable computers. In particular, we provide an in-depth analysis of the properties of the main parallel programming paradigms (MapReduce, workflow, BSP, message passing, and SQL-like) and, through programming examples, we describe the most used systems for Big Data analysis (e.g., Hadoop, Spark, and Storm). Furthermore, we discuss and compare the different systems by highlighting the main features of each of them, their diffusion (community of developers and users) and the main advantages and disadvantages of using them to implement Big Data analysis applications. The final goal of this work is to help designers and developers in identifying and selecting the best/appropriate programming solution based on their skills, hardware availability, application domains and purposes, and also considering the support provided by the developer community.

关键词： parallel programming models programming systems Big Data analysis MapReduce Workflow Message Passing Bulk Synchronous parallel SQL-like

来源：评论

学校读者我要写书评

暂无评论

Transparent Orchestration of Task-based parallel Applications in Containers Platforms

引用

JOURNAL OF GRID COMPUTING 2018年第1期16卷 137-160页

作者： Ramon-Cortes, Cristian Serven, Albert Ejarque, Jorge Lezzi, Daniele Badia, Rosa M. BSC Barcelona Spain CSIC IIIA Artificial Intelligence Res Inst Spanish Natl Res Council Barcelona Spain

This paper presents a framework to easily build and execute parallel applications in container-based distributed computing platforms in a user-transparent way. The proposed framework is a combination of the COMP Superscalar (COMPSs) programming model and runtime, which provides a straightforward way to develop task-based parallel applications from sequential codes, and containers management platforms that ease the deployment of applications in computing environments (as Docker, Mesos or Singularity). This framework provides scientists and developers with an easy way to implement parallel distributed applications and deploy them in a one-click fashion. We have built a prototype which integrates COMPSs with different containers engines in different scenarios: i) a Docker cluster, ii) a Mesos cluster, and iii) Singularity in an HPC cluster. We have evaluated the overhead in the building phase, deployment and execution of two benchmark applications compared to a Cloud testbed based on KVM and OpenStack and to the usage of bare metal nodes. We have observed an important gain in comparison to cloud environments during the building and deployment phases. This enables better adaptation of resources with respect to the computational load. In contrast, we detected an extra overhead during the execution, which is mainly due to the multi-host Docker networking.

关键词： Cloud computing Containers orchestration Linux containers Distributed systems parallel programming models

来源：评论

学校读者我要写书评

暂无评论

Hybrid MPI-thread parallelization of adaptive mesh operations

引用

parallel COMPUTING 2016年 52卷 133-143页

作者： Ibanez, Dan Dunn, Ian Shephard, Mark S. Rensselaer Polytech Inst Sci Computat Res Ctr 110 Eighth St Troy NY 12180 USA

Many of the world's leading supercomputer architectures are a hybrid of shared memory and network-distributed memory. Such an architecture lends itself to a hybrid MPI-thread programming model. We first present an implementation of inter-thread message passing based on the MPI and pthread libraries. In addition, we present an efficient implementation of termination detection for communication rounds. We use the term phased message passing to denote the communication interface based on this termination detection. This interface is then used to implement parallel operations for adaptive unstructured meshes, and the performance of resulting applications is compared to pure MPI operation. We also present new workflows enabled by the ability to vary the number of threads during run-time. (C) 2016 Elsevier B.V. All rights reserved.

关键词： Hybrid system Shared memory parallel programming models MPI Termination detection Non-blocking

来源：评论

学校读者我要写书评

暂无评论

Multicore Desktop programming with Intel Threading Building Blocks

引用

IEEE SOFTWARE 2011年第1期28卷 23-31页

作者： Kim, Wooyoung Voss, Michael Intel Corp Software & Serv Grp Santa Clara CA 95051 USA

Writing a correct parallel program is difficult; writing a highly modular parallel program that performs well in a multiprogrammed environment is even more so. Intel Threading Building Blocks (Intel TBB), a key component of Intel parallel Building Blocks , is a widely used C++ template library that helps developers achieve this goal. The Intel TBB task scheduler uses a process-wide thread pool to establish a composable execution environment that balances a load while quickly adapting to changes in resource availability. Building on top of the task scheduler, the library implements prepackaged, highly tuned algorithms for frequently used parallel idioms. It also provides several concurrent containers and useful, low-level synchronization constructs to help developers safely and efficiently manage their parallel application's data.

关键词： Multicore programming Threading Libraries parallel programming models Software Engineering programming parallel programming

来源：评论

学校读者我要写书评

暂无评论

TProf: An energy profiler for task-parallel programs

引用

SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS 2015年 5卷 1-13页

作者： Manousakis, Ioannis Zakkak, Foivos S. Pratikakis, Polyvios Nikolopoulos, Dimitrios S. Rutgers State Univ Dept Comp Sci Piscataway NJ 08855 USA Fdn Res & Technol Inst Comp Sci Hellas Greece Queens Univ Belfast Sch Elect Elect Engn & Comp Sci Belfast BT7 1NN Antrim North Ireland

We present TPR0F, an energy profiling tool for OpenMP-like task-parallel programs. To compute the energy consumed by each task in a parallel application, TPRoF dynamically traces the parallel execution and uses a novel technique to estimate the per-task energy consumption. To achieve this estimation, TPRoF apportions the total processor energy among cores and overcomes the limitation of current works which would otherwise make parallel accounting impossible to achieve. We demonstrate the value of TPRoF by characterizing a set of task parallel programs, where we find that data locality, memory access patterns and task working sets are responsible for significant variance in energy consumption between seemingly homogeneous tasks. In addition, we identify opportunities for fine-grain energy optimization by applying per-task Dynamic Voltage and Frequency Scaling (DVFS). (C) 2014 Published by Elsevier Inc.

关键词： Energy profiling parallel programming models parallel runtime systems Task parallelism

来源：评论

学校读者我要写书评

暂无评论

Associative nets: A graph-based parallel computing model

引用

IEEE TRANSACTIONS ON COMPUTERS 1997年第5期46卷 558-571页

作者： Merigot, A UNIV PARIS 11 CNRSURA22 INTEGRATED CIRCUITS & SYST ARCHITECTURE GRP FUNDAMENTAL ELECT INST ORSAY FRANCE

This paper presents a new parallel computing model called Associative Nets. This model relies on basic primitives called associations that consist to apply an associative operator over connected components of a subgraph of the physical interprocessor connection graph. Associations can be very efficiently implemented (in terms of hardware cost or processing time) thanks to asynchronous computation. This model is quite effective for image analysis and several other fields;as an example, graph processing algorithms are presented. While relying on a much simpler architecture, these algorithms have, in general, a complexity equivalent to the one obtained by more expensive computing models, like the PRAM model.

关键词： parallel programming models parallel algorithms fine grain parallelism SIMD bit-serial arithmetic graph processing asynchronous logic

来源：评论

学校读者我要写书评

暂无评论

***: MPI-Based Asynchronous Task Execution for Python

引用

IEEE TRANSACTIONS ON parallel AND DISTRIBUTED SYSTEMS 2023年第2期34卷 611-622页

作者： Rogowski, Marcin Aseeri, Samar Keyes, David Dalcin, Lisandro King Abdullah Univ Sci & Technol KAUST Extreme Comp Res Ctr ECRC Thuwal 239556900 Saudi Arabia King Abdullah Univ Sci & Technol KAUST Comp Sci Program Thuwal 239556900 Saudi Arabia

We present ***, a lightweight, asynchronous task execution framework targeting the Python programming language and using the Message Passing Interface (MPI) for interprocess communication. *** follows the interface of the *** package from the Python standard library and can be used as its drop-in replacement, while allowing applications to scale over multiple compute nodes. We discuss the design, implementation, and feature set of *** and compare its performance to other solutions on both shared and distributed memory architectures. On a shared-memory system, we show *** to consistently outperform Python's *** with speedup ratios between 1.4X and 3.7X in throughput (tasks per second) and between 1.9X and 2.9X in bandwidth. On a Cray XC40 system, we compare *** to Dask - a well-known Python parallel computing package. Although we note more varied results, we show *** to outperform Dask in most scenarios.

关键词： MPI Python parallelism master-worker parallel programming models distributed computing high performance computing task execution multiprocessing

来源：评论

学校读者我要写书评

暂无评论

PyCOMPSs: parallel computational workflows in Python

引用

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2017年第1期31卷 66-82页

作者： Tejedor, Enric Becerra, Yolanda Alomar, Guillem Queralt, Anna Badia, Rosa M. Torres, Jordi Cortes, Toni Labarta, Jesus Barcelona Supercomp Ctr BSC CNS Dept Comp Sci Barcelona Spain Spanish Council Sci Res CSIC Artificial Intelligence Res Inst IIIA Barcelona Spain

The use of the Python programming language for scientific computing has been gaining momentum in the last years. The fact that it is compact and readable and its complete set of scientific libraries are two important characteristics that favour its adoption. Nevertheless, Python still lacks a solution for easily parallelizing generic scripts on distributed infrastructures, since the current alternatives mostly require the use of APIs for message passing or are restricted to embarrassingly parallel computations. In that sense, this paper presents PyCOMPSs, a framework that facilitates the development of parallel computational workflows in Python. In this approach, the user programs her script in a sequential fashion and decorates the functions to be run as asynchronous parallel tasks. A runtime system is in charge of exploiting the inherent concurrency of the script, detecting the data dependencies between tasks and spawning them to the available resources. Furthermore, we show how this programming model can be built on top of a Big Data storage architecture, where the data stored in the backend is abstracted and accessed from the application in the form of persistent objects.

关键词： Scientic computing parallel programming models Python Big Data storage

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：