检索结果-内蒙古大学图书馆

proceedings - 2022 IEEE 36th international parallel and distributed Processing symposium Workshops, IPDPSW 2022 2022年 1127-1128页

作者： Korah, John Santos, Eunice E. California State Polytechnic University Pomona United States University of Illinois Urbana-Champaign United States

来源：评论

学校读者我要写书评

暂无评论

Nonblocking execution in GraphBLAS

Nonblocking execution in GraphBLAS

引用

IEEE international symposium on parallel and distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Aristeidis Mastoras Sotiris Anagnostidis Albert– Jan N. Yzelman Computing Systems Laboratory Zurich Research Center Huawei Technologies Switzerland Data Analytics Laboratory Department of Computer Science ETH Zurich Switzerland

ISBN: (数字)9781665497473

ISBN: (纸本)9781665497480

GraphBLAS is a recent standard that allows the expression of graph algorithms in the language of linear algebra and enables automatic code parallelization and optimization. GraphBLAS operations are executed either in blocking or in non-blocking mode. Although there exist multiple implementations of GraphBLAS for efficient blocking execution on both shared-and distributed-memory systems, none of these implementations supports full nonblocking execution to improve data locality. In this paper, we present a preliminary evaluation for two algorithms, Pagerank and Conjugate Gradient, that confirms the importance of nonblocking execution, by showing promising speedups over the corresponding blocking execution.

关键词： Analytical models distributed processing Heuristic algorithms Instruction sets distributed databases Manuals Linear algebra

来源：评论

学校读者我要写书评

暂无评论

parallel Simulation of Stochastic Reward Nets using Theatre 25

Parallel Simulation of Stochastic Reward Nets using Theatre

引用

25th IEEE/ACM international symposium on distributed Simulation and Real Time Applications (DS-RT)

作者： Cicirelli, Franco Nigro, Libero CNR Inst High Performance Comp & Networking ICAR Natl Res Council Italy I-87036 Arcavacata Di Rende CS Italy Univ Calabria DIMES Dept Informat Modelling Elect & Syst Sci I-87036 Arcavacata Di Rende CS Italy

ISBN: (纸本)9781665433266

This work aims at the development of tools for supporting modelling and analysis of timed systems by Stochastic Reward Nets (SRN). In a first approach it was proposed and experimented a formal reduction of SRN over Timed Automata (TA) in the context of the Uppaal popular toolbox. The reduction has the merit to allow both exhaustive model checking of an SRN model, useful for the assessment of qualitative properties (e.g., absence of deadlocks, occurrence of particular event sequences etc.), and quantitative analysis through the statistical model checker, which is based on simulations. However, although Uppaal enabled formal reasoning on the semantics of SRN, its practical usage suffers of scalability problems, that is it can introduce severe limitations in time and space when studying complex models. To cope with this problem, this paper describes a Java implementation of the SRN operational core engine, using the lock-free and efficient Theatre actor system which permits the parallel simulation of large models. The realization can be used for functional property checking on an untimed version of a source SRN model, and quantitative estimation of measurables through simulations. The paper discusses the design and implementation of the core engine of SRN on top of Theatre, together with supported intuitive configuration process of an SRN model, and reports some experimental results using a scalable grid computing model. The experiments confirm Theatre/SRN are capable of exploiting the potential of modern multi-core machines and can deliver good execution performances on large models.

关键词： Stochastic Reward Nets performability analysis actors high-performance computing Theatre Java

来源：评论

学校读者我要写书评

暂无评论

Productive Programming of distributed systems with the SHAD C++ Library 21

Productive Programming of Distributed Systems with the SHAD ...

引用

30th international symposium on High-Performance parallel and distributed Computing, HPDC 2021

作者： Castellana, Vito Giovanni Minutoli, Marco Pacific Northwest National Laboratory RichlandWA United States

ISBN: (纸本)9781450382175

High-performance computing (HPC) is often perceived as a matter of making large-scale systems (e.g., clusters) run as fast as possible, regardless the required programming effort. However, the idea of "bringing HPC to the masses"has recently emerged. Inspired by this vision, we have designed SHAD, the Scalable High-performance Algorithms and Data-structures library [1][6]. SHAD is open source software, written in C++, for C++ developers. Unlike other HPC libraries for distributed systems, which rely on SPMD models, SHAD adopts a shared-memory programming abstraction, to make C++ programmers feel at home. Underneath, SHAD manages tasking and data-movements, moving the computation where data resides and taking advantage of asynchrony to tolerate network latency. At the bottom of his stack, SHAD can interface with multiple runtime systems: this not only improves developer's productivity, by hiding the complexity of such software and of the underlying hardware, but also greatly enhance code portability. Thanks to its abstraction layers, SHAD can indeed target different systems, ranging from laptops to HPC clusters, without any need for modifying the user-level *** have prototyped and open-sourced the implementation of (a subset of) the C++ standard library (STL) targeting multi-node HPC clusters. Our work allows plain STL-based C++ code to scale on HPC systems, with no need for rewriting the code to exploit the complex hardware. SHAD is available under Apache v2 License at https://***/pnnl/SHAD. In this paper we overview the design of the SHAD library, depicting its main components: runtime systems abstractions for tasking;parallel and distributed data-structures;STL-compliant interfaces and algorithms. © 2020 Copyright is held by the owner/author(s).

关键词： parallel programming

来源：评论

学校读者我要写书评

暂无评论

A PORTABLE APPROACH TO INTEGRATING DIVERSE GEO-SCIENCE DATA USING STARE-AWARE databases AND TRANSITIONING TO CLOUD

A PORTABLE APPROACH TO INTEGRATING DIVERSE GEO-SCIENCE DATA ...

引用

IEEE international Geoscience and Remote Sensing symposium (IGARSS)

作者： Rilee, Michael L. Kuo, Kwo-Sen Griessbaum, Niklas Frew, James Gallagher, James NASA Goddard Space Flight Ctr Greenbelt MD 20771 USA Rilee Syst Technol LLC Derwood MD USA Bayesics LLC Bowie MD USA Univ Calif Santa Barbara Santa Barbara CA USA OPeNDAP Inc Narragansett RI USA

ISBN: (纸本)9781665403696

Big Data technologies such as Cloud and parallel distributed computing and storage are necessary to treat Earth Science data volume. Yet the great diversity of Earth Science data renders it nearly impossible to organize that data on scalable platforms without costly data movement or undesired interpolation that straitjackets scientific research. The SpatioTemporal Adaptive Resolution Encoding (STARE) is an alternative geolocation and indexing scheme for harmonizing data for integrative analysis on scalable systems. STARE uses a hierarchical, recursive partitioning of space and time in which the index or coordinates of each node are integers from the same index space, usually allowing quick comparison without floating-point calculation. STARE is well suited to provide a unifying geo-semantics for arranging data in databases. In this work, we outline the technical principles underlying STARE and its application to SQLite as an example. The STARELite STARE-aware lightweight geo-database can be used to catalogue diverse data for geographical querying and integration on local resources and Cloud.

关键词： STARE SQLite Geodatabase Cloud

来源：评论

学校读者我要写书评

暂无评论

ESSA 2022 Invited Speaker: The Curious Incident of the Data in the Scientific Workflow

ESSA 2022 Invited Speaker: The Curious Incident of the Data ...

引用

IEEE international symposium on parallel and distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Lavanya Ramakrishnan Lawrence Berkeley National Laboratory

ISBN: (数字)9781665497473

ISBN: (纸本)9781665497480

The volume, veracity, and velocity of data generated by the accelerators, colliders, supercomputers, light sources and neutron sources have grown exponentially in the last decade. Data has fundamentally changed the scientific workflow running on high performance computing (HPC) systems. It is necessary that we develop appropriate capabilities and tools to understand, analyze, preserve, share, and make optimal use of data. Intertwined with data are complex human processes, policies and decisions that need to be accounted for when building software tools. In this talk, I will outline our work addressing data lifecycle challenges on HPC systems including effective use of storage hierarchy, managing complex scientific data processing, and enabling search on large-scale scientific data.

关键词： Software tools distributed processing distributed databases Conferences Buildings Supercomputers Neutrons

来源：评论

学校读者我要写书评

暂无评论

The First international Workshop on COmputing using EmeRging EXotic AI-Inspired systems (CORtEX'22)

Proceedings - 2022 IEEE 36th International Parallel and Dist...

引用

proceedings - 2022 IEEE 36th international parallel and distributed Processing symposium Workshops, IPDPSW 2022 2022年 1235-1236页

作者： Podobas, Artur Drozd, Aleksandr Drozd, Aleksandr Devereux, Barry KTH Royal Institute of Technology Sweden Riken Center for Computational Science Japan University of Tennessee United States Queen's University Belfast United Kingdom

来源：评论

学校读者我要写书评

暂无评论

DLion: Decentralized distributed Deep Learning in Micro-Clouds 21

DLion: Decentralized Distributed Deep Learning in Micro-Clou...

引用

30th international symposium on High-Performance parallel and distributed Computing (HPDC)

作者： Hong, Rankyung Chandra, Abhishek Univ Minnesota Minneapolis MN 55455 USA

ISBN: (纸本)9781450382175

Deep learning (DL) is a popular technique for building models from large quantities of data such as pictures, videos, messages generated from edges devices at rapid pace all over the world. It is often infeasible to migrate large quantities of data from the edges to centralized data center(s) over WANs for training due to privacy, cost, and performance reasons. At the same time, training large DL models on edge devices is infeasible due to their limited resources. An attractive alternative for DL training distributed data is to use micro-clouds-small-scale clouds deployed near edge devices in multiple locations. However, micro-clouds present the challenges of both computation and network resource heterogeneity as well as dynamism. In this paper, we introduce DLion, a new and generic decentralized distributed DL system designed to address the key challenges in micro-cloud environments, in order to reduce overall training time and improve model accuracy. We present three key techniques in DLion: (1) Weighted dynamic batching to maximize data parallelism for dealing with heterogeneous and dynamic compute capacity, (2) Per-link prioritized gradient exchange to reduce communication overhead for model updates based on available network capacity, and (3) Direct knowledge transfer to improve model accuracy by merging the best performing model parameters. We build a prototype of DLion on top of TensorFlow and show that DLion achieves up to 4.2x speedup in an Amazon GPU cluster, and up to 2x speed up and 26% higher model accuracy in a CPU cluster over four state-of-the-art distributed DL systems.

关键词： Edge computing Deep learning Micro-clouds Resource allocation

来源：评论

学校读者我要写书评

暂无评论

WeTune: Automatic Discovery and Verification of Query Rewrite Rules 22

WeTune: Automatic Discovery and Verification of Query Rewrit...

引用

international Conference on Management of Data (SIGMOD)

作者： Wang, Zhaoguo Zhou, Zhou Yang, Yicun Ding, Haoran Hu, Gansen Ding, Ding Tang, Chuzhe Chen, Haibo Li, Jinyang Shanghai Jiao Tong Univ Inst Parallel & Distributed Syst Shanghai Peoples R China Minist Educ Engn Res Ctr Domain Specif Operating Syst Beijing Peoples R China NYU Dept Comp Sci New York NY 10003 USA

ISBN: (纸本)9781450392495

Query rewriting transforms a relational database query into an equivalent but more efficient one, which is crucial for the performance of database-backed applications. Such rewriting relies on pre-specified rewrite rules. In existing systems, these rewrite rules are discovered through manual insights and accumulate slowly over the years. In this paper, we present WETUNE, a rule generator that automatically discovers new rewrite rules. Inspired by compiler super-optimization, WETUNE enumerates all valid logical query plans up to a certain size and tries to discover equivalent plans that could potentially lead to more efficient rewrites. The core challenge is to determine which set of conditions (aka constraints) allows one to prove the equivalence between a pair of query plans. We address this challenge by enumerating combinations of "interesting" constraints that relate tables and their attributes between each pair of queries. We also propose a new SMT-based verifier to verify the equivalence of a query pair under different enumerated constraints. To evaluate the usefulness of rewrite rules discovered by WETUNE, we apply them on the SQL queries collected from the 20 most popular open-source web applications on GitHub. WETUNE successfully optimizes 247 queries that existing databases cannot optimize, resulting in substantial performance improvements.

关键词： Query Rewriting Rewrite Rule Discovery SQL Solver

来源：评论

学校读者我要写书评

暂无评论

File System Semantics Requirements of HPC Applications 21

File System Semantics Requirements of HPC Applications

引用

30th international symposium on High-Performance parallel and distributed Computing (HPDC)

作者： Wang, Chen Mohror, Kathryn Snir, Marc Univ Illinois Champaign IL 61820 USA Lawrence Livermore Natl Lab Livermore CA 94550 USA

ISBN: (纸本)9781450382175

Most widely-deployed parallel file systems (PFSs) implement POSIX semantics, which implies sequential consistency for reads and writes. Strict adherence to POSIX semantics is known to impede performance and thus several new PFSs with relaxed consistency semantics and better performance have been introduced. Such PFSs are useful provided that applications can run correctly on a PFS with weaker semantics. While it is widely assumed that HPC applications do not require strict POSIX semantics, to our knowledge there has not been systematic work to support this assumption. In this paper, we address this gap with a categorization of the consistency semantics guarantees of PFSs and develop an algorithm to determine the consistency semantics requirements of a variety of HPC applications. We captured the I/O activity of 17 representative HPC applications and benchmarks as they performed I/O through POSIX or I/O libraries and examined the metadata operations used and their file access patterns. From this analysis, we find that 16 of the 17 applications can utilize PFSs with weaker semantics.

关键词： consistency semantics parallel file system scientific applications

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：