The proceedings contain 13 papers. The topics discussed include: quantum algorithms and simulation for parallel and distributed quantum computing;tensor network circuit simulation at exascale;Illinois express quantum ...
ISBN:
(纸本)9781728186740
The proceedings contain 13 papers. The topics discussed include: quantum algorithms and simulation for parallel and distributed quantum computing;tensor network circuit simulation at exascale;Illinois express quantum network for distributing and controlling entanglement on metro-scale;exploring affine abstractions for qubit mapping;scalable programming workflows for validation of quantum computers;and mapping constraint problems onto quantum gate and annealing devices.
The proceedings contain 4 papers. The topics discussed include: distributing higher-dimensional simulations across compute systems: a widely distributed combination technique;benchmarking and extending SYCL hierarchic...
ISBN:
(纸本)9781665411325
The proceedings contain 4 papers. The topics discussed include: distributing higher-dimensional simulations across compute systems: a widely distributed combination technique;benchmarking and extending SYCL hierarchical parallelism;did the GPU obfuscate the load imbalance in my MPI simulation?;and PPIR: parallel pattern intermediate representation.
The proceedings contain 12 papers. The topics discussed include: performance evaluation of distributed networks for Internet of things;connectivity pattern analysis for virtual simulation design, based on high-perform...
The proceedings contain 12 papers. The topics discussed include: performance evaluation of distributed networks for Internet of things;connectivity pattern analysis for virtual simulation design, based on high-performance game analysis;computational study for improvement of aerodynamic performance of airfoil by changing various aerodynamic properties;an experimental design approach to IoT enabled smart parallel irrigation system using embedded microcontrollers;to develop, test and record a 3 lead EMG electrode and flex sensor on a 3D prosthetic limb with different gait patterns using Arduino microcontroller;and low-cost system for the management of hospital services, applied to hospitalized patients through the use of IoT technology.
Peachy parallel Assignments are high-quality assignments for teaching parallel and distributed computing. They are selected competitively for presentation at the Edu* workshops. All of the assignments have been succes...
详细信息
ISBN:
(纸本)9780738143057
Peachy parallel Assignments are high-quality assignments for teaching parallel and distributed computing. They are selected competitively for presentation at the Edu* workshops. All of the assignments have been successfully used in class and they are selected based on the their ease of adoption by other instructors and for being cool and inspirational to students. This paper presents a paper-and-pencil assignment asking students to analyze the performance of different system configurations and an assignment in which students parallelize a simulation of the evolution of simple living organisms.
This paper describes the design of a simulation model of the infrastructure for data processing from the Super Charm-Tau factory class "megasience" electron-positron collider. The model simulates the behavio...
详细信息
parallel computing systems based on reconfigurable accelerators are becoming (1) increasingly heterogeneous, (2) difficult to design and (3) complex to model. Such modeling of a parallel computing system helps to eval...
详细信息
ISBN:
(纸本)9781728184661
parallel computing systems based on reconfigurable accelerators are becoming (1) increasingly heterogeneous, (2) difficult to design and (3) complex to model. Such modeling of a parallel computing system helps to evaluate its performance and to improve its architecture before prototyping. This paper presents a simulation tool aiming to study the integration of reconfigurable accelerators in scalable distributed systems and runtimes, such as S-DSM systems, where S-DSM (software-distributed shared memory) is a paradigm to ease data management among distributed nodes. This tool allows us to simulate the execution of irregular compute kernels accessing distributed data. To deal with the complexity of modeling (3) the complete system we used a hybrid methodology. We integrated the simulation engine into the S-DSM. The distributed data management part is executed on the physical architecture allowing to generate precise and faithful latencies, and the accelerator simulation is cycle accurate. We used general sparse matrix-matrix multiplication (SpGEMM) as a case study. We show that the use of this tool makes it possible to analyze the behavior of an heterogeneous system (1) with rapid prototyping and simulation. The analysis of the results allowed to determine the correct sizing of the architecture (2) to obtain the best performance. The tool allowed to identify the bottleneck of our architecture and confirmed the possibility of hiding data access latencies. Our simulation platform allows to emulate a heterogeneous distributed system by introducing a slowdown between 1.2 and 3.7 times compared to the compute kernel simulation alone.
distributed agent-based modeling (ABM) on high-performance computing resources provides the promise of capturing unprecedented details of large-scale complex systems. However, the specialized knowledge required for de...
详细信息
ISBN:
(纸本)9780738110868
distributed agent-based modeling (ABM) on high-performance computing resources provides the promise of capturing unprecedented details of large-scale complex systems. However, the specialized knowledge required for developing such ABMs creates barriers to wider adoption and utilization. Here we present our experiences in developing an initial implementation of Repast4Py, a Python-based distributed ABM toolkit. We build on our experiences in developing ABM toolkits, including Repast for High Performance Computing (Repast HPC), to identify the key elements of a useful distributed ABM toolkit. We leverage the Numba, NumPy, and PyTorch packages and the Python C-API to create a scalable modeling system that can exploit the largest HPC resources and emerging computing architectures.
A new motif that corresponds to the communication operations of the distributed LOBPCG eigensolver used in the Many-Fermion Dynamics-nuclear, or MFDn, code is constructed. The impact of communication strategy and proc...
详细信息
ISBN:
(纸本)9780738110486
A new motif that corresponds to the communication operations of the distributed LOBPCG eigensolver used in the Many-Fermion Dynamics-nuclear, or MFDn, code is constructed. The impact of communication strategy and process placement are evaluated on current and future architectures using the SST network simulation tool. simulation of the communication motif is validated against production runs on the Cori system at NERSC. We identify the strengths and shortcomings of SST in doing so.
We consider the problem of joint resource reservation in the backhaul and Radio Access Network (RAN) based on the user demand statistics and network availability. The goal is to maximize the sum of expected traffic fl...
详细信息
ISBN:
(纸本)9781728154787
We consider the problem of joint resource reservation in the backhaul and Radio Access Network (RAN) based on the user demand statistics and network availability. The goal is to maximize the sum of expected traffic flow rates, subject to link and radio node budget constraints, while minimizing the expected outage of wireless channels. The formulated problem turns out to be non-convex and difficult to solve to global optimality. We propose an efficient Block Coordinate Descent (BCD) algorithm to approximately solve the problem. To optimize with respect to each block of variables, a distributed decomposition approach is proposed. simulation results verify the efficiency and the efficacy of our proposed approach against two heuristic algorithms.
The proceedings contain 23 papers. The topics discussed include: GeoDRIVE, an HPC flexible platform for seismic applications;one-way wave equation migration of common-offset vector gathers: parallel multi CPU/GPU impl...
The proceedings contain 23 papers. The topics discussed include: GeoDRIVE, an HPC flexible platform for seismic applications;one-way wave equation migration of common-offset vector gathers: parallel multi CPU/GPU implementation;a checkpoint of research on the implementation of geophysical stencils on multicore platforms;saving FLOPs in geophysics with optimal p-adaptivity;alleviating the pressure on memory for seismic modeling;automated distributed-memory parallelism from symbolic specification in Devito;total takes the deep dive into GPU for seismic imaging;a GPGPU pipeline for fast synthesis of 3D seismic;incorporating lossless compression in parallel reservoir simulation;and digital twin of multiscale geological media: faults, fracture corridors, caves. Seismic simulation and imaging.
暂无评论