The proceedings contain 8 papers. The topics discussed include: user-level network programmability: a scalability study for data center infrastructure;a systematic literature review of I/O optimization in HPC and clou...
ISBN:
(纸本)9798331506735
The proceedings contain 8 papers. The topics discussed include: user-level network programmability: a scalability study for data center infrastructure;a systematic literature review of I/O optimization in HPC and cloud computing environments;an instruction-set extension to support approximate multicore processors;the tracer files: cracking the case of performance impact in tracing Linux file I/O for I/O-intensive applications;Spinner: enhancing HPC experimentation with a streamlined parameter sweep tool;a flexible operational framework for energy profiling of programs;experimental study of power consumption of basic parallel programs;and leveraging cloud computing for stock market forecasting with reinforcement learning.
The proceedings contain 16 papers. The topics discussed include: outline of a thick control flow architecture;a dynamic load balance algorithm for the S4 parallel stream processing engine;a processor workload distribu...
ISBN:
(纸本)9781509048441
The proceedings contain 16 papers. The topics discussed include: outline of a thick control flow architecture;a dynamic load balance algorithm for the S4 parallel stream processing engine;a processor workload distribution algorithm for massively parallel applications;parallelism and scalability: a solution focused on the cloud computing processing service billing;task scheduling in sucuri dataflow library;synchronization-free automatic parallelization for arbitrarily nested affine loops;thread footprint analysis for the design of multithreaded applications and multicore systems;a hybrid parallel algorithm for the auction algorithm in multicore systems;and dataflow to hardware synthesis framework on FPGAs.
The proceedings contain 18 papers. The topics discussed include: exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;DASS: dynam...
ISBN:
(纸本)9798350381603
The proceedings contain 18 papers. The topics discussed include: exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;DASS: dynamic adaptive sub-target specialization;optimizing microservices performance and resource utilization through containerized grouping: an experimental study;assessing the performance of an architecture-aware optimization tool for neural networks;an exploratory study of deep learning for predicting computational tasks behavior in HPC systems;exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;and energy consumption analysis of instruction cache prefetching methods.
The proceedings contain 9 papers. The topics discussed include: compiling files in parallel: a study with GCC;I/O performance of multiscale finite element simulations on HPC environments;an OpenMP-only linear algebra ...
ISBN:
(纸本)9781665451574
The proceedings contain 9 papers. The topics discussed include: compiling files in parallel: a study with GCC;I/O performance of multiscale finite element simulations on HPC environments;an OpenMP-only linear algebra library for distributed architectures;implementing the broadcast operation in a distributed task-based runtime;homomorphic evaluation of large look-up tables for inference on human genome data in the cloud;towards a federated learning framework on a multi-cloud environment;energy-efficient online resource provisioning for cloud-edge platforms via multi-armed bandits;edge computing versus cloud computing: impact on retinal image pre-processing;and standalone data-center sizing combating the over-provisioning of the IT and electrical parts.
The proceedings contain 8 papers. The topics discussed include: a memory affinity analysis of scientific applications on NUMA platforms;an evaluation of Cassandra NoSQL database on a low-power cluster;offloading the t...
ISBN:
(纸本)9781665417303
The proceedings contain 8 papers. The topics discussed include: a memory affinity analysis of scientific applications on NUMA platforms;an evaluation of Cassandra NoSQL database on a low-power cluster;offloading the training of an I/O access pattern detector to the cloud;selecting efficient VM types to train deep learning models on Amazon SageMaker;CLAP-BOT: a framework for automatic optimization of high-performance elastic applications on the clouds;towards optimizing computational costs of federated learning in clouds;quantifying and detecting HPC resource wastage in cloud environments;and a cloud-based batch processing system for loosely-coupled applications.
The proceedings contain 19 papers. The topics discussed include: energy consumption improvement of shared-cache multicore clusters based on explicit simultaneous multithreading;performance and energy analysis of OpenM...
ISBN:
(纸本)9781538648193
The proceedings contain 19 papers. The topics discussed include: energy consumption improvement of shared-cache multicore clusters based on explicit simultaneous multithreading;performance and energy analysis of OpenMP runtime systems with dense linear algebra algorithms;a case study of performance optimization in a heterogeneous environment;tuning up TVD HOPMOC method on Intel MIC Xeon Phi architectures with Intel parallel studio tools;comparing performance of C compilers optimizations on different multicore architectures;HPSM: a programming framework for multi-CPU and multi-GPU systems;assessing sparse triangular linear system solvers on GPUs;automatic partitioning of stencil computations on heterogeneous systems;strategies to improve the performance of a geophysics model for different Manycore systems;parallel algorithm for dynamic community detection;efficient in-situ quantum computing simulation of Shor's and Grover's algorithms;a parallel algorithm for minimum spanning tree on GPU;acceleration of cellular automata through parallel computing with OpenCL;a dataflow implementation of region growing method for cracks segmentation;automatic scan parallelization in OpenMP;impact of version management for transactional memories on phase-change memories;efficient Pathfinding co-processors for FPGAs;and a communication protocol for fog computing based on network coding applied to wireless sensors.
The proceedings contain 12 papers. The topics discussed include: ring pipelined algorithm for the algebraic path problem on the CELL broadband engine;performance evaluation of optimized implementations of finite diffe...
ISBN:
(纸本)9780769542768
The proceedings contain 12 papers. The topics discussed include: ring pipelined algorithm for the algebraic path problem on the CELL broadband engine;performance evaluation of optimized implementations of finite difference method for wave propagation problems on GPU architecture;exploring data streaming to improve 3D FFT implementation on multiple GPUs;effective dynamic scheduling on heterogeneous multi/manycore desktop platforms;towards a power-aware application level scheduler for a multithreaded runtime environment;I/O performance evaluation on multicore clusters with atmospheric model environment;OpenMP-based parallel algorithms for solving Kronecker descriptors;parallel implementations of an immune network model using POSIX threads and OpenMP;and parallel implementation of a computational model of the HIS using OpenMP and MPI.
The proceedings contain 118 papers. The topics discussed include: ChameleonEC: exploiting tunability of erasure coding for low-interference repair;DPUaudit: DPU-assisted pull-based architecture for near-zero cost syst...
ISBN:
(纸本)9798331506476
The proceedings contain 118 papers. The topics discussed include: ChameleonEC: exploiting tunability of erasure coding for low-interference repair;DPUaudit: DPU-assisted pull-based architecture for near-zero cost system auditing;delinquent loop pre-execution using predicated helper threads;architecting value prediction around in-order execution;efficient optimization with encoded Ising models;LegoZK: a dynamically reconfigurable accelerator for zero-knowledge proof;reuse-aware compilation for zoned quantum architectures based on neutral atoms;HATT: Hamiltonian adaptive ternary tree for optimizing fermion-to-qubit mapping;QuCLEAR: Clifford extraction and absorption for quantum circuit optimization;and gaze into the pattern: characterizing spatial patterns with internal temporal correlations for hardware prefetching.
暂无评论