The proceedings contain 267 papers. The topics discussed include: network delay-Aware load balancing in selfish and cooperative distributed systems;an analysis framework for investigating the trade-offs between system...
The proceedings contain 267 papers. The topics discussed include: network delay-Aware load balancing in selfish and cooperative distributed systems;an analysis framework for investigating the trade-offs between system performance and energy consumption in a heterogeneous computing environment;scheduling tightly-coupled applications on heterogeneous desktop grids;an on-chip heterogeneous implementation of a general sparse linear solver;seeds for a heterogeneous interconnect;architecture exploration of high-performance floating-point fused multiply-Add units and their automatic use in high-level synthesis;a flexible memory controller supporting deep belief networks with fixed-point arithmetic;hardware supported adaptive data collection for networks on chip;automated partitioning for partial reconfiguration design of adaptive systems;and cross-Architectural study of custom reconfigurable devices using crowdsourcing.
The proceedings contain 145 papers. The topics discussed include: user-transparent translation of machine instructions to programmable hardware;approximation algorithm for scheduling applications on hybrid multi-core ...
ISBN:
(纸本)9781538655559
The proceedings contain 145 papers. The topics discussed include: user-transparent translation of machine instructions to programmable hardware;approximation algorithm for scheduling applications on hybrid multi-core machines with communications delays;large scale data centers simulation based on baseline test model;application performance on a cluster-booster system;transport-triggered soft cores;robustness of surface EMG classifiers with fixed-point decomposition on reconfigurable architecture;streaming architecture for large-scale quantized neural networks on an FPGA-based dataflow platform;high-level reliability evaluation of reconfiguration-based fault tolerance techniques;dynamic reconfiguration for real-time automotive embedded systems in fail-operational context;and rerooting trees increases opportunities for concurrent computation and results in markedly improved performance for phylogenetic inference.
The proceedings contain 153 papers. The topics discussed include: transaction data management optimization based on multi-partitioning in blockchain systems;semi-asynchronous federated learning optimized for NON-IID d...
ISBN:
(纸本)9798350329223
The proceedings contain 153 papers. The topics discussed include: transaction data management optimization based on multi-partitioning in blockchain systems;semi-asynchronous federated learning optimized for NON-IID data communication based on tensor decomposition;HKTGNN: hierarchical knowledge transferable graph neural network-based supply chain risk assessment;DQR-TTS: semi-supervised text-to-speech synthesis with dynamic quantized representation;deep reinforcement learning-based network moving target defense in DPDK;iNUMAlloc: towards intelligent memory allocation for AI accelerators with NUMA;and predictive queue-based low latency congestion detection in data center networks.
The proceedings contains 436 papers. The topics discussed include: exploiting barriers to optimize power consumption of CMPs;PDM sorting algorithms that take a small number of passes;a highly parallel algorithm for th...
详细信息
ISBN:
(纸本)0769523129
The proceedings contains 436 papers. The topics discussed include: exploiting barriers to optimize power consumption of CMPs;PDM sorting algorithms that take a small number of passes;a highly parallel algorithm for the numerical simulation of unsteady diffusion processes;functionality distribution for parallel rendering;effective instruction prefetching via fetch prestaging;enhanced parallelprocessing in wide registers;asynchronous complete distributed garbage collection;scheduling algorithms for effective thread pairing on hybrid multiprocessors;practical divisible load scheduling on grid platforms with APST-DV;parallelizing a defect detection and categorization application;data redistribution and remote method invocation in parallel component architectures;and runtime empirical selection of loop schedulers on hyperthreaded SMPs.
The proceedings contain 323 papers. The topics discussed include: experiences with the sparse matrix-vector multiplication on a many-core processor;performance benefits of heterogeneous computing in HPC workloads;ther...
ISBN:
(纸本)9780769546766
The proceedings contain 323 papers. The topics discussed include: experiences with the sparse matrix-vector multiplication on a many-core processor;performance benefits of heterogeneous computing in HPC workloads;thermal-aware performance optimization in power constrained heterogenous data centers;experiences with target-platform heterogeneity in clouds, grids, and on-premises resources;blor: bandwidth and latency sensitive overlay routing for flash data dissemination;scheduling batch and heterogeneous jobs with runtime elasticity in a parallelprocessing environment;task scheduling in large-scale distributed systems utilizing partial reconfigurable processing elements;mixed data-parallel scheduling for distributed continuous integration;a monte-carlo approach for full-ahead stochastic DAG scheduling;a block-asynchronous relaxation method for graphics processing units;and partitioning for parallel matrix-matrix multiplication with heterogeneous processors: the optimal solution.
The proceedings contain 12 papers. The special focus in this conference is on Job Scheduling Strategies for parallelprocessing. The topics include: Optimization of Execution Parameters of Moldable Ultrasoun...
ISBN:
(纸本)9783031226977
The proceedings contain 12 papers. The special focus in this conference is on Job Scheduling Strategies for parallelprocessing. The topics include: Optimization of Execution Parameters of Moldable Ultrasound Workflows Under Incomplete Performance Data;Scheduling of Elastic Message Passing applications on HPC Systems;preface;on the Feasibility of Simulation-Driven Portfolio Scheduling for Cyberinfrastructure Runtime Systems;Improving Accuracy of Walltime Estimates in PBS Professional Using Soft Walltimes;re-making the Movie-Making Machine;using Kubernetes in Academic Environment: Problems and Approaches;AI-Job Scheduling on Systems with Renewable Power Sources;Toward Building a Digital Twin of Job Scheduling and Power Management on an HPC System;encoding for Reinforcement Learning Driven Scheduling.
The proceedings contain 173 papers. The topics discussed include: portable implementation of advanced driver-assistance algorithms on heterogeneous architectures;improving CPU performance through dynamic GPU access th...
ISBN:
(纸本)9781538634080
The proceedings contain 173 papers. The topics discussed include: portable implementation of advanced driver-assistance algorithms on heterogeneous architectures;improving CPU performance through dynamic GPU access throttling in CPU-GPU heterogeneous processors;alternative processor within threshold: flexible scheduling on heterogeneous systems;preemptive resource management for dynamically arriving tasks in an oversubscribed heterogeneous computing system;modeling of applications and hardware to explore task mapping and scheduling strategies on a heterogeneous micro-server system;consumer-and-provider-oriented efficient IaaS resource allocation;a pipelined and scalable dataflow implementation of convolutional neural networks on FPGA;on-chip memory based binarized convolutional deep neural network applying batch normalization free technique on an FPGA;automatic flow selection and quality-of-result estimation for FPGA placement;exploiting decoupled OpenCL work-items with data dependencies on FPGAs: a case study;ReEP: a toolset for generation and programming of reconfigurable datapaths for event processing;a generic approach to the development of coprocessors for elliptic curve cryptosystems;a hardware acceleration for surface EMG non-negative matrix factorization;and on-FPGA real-time processing of biological signals from high-density MEAs: a design space exploration.
The proceedings contain 179 papers. The special focus in this conference is on Personal Computer Based Networks of Workstations, Advances in parallel, distributed Computational Models and Video processing. The topics ...
ISBN:
(纸本)354067442X
The proceedings contain 179 papers. The special focus in this conference is on Personal Computer Based Networks of Workstations, Advances in parallel, distributed Computational Models and Video processing. The topics include: MPI collective operations over IP multicast;an open market-based architecture for distributed computing;the multicluster model to the integrated use of multiple workstation clusters;parallel information retrieval on an SCI-based pc-now;a pc-now based parallel extension for a sequential DBMS;the heterogeneous bulk synchronous parallel model;a new computation of shape moments via quadtree decomposition;a java applet to visualize algorithms on reconfigurable mesh;a hardware implementation of pram and its performance evaluation;a non-binary parallel arithmetic architecture;multithreaded parallel computer model with performance evaluation;a high performance microprocessor for multimedia computing;a novel superscalar architecture for fast DCT implementation;computing distance maps efficiently using an optical bus;advanced data layout optimization for multimedia applications;parallel parsing of mpeg video in a multi-threaded multiprocessor environment;parallelization techniques for spatial-temporal occupancy maps from multiple video streams;heuristic solutions for a mapping problem in a TV-anytime server network;a programming environment for real-time parallel vision;parallel low-level image processing on a distributed memory system;congestion-free routing of streaming multimedia content in BMIN-based parallel systems;performance of on-chip multiprocessors for vision tasks;specification techniques for automatic performance analysis tools and controlling distributed shared memory consistency from high level programming languages.
The proceedings contain 105 papers. The topics discussed include: coding the continuum;stochastic gradient descent on modern hardware: multi-core CPU or GPU? synchronous or asynchronous?;two elementary instructions ma...
ISBN:
(纸本)9781728112466
The proceedings contain 105 papers. The topics discussed include: coding the continuum;stochastic gradient descent on modern hardware: multi-core CPU or GPU? synchronous or asynchronous?;two elementary instructions make compare-and-swap;improving strong-scaling of CNN training by exploiting finer-grained parallelism;a scalable clustering-based task scheduler for homogeneous processors using DAG partitioning;an approach for parallel loading and pre-processing of unstructured meshes stored in spatially scattered fashion;computation of matrix chain products on parallel machines;optimal placement of in-memory checkpoints under heterogeneous failure likelihoods;and understanding the impact of dynamic power capping on application progress.
The proceedings contain 116 papers. The topics discussed include: parallel construction of suffix trees and the all-nearest-smaller-values problem;SWhybrid: a hybrid-parallel framework for large-scale protein sequence...
ISBN:
(纸本)9781538639146
The proceedings contain 116 papers. The topics discussed include: parallel construction of suffix trees and the all-nearest-smaller-values problem;SWhybrid: a hybrid-parallel framework for large-scale protein sequence database search;PUNAS: a parallel ungapped-alignment-featured seed verification algorithm for next-generation sequencing read alignment;eliminating irregularities of protein sequence search on multicore architectures;communication optimization on GPU: a case study of sequence alignment algorithms;elastic-cache: GPU cache architecture for efficient fine- and coarse-grained cache-line management;content-aware non-volatile cache replacement;and adaptive software caching for efficient NVRAM data persistence.
暂无评论