The proceedings contain 118 papers. The topics discussed include: ChameleonEC: exploiting tunability of erasure coding for low-interference repair;DPUaudit: DPU-assisted pull-based architecture for near-zero cost syst...
ISBN:
(纸本)9798331506476
The proceedings contain 118 papers. The topics discussed include: ChameleonEC: exploiting tunability of erasure coding for low-interference repair;DPUaudit: DPU-assisted pull-based architecture for near-zero cost system auditing;delinquent loop pre-execution using predicated helper threads;architecting value prediction around in-order execution;efficient optimization with encoded Ising models;LegoZK: a dynamically reconfigurable accelerator for zero-knowledge proof;reuse-aware compilation for zoned quantum architectures based on neutral atoms;HATT: Hamiltonian adaptive ternary tree for optimizing fermion-to-qubit mapping;QuCLEAR: Clifford extraction and absorption for quantum circuit optimization;and gaze into the pattern: characterizing spatial patterns with internal temporal correlations for hardware prefetching.
The proceedings contain 54 papers. The topics discussed include: communication lower bound in convolution accelerators;delay and bypass: ready and criticality aware instruction scheduling in out-of-order processors;IR...
ISBN:
(纸本)9781728161495
The proceedings contain 54 papers. The topics discussed include: communication lower bound in convolution accelerators;delay and bypass: ready and criticality aware instruction scheduling in out-of-order processors;IRONHIDE: a secure multicore that efficiently mitigates microarchitecture state attacks for interactive applications;ResiRCA: a resilient energy harvesting ReRAM crossbar-based accelerator for intelligent embedded processors;EFLOPS: algorithm and system co-design for a highperformance distributed training platform;and QuickNN: memory and performance optimization of k-d tree based nearest neighbor search for 3D point clouds.
The proceedings contain 55 papers. The topics discussed include: power struggles: revisiting the RISC v CISC debate on contemporary ARM and x86 architectures;skinflint DRAM system: minimizing DRAM chip writes for low ...
ISBN:
(纸本)9781467355858
The proceedings contain 55 papers. The topics discussed include: power struggles: revisiting the RISC v CISC debate on contemporary ARM and x86 architectures;skinflint DRAM system: minimizing DRAM chip writes for low power;enabling distributed generation powered sustainable high-performance data center;store-load-branch (SLB) predictor: a compiler assisted branch prediction for data dependent branches;ReCaP: a region-based cure for the common cold (cache);application-to-core mapping policies to reduce memory system interference in multi-core systems;improving multi-core performance using mixed-cell cache architecture;ECM : effective capacity maximizer for high-performance compressed caching;a novel system architecture for web scale applications using lightweight CPUs and virtualized I/O;Runnemede: an architecture for ubiquitous high-performance computing;and architecture support for guest-transparent VM protection from untrusted hypervisor and physical attacks.
The proceedings contain 38 papers. The topics discussed include: MACAU: a Markov model for reliability evaluations of caches under single-bit and multi-bit upsets;booster: reactive core acceleration for mitigating the...
ISBN:
(纸本)9781467308243
The proceedings contain 38 papers. The topics discussed include: MACAU: a Markov model for reliability evaluations of caches under single-bit and multi-bit upsets;booster: reactive core acceleration for mitigating the effects of process variation and application imbalance in low-voltage chips;MORSE: multi-objective reconfigurable self-optimizing memory scheduler;TAP: a TLP-aware cache management policy for a CPU-GPU heterogeneous architecture;SCD: a scalable coherence directory with flexible sharer set encoding;quasi-nonvolatile SSD: trading flash memory nonvolatility to improve storage system performance for enterprise applications;decoupled dynamic cache segmentation;AgileRegulator: a hybrid voltage regulator scheme redeeming dark silicon for power efficiency in a multicore architecture;dynamically heterogeneous cores through 3D resource pooling;and Pacman: tolerating asymmetric data races with unintrusive hardware.
The proceedings contain 58 papers. The topics discussed include: locality-aware data replication in the last-level cache;FADE: a programmable filtering accelerator for instruction-grain monitoring;dynamically detectin...
ISBN:
(纸本)9781479930975
The proceedings contain 58 papers. The topics discussed include: locality-aware data replication in the last-level cache;FADE: a programmable filtering accelerator for instruction-grain monitoring;dynamically detecting and tolerating if-condition data races;exploiting thermal energy storage to reduce data center capital and operating expenses*;implications of high energy proportional servers on cluster-wide energy proportionality;strategies for anticipating risk in heterogeneous system design;stash directory: a scalable directory for many-core coherence;a non-inclusive memory permissions architecture for protection against cross-layer attacks;suppressing the oblivious ram timing channel while making information leakage and program efficiency trade-offs;and timing channel protection for a shared memory controller.
The proceedings contain 57 papers. The topics discussed include: NoMap: speeding-up JavaScript using hardware transactional memory;Darwin-WGA: a co-processor provides increased sensitivity in whole genome alignments w...
ISBN:
(纸本)9781728114446
The proceedings contain 57 papers. The topics discussed include: NoMap: speeding-up JavaScript using hardware transactional memory;Darwin-WGA: a co-processor provides increased sensitivity in whole genome alignments with high speedup;bingo spatial data prefetcher;POWERT channels: a novel class of covert communication exploiting power management vulnerabilities;stretch: balancing QoS and throughput for colocated server workloads on SMT cores;understanding the future of energy efficiency in multi-module GPUs;pliant: leveraging approximation to improve datacenter resource efficiency;early visibility resolution for removing ineffectual computations in the graphics pipeline;understanding the impact of socket density in density optimized servers;and eQASM: an executable quantum instruction set architecture.
The proceedings contain 85 papers. The topics discussed include: direct spatial implementation of sparse matrix multipliers for reservoir computing;CAMA: energy and memory efficient automata processing in content-addr...
ISBN:
(纸本)9781665420273
The proceedings contain 85 papers. The topics discussed include: direct spatial implementation of sparse matrix multipliers for reservoir computing;CAMA: energy and memory efficient automata processing in content-addressable memories;leaky frontends: security vulnerabilities in processor frontends;abusing cache line dirty states to leak information in commercial processors;cottage: coordinated time budget assignment for latency, quality and power optimization in web search;enabling efficient large-scale deep learning training with cache coherent disaggregated memory systems;Hercules: heterogeneity-aware inference serving for at-scale personalized recommendation;ANNA: specialized architecture for approximate nearest neighbor search;and hardware-accelerated hypergraph processing with chain-driven scheduling.
The proceedings contain 69 papers. The topics discussed include: mix and match: a novel FPGA-centric deep neural network quantization framework;BRIM: bistable resistively-coupled Ising machine;systematic approaches fo...
ISBN:
(纸本)9780738123370
The proceedings contain 69 papers. The topics discussed include: mix and match: a novel FPGA-centric deep neural network quantization framework;BRIM: bistable resistively-coupled Ising machine;systematic approaches for precise and approximate quantum state runtime assertion;DepGraph: a dependency-driven accelerator for efficient iterative graph processing;Chopin: scalable graphics rendering in multi-GPU systems via parallel image composition;new models for understanding and reasoning about speculative execution attacks;ultra-elastic CGRAs for irregular loop specialization;analyzing and leveraging decoupled L1 caches in GPUs;and operating liquid-cooled large-scale systems: long-term monitoring, reliability analysis, and efficiency measures.
The proceedings contain 78 papers. The topics discussed include: exploitation of security vulnerability on retirement;GadgetSpinner: a new transient execution primitive using the loop stream detector;uncovering and ex...
ISBN:
(纸本)9798350393132
The proceedings contain 78 papers. The topics discussed include: exploitation of security vulnerability on retirement;GadgetSpinner: a new transient execution primitive using the loop stream detector;uncovering and exploiting AMD speculative memory access predictors for fun and profit;Revet: a language and compiler for dataflow threads;an optimizing framework on MLIR for efficient FPGA-based accelerator generation;Celeritas: out-of-core based unsupervised graph neural network via cross-layer computing 2024;MEGA: a memory-efficient GNN accelerator exploiting degree-aware mixed-precision quantization;Gemini: mapping and architecture co-exploration for large-scale DNN Chiplet accelerators;STELLAR: energy-efficient and low-latency SNN algorithm and hardware co-design with spatiotemporal computation;and MIMDRAM: an end-to-end processing-using-DRAM system for high-throughput, energy-efficient and programmer-transparent multiple-instruction multiple-data computing.
暂无评论