The proceedings contain 49 papers. The topics discussed include: Semi-StructMG: a fast and scalable semi-structured algebraic multigrid;LibRTS: a spatial indexing library by ray tracing;high-performance visual semanti...
ISBN:
(纸本)9798400714436
The proceedings contain 49 papers. The topics discussed include: Semi-StructMG: a fast and scalable semi-structured algebraic multigrid;LibRTS: a spatial indexing library by ray tracing;high-performance visual semantics compression for AI-driven science;COMPSO: optimizing gradient compression for distributed training with second-order optimizers;TurboFFT: co-designed high-performance and fault-tolerant fast Fourier transform on GPUs;Helios: efficient distributed dynamic graph sampling for online GNN inference;triangle counting on tensor cores;AC-Cache: a memory-efficient caching system for small objects via exploiting access correlations;magneto: accelerating parallel structures in DNNsvia co-optimization of operators;and FlashSparse: minimizing computation redundancy for fast sparse matrix multiplications on tensor cores.
The proceedings contains 21 papers from the Fifth acmsigplansymposium on principles & practice of parallelprogramming PPOPP. Topics discussed include data parallel programs;data libraries;data caches;data acces...
详细信息
The proceedings contains 21 papers from the Fifth acmsigplansymposium on principles & practice of parallelprogramming PPOPP. Topics discussed include data parallel programs;data libraries;data caches;data access;distributed and shared memory multiprocessors;dataflow analysis;scheduling;optimization;and synchronization.
The proceedings contains 20 papers from the Conference on Proceedings of the 2003 acm sigplan symposium on principles and practice of parallel programming (PPOPP'03). The topics discussed include: using thread-lev...
详细信息
The proceedings contains 20 papers from the Conference on Proceedings of the 2003 acm sigplan symposium on principles and practice of parallel programming (PPOPP'03). The topics discussed include: using thread-level speculation to simplify manual parallelization;toward efficient and robust software speculative parallelization on multiprocessors;improving server software support for simultaneous multithreaded processors;programming the FlexRAM parallel intelligent memory system and automated application-level checkpointing of MPI programs.
The proceedings contain 21 papers. The topics discussed include: an empirical comparison of monitoring algorithms for access anomaly detection;modeling concurrency in parallel debugging;analysis of event synchronizati...
ISBN:
(纸本)0897913507
The proceedings contain 21 papers. The topics discussed include: an empirical comparison of monitoring algorithms for access anomaly detection;modeling concurrency in parallel debugging;analysis of event synchronization in a parallelprogramming tool;programming techniques for efficiently exploiting parallelism in logic programming languages;design and distributed implementation of the parallel logic language shared prolog;multi-model parallelprogramming in psyche;and real-time, concurrent checkpoint for parallel programs.
The proceedings contains 25 papers. Topics discussed include data and task parallelism, irregular applications, coherence protocols, shared memory, compilers and performances issue.
The proceedings contains 25 papers. Topics discussed include data and task parallelism, irregular applications, coherence protocols, shared memory, compilers and performances issue.
The proceedings contains 14 papers from the conference on the Proceedings of the acm sigplan symposium on principles and practice of parallel programming, PPOPP. Topics discussed include: reference idempotency analysi...
详细信息
The proceedings contains 14 papers from the conference on the Proceedings of the acm sigplan symposium on principles and practice of parallel programming, PPOPP. Topics discussed include: reference idempotency analysis: a framework for optimizing speculative execution;pointer and escape analysis for multithread programs;language support for motion-order matrices;efficient load balancing for wide-area divide-and-conquer applications;scalable queue-based spin locks with timeout;contention ellimination by replication of sequential sections in distributed shared memory programs;and accurate data redistribution cost estimation in software distributes shared memory systems.
The proceedings contain 26 papers. The topics discussed include: LogP: towards a realistic model of parallel computation;exploiting task and data parallelism on a multicomputer;ActorSpace: an open distributed programm...
ISBN:
(纸本)0897915895
The proceedings contain 26 papers. The topics discussed include: LogP: towards a realistic model of parallel computation;exploiting task and data parallelism on a multicomputer;ActorSpace: an open distributed programming paradigm;experiences using the ParaScope editor: an interactive parallelprogramming tool;perturbation analysis of high level instrumentation for SPMD programs;integrating message-passing and shared-memory: early experience;using scheduler information to achieve optimal barrier synchronization performance;and a concurrent copying garbage collector for languages that distinguish (im)mutable data.
The proceedings contain 21 papers. The topics discussed include: optimal schedules for parallel prefix computation with bounded resources;parallel-program transformation using a metalanguage;mapping concurrent program...
ISBN:
(纸本)0897913906
The proceedings contain 21 papers. The topics discussed include: optimal schedules for parallel prefix computation with bounded resources;parallel-program transformation using a metalanguage;mapping concurrent programs to VLIW Processors;a unified framework for systematic loop transformations;scanning polyhedra with DO loops;removal of redundant dependence in DOACROSS constant dependence;exploitation of APL data parallelism on a shared-memory MIMD machine;Andorra-I: a parallel prolog system that transparently exploits both and- and or-parallelism;and scalable reader- writer synchronization for shared-memory multiprocessors.
The symposium materials contain 26 papers covering the spectrum from models of parallel computing to implementation techniques, and from compilation algorithms to application development tools and case studies, thus s...
详细信息
ISBN:
(纸本)0897915895
The symposium materials contain 26 papers covering the spectrum from models of parallel computing to implementation techniques, and from compilation algorithms to application development tools and case studies, thus satisfying the goal of broadly covering the active areas of parallelprogramming research.
The proceedings contain 28 records. The topics discussed include: compiler techniques for high performance sequentially consistent Java programs;effective communication coalescing for data-parallel applications;a line...
详细信息
The proceedings contain 28 records. The topics discussed include: compiler techniques for high performance sequentially consistent Java programs;effective communication coalescing for data-parallel applications;a linear-time algorithm for optimal barrier placement;composable memory transactions;static analysis of atomicity for programs with non-blocking synchronization;revocable locks for non-blocking programming;automated type-based analysis of data races and atomicity;scaling model checking of dataraces using dynamic information;a novel approach for partitioning iteration spaces with variable densities;applications of synchronization coverage;and fault tolerant high performance computing by a coding approach.
暂无评论