The proceedings contain 6 papers. The topics discussed include: project 38: innovative architectures for high-performance computing systems;implementing performance portable graph algorithms using task-based execution...
ISBN:
(纸本)9781665411264
The proceedings contain 6 papers. The topics discussed include: project 38: innovative architectures for high-performance computing systems;implementing performance portable graph algorithms using task-based execution;greatly accelerated scaling of streaming problems with a migrating thread architecture;sparse exact factorization update;no more leaky PageRank;and towards scalable data processing in python with CLIPPy.
The proceedings contain 6 papers. The topics discussed include: page-address coalescing of vector gather instructions for efficient address translation;the evolution of a new model of computation;blocking sparse matri...
ISBN:
(纸本)9781665475068
The proceedings contain 6 papers. The topics discussed include: page-address coalescing of vector gather instructions for efficient address translation;the evolution of a new model of computation;blocking sparse matrices to leverage dense-specific multiplication;SparseLU, a novel algorithm and math library for Sparse LU factorization;compressed in-memory graphs for accelerating GPU-based analytics;and accelerating datalog applications with cuDF.
The proceedings contain 8 papers. The topics discussed include: accelerating domain propagation: an efficient GPU-parallel algorithm over sparse matrices;parallelizing irregular computations for molecular docking;redu...
ISBN:
(纸本)9780738110905
The proceedings contain 8 papers. The topics discussed include: accelerating domain propagation: an efficient GPU-parallel algorithm over sparse matrices;parallelizing irregular computations for molecular docking;reducing queuing impact in irregular data streaming applications;supporting irregularity in throughput-oriented computing by SIMT-SIMD integration;DistDGL: distributed graph neural network training for billion-scale graphs;labeled triangle indexing for efficiency gains in distributed interactive subgraph search;distributed memory graph coloring algorithms for multiple GPU;and performance evaluation of the vectorizable binary search algorithms on an FPGA platform.
Conference proceedings front matter may contain various advertisements, welcome messages, committee or program information, and other miscellaneous conference information. This may in some cases also include the cover...
Conference proceedings front matter may contain various advertisements, welcome messages, committee or program information, and other miscellaneous conference information. This may in some cases also include the cover art, table of contents, copyright statements, title-page or half title-pages, blank pages, venue maps or other general information relating to the conference that was part of the original conference proceedings.
The proceedings contain 11 papers. The topics discussed include: conveyors for streaming many-to-many communication;extending a work-stealing framework with priorities and weights;RDMA vs. RPC for implementing distrib...
ISBN:
(纸本)9781728159874
The proceedings contain 11 papers. The topics discussed include: conveyors for streaming many-to-many communication;extending a work-stealing framework with priorities and weights;RDMA vs. RPC for implementing distributed data structures;mixed-precision tomographic reconstructor computations on hardware accelerators;iPregel: strategies to deal with an extreme form of irregularity in vertex-centric graph processing;stretching jacobi: two-stage pivoting in block-based factorization;a hardware prefetching mechanism for vector gather instructions;and performance impact of memory channels on sparse and irregularalgorithms.
The proceedings contain 8 papers. The topics discussed include: a block-oriented, parallel and collective approach to sparse indefinite preconditioning on GPUs;software prefetching for unstructured mesh applications;t...
ISBN:
(纸本)9781728101866
The proceedings contain 8 papers. The topics discussed include: a block-oriented, parallel and collective approach to sparse indefinite preconditioning on GPUs;software prefetching for unstructured mesh applications;there are trillions of little forks in the road. choose wisely! - estimating the cost and likelihood of success of constrained walks to optimize a graph pruning pipeline;scale-free graph processing on a NUMA machine;a fast and simple approach to merge and merge sort using wide vector instructions;impact of traditional sparse optimizations on a migratory thread architecture;mix-and-match: a model-driven runtime optimization strategy for BFS on GPUs;and high-performance GPU implementation of PageRank with reduced precision based on mantissa segmentation.
暂无评论