We present ParGeo, a multicore library for computational geometry algorithms. We describe two of the algorithms from ParGeo, convex hull and the smallest enclosing ball, and present a short evaluation of all implement...
ISBN:
(纸本)9781450392044
We present ParGeo, a multicore library for computational geometry algorithms. We describe two of the algorithms from ParGeo, convex hull and the smallest enclosing ball, and present a short evaluation of all implementations currently in ParGeo.
This paper presents mathematical foundations for the design of a memory controller subcomponent that helps to bridge the processor/memory performance gap for applications with strided access patterns. The parallel Vec...
ISBN:
(纸本)9781581131857
This paper presents mathematical foundations for the design of a memory controller subcomponent that helps to bridge the processor/memory performance gap for applications with strided access patterns. The parallel Vector Access (PVA) unit exploits the regularity of vectors or streams to access them efficiently in parallel on a multi-bank SDRAM memory system. The PVA unit performs scatter/gather operations so that only the elements accessed by the application are transmitted across the system bus. Vector operations are broadcast in parallel to all memory banks, each of which implements an efficient algorithm to determine which vector elements it holds. Earlier performance evaluations have demonstrated that our PVA implementation loads elements up to 32.8 times faster than a conventional memory system and 3.3 times faster than a pipelined vector unit, without hurting the performance of normal cache-line fills. Here we present the underlying PVA algorithms for both word interleaved and cache-line inter-leaved memory systems.
The proceedings contain 65 papers. The topics discussed include: FireSim: FPGA-accelerated cycle-exact scale-out system simulation in the public cloud;PROMISE: an end-to-end design of a programmable mixed-signal accel...
ISBN:
(纸本)9781538659847
The proceedings contain 65 papers. The topics discussed include: FireSim: FPGA-accelerated cycle-exact scale-out system simulation in the public cloud;PROMISE: an end-to-end design of a programmable mixed-signal accelerator for machine-learning algorithms;computation reuse in DNNs by exploiting input similarity;criticality aware tiered cache hierarchy: a fundamental relook at multi-level cache hierarchies;constructing a weak memory model;a hardware accelerator for tracing garbage collection;get out of the valley: power-efficient address mapping for GPUs;scheduling page table walks for irregular GPU applications;a case for richer cross-layer abstractions: bridging the semantic gap with expressive memory;non-speculative store coalescing in total store order;ProtoGen: automatically generating directory cache coherence protocols from atomic specifications;Spandex: a flexible interface for efficient heterogeneous coherence;Flexon: a flexible digital neuron for efficient spiking neural network simulations;space-time algebra: a model for neocortical computation;density tradeoffs of non-volatile memory as a replacement for SRAM based last level cache;ACCORD: enabling associativity for gigascale DRAM caches by coordinating way-install and way-prediction;scaling datacenter accelerators with compute-reuse architectures;FLIN: enabling fairness and enhancing performance in modern NVMe solid state drives;and lazy persistency: a high-performing and write-efficient software persistency technique.
The following topics were dealt with: multiple processor architectures; networks and grids; non-numerical algorithms including sorting and graph algorithms; computation models; numerical parallelalgorithms; schedulin...
The following topics were dealt with: multiple processor architectures; networks and grids; non-numerical algorithms including sorting and graph algorithms; computation models; numerical parallelalgorithms; scheduling and performance evaluation including compiling, thread migration and meta computing; and high performance computing applications including computational chemistry, command and control, and finance.
暂无评论