The proceedings contain 210 papers. The topics discussed include: towards a green, QoS-enabled heterogeneous cloud infrastructure;predicting job completion time in heterogeneous MapReduce environments;minimizing renta...
ISBN:
(纸本)9781509021406
The proceedings contain 210 papers. The topics discussed include: towards a green, QoS-enabled heterogeneous cloud infrastructure;predicting job completion time in heterogeneous MapReduce environments;minimizing rental cost for multiple recipe applications in the cloud;providing fairness in heterogeneous multicores with a predictive, adaptive scheduler;dynamic resource management for parallel tasks in an oversubscribed energy-constrained heterogeneous environment;evaluation of emerging energy-efficient heterogeneous computing platforms for biomolecular and cellular simulation workloads;latency, power, and security optimization in distributed reconfigurable embedded systems;and a reconfigurable fixed-point architecture for adaptive beamforming.
The proceedings contain 109 papers. The topics discussed include: balanced coloring for parallel computing applications;high-performance graph analytics on manycore processors;scalable community detection with the Lou...
ISBN:
(纸本)9781479986484
The proceedings contain 109 papers. The topics discussed include: balanced coloring for parallel computing applications;high-performance graph analytics on manycore processors;scalable community detection with the Louvain algorithm;cooperative computing for autonomous data centers;divide and conquer symmetric tridiagonal eigensolver for multicore architectures;contention-based nonminimal adaptive routing in high-radix networks;identifying the culprits behind network congestion;embedding nonblocking multicast virtual networks in fat-tree data centers;cashmere: heterogeneous many-core computing;a scheduling and runtime framework for a cluster of heterogeneous machines with multiple accelerators;hierarchical DAG scheduling for hybrid distributed systems;pushing the performance envelope of modular exponentiation across multiple generations of GPUs;and addressing fairness in SMT multicores with a progress-aware scheduler.
The proceedings contain 113 papers. The topics discussed include: optimizing parallel graph connectivity computation via subgraph sampling;a parallel algorithm for Bayesian network inference using arithmetic circuits;...
ISBN:
(纸本)9781538643686
The proceedings contain 113 papers. The topics discussed include: optimizing parallel graph connectivity computation via subgraph sampling;a parallel algorithm for Bayesian network inference using arithmetic circuits;cataloging the visible universe through Bayesian inference at petascale;efficient, parallel at-scale correlation analysis for atom probe tomography on hybrid architectures;a fast and massively-parallel inverse solver for multiple-scattering tomographic image reconstruction;real-time massively distributed multi-object adaptive optics simulations for the European extremely large telescope;performance isolation of data-intensive scale-out applications in a multi-tenant cloud;scalable data resilience for in-memory data staging;and performance and scalability of lightweight multi-kernel based operating systems.
The proceedings contain 112 papers. The topics discussed include: an accurate tool for modeling, fingerprinting, comparison, and clustering of parallelapplications based on performance counters;SmarTmem: intelligent ...
ISBN:
(纸本)9781728135106
The proceedings contain 112 papers. The topics discussed include: an accurate tool for modeling, fingerprinting, comparison, and clustering of parallelapplications based on performance counters;SmarTmem: intelligent management of transcendent memory in a virtualized server;data reliability and redundancy optimization of a secure multi-cloud storage under uncertainty of errors and falsifications;a portable GPU framework for SNP comparisons;towards a methodology for benchmarking edge processing frameworks;a fast local algorithm for track reconstruction on parallel architectures;towards native execution of deep learning on a leadership-class hpc system;improving robustness of heterogeneous serverless computing systems via probabilistic task pruning;and influence of tasks duration variability on task-based runtime schedulers.
The proceedings contain 110 papers. The topics discussed include: SSDKeeper: self-adapting channel allocation to improve the performance of SSD devices;a study of graph analytics for massive datasets on distributed mu...
ISBN:
(纸本)9781728168760
The proceedings contain 110 papers. The topics discussed include: SSDKeeper: self-adapting channel allocation to improve the performance of SSD devices;a study of graph analytics for massive datasets on distributed multi-GPUs;DPF-ECC: accelerating elliptic curve cryptography with floating-point computing power of GPUs;inter-job scheduling of high-throughput material screening applications;learning an effective charging scheme for mobile devices;improving transactional code generation via variable annotation and barrier elision;solving the container explosion problem for distributed high throughput computing;CycLedger: a scalable and secure parallel protocol for distributed ledger via sharding;DAG-aware joint task scheduling and cache management in spark clusters;and understanding the interplay between hardware errors and user job characteristics on the Titan supercomputer.
The proceedings contain 112 papers. The topics discussed include: power-aware replica placement and update strategies in tree networks;minimum cost resource allocation for meeting job requirements;power and performanc...
ISBN:
(纸本)9780769543857
The proceedings contain 112 papers. The topics discussed include: power-aware replica placement and update strategies in tree networks;minimum cost resource allocation for meeting job requirements;power and performance management in priority-type cluster computing systems;communication-avoiding QR decomposition for GPU;overlapping computation and communication for advection on hybrid parallel computers;VisIO: enabling interactive visualization of ultra-scale, time series data via high-bandwidth distributed I/O systems;a novel power management for CMP systems in data-intensive environment;characterization of system services and their performance impact in multi-core nodes;automatic recognition of performance idioms in scientific applications;exploiting data similarity to reduce memory footprints;the evaluation of an effective out-of-core run-time system in the context of parallel mesh generation;and a lightweight method for automated design of convergence.
Rapid object detection is crucial for safety-critical applications, such as post-event analysis of surveillance videos in crime investigations and the operation of autonomous vehicles. The objective of this paper is t...
详细信息
ISBN:
(数字)9798331521165
ISBN:
(纸本)9798331521172
Rapid object detection is crucial for safety-critical applications, such as post-event analysis of surveillance videos in crime investigations and the operation of autonomous vehicles. The objective of this paper is to improve the speed of object detection through efficient partitioning and frame reduction using parallelprocessing. Techniques to improve Spark's default partitioning by using entropy-based algorithms to create more evenly distributed partitions based on the estimated workload of the frames are considered. Redundant frames are removed to efficiently reduce the workload. Algorithms for removing redundant frames are introduced and evaluated for their effectiveness in comparison to random and fixed frame removals. The results from this paper demonstrate that partitioning algorithms that estimate workload using entropy provide faster processing time when compared to those that use random partitioning. The results also indicate that frame removal algorithms that use frame-specific information, like entropy difference between frames, further improve performance without notably reducing detection accuracy when compared to those that do not utilize frame-specific information.
The proceedings contain 114 papers. The topics discussed include: distributed-memory algorithms for maximum cardinality matching in bipartite graphs;ARCHER: effectively spotting data races in large OpenMP applications...
ISBN:
(纸本)9781509021406
The proceedings contain 114 papers. The topics discussed include: distributed-memory algorithms for maximum cardinality matching in bipartite graphs;ARCHER: effectively spotting data races in large OpenMP applications;algorithm and architecture independent benchmarking with SEAK;design and implementation of a parallel research kernel for assessing dynamic load-balancing capabilities;VNRE: flexible and efficient acceleration for network redundancy elimination;analyzing network health and congestion in dragonfly-based supercomputers;random regular graph and generalized de Bruijn graph with k-shortest path routing;deflection containment for bufferless network-on-chips;RUPS: fixing relative distances among urban vehicles with context-aware trajectories;hybrid dynamic trees for extreme-resolution 3D sparse data modeling;and optimization of an electromagnetics code with multicore wavefront diamond blocking and multi-dimensional intra-tile parallelization.
The proceedings contain 88 papers. The topics discussed include: HINT: designing cache-efficient MPI_Alltoall using hybrid memory copy ordering and non-temporal instructions;graph analytics on jellyfish topology;QSync...
ISBN:
(纸本)9798350337662
The proceedings contain 88 papers. The topics discussed include: HINT: designing cache-efficient MPI_Alltoall using hybrid memory copy ordering and non-temporal instructions;graph analytics on jellyfish topology;QSync: quantization-minimized synchronous distributed training across hybrid devices;two-stage block orthogonalization to improve performance of s-step GMRES;CloverLeaf on intel multi-core CPUs: a case study in write-allocate evasion;the self-adaptive and topology-aware MPI Bcast leveraging collective offload on Tianhe express interconnect;Picasso: memory-efficient graph coloring using palettes with applications in quantum computing;exploiting long vectors with a CFD code: a co-design show case;and to store or not to store: a graph theoretical approach for dataset versioning.
The proceedings contain 118 papers. The topics discussed include: a predictive model for solving small linear algebra problems in GPU registers;a parallel tiled solver for dense symmetric indefinite systems on multico...
ISBN:
(纸本)9780769546759
The proceedings contain 118 papers. The topics discussed include: a predictive model for solving small linear algebra problems in GPU registers;a parallel tiled solver for dense symmetric indefinite systems on multicore architectures;a comprehensive study of task coalescing for selecting parallelism granularity in a two-stage bidiagonal reduction;improving the performance of dynamical simulations via multiple right-hand sides;high-performance interaction-based simulation of gut immunopathologies with enteric immunity simulator (ENISI);a parallel algorithm for spectrum-based short read error correction;enhancing the scalability of consistency-based progressive multiple sequences alignment applications;an accurate GPU performance model for effective control flow divergence optimization;power-aware Manhattan routing on chip multiprocessors;and efficient resource oblivious algorithms for multicores with false sharing.
暂无评论