the proceedings contain 36 papers. the topics discussed include: the network adapter: the missing link between MPI applications and network performance;on the efficiency of register file versus broadcast interconnect ...
the proceedings contain 36 papers. the topics discussed include: the network adapter: the missing link between MPI applications and network performance;on the efficiency of register file versus broadcast interconnect for collective communications in data-parallel hardware accelerators;network endpoints for clusters of SMPs;assessing energy efficiency of fault tolerance protocols for HPC systems;using heterogeneous networks to improve energy efficiency in direct coherence protocols for many-core CMPs;energy savings via dead sub-block prediction;scalable thread scheduling in asymmetric multicores for power efficiency;divergence analysis with affine constraints;exploiting concurrent GPU operations for efficient work stealing on multi-GPUs;sparse fast Fourier transform on GPUs and multi-core CPUs;cloud workload analysis with SWAT;and scalable algorithms for distributed-memory adaptive mesh refinement.
the proceedings contain 61 papers. the topics discussed include: new number representation and conversion techniques on reconfigurable mesh;precise control of instruction caches;more on arbitrary boundary packed arith...
ISBN:
(纸本)0818691948
the proceedings contain 61 papers. the topics discussed include: new number representation and conversion techniques on reconfigurable mesh;precise control of instruction caches;more on arbitrary boundary packed arithmetic;more on arbitrary boundary packed arithmetic;PERL - a registerless architecture;design alternatives for shared memory multiprocessors;a simple optimal list ranking algorithm;a parallel skeletonization algorithm and its VLSI architecture;improving error bounds for multipole-based treecodes;computation of penetration measures for convex polygons and polyhedra for graphics applications;extrapolation in distributed adaptive integration;and java data parallel extensions with runtime system support.
Significant health disparities persist across U.S. counties due to socioeconomic inequalities, environmental challenges, and unequal access to healthcare. Leveraging large health datasets, this study employs machine l...
详细信息
the proceedings contain 43 papers. the topics discussed include: a case study of hybrid dataflow and shared-memory programming models: dependency-based parallel game engine;cloud-based OpenMP parallelization using a M...
ISBN:
(纸本)9781479969043
the proceedings contain 43 papers. the topics discussed include: a case study of hybrid dataflow and shared-memory programming models: dependency-based parallel game engine;cloud-based OpenMP parallelization using a MapReduce runtime;cross-layer self-adaptive/self-aware system software for exascale systems;SLURM support for remote GPU virtualization: implementation and performance study;modeling the impact of workload on cloud resource scaling;analyzing real cluster data for formulating allocation algorithms in cloud platforms;wide area BonjourGrid as a data desktop grid: modeling and implementation on top of redis;reducing compiler-inserted instrumentation in unified-parallel-c code generation;and evaluation of a feature tracking vision application on a heterogeneous chip.
the proceedings contain 20 papers. the topics discussed include: using hardware transactional memory to enable speculative trace optimization;energy consumption and scalability evaluation for software transactional me...
ISBN:
(纸本)9781467386210
the proceedings contain 20 papers. the topics discussed include: using hardware transactional memory to enable speculative trace optimization;energy consumption and scalability evaluation for software transactional memory on a real computing environment;replicating the performance evaluation of an n-body application on a manycore accelerator;characterizing anomalies of a multicore ARMv7 cluster with parallel N-body simulations;MDACCER: modified distributed assessment of the closeness CEntrality ranking in complex networks for massively parallel environments;intra-clustering: accelerating on-chip communication for data parallel architectures;Kanga: a skeleton-based generic interface for parallel programming;painless parallelism on heterogeneous hardware leveraging the functional paradigm;CHAOS-MCAPI: an optimized mechanism to support multicore parallel programming;and exploiting parallelism in linear algebra kernels through dataflow execution.
the ADEPT framework integrates Ambient Intelligence (AmI) technologies into Ambient Assisted Living (AAL) and Ubiquitous computing to improve the quality of life for the elderly and those needing special care, particu...
详细信息
this paper provides practical and experimental results coming from the real-life research project that focuses on Big Data Analytics over Big Healthcare Data about chronical pain of patients in real-life hospitals and...
详细信息
the proceedings contain 24 papers. the topics discussed include: high-speed restoration of atomic force microscopy images using Tikhonov regularization in GPGPU;video processing on GPU: analysis of data transfer overh...
ISBN:
(纸本)9781479970148
the proceedings contain 24 papers. the topics discussed include: high-speed restoration of atomic force microscopy images using Tikhonov regularization in GPGPU;video processing on GPU: analysis of data transfer overhead;efficient virtual channel organization and congestion avoidance in multicore NoC systems;impact of serial scaling of multi-threaded programs in many-core era;evaluating performance of deterministic algorithms on a multicore processor of a public cloud;on the evaluation of multi-core systems with SIMD engines for public-key cryptography;energy evaluation for applications with different thread affinities on the Intel Xeon Phi;an introduction to DF-threads and their execution model;high-level dataflow programming for reconfigurable computing;a Hadoop extension to process mail folders and its application to a spam dataset;exploratory analysis of raw data files through dataflows;exploratory analysis of raw data files through dataflows;GBFs: efficient data-sharing on hybrid platforms: towards adding wan-wide elasticity to DFSEs;and BIGHYBRID-a toolkit for simulating MapReduce in hybrid infrastructures.
the proceedings contain 27 papers. the topics discussed include: using balanced data placement to address I/O contention in production environments;dynamic inter-thread vectorization architecture: extracting DLP from ...
ISBN:
(纸本)9781509061082
the proceedings contain 27 papers. the topics discussed include: using balanced data placement to address I/O contention in production environments;dynamic inter-thread vectorization architecture: extracting DLP from TLP;HYPPO: a hybrid, piecewise polynomial modeling technique for non-smooth surfaces;automatic insertion of copy annotation in data-parallel programs;partitioning GPUs for improved scalability;designing highperformance heterogeneous broadcast for streaming applications on GPU clusters;performance-aware device driver architecture for signal processing;speeding up stencil computations with kernel convolution;a study of power-performance modeling using a domain-specific language;STOMP: Statistical Techniques for Optimizing and modeling performance of blocked sparse matrix vector multiplication;empirical, analytical study of hardware-based page swap in hybrid main memory system;building a low latency, highly associative DRAM cache withthe buffered way predictor;breadth-first search on heterogeneous platforms: a case of study on social networks;a parallelization of a simulated annealing approach for 0-1 multidimensional knapsack problem using GPGPU;parallel pairwise correlation computation on Intel Xeon Phi clusters;and on the dark silicon automatic evaluation on multicore processors.
the proceedings contain 23 papers. the topics discussed include: extending OmpSs for OpenCL kernel co-execution in heterogeneous systems;data coherence analysis and optimization for heterogeneous computing;exploring h...
ISBN:
(纸本)9781509012336
the proceedings contain 23 papers. the topics discussed include: extending OmpSs for OpenCL kernel co-execution in heterogeneous systems;data coherence analysis and optimization for heterogeneous computing;exploring heterogeneous mobile architectures with a high-level programming model;scalability of CPU and GPU solutions of the prime elliptic curve discrete logarithm problem;overcoming memory-capacity constraints in the use of ILUPACK on graphics processors;exploiting data compression to mitigate aging in GPU register files;SEDEA: a sensible approach to account DRAM energy in multicore systems;a user-level scheduling framework for BoT applications on private clouds;GC-CR: a decentralized garbage collector component for checkpointing in clouds;towards a deterministic fine-grained task ordering using multi-versioned memory;FGSCM: a fine-grained approach to transactional lock elision;a machine learning approach for performance prediction and scheduling on heterogeneous CPUs;object placement for high bandwidth memory augmented withhigh capacity memory;accelerating graph analytics on CPU-FPGA heterogeneous platform;online multimedia similarity search with response time-aware parallelism and task granularity auto-tuning;a publish/subscribe system using causal broadcast over dynamically built spanning trees;global snapshot of a distributed system running on virtual machines;and resource-management study in HPC runtime-stacking context.
暂无评论