The proceedings contain 179 papers. The special focus in this conference is on Personal Computer Based Networks of Workstations, Advances in parallel, distributed Computational Models and Video processing. The topics ...
ISBN:
(纸本)354067442X
The proceedings contain 179 papers. The special focus in this conference is on Personal Computer Based Networks of Workstations, Advances in parallel, distributed Computational Models and Video processing. The topics include: MPI collective operations over IP multicast;an open market-based architecture for distributed computing;the multicluster model to the integrated use of multiple workstation clusters;parallel information retrieval on an SCI-based pc-now;a pc-now based parallel extension for a sequential DBMS;the heterogeneous bulk synchronous parallel model;a new computation of shape moments via quadtree decomposition;a java applet to visualize algorithms on reconfigurable mesh;a hardware implementation of pram and its performance evaluation;a non-binary parallel arithmetic architecture;multithreaded parallel computer model with performance evaluation;a high performance microprocessor for multimedia computing;a novel superscalar architecture for fast DCT implementation;computing distance maps efficiently using an optical bus;advanced data layout optimization for multimedia applications;parallel parsing of mpeg video in a multi-threaded multiprocessor environment;parallelization techniques for spatial-temporal occupancy maps from multiple video streams;heuristic solutions for a mapping problem in a TV-anytime server network;a programming environment for real-time parallel vision;parallel low-level image processing on a distributed memory system;congestion-free routing of streaming multimedia content in BMIN-based parallel systems;performance of on-chip multiprocessors for vision tasks;specification techniques for automatic performance analysis tools and controlling distributed shared memory consistency from high level programming languages.
The proceedings contains 436 papers. The topics discussed include: exploiting barriers to optimize power consumption of CMPs;PDM sorting algorithms that take a small number of passes;a highly parallel algorithm for th...
详细信息
ISBN:
(纸本)0769523129
The proceedings contains 436 papers. The topics discussed include: exploiting barriers to optimize power consumption of CMPs;PDM sorting algorithms that take a small number of passes;a highly parallel algorithm for the numerical simulation of unsteady diffusion processes;functionality distribution for parallel rendering;effective instruction prefetching via fetch prestaging;enhanced parallelprocessing in wide registers;asynchronous complete distributed garbage collection;scheduling algorithms for effective thread pairing on hybrid multiprocessors;practical divisible load scheduling on grid platforms with APST-DV;parallelizing a defect detection and categorization application;data redistribution and remote method invocation in parallel component architectures;and runtime empirical selection of loop schedulers on hyperthreaded SMPs.
The proceedings contain 362 papers. The topics discussed include: uniform scattering of autonomous mobile robots in a grid;performance study of interference on sharing GPU and CPU resources with multiple applications;...
ISBN:
(纸本)9781424437504
The proceedings contain 362 papers. The topics discussed include: uniform scattering of autonomous mobile robots in a grid;performance study of interference on sharing GPU and CPU resources with multiple applications;resource allocation strategies for constructive in-network stream processing;deciding model of population size in time-constrained task scheduling;improving accuracy of host load predictions on computational grids by artificial neural networks;combining multiple heuristics on discrete resources;predictive analysis and optimization of pipelined wavefront computations;RSA encryption and decryption using the redundant number system on the FPGA;computation with a constant number of steps in membrane computing;analytical model of inter-node communication under multi-versioned coherence mechanisms;and a distributed approach for the problem of routing and wavelength assignment in WDM networks.
The proceedings contain 461 papers. The topics discussed include: how to make discretionary access control secure against trojan horses;random number generation for serial, parallel, distributed, and grid-based financ...
详细信息
ISBN:
(纸本)9781424416943
The proceedings contain 461 papers. The topics discussed include: how to make discretionary access control secure against trojan horses;random number generation for serial, parallel, distributed, and grid-based financial computations;mobility control schemes with quick convergence in wireless sensor networks;design and implementation of a tool for modeling and programming deadlock free meta-pipeline applications;analytic performance models for bounded queuing systems;on the construction of paired many-to-many disjoint path covers in hypercube-like interconnection networks with faulty elements;a scalable configurable architecture for the massively parallel GCA model;state management for distributed python applications;a fault-tolerant system for Java/CORBA objects;and improving data availability for a cluster file system through replication.
The proceedings contain 117 papers. The topics discussed include: resource elasticity at task-level;evaluation of vertex reordering for graph applications;on the predictability of quantum circuit fidelity using machin...
ISBN:
(纸本)9781665435772
The proceedings contain 117 papers. The topics discussed include: resource elasticity at task-level;evaluation of vertex reordering for graph applications;on the predictability of quantum circuit fidelity using machine learning;improving the operational capability of automated empirical performance modeling;development of a middleware to create an efficient unified programming model for heterogeneous computing;task-level checkpointing for nested fork-join programs;verifiable coded computing: towards fast and secure distributed computing;hierarchical cost analysis for distributed deep learning;pattern-aware vectorization for sparse matrix computations;and heterogeneity-aware deep learning workload deployments on the computing continuum.
The proceedings contain 112 papers. The topics discussed include: power-aware replica placement and update strategies in tree networks;minimum cost resource allocation for meeting job requirements;power and performanc...
ISBN:
(纸本)9780769543857
The proceedings contain 112 papers. The topics discussed include: power-aware replica placement and update strategies in tree networks;minimum cost resource allocation for meeting job requirements;power and performance management in priority-type cluster computing systems;communication-avoiding QR decomposition for GPU;overlapping computation and communication for advection on hybrid parallel computers;VisIO: enabling interactive visualization of ultra-scale, time series data via high-bandwidth distributed I/O systems;a novel power management for CMP systems in data-intensive environment;characterization of system services and their performance impact in multi-core nodes;automatic recognition of performance idioms in scientific applications;exploiting data similarity to reduce memory footprints;the evaluation of an effective out-of-core run-time system in the context of parallel mesh generation;and a lightweight method for automated design of convergence.
The proceedings contain 165 papers. The topics discussed include: understanding multi-dimensional efficiency of fine-tuning large language models using SpeedUp, MemoryUp, and EnergyUp;shared-memory parallel Edmonds bl...
ISBN:
(纸本)9798350364606
The proceedings contain 165 papers. The topics discussed include: understanding multi-dimensional efficiency of fine-tuning large language models using SpeedUp, MemoryUp, and EnergyUp;shared-memory parallel Edmonds blossom algorithm for maximum cardinality matching in general graphs;a reconfigurable architecture of a scalable, ultrafast, ultrasound, delay-and-sum beamformer;scheduling and allocation of disaggregated memory resources in HPC systems;GIM (ghost in the machine): a coarse-grained reconfigurable compute-in-memory platform for exploring machine-learning architectures;further optimizations and analysis of smith-waterman with vector extensions;measurement-based quantum approximate optimization;optimizing forward wavefield storage leveraging high-speed storage media;teaching performance metrics in parallel computing courses;and compiler-driven Swar parallelism for high-performance bitboard algorithms.
The proceedings contain 88 papers. The topics discussed include: HINT: designing cache-efficient MPI_Alltoall using hybrid memory copy ordering and non-temporal instructions;graph analytics on jellyfish topology;QSync...
ISBN:
(纸本)9798350337662
The proceedings contain 88 papers. The topics discussed include: HINT: designing cache-efficient MPI_Alltoall using hybrid memory copy ordering and non-temporal instructions;graph analytics on jellyfish topology;QSync: quantization-minimized synchronous distributed training across hybrid devices;two-stage block orthogonalization to improve performance of s-step GMRES;CloverLeaf on intel multi-core CPUs: a case study in write-allocate evasion;the self-adaptive and topology-aware MPI Bcast leveraging collective offload on Tianhe express interconnect;Picasso: memory-efficient graph coloring using palettes with applications in quantum computing;exploiting long vectors with a CFD code: a co-design show case;and to store or not to store: a graph theoretical approach for dataset versioning.
The proceedings contain 114 papers. The topics discussed include: a task based approach for co-scheduling ensemble workloads on heterogeneous nodes;power-aware computing with Optane persistent memory modules;cloud ser...
ISBN:
(纸本)9798350311990
The proceedings contain 114 papers. The topics discussed include: a task based approach for co-scheduling ensemble workloads on heterogeneous nodes;power-aware computing with Optane persistent memory modules;cloud services enable efficient ai-guided simulation workflows across heterogeneous resources;enabling efficient regular expression matching at the edge through domain-specific architectures;is your FPGA transmitting secrets: covert antennas from interconnect;hardware accelerator for transformer based end-to-end automatic speech recognition system;near-storage accelerator for bulk graph ingestion;application-specific FPGAs: cryptographic agility through customized reconfigurable architectures;parallel inference of phylogenetic stands with Gentrius;and using hyperdimensional computing to extract features for the detection of type 2 diabetes.
The proceedings contain 95 papers. The topics discussed include: distributed sparse random projection trees for constructing K-nearest neighbor graphs;fast deterministic gathering with detection on arbitrary graphs: t...
ISBN:
(纸本)9798350337662
The proceedings contain 95 papers. The topics discussed include: distributed sparse random projection trees for constructing K-nearest neighbor graphs;fast deterministic gathering with detection on arbitrary graphs: the power of many robots;accurate and efficient distributed covid-19 spread prediction based on a large-scale time-varying people mobility graph;accelerating packet processing in container overlay networks via packet-level parallelism;efficient hardware primitives for immediate memory reclamation in optimistic data structures;efficient hardware primitives for immediate memory reclamation in optimistic data structures;accelerating distributed deep learning training with compression assisted Allgather and reduce-scatter communication;accelerating CNN inference on long vector architectures via co-design;exploiting input tensor dynamics in activation checkpointing for efficient training on GPU;drill: log-based anomaly detection for large-scale storage systems using source code analysis;dynasparse: accelerating GNN inference through dynamic sparsity exploitation;exploiting sparsity in pruned neural networks to optimize large model training;SRC: mitigate I/O throughput degradation in network congestion control of disaggregated storage systems;boosting multi-block repair in cloud storage systems with wide-stripe erasure coding;on doorway egress by autonomous robots;and on the arithmetic intensity of distributed-memory dense matrix multiplication involving a symmetric input matrix (SYMM).
暂无评论