The proceedings contains 436 papers. The topics discussed include: exploiting barriers to optimize power consumption of CMPs;PDM sorting algorithms that take a small number of passes;a highly parallel algorithm for th...
详细信息
ISBN:
(纸本)0769523129
The proceedings contains 436 papers. The topics discussed include: exploiting barriers to optimize power consumption of CMPs;PDM sorting algorithms that take a small number of passes;a highly parallel algorithm for the numerical simulation of unsteady diffusion processes;functionality distribution for parallel rendering;effective instruction prefetching via fetch prestaging;enhanced parallelprocessing in wide registers;asynchronous complete distributed garbage collection;scheduling algorithms for effective thread pairing on hybrid multiprocessors;practical divisible load scheduling on grid platforms with APST-DV;parallelizing a defect detection and categorization application;data redistribution and remote method invocation in parallel component architectures;and runtime empirical selection of loop schedulers on hyperthreaded SMPs.
The proceedings contain 323 papers. The topics discussed include: experiences with the sparse matrix-vector multiplication on a many-core processor;performance benefits of heterogeneous computing in HPC workloads;ther...
ISBN:
(纸本)9780769546766
The proceedings contain 323 papers. The topics discussed include: experiences with the sparse matrix-vector multiplication on a many-core processor;performance benefits of heterogeneous computing in HPC workloads;thermal-aware performance optimization in power constrained heterogenous data centers;experiences with target-platform heterogeneity in clouds, grids, and on-premises resources;blor: bandwidth and latency sensitive overlay routing for flash data dissemination;scheduling batch and heterogeneous jobs with runtime elasticity in a parallelprocessing environment;task scheduling in large-scale distributed systems utilizing partial reconfigurable processing elements;mixed data-parallel scheduling for distributed continuous integration;a monte-carlo approach for full-ahead stochastic DAG scheduling;a block-asynchronous relaxation method for graphics processing units;and partitioning for parallel matrix-matrix multiplication with heterogeneous processors: the optimal solution.
The proceedings contain 12 papers. The special focus in this conference is on Job Scheduling Strategies for parallelprocessing. The topics include: Optimization of Execution Parameters of Moldable Ultrasoun...
ISBN:
(纸本)9783031226977
The proceedings contain 12 papers. The special focus in this conference is on Job Scheduling Strategies for parallelprocessing. The topics include: Optimization of Execution Parameters of Moldable Ultrasound Workflows Under Incomplete Performance Data;Scheduling of Elastic Message Passing applications on HPC Systems;preface;on the Feasibility of Simulation-Driven Portfolio Scheduling for Cyberinfrastructure Runtime Systems;Improving Accuracy of Walltime Estimates in PBS Professional Using Soft Walltimes;re-making the Movie-Making Machine;using Kubernetes in Academic Environment: Problems and Approaches;AI-Job Scheduling on Systems with Renewable Power Sources;Toward Building a Digital Twin of Job Scheduling and Power Management on an HPC System;encoding for Reinforcement Learning Driven Scheduling.
The proceedings contain 173 papers. The topics discussed include: portable implementation of advanced driver-assistance algorithms on heterogeneous architectures;improving CPU performance through dynamic GPU access th...
ISBN:
(纸本)9781538634080
The proceedings contain 173 papers. The topics discussed include: portable implementation of advanced driver-assistance algorithms on heterogeneous architectures;improving CPU performance through dynamic GPU access throttling in CPU-GPU heterogeneous processors;alternative processor within threshold: flexible scheduling on heterogeneous systems;preemptive resource management for dynamically arriving tasks in an oversubscribed heterogeneous computing system;modeling of applications and hardware to explore task mapping and scheduling strategies on a heterogeneous micro-server system;consumer-and-provider-oriented efficient IaaS resource allocation;a pipelined and scalable dataflow implementation of convolutional neural networks on FPGA;on-chip memory based binarized convolutional deep neural network applying batch normalization free technique on an FPGA;automatic flow selection and quality-of-result estimation for FPGA placement;exploiting decoupled OpenCL work-items with data dependencies on FPGAs: a case study;ReEP: a toolset for generation and programming of reconfigurable datapaths for event processing;a generic approach to the development of coprocessors for elliptic curve cryptosystems;a hardware acceleration for surface EMG non-negative matrix factorization;and on-FPGA real-time processing of biological signals from high-density MEAs: a design space exploration.
The proceedings contain 179 papers. The special focus in this conference is on Personal Computer Based Networks of Workstations, Advances in parallel, distributed Computational Models and Video processing. The topics ...
ISBN:
(纸本)354067442X
The proceedings contain 179 papers. The special focus in this conference is on Personal Computer Based Networks of Workstations, Advances in parallel, distributed Computational Models and Video processing. The topics include: MPI collective operations over IP multicast;an open market-based architecture for distributed computing;the multicluster model to the integrated use of multiple workstation clusters;parallel information retrieval on an SCI-based pc-now;a pc-now based parallel extension for a sequential DBMS;the heterogeneous bulk synchronous parallel model;a new computation of shape moments via quadtree decomposition;a java applet to visualize algorithms on reconfigurable mesh;a hardware implementation of pram and its performance evaluation;a non-binary parallel arithmetic architecture;multithreaded parallel computer model with performance evaluation;a high performance microprocessor for multimedia computing;a novel superscalar architecture for fast DCT implementation;computing distance maps efficiently using an optical bus;advanced data layout optimization for multimedia applications;parallel parsing of mpeg video in a multi-threaded multiprocessor environment;parallelization techniques for spatial-temporal occupancy maps from multiple video streams;heuristic solutions for a mapping problem in a TV-anytime server network;a programming environment for real-time parallel vision;parallel low-level image processing on a distributed memory system;congestion-free routing of streaming multimedia content in BMIN-based parallel systems;performance of on-chip multiprocessors for vision tasks;specification techniques for automatic performance analysis tools and controlling distributed shared memory consistency from high level programming languages.
The proceedings contain 105 papers. The topics discussed include: coding the continuum;stochastic gradient descent on modern hardware: multi-core CPU or GPU? synchronous or asynchronous?;two elementary instructions ma...
ISBN:
(纸本)9781728112466
The proceedings contain 105 papers. The topics discussed include: coding the continuum;stochastic gradient descent on modern hardware: multi-core CPU or GPU? synchronous or asynchronous?;two elementary instructions make compare-and-swap;improving strong-scaling of CNN training by exploiting finer-grained parallelism;a scalable clustering-based task scheduler for homogeneous processors using DAG partitioning;an approach for parallel loading and pre-processing of unstructured meshes stored in spatially scattered fashion;computation of matrix chain products on parallel machines;optimal placement of in-memory checkpoints under heterogeneous failure likelihoods;and understanding the impact of dynamic power capping on application progress.
The proceedings contain 116 papers. The topics discussed include: parallel construction of suffix trees and the all-nearest-smaller-values problem;SWhybrid: a hybrid-parallel framework for large-scale protein sequence...
ISBN:
(纸本)9781538639146
The proceedings contain 116 papers. The topics discussed include: parallel construction of suffix trees and the all-nearest-smaller-values problem;SWhybrid: a hybrid-parallel framework for large-scale protein sequence database search;PUNAS: a parallel ungapped-alignment-featured seed verification algorithm for next-generation sequencing read alignment;eliminating irregularities of protein sequence search on multicore architectures;communication optimization on GPU: a case study of sequence alignment algorithms;elastic-cache: GPU cache architecture for efficient fine- and coarse-grained cache-line management;content-aware non-volatile cache replacement;and adaptive software caching for efficient NVRAM data persistence.
The proceedings contain 114 papers. The topics discussed include: cost-optimal execution of Boolean query trees with shared streams;exploiting geometric partitioning in task mapping for parallel computers;communicatio...
ISBN:
(纸本)9780769552071
The proceedings contain 114 papers. The topics discussed include: cost-optimal execution of Boolean query trees with shared streams;exploiting geometric partitioning in task mapping for parallel computers;communication-efficient distributed variance monitoring and outlier detection for multivariate time series;pythia: faster big data in motion through predictive software-defined network optimization at runtime;power and performance characterization and modeling of GPU-accelerated systems;scibox: online sharing of scientific data via the cloud;active measurement of the impact of network switch utilization on application performance;multi-resource real-time reader/writer locks for multiprocessors;remote invalidation: optimizing the critical path of memory transactions;and revisiting asynchronous linear solvers: provable convergence rate through randomization.
The proceedings contain 119 papers. The special focus in this conference is on parallel and distributedprocessing and applications. The topics include: Present and future supercomputer architectures;challenges in P2P...
ISBN:
(纸本)9783540241287
The proceedings contain 119 papers. The special focus in this conference is on parallel and distributedprocessing and applications. The topics include: Present and future supercomputer architectures;challenges in P2P computing;multihop wireless Ad Hoc networking: current challenges and future opportunities;an inspector-executor algorithm for irregular assignment;multi-grain parallelprocessing of data-clustering on programmable graphics hardware;a parallel reed-solomon decoder on the imagine stream processor;asynchronous document dissemination in dynamic Ad Hoc networks;location-dependent query results retrieval in a multi-cell wireless;an efficient mobile data mining model;towards correct distributed simulation of high-level petri nets with fine-grained partitioning;m-guard: a new distributed deadlock detection algorithm based on mobile agent technology;meta-based distributed computing framework;locality optimizations for jacobi iteration on distributedparallel;fault-tolerant cycle embedding in the WK-recursive network;RAIDb: redundant array of inexpensive databases;a fault-tolerant multi-agent development framework;a fault tolerance protocol for uploads: design and evaluation;topological adaptability for the distributed token circulation paradigm in faulty environment;adaptive data dissemination in wireless sensor networks;design and analysis of a k-connected topology control algorithm for Ad Hoc networks;on using temporal consistency for parallel execution of real-time queries in wireless sensor systems;cluster-based parallel simulation for large scale molecular dynamics in microscale thermophysics;parallel checkpoint/recovery on cluster of IA-64 computers;an enhanced message exchange mechanism in cluster-based mobile;a scalable low discrepancy point generator for parallel computing;generalized trellis stereo matching with systolic array.
Minimizing the Gaussian Curvature of triangular meshes can have important applications in 3D computer vision and graphics. However, traditional explicit methods require solving high-order partial differential equation...
详细信息
ISBN:
(数字)9798331520526
ISBN:
(纸本)9798331520533
Minimizing the Gaussian Curvature of triangular meshes can have important applications in 3D computer vision and graphics. However, traditional explicit methods require solving high-order partial differential equations which makes them computationally demanding and impractical in many applications. This paper presents a very fast and efficient adaptive filtering technique termed Gaussian Curvature Filtering (GCF) which optimizes the Gaussian curvature of the triangular meshes through exploiting the properties of developable surfaces. By moving a vertex along its normal direction such that one of its 1-ring neighbors falls onto the vertex's tangent plane, GCF minimizes Gaussian curvature without explicitly computing the Gaussian curvature. A novel multi tangent plane projection strategy is developed to adaptively determine a vertex's moving distance which enables the GCF to achieve Gaussian curvature minimization while preserving important geometric features. We present extensive experiments to demonstrate that GCF outperforms state of the art methods in Gaussian curvature minimization and shape-preserving model smoothing, and that it is $7\sim 50$ times faster than previous explicit optimization methods.
暂无评论