the proceedings contain 32 papers from the 16thsymposium on computerarchitecture and highperformancecomputing. the topics discussed include: self-monitored adaptive cache warm up for microprocessor simulation;the ...
详细信息
the proceedings contain 32 papers from the 16thsymposium on computerarchitecture and highperformancecomputing. the topics discussed include: self-monitored adaptive cache warm up for microprocessor simulation;the eDRAM based L3-Chache of the BlueGene/L supercomputer processor node;multi-profile instruction based compression;a study of errant pipeline flushes caused by value misspeculation;design space exploration using T&D-bench;value predictors for reuse through speculation on traces;optimizations for compiled simulation using instruction type information;and highperformance communication system based on generic programming.
the proceedings contain 16 papers. the topics discussed include: outline of a thick control flow architecture;a dynamic load balance algorithm for the S4 parallel stream processing engine;a processor workload distribu...
ISBN:
(纸本)9781509048441
the proceedings contain 16 papers. the topics discussed include: outline of a thick control flow architecture;a dynamic load balance algorithm for the S4 parallel stream processing engine;a processor workload distribution algorithm for massively parallel applications;parallelism and scalability: a solution focused on the cloud computing processing service billing;task scheduling in sucuri dataflow library;synchronization-free automatic parallelization for arbitrarily nested affine loops;thread footprint analysis for the design of multithreaded applications and multicore systems;a hybrid parallel algorithm for the auction algorithm in multicore systems;and dataflow to hardware synthesis framework on FPGAs.
the proceedings contain 31 papers. the topics discussed include: highperformancecomputing in science and engineering;towards grid implementations of metaheuristics for hard combinatorial optimization problems;a new ...
详细信息
ISBN:
(纸本)076952446X
the proceedings contain 31 papers. the topics discussed include: highperformancecomputing in science and engineering;towards grid implementations of metaheuristics for hard combinatorial optimization problems;a new multi-processor architecture for parallel lazy cyclic reference counting;reconfigurable optical interconnection system supporting concurrent application-specific parallel computing;managing the execution of large scale MPI applications on computational grids;function outlining and partial inlining;reusing traces in a dynamic conditional execution architecture;cooperation of neighboring PEs in clustered architectures;a new parallel environment for interactive simulations implementing safe multithreading with MPI;a time petri-net-based approach for software synthesis considering overheads;and analyzing and improving clustering based sampling for microprocessor simulation.
the proceedings contain 22 papers. the topics discussed include: towards production code effective portability among vector machines and microprocessors-based architectures;data segmentation management infrastructure ...
详细信息
ISBN:
(纸本)0769527043
the proceedings contain 22 papers. the topics discussed include: towards production code effective portability among vector machines and microprocessors-based architectures;data segmentation management infrastructure in a database grid;detecting malicious manipulation in grid environments;policy-based resource allocation in hierarchical virtual organizations for global grids;a speculative trace reuse architecture with reduced hardware requirements;controlling the power and area of neural branch predictors for practical implementation in high-performance processors;a run-time system for efficient execution of scientific workflows on distributed environments;dual-thread speculation: two threads in the machine are worth eight in the bush;characterizing the performance of data management systems on hyper-threaded architectures;and ultra-fast CPU performance prediction: extending the monte carlo approach.
the proceedings contain 18 papers. the topics discussed include: exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;DASS: dynam...
ISBN:
(纸本)9798350381603
the proceedings contain 18 papers. the topics discussed include: exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;DASS: dynamic adaptive sub-target specialization;optimizing microservices performance and resource utilization through containerized grouping: an experimental study;assessing the performance of an architecture-aware optimization tool for neural networks;an exploratory study of deep learning for predicting computational tasks behavior in HPC systems;exploring federated learning to trace depression in social media with language models;computing seismic attributes with deep-learning models;and energy consumption analysis of instruction cache prefetching methods.
the proceedings contain 23 papers. the topics discussed include: performance modeling and estimation of a configurable output stationary neural network accelerator;NeurOPar, a neural network-driven EDP optimization st...
ISBN:
(纸本)9798350305487
the proceedings contain 23 papers. the topics discussed include: performance modeling and estimation of a configurable output stationary neural network accelerator;NeurOPar, a neural network-driven EDP optimization strategy for parallel workloads;exploiting the potential of flexible processing units;reverse time migration with lossy and lossless wavefield compression;performance tuning for GPU-embedded systems: machine-learning-based and analytical model-driven tuning methodologies;WCSim: a cloud computing simulator with support for bag of tasks workflows;performance modeling of MARE2DEM’s adaptive mesh refinement for makespan estimation;and comparing performance and portability between CUDA and SYCL for protein database search on NVIDIA, AMD, and Intel GPUs.
the proceedings contain 63 papers. the topics discussed include: automated GPU grid geometry selection for OpenMP kernels;effect of network topology on the performance of ADMM-based SVMs;exploring the potential of nex...
ISBN:
(纸本)9781538677698
the proceedings contain 63 papers. the topics discussed include: automated GPU grid geometry selection for OpenMP kernels;effect of network topology on the performance of ADMM-based SVMs;exploring the potential of next generation software-defined in-memory frameworks;a fault-tolerant agent-based architecture for transient servers in fog computing;deep learning on large-scale muticore clusters;accelerating deep neural network training for action recognition on a cluster of GPUs;balancing load of GPU subsystems to accelerate image reconstruction in parallel beam tomography;high-performance ensembles of online sequential extreme learning machine for regression and time series forecasting;a machine learning approach for parameter screening in earthquake simulation;adaptive partitioning for iterated sequences of irregular OpenCL kernels;highly scalable stencil-based matrix-free stochastic estimator for the diagonal of the inverse;and adaptive scheduling of collocated applications using a task-based runtime system.
the proceedings contain 32 papers. the topics discussed include: multi-level parallelism in the computational modeling of the heart;computational characteristics of production seismic migration and its performance on ...
详细信息
ISBN:
(纸本)9780769530147
the proceedings contain 32 papers. the topics discussed include: multi-level parallelism in the computational modeling of the heart;computational characteristics of production seismic migration and its performance on novel processor architectures;exploring novel parallelization technologies for 3-D imaging applications;low-cost techniques for reducing branch context pollution in a soft realtime embedded multithreaded processor;predicting loop termination to boost speculative thread-level parallelism in embedded applications;performance improvement of the parallel lattice Boltzmann method through blocked data distributions;a scalable parallel deduplication algorithm;impacts of multiprocessor configurations on workloads in bioinformatics;efficient hardware for modular exponentiation using the sliding-window method with variable-length partitioning;and optimized math functions for a fixed-point DSP architecture.
the proceedings contain 22 papers. the topics discussed include: accurate and low-overhead dynamic detection and prediction of program phases using branch signatures;aggressive scheduling and speculation in multithrea...
the proceedings contain 22 papers. the topics discussed include: accurate and low-overhead dynamic detection and prediction of program phases using branch signatures;aggressive scheduling and speculation in multithreaded architectures: is it worth its salt?;an optimization mechanism intended for two-level cache hierarchy to improve energy and performance using the NSGAII algorithm;on simulated annealing for the scheduling of parallel applications;controlling processes reassignment in BSP applications;a highperformance massively parallel approach for real time deformable body physics simulation;a methodology for developing high fidelity communication models for large-scale applications targeted on multicore systems;and ORBIT: effective issue queue soft-error vulnerability mitigation on simultaneous multithreaded architectures using operand readiness-based instruction dispatch.
暂无评论