the proceedings contain 101 papers. the special focus in this conference is on Grid Architectures, Load Balancing, Performance Analysis, Prediction, parallel Non-numerical Algorithms and parallel Programming. the topi...
ISBN:
(纸本)9783540437925
the proceedings contain 101 papers. the special focus in this conference is on Grid Architectures, Load Balancing, Performance Analysis, Prediction, parallel Non-numerical Algorithms and parallel Programming. the topics include: Interrupt and cancellation as synchronization methods;applications of virtual data in the LIGO experiment;a parallel system architecture based on dynamically configurable shared memory clusters;simultaneous allocation and scheduling with exclusion and precedence relations algorithm;a greedy approach for a time-dependent scheduling problem;dedicated scheduling of biprocessor tasks to minimize mean flow time;heterogeneous dynamic load balancing with a scheme based on the laplacian polynomial;task scheduling for dynamically configurable multiple SMP clusters based on extended DSC approach;processing time and memory requirements for multi-instalment divisible job processing;estimating execution time of distributed applications;evaluation of parallel programs by measurement of its granularity;the performance of different communication mechanisms and algorithms used for parallelization of molecular dynamics code;benchmarking tertiary storage systems with file fragmentation;fem computations on clusters using different models of parallel programming;parallel skeletons for tabu search method based on search strategies and neighborhood partition;a new parallel approach for multi-dimensional packing problems;three parallel algorithms for simulated annealing;solving the flow shop problem by parallel simulated annealing;automated verification of infinite state concurrent systems;criteria of satisfiability for homogeneous systems of linear Diophantine constraints and irregular and out-of-core parallel computing on clusters.
the proceedings contain 91 papers. the special focus in this conference is on parallelprocessing and appliedmathematics. the topics include: Performance/Energy Aware Optimization of parallel Applications on GPUs Und...
ISBN:
(纸本)9783030432218
the proceedings contain 91 papers. the special focus in this conference is on parallelprocessing and appliedmathematics. the topics include: Performance/Energy Aware Optimization of parallel Applications on GPUs Under Power Capping;improving Energy Consumption in Iterative Problems Using Machine Learning;automatic Software Tuning of parallel Programs for Energy-Aware Executions;overview of Application Instrumentation for Performance Analysis and Tuning;Energy-Efficiency Tuning of a Lattice Boltzmann Simulation Using MERIC;Evaluating the Advantage of Reactive MPI-aware Power Control Policies;application-Aware Power Capping Using Nornir;A New Hardware Counters Based thread Migration Strategy for NUMA Systems;alea – Complex Job Scheduling Simulator;studying the Performance of Vector-Based Quicksort Algorithm;makespan Minimization in Data Gathering Networks with Dataset Release Times;overlapping Schwarz Preconditioner for Fourth Order Multiscale Elliptic Problems;MATLAB Implementation of C1 Finite Elements: Bogner-Fox-Schmit Rectangle;simple Preconditioner for a thin Membrane Diffusion Problem;a Numerical Scheme for Evacuation Dynamics;Additive Average Schwarz with Adaptive Coarse Space for Morley FE;application of Multiscale Computational Techniques to the Study of Magnetic Nanoparticle Systems;clique: A parallel Tool for the Molecular Nanomagnets Simulation and Modelling;Modelling of Limitations of BHJ Architecture in Organic Solar Cells;monte Carlo Study of Spherical and Cylindrical Micelles in Multiblock Copolymer Solutions;parallel Tiled Cache and Energy Efficient Code for Zukers RNA Folding;electronic and Optical Properties of Carbon Nanotubes Directed to their Applications in Solar Cells;the MPFI Library: Towards IEEE 17882015 Compliance;softmax and McFaddens Discrete Choice Under Interval (and Other) Uncertainty;experiments with Heterogenous Automata-Based Multi-agent Systems.
the proceedings contain 91 papers. the special focus in this conference is on parallelprocessing and appliedmathematics. the topics include: A high-performance implementation of a robust preconditioner for heterogen...
ISBN:
(纸本)9783030432287
the proceedings contain 91 papers. the special focus in this conference is on parallelprocessing and appliedmathematics. the topics include: A high-performance implementation of a robust preconditioner for heterogeneous problems;hybrid solver for quasi block diagonal linear systems;parallel adaptive cross approximation for the multi-trace formulation of scattering problems;Implementation of parallel 3-D Real FFT with 2-D decomposition on intel xeon phi clusters;Exploiting symmetries of small prime-sized DFTs;parallel computations for various scalarization schemes in multicriteria optimization problems;early performance assessment of the thunderX2 processor for lattice based simulations;An area efficient and reusable HEVC 1D-DCT hardware accelerator;improving locality-aware scheduling with acyclic directed graph partitioning;structure-aware calculation of many-electron wave function overlaps on multicore processors;isoefficiency maps for divisible computations in hierarchical memory systems;OpenMP target device offloading for the SX-Aurora TSUBASA vector engine;On the road to DiPOSH: Adventures in high-performance openSHMEM;click-Fraud detection for online advertising;parallel graph partitioning optimization under PEGASUS DA application global state monitoring;cloud infrastructure automation for scientific workflows;Posit NPB: Assessing the precision improvement in HPC scientific applications;a high-order discontinuous galerkin solver with dynamic adaptive mesh refinement to simulate cloud formation processes;Performance and portability of state-of-art molecular dynamics software on modern GPUs;Exploiting parallelism on shared memory in the QED particle-in-cell code PICADOR with greedy load balancing;lazy stencil integration in multigrid algorithms;parallelized construction of extension velocities for the level-set method;Relative expression classification tree. A preliminary GPU-Based implementation;SIMD-node transformations for non-blocking data structures.
the proceedings contain 44 papers. the discussed topics include: performance analysis of interconnection networks under bursty and batch arrival traffic;a lazy EDF interrupt scheduling algorithm for multiprocessor in ...
详细信息
ISBN:
(纸本)9783540729044
the proceedings contain 44 papers. the discussed topics include: performance analysis of interconnection networks under bursty and batch arrival traffic;a lazy EDF interrupt scheduling algorithm for multiprocessor in parallel computing environment;a time and interaction model for open distributed timing computation;efficient linkable ring signatures and threshold signatures from linear feedback shift register;an improved algorithm for Alhusaini's algorithm in heterogeneous distributed systems;a framework of software component adaptation;data interoperation between ChinaGrid ad SRB;method for computational grids resources allocate based on auction and utility analyses;automatic conceptual indexing of web services and its application to service retrieval;on-demand capacity framework;implementing digital right management in P2P content sharing system;and a generalized critical task anticipation technique for DAG scheduling.
the proceedings contain 149 papers. the special focus in this conference is on parallel, Distributed Architectures, Scheduling and Load Balancing. the topics include: Session guarantees to achieve pram consistency of ...
ISBN:
(纸本)3540219463
the proceedings contain 149 papers. the special focus in this conference is on parallel, Distributed Architectures, Scheduling and Load Balancing. the topics include: Session guarantees to achieve pram consistency of replicated shared objects;an extended atomic consistency protocol for recoverable DSM systems;hyper-threading technology speeds clusters;configurable microprocessor array for DSP applications;on generalized moore digraphs;RDMA communication based on rotating buffers for efficient parallel fine-grain computations;communication on the fly in dynamic SMP clusters;accelerated diffusion algorithms on general dynamic networks;suitability of load scheduling algorithms to workload characteristics;minimizing time-dependent total completion time on parallel identical machines;diffusion based scheduling in the agent-oriented computing system;approximation algorithms for scheduling jobs with chain precedence constraints;combining vector quantization and ant-colony algorithm for mesh-partitioning;wavelet-neuronal resource load prediction for multiprocessor environment;fault-tolerant scheduling in distributed real-time systems;online scheduling of multiprocessor jobs with idle regulation;predicting the response time of a new task on a beowulf cluster;space decomposition solvers and their performance in pc-based parallel computing environments;evaluation of execution time of mathematical library functions based on historical performance information;empirical modelling of parallel linear algebra routines;efficiency of divisible load processing;gray box based data access time estimation for tertiary storage in grid environment;performance modeling of parallel fem computations on clusters;asymptotical behaviour of the communication complexity of one parallel algorithm and analytical modeling of optimized sparse linear code.
the proceedings contain 56 papers. the special focus in this conference is on parallel Architectures and Resilience, Numerical Algorithms and parallel Scientific Computing. the topics include: Exploring memory error v...
ISBN:
(纸本)9783319321486
the proceedings contain 56 papers. the special focus in this conference is on parallel Architectures and Resilience, Numerical Algorithms and parallel Scientific Computing. the topics include: Exploring memory error vulnerability for parallel programming models;an approach for ensuring reliable functioning of a supercomputer based on a formal model;sparse matrix multiplication on dataflow engines;energy efficient calculations of text similarity measure on FPGA-accelerated computing platforms;a bucket sort algorithm for the particle-in-cell method on manycore architectures;experience on vectorizing lattice boltzmann kernels for multi- and many-core architectures;performance analysis of the Kahan-enhanced scalar product on current multicore processors;performance analysis of the Chebyshev basis conjugate gradient method on the K computer;dense symmetric indefinite factorization on GPU accelerated architectures;a parallel multi-threaded solver for symmetric positive definite bordered-band linear systems;parallel algorithm for quasi-band matrix-matrix multiplication;comparative performance analysis of coarse solvers for algebraic multigrid on multicore and manycore architectures;LU preconditioning for overdetermined sparse least squares problems;experimental optimization of parallel 3D overlapping domain decomposition schemes;massively parallel approach to sensitivity analysis on HPC architectures by using scalarm platform;GPU implementation of Krylov solvers for block-tridiagonal eigenvalue problems;comparison of large graphs using distance information;fast incremental community detection on dynamic graphs;a diffusion process for graph partitioning;a parallel algorithm for LZW decompression, with GPU implementation;parallel FDFM approach for computing GCDs using the FPGA and parallel induction of nondeterministic finite automata.
the proceedings contain 56 papers. the special focus in this conference is on Models, Algorithms, Energy Aspects of Computation, Scheduling for parallel Computing and Language-Based parallel Programming Models. the to...
ISBN:
(纸本)9783319321516
the proceedings contain 56 papers. the special focus in this conference is on Models, Algorithms, Energy Aspects of Computation, Scheduling for parallel Computing and Language-Based parallel Programming Models. the topics include: Virtualizing CUDA enabled GPGPUS on arm clusters;a distributed hash table for shared memory;mathematical approach to the performance evaluation of matrix multiply algorithm;a scalable numerical algorithm for solving tikhonov regularization problems;energy performance modeling with TIA and EML;considerations of computational efficiency in volunteer and cluster computing;parallel programs scheduling with architecturally supported regions;adaptive multi-level workflow scheduling with uncertain task estimates;divisible loads scheduling in hierarchical memory systems with time and energy constraints;extending Gustafson-barsis’s law for dual-architecture computing;free scheduling of tiles based on the transitive closure of dependence graphs;multi-threaded construction of neighbour lists for particle systems in openMP;high productivity and high performance;parallel ant brood graph partitioning in Julia;scalability model based on the concept of granularity;performance and power-aware modeling of MPI applications for cluster computing;running time prediction for web search queries;performance analysis of a parallel, multi-node pipeline for DNA sequencing;parallelising the computation of minimal absent words;modeling and simulations of edge-emitting broad-area semiconductor lasers and amplifiers;application of the parallel inmost platform to subsurface flow and transport modelling;genetic algorithm and exact diagonalization approach for molecular nanomagnets modelling and parallel Monte Carlo simulations for spin models with distributed lattice.
the article explores character recognition using convolutional neural networks (CNNs) optimized withthe CUDA platform to enhance computational efficiency. It outlines the CNN architecture, methods for leveraging GPU-...
详细信息
ISBN:
(数字)9798331531836
ISBN:
(纸本)9798331531843
the article explores character recognition using convolutional neural networks (CNNs) optimized withthe CUDA platform to enhance computational efficiency. It outlines the CNN architecture, methods for leveraging GPU-based parallel data processing, and presents experimental results derived from the MNIST dataset. the study highlights that implementing CUDA drastically reduces processing time while maintaining a high level of predictive accuracy. the findings emphasize the potential of GPU acceleration in handling intensive computational tasks, making it a promising approach for real-time applications in image recognition and machine learning.
the paper describes and analyzes an Average Schwarz Method with spectrally enriched coarse space for a reduced Hsieh-Clough-Tocher (RHCT) finite element discretization of a 4th-order elliptic multiscale problem. the d...
详细信息
Head detection is a challenging and widely applied object detection task. Although previous CNN-based head detectors have made good progress, the inherent locality of CNN restricts the extraction of global contextual ...
详细信息
ISBN:
(纸本)9789819788576;9789819788583
Head detection is a challenging and widely applied object detection task. Although previous CNN-based head detectors have made good progress, the inherent locality of CNN restricts the extraction of global contextual information, which leads to low precision and recall rates in head detection. In this article, we propose an end-to-end high-quality head detector based on Transformer, which effectively models the contextual relationships between heads, other objects and the background. To extract and generate discriminative feature maps suitable for detecting small head targets, we incorporate specific CNN-based auxiliary detector heads for joint training. the GIoU-aware classification loss function is improved to generate bounding boxes with high localization quality and high classification confidence, and a feature fusion module is introduced to enhance the feature representation capabilities of the model. We conduct experiments on COCO 2017 dataset and Brainwash head dataset, and the results demonstrate that our method outperforms in both COCO generalized object detection and Brainwash head detection tasks compared to previous CNN-based detectors as well as other current mainstream Transformer-based object detection models.
暂无评论