the proceedings contain 56 papers. the special focus in this conference is on Models, Algorithms, Energy Aspects of Computation, Scheduling for parallel Computing and Language-Based parallel Programming Models. the to...
ISBN:
(纸本)9783319321516
the proceedings contain 56 papers. the special focus in this conference is on Models, Algorithms, Energy Aspects of Computation, Scheduling for parallel Computing and Language-Based parallel Programming Models. the topics include: Virtualizing CUDA enabled GPGPUS on arm clusters;a distributed hash table for shared memory;mathematical approach to the performance evaluation of matrix multiply algorithm;a scalable numerical algorithm for solving tikhonov regularization problems;energy performance modeling with TIA and EML;considerations of computational efficiency in volunteer and cluster computing;parallel programs scheduling with architecturally supported regions;adaptive multi-level workflow scheduling with uncertain task estimates;divisible loads scheduling in hierarchical memory systems with time and energy constraints;extending Gustafson-barsis’s law for dual-architecture computing;free scheduling of tiles based on the transitive closure of dependence graphs;multi-threaded construction of neighbour lists for particle systems in openMP;high productivity and high performance;parallel ant brood graph partitioning in Julia;scalability model based on the concept of granularity;performance and power-aware modeling of MPI applications for cluster computing;running time prediction for web search queries;performance analysis of a parallel, multi-node pipeline for DNA sequencing;parallelising the computation of minimal absent words;modeling and simulations of edge-emitting broad-area semiconductor lasers and amplifiers;application of the parallel inmost platform to subsurface flow and transport modelling;genetic algorithm and exact diagonalization approach for molecular nanomagnets modelling and parallel Monte Carlo simulations for spin models with distributed lattice.
Images are often used as teaching resources in various ways to support teaching. Teachers may use different colors for the same image to teach the same concept. Color is a visual stimulus to the human eye, which may g...
详细信息
Can we tell which parallel algorithm is executing by looking at the performance of the algorithm? In this work, we design and demonstrate a study for parallel algorithm classification of parallel sorting algorithms. W...
详细信息
Computer systems that have been successfully deployed for dense regular workloads fall short of achieving scalability and efficiency when applied to irregular and dynamic graph applications. Conventional computing sys...
详细信息
the article explores character recognition using convolutional neural networks (CNNs) optimized withthe CUDA platform to enhance computational efficiency. It outlines the CNN architecture, methods for leveraging GPU-...
详细信息
ISBN:
(数字)9798331531836
ISBN:
(纸本)9798331531843
the article explores character recognition using convolutional neural networks (CNNs) optimized withthe CUDA platform to enhance computational efficiency. It outlines the CNN architecture, methods for leveraging GPU-based parallel data processing, and presents experimental results derived from the MNIST dataset. the study highlights that implementing CUDA drastically reduces processing time while maintaining a high level of predictive accuracy. the findings emphasize the potential of GPU acceleration in handling intensive computational tasks, making it a promising approach for real-time applications in image recognition and machine learning.
SYCL programming model does not guarantee performance portability across different architectures. However, the HPC community severely needs platform-independent performance portable applications more than ever. theref...
详细信息
Head detection is a challenging and widely applied object detection task. Although previous CNN-based head detectors have made good progress, the inherent locality of CNN restricts the extraction of global contextual ...
详细信息
ISBN:
(纸本)9789819788576;9789819788583
Head detection is a challenging and widely applied object detection task. Although previous CNN-based head detectors have made good progress, the inherent locality of CNN restricts the extraction of global contextual information, which leads to low precision and recall rates in head detection. In this article, we propose an end-to-end high-quality head detector based on Transformer, which effectively models the contextual relationships between heads, other objects and the background. To extract and generate discriminative feature maps suitable for detecting small head targets, we incorporate specific CNN-based auxiliary detector heads for joint training. the GIoU-aware classification loss function is improved to generate bounding boxes with high localization quality and high classification confidence, and a feature fusion module is introduced to enhance the feature representation capabilities of the model. We conduct experiments on COCO 2017 dataset and Brainwash head dataset, and the results demonstrate that our method outperforms in both COCO generalized object detection and Brainwash head detection tasks compared to previous CNN-based detectors as well as other current mainstream Transformer-based object detection models.
A model for simulating pedestrian recreational movement along a tourist trail is introduced. the model reduces the two-dimensional nature of bidirectional flow to one dimension while preserving the inter-agent interac...
详细信息
the progress of Natural Language processing (NLP), although fast in recent years, is not at the same pace for all languages. African languages in particular are still behind and lack automatic processing tools. Some o...
详细信息
暂无评论