The proceedings contain 55 papers. The special focus in this conference is on parallel and distributedcomputing: Applications and Technologies. The topics include: A Meta-reinforcement Learning Framework for Ada...
ISBN:
(纸本)9789819642069
The proceedings contain 55 papers. The special focus in this conference is on parallel and distributedcomputing: Applications and Technologies. The topics include: A Meta-reinforcement Learning Framework for Adaptive Quadrotor UAV Attitude Control;Securing Energy Transactions for Electric Vehicles: The Blockchain Approach and Encrypted NFTs;optimizing Task Allocation in Heterogeneous Agent Manufacturing Systems;MPG: Multi-modal Personal Health Graph for Alzheimer’s Disease Diagnosis;SMAC: A Secure Multi-authority Access Control Scheme with Attribute Unification for Fog Enabled IoT in E-Health;Convolutional Neural Networks Parameter Training for SCM Algorithm Based on Hausdorff Difference;handling Non-stationarity with Distribution Shifts and Data Dependency in Time Series Forecasting;the Two-Stage Stochastic Facility Location Game;regularized Non-monotone γ-weakly Submodular Maximization;fed-MoE: Efficient Federated Learning for Mixture-of-Experts Models via Empirical Pruning;WaitIO-Hybrid: Communication for Coupling MPI Programs Among Heterogeneous Systems;the Material Delivery Route Prediction Method Based on Deep Reinforcement Learning;privacy-Preserving in Medical Image Analysis: A Review of Methods and Applications;research on Task Migration Problem Based on Link Uncertainty in Adversarial Scenarios;optimizing Production Component Scheduling in Multivariate Industrial Networks with Dynamic Changes in Production Costs;multi-agent Collaboration for Time-Sensitive Tasks in Multiple Networked Adversarial Scenarios;containerized Data-Flow Processing for Scalable Real-Time Analytics on Edge Devices;fast Approximation for Scheduling Malleable Jobs on parallel Batch Machines with Rejection;real-Time and In-Situ Temperature Profiling for Determining Detonation of White Dwarf Mergers;accparser: A Standalone OpenACC Parser and Its Usage on Mapping OpenACC to OpenMP Directives;Out-of-Memory GPU Sorting Using Asynchronous CUDA Streams;long-Term and Periodicity-Aware Spatio
The inherent computational complexity of validating and verifying concurrent systems implies a need to be able to exploit parallel and distributedcomputing architectures. We present a new distributed algorithm for st...
详细信息
The inherent computational complexity of validating and verifying concurrent systems implies a need to be able to exploit parallel and distributedcomputing architectures. We present a new distributed algorithm for state space exploration of concurrent systems on computing clusters. Our algorithm relies on Remote Direct Memory Access (RDMA) for low-latency transfer of states between computing elements and on state reconstruction trees for compact representation of states on the computing elements themselves. For the distribution of states between computing elements, we propose a concept of state stealing. We have implemented our proposed algorithm using the OpenSHMEM API for RDMA and experimentally evaluated it on the grid'5000 testbed with a set of benchmark models. The experimental results show that our algorithm scales well with the number of available computing elements and that our state stealing mechanism generally provides a balanced workload distribution.
This special issue is dedicated to examining the rapidly evolving fields of artificial intelligence, mathematical modeling, and optimization, with particular emphasis on their growing importance in computational scien...
详细信息
This special issue is dedicated to examining the rapidly evolving fields of artificial intelligence, mathematical modeling, and optimization, with particular emphasis on their growing importance in computational science. It features the most notable papers from the "Mathematical Modeling and Problem Solving" workshop at PDPTA'24, the 30th internationalconference on parallel and distributed Processing Techniques and Applications. The issue showcases pioneering research in areas such as natural language processing, system optimization, and high-performance computing. The nine selected studies include novel AI-driven methods for chemical compound generation, historical text recognition, and music recommendation, along with advancements in hardware optimization through reconfigurable accelerators and vector register sharing. Additionally, evolutionary and hyper-heuristic algorithms are explored for sophisticated problem-solving in engineering design, and innovative techniques are introduced for high-speed numerical methods in large-scale systems. Collectively, these contributions demonstrate the significance of AI, supercomputing, and advanced algorithms in driving the next generation of scientific discovery.
Simulations of reacting multiphase flows tend to display an inhomogeneously distributed computational intensity over the spatial and temporal domains. The time-to-solution of chemical reaction rates can span multiple ...
详细信息
Simulations of reacting multiphase flows tend to display an inhomogeneously distributed computational intensity over the spatial and temporal domains. The time-to-solution of chemical reaction rates can span multiple orders of magnitude due to the emergence of combustible kernels and thin turbulent reaction zones. Similarly, the time to solve the equation of state (EoS) for non-ideal fluid mixtures deviates substantially between the grid cells. These effects result in a performance profile that is unbalanced and rapidly changing for transient simulations, and therefore beyond the capabilities of traditional (quasi-)static mesh partitioning methods. We analyse this loss of parallel efficiency for large-eddy simulations of the ECN Spray-A benchmark with the multi-physics solver INCA and propose to mitigate the problem by introducing two independent repartitioning stages in addition to the classic domain decomposition for fluid transport: one for the EoS and one for chemical reactions. We explore various scalable repartitioning strategies in this context and observe that rebalancing computational load yields a significant speedup that is robust for various mesh resolutions and process numbers. The dynamic multistage load-balancing thus effectively removes obstacles towards good parallel scaling of INCA and similar solvers for reacting and/or multiphase flows.
In the dispersion problem, a group of k <= n mobile robots, initially placed on the vertices of an anonymous graph G with n vertices, must redistribute themselves so that each vertex hosts no more than one robot. W...
详细信息
ISBN:
(纸本)9783031814037;9783031814044
In the dispersion problem, a group of k <= n mobile robots, initially placed on the vertices of an anonymous graph G with n vertices, must redistribute themselves so that each vertex hosts no more than one robot. We address this challenge on an anonymous triangular grid graph, where each vertex can connect to up to six adjacent vertices. We propose a distributed deterministic algorithm that achieves dispersion on an unoriented triangular grid graph in O(root n) time, where n is the number of vertices. Each robot requires O(log n) bits of memory. The time complexity of our algorithm and the memory usage per robot are optimal. This work builds on previous studies by Kshemkalyani et al. [WALCOM 2020 [17]] and Banerjee et al. [ALGOWIN 2024 [3]]. Importantly, our algorithm terminates without requiring prior knowledge of n and resolves a question posed by Banerjee et al. [ALGOWIN 2024 [3]].
Electric power computing resources are distributed in the respective regions of provincial data centers and transmission and transformation equipment, and the scheduling efficiency of computing resources is limited by...
详细信息
Exploration of a graph network means each node of the graph has to be visited by at least one robot. The problem of exploration has been studied in various networks like rings, trees, finite rectangular grids, etc. If...
详细信息
The proceedings contain 24 papers. The special focus in this conference is on parallel and distributed Processing Techniques. The topics include: parallel N-Body Performance Comparison: Julia, Rust, and More;REFT...
ISBN:
(纸本)9783031856372
The proceedings contain 24 papers. The special focus in this conference is on parallel and distributed Processing Techniques. The topics include: parallel N-Body Performance Comparison: Julia, Rust, and More;REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments;An Efficient Data Provenance Collection Framework for HPC I/O Workloads;using Minicasts for Efficient Asynchronous Causal Unicast and Byzantine Tolerance;a Comparative Study of Two Matrix Multiplication Algorithms Under Current Hardware Architectures;Is Manual Code Optimization Still Required to Mitigate GPU Thread Divergence? Applying a Flattening Technique to Observe Performance;towards Automatic, Predictable and High-Performance parallel Code Generation;Attack Graph Generation on HPC Clusters;analyzing the Influence of File Formats on I/O Patterns in Deep Learning;inference of Cell–Cell Interactions Through Spatial Transcriptomics Data Using Graph Convolutional Neural Networks;natural Product-Like Compound Generation with Chemical Language Models;improved Early–Modern Japanese Printed Character Recognition Rate with Generated Characters;Improved Method for Similar Music Recommendation Using Spotify API;Reconfigurable Virtual Accelerator (ReVA) for Large-Scale Acceleration Circuits;Building Simulation Environment of Reconfigurable Virtual Accelerator (ReVA);vector Register Sharing Mechanism for High Performance Hardware Acceleration;Efficient Compute Resource Sharing of RISC-V Packed-SIMD Using Simultaneous Multi-threading;introducing Competitive Mechanism to Differential Evolution for Numerical Optimization;hyper-heuristic Differential Evolution with Novel Boundary Repair for Numerical Optimization;jump Like a Frog: Optimization of Renewable Energy Prediction in Smart Gird Based on Ultra Long Term Network;vision Transformer-Based Meta Loss Landscape Exploration with Actor-Critic Method;Fast Computation Method for Stopping Condition of Range Restricted
In the evolving landscape of SAT solving, leveraging parallel computation has become increasingly significant. The portfolio strategy, combined with clause sharing, has emerged as the leading approach for both local a...
详细信息
The proceedings contain 83 papers. The topics discussed include: parallel execution strategies for cellular automata on shared memory architectures;parallel median filter with arbitrary window size and image depth;hig...
ISBN:
(纸本)9798331524937
The proceedings contain 83 papers. The topics discussed include: parallel execution strategies for cellular automata on shared memory architectures;parallel median filter with arbitrary window size and image depth;high performance stingray: fast spectral timing for all;optimizing transitive closure computation for high performance computing and security;a preliminary study on performance modeling at scale for geophysical applications;a multi-level parallel algorithm for detection of single scatterers in SAR tomography;order, unite, and conquer: a group formulation for multi-armed bandits in microservice provisioning;a performance analysis of VM-based trusted execution environments for confidential federated learning;improving cloud energy efficiency through machine learning models;and adaptive AI-based decentralized resource management in the cloud-edge continuum.
暂无评论