In 2021, the country proposed the strategic goal of "carbon peaking and carbon neutrality". In order to actively respond to the national strategy and complete the implementation of the strategy, it was propo...
详细信息
The proceedings contain 35 papers. The special focus in this conference is on parallel and distributedcomputing, Applications, and Technologies. The topics include: On the Non-ergodic Convergence Rate of the Directed...
ISBN:
(纸本)9783030692438
The proceedings contain 35 papers. The special focus in this conference is on parallel and distributedcomputing, Applications, and Technologies. The topics include: On the Non-ergodic Convergence Rate of the Directed Nonsmooth Composite Optimization;6D Pose Estimation Based on the Adaptive Weight of RGB-D Feature;blockchain-Based Secure Outsourcing of Fully Homomorphic Encryption Using Hidden Ideal Lattice;multiple Projections Learning for Dimensional Reduction;Preventing DDoS Attacks on Bitcoin Memory Pool by the Dynamic Fee Threshold Mechanism;The Compiler of DFC: A Source Code Converter that Transform the Dataflow Code to the Multi-threaded C Code;online Learning-Based Co-task Dispatching with Function Configuration in Edge computing;System-Level FPGA Routing for Logic Verification with Time-Division Multiplexing;protein Interresidue Contact Prediction Based on Deep Learning and Massive Features from Multi-sequence Alignment;heterogeneous Software Effort Estimation via Cascaded Adversarial Auto-Encoder;see Fine Color from the Rough Black-and-White;data Aggregation Aware Routing for distributed Training;a New Integer Programming Model for the File Transfer Scheduling Problem;approximation Algorithms for the General Cluster Routing Problem;maximizing Group Coverage in Social Networks;lightLayers: Parameter Efficient Dense and Convolutional Layers for Image Classification;the Hybrid Navigation Method in Face of Dynamic Obstacles;a Relaxed Balanced Lock-Free Binary Search Tree;A Dynamic Parameter Tuning Method for High Performance SpMM;data Caching Based Transfer Optimization in Large Scale Networks;a Novel distributed Reinforcement Learning Method for Classical Chinese Poetry Generation;second-Order Convolutional Neural Network Based on Cholesky Compression Strategy;submodular Maximization with Bounded Marginal Values;the Prize-Collecting k-Steiner Tree Problem.
We investigate the timestamp allocation scheme in classical concurrency controls of the database management systems (DBMS) on many-core machines. Then we discuss a distributed logical timestamp allocation scheme with ...
详细信息
ISBN:
(纸本)9798400701559
We investigate the timestamp allocation scheme in classical concurrency controls of the database management systems (DBMS) on many-core machines. Then we discuss a distributed logical timestamp allocation scheme with uniqueness and fairness to improve the performance of DBMS concurrency control algorithms on many-core machines. Further, the proposed logical timestamp generator is free of bottlenecks such as accessing the system clock counter, calling for atomic add operation, and synchronization. Finally, we experiment with an optimistic concurrency control algorithm based on the proposed and other allocation schemes. The results show that the performance of an optimistic concurrency control algorithm based on the proposed timestamp allocation outperforms one based on other allocations. Furthermore, it has better linear scalability under heavy loads.
Electric Vehicles (EVs) are being stepping into the spotlight as a sustainable alternative to gasoline vehicles. The security of the EV charging infrastructure is more and more critical, necessitating the deployment a...
详细信息
The edge computing architecture exposes data transmission and storage to unauthorized access, compromising data integrity. Users typically employ multiple data backups to ensure data reliability and availability, enha...
详细信息
Heterogeneous IoT architectures are evolving rapidly and different challenged are faced with the traditional IoT architectures including the performance time of real-time IoT application. parallelcomputing programmin...
详细信息
ISBN:
(纸本)9798350349740;9798350349757
Heterogeneous IoT architectures are evolving rapidly and different challenged are faced with the traditional IoT architectures including the performance time of real-time IoT application. parallelcomputing programming technique could enhance the performance and efficiency for distributed systems and multicore processors as well as the IoT systems. However, parallelcomputing, presents certain difficulties and constraints, including synchronization, communication, security concerns, and load balancing. In this regard, a novel IoT workload balancing model for heterogeneous IoT architectures is presented in this paper. This model is intended to reduce the execution time of large systems by redistributing part of their functions to other involved IoT nodes. An experiment has been conducted to evaluate the actual real load for each IoT node and tried to rebalance the load using the proposed model. The results were encouraging as the performance time was reduced by about one third on two cores.
With the development of new technologies such as big data, cloud computing, Internet of Things, mobile Internet, and artificial intelligence, the connotation of integrated energy services continues to expand. Integrat...
详细信息
Preconditioned iterative methods based on the Krylov subspace technique are widely employed in various scientific and technical computing. When utilizing large-scale parallelcomputing systems, the communication overh...
详细信息
ISBN:
(纸本)9798350364613;9798350364606
Preconditioned iterative methods based on the Krylov subspace technique are widely employed in various scientific and technical computing. When utilizing large-scale parallelcomputing systems, the communication overhead tends to increase with the growth in the number of nodes, making its reduction a crucial challenge. In parallel finite element methods (FEM) and finite volume methods (FVM), halo communication and computation overlapping (CC-Overlapping) are commonly employed, often in conjunction with the dynamic loop scheduling feature of OpenMP. This approach has been primarily applied to sparse matrix-vector products (SpMV) and explicit solvers. Previous studies by the author have proposed reordering techniques for applying CC-Overlapping to processes involving global data dependencies, such as the Conjugate Gradient method preconditioned by Incomplete Cholesky Factorization (ICCG). Successful implementations on massively parallel supercomputers demonstrated high parallel performance, but the application of CC-Overlapping was limited to SpMV. In the present work, the author proposes a method to apply CC-Overlapping to the forward and backward substitutions of the IC(0) smoother of the parallel Conjugate Gradient method preconditioned by Multigrid (MGCG). Using up to 4,096 nodes on Wisteria/EMEC-01 (Odyssey) with A64FX, performance improvement of approximately 40+% was achieved compared to the original implementation, while improvement was 20+% on 1,024 nodes of Oakbridge-CX system with Intel Xeon CPU's.
The proceedings contain 38 papers. The special focus in this conference is on parallel and distributedcomputing. The topics include: An MPI-based Algorithm for Mapping Complex Networks onto Hierarchical Architectures...
ISBN:
(纸本)9783030856649
The proceedings contain 38 papers. The special focus in this conference is on parallel and distributedcomputing. The topics include: An MPI-based Algorithm for Mapping Complex Networks onto Hierarchical Architectures;pipelined Model parallelism: Complexity Results and Memory Considerations;efficient and Systematic Partitioning of Large and Deep Neural Networks for parallelization;A GPU Architecture Aware Fine-Grain Pruning Technique for Deep Neural Networks;Towards Flexible and Compiler-Friendly Layer Fusion for CNNs on Multicore CPUs;smart distributed DataSets for Stream Processing;colony: parallel Functions as a Service on the Cloud-Edge Continuum;horizontal Scaling in Cloud Using Contextual Bandits;geo-distribute Cloud Applications at the Edge;Automatic Low-Overhead Load-Imbalance Detection in MPI Applications;A Fault Tolerant and Deadline Constrained Sequence Alignment Application on Cloud-Based Spot GPU Instances;sustaining Performance While Reducing Energy Consumption: A Control Theory Approach;algorithm Design for Tensor Units;a Scalable Approximation Algorithm for Weighted Longest Common Subsequence;TSLQueue: An Efficient Lock-Free Design for Priority Queues;G-Morph: Induced Subgraph Isomorphism Search of Labeled Graphs on a GPU;accelerating Graph Applications Using Phased Transactional Memory;Efficient GPU Computation Using Task Graph parallelism;towards High Performance Resilience Using Performance Portable Abstractions;Enhancing Load-Balancing of MPI Applications with Workshare;trace-Based Workload Generation and Execution;particle-In-Cell Simulation Using Asynchronous Tasking;Exploiting Co-execution with OneAPI: Heterogeneity from a Modern Perspective;designing a 3D parallel Memory-Aware Lattice Boltzmann Algorithm on Manycore Systems;Fault-Tolerant LU Factorization Is Low Cost;Mixed Precision Incomplete and Factorized Sparse Approximate Inverse Preconditioning on GPUs;GPU-Accelerated Mahalanobis-Average Hierarchical Clustering Analysis.
Genetic algorithms have been widely used in intelligent test paper generation systems. However, traditional genetic algorithms cannot ensure that the difficulty of test questions is normally distributed, and are prone...
详细信息
暂无评论