The proceedings contain 57 papers. The special focus in this conference is on parallelarchitectures, algorithms, and programming . The topics include: Ford motor side-view recognition system based on wavelet entropy ...
ISBN:
(纸本)9789811064418
The proceedings contain 57 papers. The special focus in this conference is on parallelarchitectures, algorithms, and programming . The topics include: Ford motor side-view recognition system based on wavelet entropy and back propagation neural network and levenberg-marquardt algorithm;intrusion detection based on self-adaptive differential evolution extreme learning machine with gaussian kernel;prediction for passenger flow at the airport based on different models;election based pose estimation of moving objects;a novel topology reconfiguration backtracking algorithm for 2d REmesh networks-on-chip;user behaviour authentication model based on stochastic petri net in cloud environment;performance prediction of spark based on the multiple linear regression analysis;exploration of heuristic-based feature selection on classification problems;anti-similarity group shilling attacks;the study of the seabed side-scan acoustic images recognition using bp neural network;node localization of wireless sensor network based on secondary correction error;optimizations of the whole function vectorization based on SIMD characteristics;a stacked denoising autoencoders based collaborative approach for recommender system;research on adaptive canny algorithm based on dual-domain filtering;a dynamic individual recommendation method based on reinforcement learning;research on the pre-distribution model based on seesaw model;an efficient filtration method based on variable-length seeds for sequence alignment;an optimized fusion method for double-wearable-wireless-band platform on cloud-health application;research on concept drift detection for decision tree algorithm in the stream of big data;review of various strategies for gateway discovery mechanisms for integrating internet-MANET.
The proceedings contain 37 papers. The special focus in this conference is on parallelarchitectures, algorithms and programming. The topics include: Analysing and Forecasting Electricity Demand and Price Using Deep L...
ISBN:
(纸本)9789811600098
The proceedings contain 37 papers. The special focus in this conference is on parallelarchitectures, algorithms and programming. The topics include: Analysing and Forecasting Electricity Demand and Price Using Deep Learning Model During the COVID-19 Pandemic;cross-database Micro Expression Recognition Based on Apex Frame Optical Flow and Multi-head Self-attention;GPS Intelligent Solution of Aerial Image Target in State Grid EIA Survey;Encryption and Decryption in Conic Curves Cryptosystem Over Finite Field GF(2n) Using Tile Self-assembly;optimizing Embedding-Related Quantum Annealing Parameters for Reducing Hardware Bias;a Behavioural Network Traffic Novelty Detection for the Internet of Things Infrastructures;a Fast Algorithm for Image Segmentation Based on Global Cosine Fitting Energy Model;household Garbage Classification: A Transfer Learning Based Method and a Benchmark;lightweight Neural Network Based Garbage Image Classification Using a Deep Mutual Learning;on the Decycling Problem in a Torus;VBSSR: Variable Bitrate Encoded Video Streaming with Super-Resolution on HPC Education Platform;An Investigation on the Performance of Highly Congested Home WiFi Networks During the COVID-19 Pandemic;using Feed-Forward Network for Fast Arbitrary Style Transfer with Contextual Loss;enhancing Underwater Image Using Multi-scale Generative Adversarial Networks;Inferring Prerequisite Relationships Among Learning Resources for HPC Education;research on Bank Knowledge Transaction Coverage Model Based on Innovation Capacity Analysis;deep Deterministic Policy Gradient Based Resource Allocation in Internet of Vehicles;a Pufferfish Privacy Mechanism for the Trajectory Clustering Task;a Novel Attention Model of Deep Learning in Image Classification;FDRA: Fully Distributed Routing Architecture for Private Virtual Network in Public Cloud.
Stream processing is a computing paradigm enabling the continuous processing of unbounded data streams. Some classes of stream processing applications can greatly benefit from the parallel processing power and afforda...
详细信息
Stream processing is a computing paradigm enabling the continuous processing of unbounded data streams. Some classes of stream processing applications can greatly benefit from the parallel processing power and affordability offered by GPUs. However, efficient GPU utilization with stream processing applications often requires micro-batching techniques, i.e., the continuous processing of data batches to expose data parallelism opportunities and amortize host-device data transfer overheads. Micro-batching further introduces the challenge of finding suitable micro-batch sizes to maintain low-latency processing under highly dynamic workloads. The research field of self-adaptive software provides different techniques to address such a challenge. Our goal is to assess the performance of six self-adaptive algorithms in meeting latency requirements through micro-batch size adaptation. The algorithms are applied to a GPU-accelerated stream processing benchmark with a highly dynamic workload. Four of the six algorithms have already been evaluated using a smaller workload with the same application. We propose two new algorithms to address the shortcomings detected in the former four. The results demonstrate that a highly dynamic workload is challenging for the evaluated algorithms, as they could not meet the most strict latency requirements for more than 38.5% of the stream data items. Overall, all algorithms performed similarly in meeting the latency requirements. However, one of our proposed algorithms met the requirements for 4% more data items than the best of the previously studied algorithms, demonstrating more effectiveness in highly variable workloads. This effectiveness is particularly evident in segments of the workload with abrupt transitions between low- and high-latency regions, where our proposed algorithms met the requirements for 79% of the data items in those segments, compared to 33% for the best of the earlier algorithms.
This paper investigates the use of High-Level Synthesis (HLS) for designing parallel hardware architectures on FPGAs. HLS compilers, like the one used in Vitis HLS, extract the available parallelism so the HLS languag...
详细信息
This paper investigates the use of High-Level Synthesis (HLS) for designing parallel hardware architectures on FPGAs. HLS compilers, like the one used in Vitis HLS, extract the available parallelism so the HLS languages should be thought as inherently parallel and should be programmed with the target parallel architecture in mind. We discuss how HLS facilitated the development of FIPLib, an image processing library for FPGAs, leveraging the streaming model. This library comprises parallel kernels connected through streams to implement a streaming data-flow computation. Following an overview of the library's functionalities and its parallel implementation, we present the benefits of adopting this FPGA library, particularly in terms of speed and power consumption. We conduct a comparative analysis by implementing two image processing algorithms using both our FPGA library and the equivalent OpenCV CPU and GPU implementation. The results demonstrate that FPGAs programmed through FIPLib can significantly accelerate computations and/or reduce power consumption.
Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular e...
详细信息
The proceedings contain 131 papers. The special focus in this conference is on algorithms and architectures for parallel Processing. The topics include: MARO: Enabling Full MPI Automatic Refactoring in DSL-Based ...
ISBN:
(纸本)9789819615247
The proceedings contain 131 papers. The special focus in this conference is on algorithms and architectures for parallel Processing. The topics include: MARO: Enabling Full MPI Automatic Refactoring in DSL-Based programming Framework;SSC: An SRAM-Based Silence Computing Design for On-chip Memory;TP-BFT: A Faster Asynchronous BFT Consensus with parallel Structure;LTP: A Lightweight On-Chip Temporary Prefetcher for Data-Dependent Memory Accesses;A Neural Network-Based PUF Protection Method Against Machine Learning Attack;Compression Format and Systolic Array Structure Co-design for Accelerating Sparse Matrix Multiplication in DNNs;multidimensional Intrinsic Identity Construction and Dynamic Seamless Authentication Schemes in IoT Environments;invisible Backdoor Attack with Image Contours Triggers;finestra: Multi-aggregator Swarm Learning for Gradient Leakage Defense;DIsFU: Protecting Innocent Clients in Federated Unlearning;multiple-Round Aggregation of Abstract Semantics for Secure Heterogeneous Federated Learning;dynamic Privacy Protection with Large Language Model in Social Networks;a Dynamic Symmetric Searchable Encryption Scheme for Rapid Conjunctive Queries;a Data Watermark Scheme Base on Data Converted Bitmap for Data Trading;distributed Incentive Algorithm for Fine-Grained Offloading in Vehicular Ad Hoc Networks;mitigating Over-Unlearning in Machine Unlearning with Synthetic Data Augmentation;AW-YOLOv9: Adverse Weather Conditions Adaptation for UAV Detection;efficient and Privacy-Preserving Ranking-Based Federated Learning;on-Chain Dynamic Policy Evaluation for Decentralized Access Control;DPG-FairFL: A Dual-Phase GAN-Based Defense Framework Against Image-Based Fairness Data Poisoning Attacks in Federated Learning.
The proceedings contain 131 papers. The special focus in this conference is on algorithms and architectures for parallel Processing. The topics include: MARO: Enabling Full MPI Automatic Refactoring in DSL-Based ...
ISBN:
(纸本)9789819615278
The proceedings contain 131 papers. The special focus in this conference is on algorithms and architectures for parallel Processing. The topics include: MARO: Enabling Full MPI Automatic Refactoring in DSL-Based programming Framework;SSC: An SRAM-Based Silence Computing Design for On-chip Memory;TP-BFT: A Faster Asynchronous BFT Consensus with parallel Structure;LTP: A Lightweight On-Chip Temporary Prefetcher for Data-Dependent Memory Accesses;A Neural Network-Based PUF Protection Method Against Machine Learning Attack;Compression Format and Systolic Array Structure Co-design for Accelerating Sparse Matrix Multiplication in DNNs;multidimensional Intrinsic Identity Construction and Dynamic Seamless Authentication Schemes in IoT Environments;invisible Backdoor Attack with Image Contours Triggers;finestra: Multi-aggregator Swarm Learning for Gradient Leakage Defense;DIsFU: Protecting Innocent Clients in Federated Unlearning;multiple-Round Aggregation of Abstract Semantics for Secure Heterogeneous Federated Learning;dynamic Privacy Protection with Large Language Model in Social Networks;a Dynamic Symmetric Searchable Encryption Scheme for Rapid Conjunctive Queries;a Data Watermark Scheme Base on Data Converted Bitmap for Data Trading;distributed Incentive Algorithm for Fine-Grained Offloading in Vehicular Ad Hoc Networks;mitigating Over-Unlearning in Machine Unlearning with Synthetic Data Augmentation;AW-YOLOv9: Adverse Weather Conditions Adaptation for UAV Detection;efficient and Privacy-Preserving Ranking-Based Federated Learning;on-Chain Dynamic Policy Evaluation for Decentralized Access Control;DPG-FairFL: A Dual-Phase GAN-Based Defense Framework Against Image-Based Fairness Data Poisoning Attacks in Federated Learning.
The proceedings contain 131 papers. The special focus in this conference is on algorithms and architectures for parallel Processing. The topics include: MARO: Enabling Full MPI Automatic Refactoring in DSL-Based ...
ISBN:
(纸本)9789819615414
The proceedings contain 131 papers. The special focus in this conference is on algorithms and architectures for parallel Processing. The topics include: MARO: Enabling Full MPI Automatic Refactoring in DSL-Based programming Framework;SSC: An SRAM-Based Silence Computing Design for On-chip Memory;TP-BFT: A Faster Asynchronous BFT Consensus with parallel Structure;LTP: A Lightweight On-Chip Temporary Prefetcher for Data-Dependent Memory Accesses;A Neural Network-Based PUF Protection Method Against Machine Learning Attack;Compression Format and Systolic Array Structure Co-design for Accelerating Sparse Matrix Multiplication in DNNs;multidimensional Intrinsic Identity Construction and Dynamic Seamless Authentication Schemes in IoT Environments;invisible Backdoor Attack with Image Contours Triggers;finestra: Multi-aggregator Swarm Learning for Gradient Leakage Defense;DIsFU: Protecting Innocent Clients in Federated Unlearning;multiple-Round Aggregation of Abstract Semantics for Secure Heterogeneous Federated Learning;dynamic Privacy Protection with Large Language Model in Social Networks;a Dynamic Symmetric Searchable Encryption Scheme for Rapid Conjunctive Queries;a Data Watermark Scheme Base on Data Converted Bitmap for Data Trading;distributed Incentive Algorithm for Fine-Grained Offloading in Vehicular Ad Hoc Networks;mitigating Over-Unlearning in Machine Unlearning with Synthetic Data Augmentation;AW-YOLOv9: Adverse Weather Conditions Adaptation for UAV Detection;efficient and Privacy-Preserving Ranking-Based Federated Learning;on-Chain Dynamic Policy Evaluation for Decentralized Access Control;DPG-FairFL: A Dual-Phase GAN-Based Defense Framework Against Image-Based Fairness Data Poisoning Attacks in Federated Learning.
暂无评论