Initially the Blockchain technology was used to implement different digital currencies. Nowadays blockchain is adopted as a common technology to be used in various business applications. Blockchain based solutions hav...
详细信息
Client selection (a.k.a., client sampling) is one of the hot topics in Federated Learning (FL). In each communication round, selecting some clients to participate in aggregation can effectively reduce the communicatio...
详细信息
ISBN:
(纸本)9798350302936
Client selection (a.k.a., client sampling) is one of the hot topics in Federated Learning (FL). In each communication round, selecting some clients to participate in aggregation can effectively reduce the communication overhead caused by exchanging model parameters. However, due to statistical heterogeneity in FL, selecting clients randomly may affect the performance of aggregated global models. existing approaches regarding client selection firstly cluster clients and then sample(select) some representative clients from each cluster. However, these clusteringbased approaches may be either time-intensive or high complexity. To address these issues, In this paper, we introduce Bi-level Sampling, a clustering-based approach for client selection. After multinomial distribution sampling, Bi-level Sampling clusters clients based on weighted per-label mean class scores and then selects participating clients for federated learning in each round. Bi-level Sampling can lead to better client representativity and the reduced variance of the client's stochastic aggregation weights in FL. Our approach can be integrated into typical FL frameworks. Experimental results show that, compared with state-of-the-art approaches, our approach demonstrates significantly more stable and accurate convergence behavior-getting higher test accuracy and less training time, especially in highly Non-IID settings.
ZooKeeper Atomic Broadcast (Zab) is a high-performance atomic broadcast protocol, which is a key component of Apache ZooKeeper. By ensuring strong consistency and fault tolerance, the Zab protocol plays a crucial role...
详细信息
As the Function-as-a-Service (FaaS) paradigm enjoys growing popularity within Cloud-based systems, there is increasing interest in moving serverless functions towards the Edge, to better support geo-distributed and pe...
详细信息
ISBN:
(纸本)9781665453783
As the Function-as-a-Service (FaaS) paradigm enjoys growing popularity within Cloud-based systems, there is increasing interest in moving serverless functions towards the Edge, to better support geo-distributed and pervasive applications. However, enjoying both the reduced latency of Edge and the scalability of FaaS requires new architectures and implementations to cope with typical Edge challenges (e.g., nodes with limited computational capacity). While first solutions have been proposed for Edge-based FaaS, including light function sandboxing techniques, we lack a platform with the ability to span both Edge and Cloud and adaptively exploit both. In this paper, we present Serverledge, a FaaS platform designed for the Edge-to-Cloud continuum. Serverledge adopts a decentralized architecture, where function invocation requests can be fully served within Edge nodes. To cope with load peaks, Serverledge also supports vertical (i.e., from Edge to Cloud) and horizontal (i.e., among Edge nodes) computation offloading. Our evaluation shows that Serverledge outperforms Apache OpenWhisk in an Edge-like scenario and has competitive performance with state-of-the-art frameworks optimized for the Edge, with the advantage of built-in support for vertical and horizontal offloading.
This study introduces the thyroid nodule segmentation grid search based local patch learning (GS-LPL) network as an effective IoT solution for real-time, precise thyroid cancer segmentation. Utilizing the Turing PI an...
详细信息
The exponential growth of communication technologies has broadened the cloud computing ecosystem horizon to meet main communication needs. However, in-parallel upsurge in online attacks has alarmed industries to ensur...
详细信息
computing systems always face a "resource allocation dilemma" that shows the great difficulties in trading off resource efficiency for tail latency, due to the internal uncertainty of cluster status and exec...
详细信息
ISBN:
(纸本)9781665473156
computing systems always face a "resource allocation dilemma" that shows the great difficulties in trading off resource efficiency for tail latency, due to the internal uncertainty of cluster status and execution behavior. Inspired by the imaginary "Maxwell's demon" in thermodynamics who can reduce the uncertainty through a per-gas molecule-level control policy, we consider the "one-to-one mapping" feature of serverless computing and build a novel resource allocator, named Maxwell, that can achieve low tail latency and high resource efficiency in serverless simultaneously. Like the "Maxwell's demon", Maxwell is able to optimize the resource allocation for every request. It observes the state of each request and makes decisions about the minimum resource allocation through a reinforcement learning predictor. As the per-request-grained control incurs significant overhead, we further design a pipeline for avoiding the accumulated effect on a workflow. Experimental results show that Maxwell not only saves up to 31% CPU resources but also reduces the standard deviation of latency by 1.9x. Its time overhead is negligible and the resource overhead is also limited when the query per second <= 500.
Accurately calculating the electronic structure of strongly correlated chemical systems necessitates a detailed description of both static and dynamical electron correlations, posing a significant challenge in ab init...
详细信息
ISBN:
(纸本)9798400717932
Accurately calculating the electronic structure of strongly correlated chemical systems necessitates a detailed description of both static and dynamical electron correlations, posing a significant challenge in ab initio quantum chemistry. Although the high memory and computational demands generally limit these calculations to relatively modest systems, the advanced computational capabilities of modern GPUs provide new avenues to expand these limits. However, complex control flows inherent to computation notably impair performance on GPUs. Furthermore, the significant disparity in computational load across different branches leads to load imbalance, challenging the large-scale simulations. In this work, we introduce PASCI, a heterogeneous parallelcomputing framework designed to quickly and efficiently parallelize the computation of dynamical correlation energy based on determinants. The features of the PASCI framework include (1) a divergence-avoiding GPU algorithm, (2) a three-level load-mapping strategy to ensure load balance across processors, GPU warps, and GPU threads, (3) performance models for memory footprint and computation, and (4) seamless integration with existing quantum chemistry software. Experimental results using an NVIDIA A100 GPU demonstrate that our new GPU algorithm achieves an average 6.6x (up to 13.8x) peak performance increase and 2-4 orders of magnitude speedup in practical usage compared to its original GPU implementation. Moreover, PASCI exhibits excellent scalability, highlighting its potential as a powerful high-performance computing tool in complex quantum chemistry research.
The development of the Internet of Things has increased the demand for real-time communication and computing in user equipments (UEs), which is a challenge for the limited battery capacity and computing power of UEs. ...
详细信息
In healthcare, safeguarding patient data integrity is paramount, necessitating robust deep learning models. This research focuses on bolstering the cyber security defenses of these models, particularly against adversa...
详细信息
ISBN:
(纸本)9798350372816
In healthcare, safeguarding patient data integrity is paramount, necessitating robust deep learning models. This research focuses on bolstering the cyber security defenses of these models, particularly against adversarial attacks, through the application of the Localized Adversarial Feature Attack (LAFEAT) technique. Leveraging Graphics Processing Units (GPU) and the TensorFlow framework, the study explores algorithmic enhancements like the Fast Gradient Sign Method (FGSM) and Model-Agnostic Meta-Learning (MAML). These techniques perturb input data gradients or adapt model parameters, enhancing resilience against adversarial perturbations. Additionally, Particle Swarm Optimization (PSO), a metaheuristic optimization technique, is examined to fortify defense mechanisms further. parallel processing techniques, utilizing parallel GPU or distributed clusters, are implemented to expedite the optimization process, reducing computational burdens and enhancing scalability. Despite advancements, gaps remain in understanding how these models can be optimized for real-world healthcare applications, particularly in terms of balancing computational efficiency and robustness. The research demonstrates tangible benefits, with a significant 10% increase in success rates against adversarial attacks compared to baseline methods. Moreover, parallel processing implementation results in a 30% reduction in optimization time, improving the efficiency of cyber security defenses for deep learning models in healthcare. These numerical enhancements underscore the research's value in fortifying healthcare systems against adversarial threats, ensuring the security and integrity of patient data while addressing existing gaps in the optimization of adversarial defenses in healthcare applications. The strengths of this research lie in its robust adversarial attack optimization techniques, integration of parallel processing for efficiency, and application to critical healthcare tasks. Comprehensive
暂无评论