Federated learning (FL) is a promising decentralized machine learning approach that enables multiple distributed clients to train a model jointly while keeping their data private. However, in real-world scenarios, the...
详细信息
Federated learning (FL) is a promising decentralized machine learning approach that enables multiple distributed clients to train a model jointly while keeping their data private. However, in real-world scenarios, the supervised training data stored in local clients inevitably suffer from imperfect annotations, resulting in subjective, inconsistent and biased labels. These noisy labels can harm the collaborative aggregation process of FL by inducing inconsistent decision boundaries. Unfortunately, few attempts have been made towards noise-tolerant federated learning, with most of them relying on the strategy of transmitting overhead messages to assist noisy labels detection and correction, which increases the communication burden as well as privacy risks. In this paper, we propose a simple yet effective method for noise-tolerant FL based on the well-established co-training framework. Our method leverages the inherent discrepancy in the learning ability of the local and global models in FL, which can be regarded as two complementary views. By iteratively exchanging samples with their high confident predictions, the two models “teach each other” to suppress the influence of noisy labels. The proposed scheme enjoys the benefit of overhead cost-free and can serve as a robust and efficient baseline for noise-tolerant federated learning. Experimental results demonstrate that our method outperforms existing approaches, highlighting the superiority of our method.
Learning the accurate dynamics of robotic systems directly from the trajectory data is currently a prominent research *** physics-enforced networks,exemplified by Hamiltonian neural networks and Lagrangian neural netw...
详细信息
Learning the accurate dynamics of robotic systems directly from the trajectory data is currently a prominent research *** physics-enforced networks,exemplified by Hamiltonian neural networks and Lagrangian neural networks,demonstrate proficiency in modeling ideal physical systems,but face limitations when applied to systems with uncertain non-conservative dynamics due to the inherent constraints of the conservation laws *** this paper,we present a novel augmented deep Lagrangian network,which seamlessly integrates a deep Lagrangian network with a standard deep *** fusion aims to effectively model uncertainties that surpass the limitations of conventional Lagrangian *** proposed network is applied to learn inverse dynamics model of two multi-degree manipulators including a 6-dof UR-5 robot and a 7-dof SARCOS manipulator under *** experimental results clearly demonstrate that our approach exhibits superior modeling precision and enhanced physical credibility.
Dear Editor,This letter presents a new transfer learning framework for the deep multi-agent reinforcement learning(DMARL) to reduce the convergence difficulty and training time when applying DMARL to a new scenario [1...
详细信息
Dear Editor,This letter presents a new transfer learning framework for the deep multi-agent reinforcement learning(DMARL) to reduce the convergence difficulty and training time when applying DMARL to a new scenario [1], [2].
Robots are increasingly being deployed in densely populated environments, such as homes, hotels, and office buildings, where they rely on explicit instructions from humans to perform tasks. However, complex tasks ofte...
详细信息
Robots are increasingly being deployed in densely populated environments, such as homes, hotels, and office buildings, where they rely on explicit instructions from humans to perform tasks. However, complex tasks often require multiple instructions and prolonged monitoring, which can be time-consuming and demanding for users. Despite this, there is limited research on enabling robots to autonomously generate tasks based on real-life scenarios. Advanced intelligence necessitates robots to autonomously observe and analyze their environment and then generate tasks autonomously to fulfill human requirements without explicit commands. To address this gap, we propose the autonomous generation of navigation tasks using natural language dialogues. Specifically, a robot autonomously generates tasks by analyzing dialogues involving multiple persons in a real office environment to facilitate the completion of item transportation between various *** propose the leveraging of a large language model(LLM) through chain-of-thought prompting to generate a navigation sequence for a robot from dialogues. We also construct a benchmark dataset consisting of 625 multiperson dialogues using the generation capability of LLMs. Evaluation results and real-world experiments in an office building demonstrate the effectiveness of the proposed method.
Software defect prediction plays a critical role in software development and quality assurance processes. Effective defect prediction enables testers to accurately prioritize testing efforts and enhance defect detecti...
详细信息
Software defect prediction plays a critical role in software development and quality assurance processes. Effective defect prediction enables testers to accurately prioritize testing efforts and enhance defect detection efficiency. Additionally, this technology provides developers with a means to quickly identify errors, thereby improving software robustness and overall quality. However, current research in software defect prediction often faces challenges, such as relying on a single data source or failing to adequately account for the characteristics of multiple coexisting data sources. This approach may overlook the differences and potential value of various data sources, affecting the accuracy and generalization performance of prediction results. To address this issue, this study proposes a multivariate heterogeneous hybrid deep learning algorithm for defect prediction (DP-MHHDL). Initially, Abstract Syntax Tree (AST), Code Dependency Network (CDN), and code static quality metrics are extracted from source code files and used as inputs to ensure data diversity. Subsequently, for the three types of heterogeneous data, the study employs a graph convolutional network optimization model based on adjacency and spatial topologies, a Convolutional Neural Network-Bidirectional Long Short-Term Memory (CNN-BiLSTM) hybrid neural network model, and a TabNet model to extract data features. These features are then concatenated and processed through a fully connected neural network for defect prediction. Finally, the proposed framework is evaluated using ten promise defect repository projects, and performance is assessed with three metrics: F1, Area under the curve (AUC), and Matthews correlation coefficient (MCC). The experimental results demonstrate that the proposed algorithm outperforms existing methods, offering a novel solution for software defect prediction.
Aiming at the problem of low precision and large cleaning error in multi-link similar data mining, a multi-link similar data mining cleaning method based on Bayesian algorithm is proposed in this paper. On the basis o...
详细信息
Tactile sensing plays a crucial role in enabling robots to safely interact with objects in dynamic environments [1].Given that potential physical contact can occur at any location during robot interaction, there is a ...
详细信息
Tactile sensing plays a crucial role in enabling robots to safely interact with objects in dynamic environments [1].Given that potential physical contact can occur at any location during robot interaction, there is a need for a tactile sensor that can be deployed extensively across the robot's body.
This paper focuses on the optimal output synchronization control problem of heterogeneous multiagent systems(HMASs) subject to nonidentical communication delays by a reinforcement learning *** with existing studies as...
详细信息
This paper focuses on the optimal output synchronization control problem of heterogeneous multiagent systems(HMASs) subject to nonidentical communication delays by a reinforcement learning *** with existing studies assuming that the precise model of the leader is globally or distributively accessible to all or some of the followers, the leader's precise dynamical model is entirely inaccessible to all the followers in this paper. A data-based learning algorithm is first proposed to reconstruct the leader's unknown system matrix online. A distributed predictor subject to communication delays is further devised to estimate the leader's state, where interaction delays are allowed to be nonidentical. Then, a learning-based local controller, together with a discounted performance function, is projected to reach the optimal output synchronization. Bellman equations and game algebraic Riccati equations are constructed to learn the optimal solution by developing a model-based reinforcement learning(RL) algorithm online without solving regulator equations, which is followed by a model-free off-policy RL algorithm to relax the requirement of all agents' dynamics faced by the model-based RL algorithm. The optimal tracking control of HMASs subject to unknown leader dynamics and communication delays is shown to be solvable under the proposed RL algorithms. Finally, the effectiveness of theoretical analysis is verified by numerical simulations.
This paper presents a refinement of a method that simulates flow- and pressure-regulating valves by replacing them with pipes and adjusting the resistances (diameters) of those pipes to meet the valve settings. The me...
详细信息
Semantic segmentation plays an important role in computer perception tasks. Integrating the rich details of RGB images with the illumination robustness of thermal infrared (TIR) images is a promising approach for achi...
详细信息
Semantic segmentation plays an important role in computer perception tasks. Integrating the rich details of RGB images with the illumination robustness of thermal infrared (TIR) images is a promising approach for achieving reliable semantic scene understanding. Current approaches for RGB-Thermal semantic segmentation often overlook the unique characteristics exhibited by each modality at different encoding layers and underutilize the complementary information between the two modalities during decoding. To acquire complementary cross-modality encoding and decoding features, we propose a multi-branch differential bidirectional fusion network known as MDBFNet. Firstly, it models the dependencies between the modality-specific characteristics and the different encoding layers, and designs a TIR-led detail enhancement module (TDE) and an RGB-led semantic enhancement module (RSE) to guide distinguishable fusion for different layer features. Secondly, a three-branch fusion decoder with three supervision (TFDS) is proposed to thoroughly explore the complementary decoding features between two modalities. Experiments on MFNet and PST900 datasets show that our method surpasses state-of-the-art methods by a clear margin. IEEE
暂无评论