检索结果-内蒙古大学图书馆

A Transfer Learning Framework for Deep Multi-Agent Reinforcement Learning

IEEE/CAA Journal of Automatica Sinica 2024年第11期11卷 2346-2348页

作者： Yi Liu Xiang Wu Yuming Bo Jiacun Wang Lifeng Ma the School of Automation Nanjing University of Science and Technology the Department of Computer Science and Software Engineering Monmouth University

Dear Editor,This letter presents a new transfer learning framework for the deep multi-agent reinforcement learning(DMARL) to reduce the convergence difficulty and training time when applying DMARL to a new scenario [1... 详细信息

关键词： Deep agent Framework

来源：评论

学校读者我要写书评

暂无评论

Dynamic Modeling of Robotic Manipulator via an Augmented Deep Lagrangian Network

引用

Tsinghua science and Technology 2024年第5期29卷 1604-1614页

作者： Shuangshuang Wu Zhiming Li Wenbai Chen Fuchun Sun School of Automation Beijing Information Science and Technology UniversityBeijing 100192China Department of Computer Science and Technology Tsinghua UniversityBeijing 100084China

Learning the accurate dynamics of robotic systems directly from the trajectory data is currently a prominent research *** physics-enforced networks,exemplified by Hamiltonian neural networks and Lagrangian neural networks,demonstrate proficiency in modeling ideal physical systems,but face limitations when applied to systems with uncertain non-conservative dynamics due to the inherent constraints of the conservation laws *** this paper,we present a novel augmented deep Lagrangian network,which seamlessly integrates a deep Lagrangian network with a standard deep *** fusion aims to effectively model uncertainties that surpass the limitations of conventional Lagrangian *** proposed network is applied to learn inverse dynamics model of two multi-degree manipulators including a 6-dof UR-5 robot and a 7-dof SARCOS manipulator under *** experimental results clearly demonstrate that our approach exhibits superior modeling precision and enhanced physical credibility.

关键词： deep Lagrangian network nonconservative dynamics multi-degree manipulator inverse dynamic modeling

来源：评论

学校读者我要写书评

暂无评论

Human as Points: Explicit Point-based 3D Human Reconstruction from Single-view RGB Images

引用

IEEE Transactions on Pattern Analysis and Machine Intelligence 2025年第7期47卷 5884-5900页

作者： Tang, Yingzhi Zhang, Qijian Liu, Yebin Hou, Junhui City University of Hong Kong Department of Computer Science Hong Kong Tsinghua University Department of Automation Beijing China

The latest trends in the research field of single-view human reconstruction are devoted to learning deep implicit functions constrained by explicit body shape priors. Despite the remarkable performance improvements compared with traditional processing pipelines, existing learning approaches still exhibit limitations in terms of flexibility, generalizability, robustness, and/or representation capability. To comprehensively address the above issues, in this paper, we investigate an explicit point-based human reconstruction framework named HaP, which utilizes point clouds as the intermediate representation of the target geometric structure. Technically, our approach features fully explicit point cloud estimation (exploiting depth and SMPL), manipulation (SMPL rectification), generation (built upon diffusion), and refinement (displacement learning and depth replacement) in the 3D geometric space, instead of an implicit learning process that can be ambiguous and less controllable. Extensive experiments demonstrate that our framework achieves quantitative performance improvements of 20% to 40% over current state-of-the-art methods, and better qualitative results. Our promising results may indicate a paradigm rollback to the fully-explicit and geometry-centric algorithm design. In addition, we newly contribute a real-scanned 3D human dataset featuring more intricate geometric details. We will make our code and data publicly available at https://***/yztang4/HaP. © 1979-2012 IEEE.

关键词： 3D reconstruction

来源：评论

学校读者我要写书评

暂无评论

Overhead-free Noise-tolerant Federated Learning: A New Baseline

引用

Machine Intelligence Research 2024年第3期21卷 526-537页

作者： Shiyi Lin Deming Zhai Feilong Zhang Junjun Jiang Xianming Liu Xiangyang Ji Department of Computer Science and Technology Harbin Institute of TechnologyHarbin150000China Department of Automation Tsinghua UniversityBeijing100084China

Federated learning (FL) is a promising decentralized machine learning approach that enables multiple distributed clients to train a model jointly while keeping their data private. However, in real-world scenarios, the supervised training data stored in local clients inevitably suffer from imperfect annotations, resulting in subjective, inconsistent and biased labels. These noisy labels can harm the collaborative aggregation process of FL by inducing inconsistent decision boundaries. Unfortunately, few attempts have been made towards noise-tolerant federated learning, with most of them relying on the strategy of transmitting overhead messages to assist noisy labels detection and correction, which increases the communication burden as well as privacy risks. In this paper, we propose a simple yet effective method for noise-tolerant FL based on the well-established co-training framework. Our method leverages the inherent discrepancy in the learning ability of the local and global models in FL, which can be regarded as two complementary views. By iteratively exchanging samples with their high confident predictions, the two models “teach each other” to suppress the influence of noisy labels. The proposed scheme enjoys the benefit of overhead cost-free and can serve as a robust and efficient baseline for noise-tolerant federated learning. Experimental results demonstrate that our method outperforms existing approaches, highlighting the superiority of our method.

关键词： Federated learning noise-label learning privacy-preserving machine learning edge intelligence distributed machine learning

来源：评论

学校读者我要写书评

暂无评论

Reinforcement Learning Algorithms with Graph Convolution Networks for Traffic Signal Control 8th

Reinforcement Learning Algorithms with Graph Convolution Ne...

引用

8th International Conference on Intelligent Transport Systems, INTSYS 2024

作者： Salmalge, Shreya Bhatnagar, Shalabh Department of Computer Science and Automation Indian Institute of Science Bengaluru India

ISBN: (纸本)9783031863691

Traffic congestion is the root cause of various social and economic problems like longer travel times, increased pollution, and fuel or energy consumption. Addressing the issue is becoming increasingly crucial with rising city traffic and limited road infrastructure. The way we change traffic signals has a significant impact on congestion in road networks. We implement reinforcement learning algorithms for controlling traffic signals adaptive to congestion in incoming roads at junctions. Road networks can be viewed as graphs with intersections as nodes and roads as edges. This motivates us to use graph convolutional networks (GCN) as function approximators in various RL algorithms applied to traffic signal control. We implement Deep Q-learning (DQN), Graph Convolutional Q-learning (GCQN), Graph Convolutional Actor-Critic (GCAC), and individual-DQN models to learn a deterministic policy for adaptive traffic signal control. We also present a comparison of the performances of these models and infer that GCQN models are better suited to work for large road networks. To the best of our knowledge, the Graph Convolutional Actor-Critic model is not used in any existing traffic signal control method. We also compare the GCQN and GCAC models against existing and state-of-the-art approaches. Experimental evaluation shows that our proposed method achieves performance levels comparable to the state-of-the-art techniques. © ICST Institute for computer sciences, Social Informatics and Telecommunications Engineering 2025.

关键词： Travel time

来源：评论

学校读者我要写书评

暂无评论

Random-Order Online Independent Set of Intervals and Hyperrectangles 32

Random-Order Online Independent Set of Intervals and Hyperre...

引用

32nd Annual European Symposium on Algorithms, ESA 2024

作者： Garg, Mohit Kar, Debajyoti Khan, Arindam Department of Computer Science and Automation Indian Institute of Science Bengaluru India

ISBN: (纸本)9783959773386

In the Maximum Independent Set of Hyperrectangles problem, we are given a set of n (possibly overlapping) d-dimensional axis-aligned hyperrectangles, and the goal is to find a subset of non-overlapping hyperrectangles of maximum cardinality. For d = 1, this corresponds to the classical Interval Scheduling problem, where a simple greedy algorithm returns an optimal solution. In the offline setting, for d-dimensional hyperrectangles, polynomial time (log n)O(d)-approximation algorithms are known [16]. However, the problem becomes notably challenging in the online setting, where the input objects (hyperrectangles) appear one by one in an adversarial order, and on the arrival of an object, the algorithm needs to make an immediate and irrevocable decision whether or not to select the object while maintaining the feasibility. Even for interval scheduling, an Ω(n) lower bound is known on the competitive ratio. To circumvent these negative results, in this work, we study the online maximum independent set of axis-aligned hyperrectangles in the random-order arrival model, where the adversary specifies the set of input objects which then arrive in a uniformly random order. Starting from the prototypical secretary problem, the random-order model has received significant attention to study algorithms beyond the worst-case competitive analysis (see the survey by Gupta and Singla [40]). Surprisingly, we show that the problem in the random-order model almost matches the best-known offline approximation guarantees, up to polylogarithmic factors. In particular, we give a simple (log n)O(d)competitive algorithm for d-dimensional hyperrectangles in this model, which runs in Õd(n) time. Our approach also yields (log n)O(d)-competitive algorithms in the random-order model for more general objects such as d-dimensional fat objects and ellipsoids. Furthermore, all our competitiveness guarantees hold with high probability, and not just in expectation. © Mohit Garg, Debajyoti Kar, and Arind

关键词： Approximation algorithms

来源：评论

学校读者我要写书评

暂无评论

Reinforcement learning-based unknown reference tracking control of HMASs with nonidentical communication delays

引用

science China(Information sciences) 2023年第7期66卷 46-57页

作者： Yong XU Zheng-Guang WU Wei-Wei CHE Deyuan MENG School of Automation Beijing Institute of Technology Institute of Cyber-Systems and Control Zhejiang University College of Mathematics and Computer Science Zhejiang Normal University Department of Automation Qingdao University School of Automation Science and Electrical Engineering Beihang University(BUAA)

This paper focuses on the optimal output synchronization control problem of heterogeneous multiagent systems(HMASs) subject to nonidentical communication delays by a reinforcement learning *** with existing studies assuming that the precise model of the leader is globally or distributively accessible to all or some of the followers, the leader's precise dynamical model is entirely inaccessible to all the followers in this paper. A data-based learning algorithm is first proposed to reconstruct the leader's unknown system matrix online. A distributed predictor subject to communication delays is further devised to estimate the leader's state, where interaction delays are allowed to be nonidentical. Then, a learning-based local controller, together with a discounted performance function, is projected to reach the optimal output synchronization. Bellman equations and game algebraic Riccati equations are constructed to learn the optimal solution by developing a model-based reinforcement learning(RL) algorithm online without solving regulator equations, which is followed by a model-free off-policy RL algorithm to relax the requirement of all agents' dynamics faced by the model-based RL algorithm. The optimal tracking control of HMASs subject to unknown leader dynamics and communication delays is shown to be solvable under the proposed RL algorithms. Finally, the effectiveness of theoretical analysis is verified by numerical simulations.

关键词： heterogeneous multiagent systems HMAS reinforcement learning RL optimal output synchronization communication delays

来源：评论

学校读者我要写书评

暂无评论

Editorial: Artificial intelligence in biomedical big data and digital healthcare

引用

Future Generation computer Systems 2024年 152卷 343-345页

作者： Lim, Kiho Esposito, Christian Wang, Tian Choi, Chang Department of Computer Science William Paterson University of New Jersey United States Department of Computer Science University of Salerno Italy School of Automation Science and Electrical Engineering Beihang University China Department of Computer Engineering Gachon University Korea Republic of

来源：评论

学校读者我要写书评

暂无评论

Multi-branch Differential Bidirectional Fusion Network for RGB-T Semantic Segmentation

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年 1-11页

作者： Liang, Wenli Shan, Caifeng Yang, Yuanjian Han, Jungong College of Electrical Engineering and Automation Shandong University of Science and Technology Qingdao China Department of Computer Science University of Sheffield U.K

Semantic segmentation plays an important role in computer perception tasks. Integrating the rich details of RGB images with the illumination robustness of thermal infrared (TIR) images is a promising approach for achieving reliable semantic scene understanding. Current approaches for RGB-Thermal semantic segmentation often overlook the unique characteristics exhibited by each modality at different encoding layers and underutilize the complementary information between the two modalities during decoding. To acquire complementary cross-modality encoding and decoding features, we propose a multi-branch differential bidirectional fusion network known as MDBFNet. Firstly, it models the dependencies between the modality-specific characteristics and the different encoding layers, and designs a TIR-led detail enhancement module (TDE) and an RGB-led semantic enhancement module (RSE) to guide distinguishable fusion for different layer features. Secondly, a three-branch fusion decoder with three supervision (TFDS) is proposed to thoroughly explore the complementary decoding features between two modalities. Experiments on MFNet and PST900 datasets show that our method surpasses state-of-the-art methods by a clear margin. IEEE

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Adaptive control of Quadruped robot under varying load conditions 10

Adaptive control of Quadruped robot under varying load condi...

引用

10th Indian Control Conference, ICC 2024

作者： Kurva, Vamshi Kumar Kolathaya, Shishir The department of Computer Science and Automation Indian Institute of Science Bangalore India The Robert Bosch Center for Cyber Physical Systems The Department of Computer Science & Automation Indian Institute of Science Bangalore India

ISBN: (纸本)9798331517212

Control frameworks for legged robots often rely on accurate dynamic models. However, these models often proves to be inaccurate due to factors such as mechanical wear and tear, and unforeseen changes such as the addition of extra payloads during deployment. Significant deviations in the dynamics can severely impact the controller’s performance. Our goal is to enhance the controller’s model in real-time during deployment using onboard sensors and online learning. Specifically, our work focuses on quadruped locomotion under varying load conditions. This paper presents an adaptive force control framework for quadruped robots, enhanced with online system identification, to handle significant changes in both mass and center of mass (CoM). The proposed approach demonstrates superior velocity and height tracking, even under extreme load conditions, showing promise for applications in logistics, military, and rescue missions. © 2024 IEEE.

关键词： Multipurpose robots

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：