版权所有:内蒙古大学图书馆 技术提供:维普资讯• 智图
内蒙古自治区呼和浩特市赛罕区大学西街235号 邮编: 010021
作者机构:Southwest Univ Chongqing Key Lab Nonlinear Circuits & Intelligent Tiansheng St Chongqing 400715 Peoples R China Chongqing Univ Posts & Telecommun Chongqing Key Lab Image Cognit Chongwen Rd Chongqing 400065 Peoples R China
出 版 物:《NEUROCOMPUTING》 (神经计算)
年 卷 期:2024年第600卷
核心收录:
学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:National Natural Science Foundation of China
主 题:Adaptive dynamic programming Optimal consensus control Dynamic event-triggered mechanism Heterogeneous multi-agent systems
摘 要:We investigate the distributed optimal consensus control problem based on action-dependent heuristic dynamic programming (ADHDP). In order to strike a stable balance between the learning cost of reinforcement learning and the resource utilization efficiency of the hybrid-order multi-agent systems (MASs), we propose an improved dynamic event-triggered ADHDP (dET-ADHDP) method. This approach can non-periodically explore the control policy distribution using the online action-dependent actor-critic (ADAC) learning framework. Meanwhile, it can dynamically adjust the trigger lower bound by exploiting the designed trigger threshold function, and adaptively decide the signal trigger moment during the ADAC learning process. In addition, we demonstrate the boundedness of the ADAC network weights and show that under the designed dynamic event-triggering rules, the MASs can asymptotically achieve optimal tracking control without Zeno phenomenon. Finally, compared with the traditional static counterparts, simulation experiments demonstrate that the proposed dynamic eventtriggered ADAC (dET-ADAC) algorithm has more efficient resource utilization while maintaining satisfactory learning performance.