检索结果-内蒙古大学图书馆

Model-free adaptive optimal control for nonlinear multiplayer games with input disturbances

NEUROCOMPUTING 2024年 580卷

作者： Shi, Jing Peng, Chen Zhang, Jin Zhang, Zhihao Xie, Xiangpeng Shanghai Univ Sch Mechatron Engn & Automat Shanghai 200444 Peoples R China Nanjing Univ Posts & Telecommun Inst Carbon Neutral Adv Technol Nanjing 210023 Peoples R China

In this paper, we investigate a model-free identifier-critic-based optimal adaptive controller for multiplayer games with the input disturbances. Specifically, we first adopt the identifier neural network to identify the system dynamics. Simultaneously, we use the critic neural network to estimate the optimal cost function thereby obtaining the estimated optimal controller. Further taking the input disturbances into consideration, we add a feedback gain into the estimated optimal controller so as to obtain the controller. The learning of the identifier and critic network is online and simultaneous. Then, we analyze the stability of the proposed approach. Eventually, the simulation results illustrate the validity of the proposed controller.

关键词： adaptive dynamic programming Multiplayer game Input disturbances adaptive control

来源：评论

学校读者我要写书评

暂无评论

Guaranteed cost tracking control of constrained input nonlinear uncertain systems using event-based ADP

引用

NEUROCOMPUTING 2024年 572卷

作者： Dahal, Raju Kar, Indrani Indian Inst Technol Guwahati Dept Elect & Elect Engn Kamrup 781039 Assam India

This article investigates the guaranteed cost robust tracking control problem of nonlinear systems subjected to input constraint and unmatched uncertainty. The event -based adaptive dynamic programming (ADP) approach is utilized to address this problem. First, the tracking error and reference trajectory are combined to form an augmented uncertain system. Then, by decomposing the uncertainty into the matched and unmatched parts, the original tracking problem is converted into the optimal regulation problem of an auxiliary system. The cost function for the auxiliary system is defined, and the associated Hamilton-Jacobi-Bellman (HJB) equation is solved using a single critic neural network (NN). Moreover, a novel event -triggering rule is formulated, and it is shown that the designed event -based controller guarantees that the tracking error is uniformly ultimately bounded. The derivation of event -based guaranteed cost and its relation with the time -based counterpart is presented. The exclusion of the infamous Zeno behavior is guaranteed. The uniform ultimate boundedness of the critic weight estimation error is shown. Finally, the effectiveness of the proposed event -triggered ADP method is illustrated through simulations of the spring-mass-damper system and Van der Pol's oscillator with unmatched uncertainty.

关键词： Guaranteed cost tracking Constrained input Unmatched uncertainty Neural networks adaptive dynamic programming Event-triggered

来源：评论

学校读者我要写书评

暂无评论

Fully distributed adaptive control for output consensus of uncertain discrete-time linear multi-agent systems

引用

AUTOMATICA 2024年 162卷

作者： Jiang, Yi Liu, Lu Feng, Gang City Univ Hong Kong Dept Biomed Engn Hong Kong Peoples R China

This work investigates the adaptive output consensus problem for uncertain discrete-time linear multi-agent systems on directed graphs when both Laplacian matrices for communication graphs and agent system matrices are not available. Firstly, a fully distributed algorithm is proposed to estimate the Laplacian matrix for each agent. Then, based on the proposed estimation algorithm, two fully distributed adaptive control algorithms, one for state feedback and the other for output feedback, are developed to acquire the desired controller parameters by utilizing the so-called adaptive dynamic programming techniques. It is shown that the output consensus is achieved for the resulting closedloop multi-agent system. Simulation results demonstrate the efficacy of the proposed fully distributed adaptive controllers resulting from those adaptive control algorithms. (c) 2024 Elsevier Ltd. All rights reserved.

关键词： Output consensus Fully distributed adaptive control Discrete-time linear multi-agent system adaptive dynamic programming

来源：评论

学校读者我要写书评

暂无评论

Distributed output data-driven optimal robust synchronization of heterogeneous multi-agent systems

引用

AUTOMATICA 2023年第1期153卷

作者： Chen, Ci Lewis, Frank L. Xie, Kan Lyu, Yi Xie, Shengli Guangdong Univ Technol Sch Automation Guangzhou Peoples R China Guangdong Key Lab IoT Informat Technol Guangzhou Peoples R China Minist Educ Key Lab Intelligent Informat Proc & Syst Integrat Beijing Peoples R China Univ Texas Arlington UTA Res Inst Ft Worth TX USA 111 Ctr Intelligent Batch Mfg Based IoT Technol Guangzhou Peoples R China Univ Elect Sci & Technol China Zhongshan Inst Sch Comp Chengdu Peoples R China Guangdong HongKong Macao Joint Lab Smart Discrete Guangzhou Guangdong Peoples R China

This work presents an output-feedback policy learning algorithm underlining input-output system data for distributed robust optimal synchronization of heterogeneous multi-agent systems. The output -feedback synchronization problem in the context of this work is formulated via robust output regulation and reinforcement learning modeling the interactions among agents by a zero-sum game. The proposed learning and control structure only requires the local system data for each agent and distributed output data among communicating neighbors. We utilize system-level synchysis for the continuous-time state reconstruction for the distributed learning with convergence and stability proofs under the proposed output-feedback policy for solving the zero-sum game. We further show that policy learning is assured under the proposed data criteria relating to input-output data only rather than any inter-immediate gains from policy iterations. Based on the cooperative robust output regulation, this work gains robustness after the learning is complete and establishes an output data-driven distributed optimal robust synchronization without knowing accurate system dynamics. A numerical example shows the effectiveness of the proposed learning algorithm. (c) 2023 Elsevier Ltd. All rights reserved.

关键词： adaptive dynamic programming Heterogeneous multi -agent systems Output synchronization Reinforcement learning Zero -sum game

来源：评论

学校读者我要写书评

暂无评论

Optimal containment control for a class of heterogeneous multi-agent systems with actuator faults

引用

INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2024年第2期34卷 849-865页

作者： Fan, Sijia Wang, Tong Wu, Ju Qiu, Jianbin Harbin Inst Technol Res Inst Intelligent Control & Syst Harbin 150080 Peoples R China Ecole Polytech Fed Lausanne EPFL Sch Engn Lausanne Switzerland

This article investigates the optimal containment control problem for a class of heterogeneous multi-agent systems with time-varying actuator faults and unmatched disturbances based on adaptive dynamic programming. Since there exist unknown input signals in each leader, distributed observers are utilized to estimate trajectories in the convex hull spanned by leaders. The containment control problem is then transformed into an optimal tracking problem. To compensate for the actuator faults and unmatched disturbances, a novel performance index function is designed. We prove that the optimal control policy can ensure that the tracking error system of each follower is uniformly ultimately bounded. The online policy iteration algorithm is implemented using critic neural networks to obtain the optimal control policy. A numerical example is provided to demonstrate the effectiveness of the proposed control policy.

关键词： actuator faults adaptive dynamic programming containment control heterogeneous multi-agent system unmatched disturbances

来源：评论

学校读者我要写书评

暂无评论

Backstepping based neural H ,,, optimal tracking control for nonlinear state constrained with and disturbances

引用

NEUROCOMPUTING 2024年 595卷

作者： Huang, Yuzhu Zhang, Zhaoyan Yang, Xiong Hebei Univ Coll Elect & Informat Engn Baoding 071002 Hebei Peoples R China Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China

This paper focuses on the problem of H ,,, optimal tracking control for a class of nonlinear state constrained systems with input delay and disturbances. With the aid of Pade approximation, an auxiliary variable is devised to eliminate the effects of input delay. Combining barrier Lyapunov functions (BLFs) with backstepping design technique, a feedforward adaptive controller is designed to transform the tracking control problem of nonlinear state constrained system into an equivalent H ,,, control problem of input-affine error system without state constraints, wherein neural networks (NNs) are employed to approximate unknown system dynamics. Then based on single -network adaptive dynamic programming (ADP), an H ,,, optimal feedback controller is developed by utilizing a single critic network to learn the Nash equilibrium related to Hamilton-Jacobi-Isaacs (HJI) equation. Therefore, the whole tracking controller can be constructed by integrating the feedforward adaptive controller with the optimal feedback controller. Moreover, it is proven by Lyapunov's theory that all signals within the closed -loop system are uniformly ultimately bounded (UUB), and the tracking error converges to a small neighborhood of the origin without violating any state constraints. Two simulation examples are also presented to validate the effectiveness of the proposed approach.

关键词： Optimal tracking control Neural network Backstepping design adaptive dynamic programming State constrained systems

来源：评论

学校读者我要写书评

暂无评论

dynamic event-triggered neuro-optimal control for uncertain nonlinear systems with unknown dead-zone constraint

引用

COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION 2024年 139卷

作者： Zhang, Shunchao Zhuang, Jiawei Zhang, Yongwei Guangdong Univ Finance Sch Internet Finance & Informat Engn Guangzhou 510521 Peoples R China South China Agr Univ Coll Math & Informat Guangzhou Peoples R China

In this article, we propose a dynamic event-triggered neuro-optimal control scheme (DETNOC) for uncertain nonlinear systems subject to unknown dead-zone and disturbances through the design of a composite control law. An integral sliding mode-based discontinuous control law is utilized to compensate for the effects of unknown dead-zone, disturbance, and a component of uncertainties. As a result, a system dynamics that evolves free of these effects during the sliding mode is obtained. Then, an adaptive dynamic programming-based dynamic event-triggered optimal control law is designed to stabilize the sliding mode dynamics with the help of critic- only neural network architecture. Finally, stability analysis of the closed-loop system is provided and the effectiveness of the developed DETNOC scheme is verified.

关键词： adaptive dynamic programming Integral sliding mode control Neural networks Uncertain nonlinear systems Unknown dead-zone

来源：评论

学校读者我要写书评

暂无评论

Multi-objective optimization of actuators and consensus ADP-based vibration control for the large flexible space structures

引用

AEROSPACE SCIENCE AND TECHNOLOGY 2023年第1期137卷

作者： Tian, Dalong Guo, Jianguo Guo, Zongyi Northwestern Polytech Univ Sch Astronaut Inst Precis Guidance & Control Xian 710072 Peoples R China

In this study, the multi-objective optimization and decision-making for optimal positions of actuators and consensus adaptive dynamic programming (CADP) are investigated to mitigate the vibration of large flexible space structures (LFSS). The optimization of the actuator positions maintains a balance between maximizing actuation efficiency and maximizing input voltage decoupling. Meanwhile, the CADP control method accelerates the attenuation of vibration when agents collaborate in the designed communication topology network. First, the electromechanical coupled dynamic model of the LFSS is built by the finite element method. Subsequently, the multi-objective optimization criteria are proposed, which maximize the actuation efficiency and decoupling of control inputs. Moreover, the multi-objective optimization and decision-making, which is based on the non-dominated sorting differential evolutionary algorithm (NSDE) and technique for order preference by similarity to ideal solution (TOPSIS), respectively, are performed to rapidly find the optimal position of actuators. In addition, the CADP control algorithm is designed and its stability is proven. Finally, for harmonic excitation under multi-frequency superposition, simulation comparisons based on the CADP and adaptive dynamic programming (ADP) are performed. Simulation results verify the effectiveness of the proposed optimization criterion of actuators and the CADP algorithm for vibration mitigation of LFSS.(c) 2023 Elsevier Masson SAS. All rights reserved.

关键词： Large flexible space structures Active vibration control adaptive dynamic programming Consensus control

来源：评论

学校读者我要写书评

暂无评论

adaptive dynamic event-triggered control for constrained modular reconfigurable robot

引用

KNOWLEDGE-BASED SYSTEMS 2022年 254卷

作者： Song, Ruizhuo Liu, Lu Xu, Zhen Univ Sci & Technol Beijing Beijing Engn Res Ctr Ind Spectrum Imaging Sch Automat & Elect Engn Beijing 100083 Peoples R China Univ Sci & Technol Beijing Res Inst Urbanizat & Urban Safety Sch Civil & Resource Engn Beijing 100083 Peoples R China

Compared with traditional robot, modular reconfigurable robot (MRR) has the advantages of strong environmental adaptability and flexible task completion. According to the optimal tracking control problem (OTCP) of MRR under some restricted conditions, this paper puts forward a constrained dynamic event-triggered control (DETC) for MRR system with disturbance through adaptive dynamic programming (ADP), which can minimize the information interaction quantity under the premise of system stability and expected control effect. In view of the uncertainty of model coupling part, the identification network is used to estimate the dynamics of MRR and the estimation error is proved to be uniformly ultimate bounded (UUB). The other three groups of critic, action and disturbance neural networks (NNs) are established by the approximation principle of ADP. The optimal control pair is obtained through policy iteration (PI) with DETC, and the triggering condition is designed based on the asymptotic stability of MRR system. At last, the strengths of the algorithm in this paper are validated through simulation experiments. (C) 2022 Elsevier B.V. All rights reserved.

关键词： adaptive dynamic programming Constrained input dynamic event -triggered control Modular reconfigurable robot

来源：评论

学校读者我要写书评

暂无评论

Novel single-loop policy iteration for linear zero-sum games

引用

AUTOMATICA 2024年 163卷

作者： Zhao, Jianguo Yang, Chunyu Gao, Weinan Park, Ju H. China Univ Min & Technol Sch Informat & Control Engn Xuzhou 221116 Peoples R China Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Peoples R China Yeungnam Univ Dept Elect Engn Kyongsan 38541 South Korea

The infinite-horizon zero-sum game of a linear system can be resorted to solve a Game algebraic Riccati equation (GARE) with indefinite quadratic term. Double-loop policy iteration algorithm is often used to find the solution of such GARE, but its calculation is usually time-consuming. In this work, we propose a novel model-based single-loop policy iteration algorithm to solve GARE and the convergence of the algorithm is guaranteed by the boundness of the iterative sequence and the comparison result. Furthermore, we devise a data-driven single-loop policy iteration algorithm for solving linear zero-sum games, without requiring the knowledge of system dynamics. Compared to the existing Newton's single-loop methods, the initialization of our algorithms is significantly relaxed and easier to implement. Two numerical examples are included to illustrate the proposed algorithms. (c) 2024 Published by Elsevier Ltd.

关键词： Policy iteration Zero -sum games Game algebraic Riccati equations adaptive dynamic programming

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：