检索结果-内蒙古大学图书馆

adaptive, Optimal, Virtual Synchronous Generator Control of Three-Phase Grid-Connected Inverters Under Different Grid Conditions-An adaptive dynamic programming Approach

引用

ieee TRANSACTIONS ON INDUSTRIAL INFORMATICS 2022年第11期18卷 7388-7399页

作者： Wang, Zhongyang Yu, Yunjun Gao, Weinan Davari, Masoud Deng, Chao Fuzhou Inst Technol Sch Appl Sci & Engn Fuzhou 350506 Peoples R China Nanchang Univ Dept Automat Informat Engn Nanchang 330031 Jiangxi Peoples R China Florida Inst Technol Florida Tech Coll Engn & Sci Dept Mech & Civil Engn Melbourne FL 32901 USA Georgia Southern Univ Dept Elect & Comp Engn Statesboro Campus Statesboro GA 30460 USA Nanjing Univ Posts & Telecommun Inst Adv Technol Nanjing 210023 Peoples R China

This article proposes an adaptive, optimal, data-driven control approach based on reinforcement learning and adaptive dynamic programming to the three-phase grid-connected inverter employed in virtual synchronous generators (VSGs). This article takes into account unknown system dynamics and different grid conditions, including balanced/unbalanced grids, voltage drop/sag, and weak grids. The proposed method is based on value iteration, which does not rely on an initial admissible control policy for learning. Considering the premise that the VSG control should stabilize the closed-loop dynamics, the VSG outputs are optimally regulated through the adaptive, optimal control strategy proposed in this article. Comparative simulations and experimental results validate the proposed method's effectiveness and reveal its practicality and implementation.

关键词： Voltage control Power system stability Synchronous generators Inverters Damping reinforcement learning Optimal control adaptive dynamic programming (ADP) adaptive optimal control reinforcement learning value iteration virtual synchronous generator (VSG)

来源：评论

学校读者我要写书评

暂无评论

adaptive Optimal Control of CVCF Inverters With Uncertain Load: An adaptive dynamic programming Approach

引用

ieee ACCESS 2021年 9卷 89276-89286页

作者： Wang, Zhongyang Yu, Yunjun Fuzhou Inst Technol Sch Appl Sci & Engn Fuzhou 350506 Peoples R China Nanchang Univ Sch Informat Engn Nanchang 330031 Jiangxi Peoples R China Nanchang Univ AI Inst Nanchang 330031 Jiangxi Peoples R China

This paper proposed a data-driven adaptive optimal control approach for CVCF (constant voltage, constant frequency) inverter based on reinforcement learning and adaptive dynamic programming (ADP). Different from existing literature, the load is treated as a dynamic uncertainty and a robust optimal state-feedback controller is proposed. The stability of the inverter-load system has been strictly analyzed. In order to obtain accurate output current differential signal, this paper designs a tracking differentiator. It is ensured that the tracking error asymptotically converges to zero through the proposed output-feedback controllers. A standard proportional integral controller and linear active disturbance rejection control strategy are also designed for the purpose of comparison. The simulation results show that the proposed controller has inherent robustness and does not require retuning with different applications.

关键词： Inverters Voltage control Mathematical model reinforcement learning Optimal control Load modeling Licenses adaptive optimal control CVCF inverter reinforcement learning adaptive dynamic programming

来源：评论

学校读者我要写书评

暂无评论

LEACH-RLC: Enhancing IoT Data Transmission with Optimized Clustering and reinforcement learning

引用

ieee Internet of Things Journal 2025年

作者： Jurado-Lasso, F. Fernando Jurado, J.F. Fafoutis, Xenofon Technical University of Denmark Embedded Systems Engineering Section Dtu Compute Lyngby2800 Denmark Universidad Nacional de Colombia Sede Palmira Faculty of Engineering and Administration Department of Basic Science Palmira763531 Colombia

Wireless Sensor Networks (WSNs) play a pivotal role in enabling Internet of Things (IoT) devices with sensing and actuation capabilities. Operating in remote and resourceconstrained environments, these IoT devices face challenges related to energy consumption, crucial for network longevity. Existing clustering protocols often suffer from high control overhead, inefficient cluster formation, and poor adaptability to dynamic network conditions, leading to suboptimal data transmission and reduced network lifetime. This paper introduces Low-Energy adaptive Clustering Hierarchy with reinforcement learning-based Controller (LEACH-RLC), a novel clustering protocol designed to address these limitations by employing a Mixed Integer Linear programming (MILP) approach for strategic selection of Cluster Heads (CHs) and node-to-cluster assignments. Additionally, it integrates a reinforcement learning (RL) agent to minimize control overhead by learning optimal timings for generating new clusters. LEACH-RLC aims to balance control overhead reduction without compromising overall network performance. Through extensive simulations, this paper investigates the frequency and opportune moments for generating new clustering solutions. Results demonstrate the superior performance of LEACH-RLC over state-of-theart protocols, showcasing enhanced network lifetime, reduced average energy consumption, and minimized control overhead. The proposed protocol contributes to advancing the efficiency and adaptability of WSNs, addressing critical challenges in IoT deployments. © 2014 ieee.

关键词： reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Editorial Special Issue on adaptive dynamic programming and reinforcement learning

引用

ieee Transactions on Systems, Man, and Cybernetics: Systems 2020年第11期50卷 3944-3947页

作者： Liu, Derong Lewis, Frank L. Wei, Qinglai School of Automation Guangdong University of Technology Guangzhou510006 China Uta Research Institute University of Texas at Arlington Fort WorthTX76118 United States State Key Laboratory of Management and Control for Complex Systems Istitute of Automation Chinese Academy of Sciences Beijing100190 China University of Chinese Academy of Sciences Beijing100049 China

The past decade has witnessed a surge in research activities related to adaptive dynamic programming (ADP) and reinforcement learning (RL), particularly for control applications. Several books [item 1)–5) in the Appendix] and survey papers [item 6)–10) in the Appendix] have been published on the subject. Both ADP and RL provide approximate solutions to dynamic programming problems. In a 1995 article by Barto et al. [item 11) in the Appendix], they introduced the so-called “adaptive real-time dynamic programming,” which was specifically to apply ADP for real-time control. Later, in 2002, Murray et al. [item 12) in the Appendix] developed an ADP algorithm for optimal control of continuous-time affine nonlinear systems. On the other hand, the most famous algorithms in RL are the temporal difference algorithm [item 13) in the Appendix] and the Q-learning algorithm [item 14) and 15) in the Appendix].

关键词： Special issues and sections reinforcement learning learning systems Control systems dynamic programming Real-time systems Optimal control

来源：评论

学校读者我要写书评

暂无评论

ieee SSCI 2014 - 2014 ieee symposium Series on Computational Intelligence - adprl 2014: 2014 ieee symposium on adaptive dynamic programming and reinforcement learning, Proceedings

IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational...

引用

2014 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2014

ISBN: (纸本)9781479945535

The proceedings contain 42 papers. The topics discussed include: approximate real-time optimal control based on sparse Gaussian process models;subspace identification for predictive state representation by nuclear norm minimization;active learning for classification: an optimistic approach;convergent reinforcement learning control with neural networks and continuous action search;theoretical analysis of a reinforcement learning based switching scheme;an analysis of optimistic, best-first search for minimax sequential decision making;information-theoretic stochastic optimal control via incremental sampling-based algorithms;policy gradient approaches for multi-objective sequential decision making: a comparison;and cognitive control in cognitive dynamic systems: a new way of thinking inspired by the brain.

关键词：

来源：评论

学校读者我要写书评

暂无评论

PDP: Parallel dynamic programming

引用

ieee/CAA Journal of Automatica Sinica 2017年第1期4卷 1-5页

作者： Fei-Yue Wang Jie Zhang Qinglai Wei Xinhu Zheng Li Li IEEE State Key Laboratory of Management and Control for Complex Systems(SKL-MCCS) Institute of AutomationChinese Academy of Sciences(CASIA) School of Computer and Control Engineering University of Chinese Academy of Sciences Research Center for Military Computational Experiments and Parallel Systems Technology National University of Defense Technology State Key Laboratory of Management and Control for Complex Systems Institute of AutomationChinese Academy of Sciences(SKL-MCCSCASIA) Qingdao Academy of Intelligent Industries Department of Computer Science and Engineering University of Minnesota Department of Automation Tsinghua University

Deep reinforcement learning is a focus research area in artificial intelligence. The principle of optimality in dynamic programming is a key to the success of reinforcement learning methods. The principle of adaptive dynamic programming ADP is first presented instead of direct dynamic programming DP , and the inherent relationship between ADP and deep reinforcement learning is developed. Next, analytics intelligence, as the necessary requirement, for the real reinforcement learning, is discussed. Finally, the principle of the parallel dynamic programming, which integrates dynamic programming and analytics intelligence, is presented as the future computational intelligence. © 2014 Chinese Association of Automation.

关键词： Artificial intelligence Neural networks reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Proceedings of the 2013 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2013 - 2013 ieee symposium Series on Computational Intelligence, SSCI 2013

Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic P...

引用

2013 4th ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2013

ISBN: (纸本)9781467359252

The proceedings contain 28 papers. The topics discussed include: local stability analysis of high-order recurrent neural networks with multi-step piecewise linear activation functions;finite-horizon optimal control design for uncertain linear discrete-time systems;adaptive optimal control for nonlinear discrete-time systems;optimal control for a class of nonlinear system with controller constraints based on finite-approximation-errors ADP algorithm;finite horizon stochastic optimal control of uncertain linear networked control system;real-time tracking on adaptive critic design with uniformly ultimately bounded condition;a novel approach for constructing basis functions in approximate dynamic programming for feedback control;and a combined hierarchical reinforcement learning based approach for multi-robot cooperative target searching in complex unknown environments.

关键词：

来源：评论

学校读者我要写书评

暂无评论

An adaptive dynamic programming Algorithm to Solve Optimal Control of Uncertain Nonlinear Systems

An Adaptive Dynamic Programming Algorithm to Solve Optimal C...

引用

ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)

作者： Cui, Xiaohong Luo, Yanhong Zhang, Huaguang Northeastern Univ Sch Informat Sci & Engn Shenyang 110819 Liaoning Peoples R China

ISBN: (纸本)9781479945528

In this paper, an approximate optimal control method based on adaptive dynamic programming(ADP) is discussed for completely unknown nonlinear system. An online critic-action-identifier algorithm is developed using neural network systems, where the critic -action networks approximate the optimal value function and optimal control and the other two neural networks approximates the unknown system. Furthermore the adaptive tuning laws are given based on Lyapunov approach, which ensures the uniform ultimate bounded stability of the closed-loop system. Finally, the effectiveness is demonstrated by a simulation example.

关键词： Closed loop systems

来源：评论

学校读者我要写书评

暂无评论

A Data-based Online reinforcement learning Algorithm with High-efficient Exploration

A Data-based Online Reinforcement Learning Algorithm with Hi...

引用

ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)

作者： Zhu, Yuanheng Zhao, Dongbin Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing Peoples R China

ISBN: (纸本)9781479945528

An online reinforcement learning algorithm is proposed in this paper to directly utilizes online data efficiently for continuous deterministic systems without system parameters. The dependence on some specific approximation structures is crucial to limit the wide application of online reinforcement learning algorithms. We utilize the online data directly with the kd-tree technique to remove this limitation. Moreover, we design the algorithm in the Probably Approximately Correct principle. Two examples are simulated to verify its good performance.

关键词： Trees (mathematics)

来源：评论

学校读者我要写书评

暂无评论

adaptive dynamic programming-based optimal tracking control for nonlinear systems using general value iteration

Adaptive dynamic programming-based optimal tracking control ...

引用

ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)

作者： Lin, Xiaofeng Ding, Qiang Kong, Weikai Song, Chunning Huang, Qingbao Guangxi Univ Sch Elect Engn Nanning Peoples R China

ISBN: (纸本)9781479945528

For the optimal tracking control problem of affine nonlinear systems, a general value iteration algorithm based on adaptive dynamic programming is proposed in this paper. By system transformation, the optimal tracking problem is converted into the optimal regulating problem for the tracking error dynamics. Then, general value iteration algorithm is developed to obtain the optimal control with convergence analysis. Considering the advantages of echo state network, we use three echo state networks with levenberg-Marquardt (LM) adjusting algorithm to approximate the system, the cost function and the control law. A simulation example is given to demonstrate the effectiveness of the presented scheme.

关键词： adaptive dynamic programming value iteration tracking control echo state network

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：