This paper presents two novel deep reinforcement learning (DRL) approaches aimed at solving complex power system control problems in a data-driven sense to maintain the stability of power systems. Specifically, we pro...
详细信息
This paper presents two novel deep reinforcement learning (DRL) approaches aimed at solving complex power system control problems in a data-driven sense to maintain the stability of power systems. Specifically, we propose, respectively, SACPER (Soft Actor-Critic (SAC) with Prioritized Experience Replay (PER)) and Constrained Variational Policy Optimization (CVPO) DRL algorithms to address the sequential decision-making problem of active network management (ANM) in distributed power systems and optimizing emergency load shedding (ELS) control problems. First, we propose SACPER for the ANM problem, which prioritizes the training of samples with large errors and poor policy performance. Evaluation of SACPER in terms of stability improvement and convergence speed shows that the ANM problem is optimized and energy loss and operational constraint violations are minimized. Next, we introduce CVPO for the ELS control problem, which is formulated as the Safe Reinforcement learning (SRL) framework to address safety constraint prioritization issues in power systems. We consider additional voltage variables in the network as strong constraints for SRL to achieve fast voltage recovery and minimize unnecessary energy loss, while ensuring good training performance and efficiency. To demonstrate the performances of SACPER, we apply it to ANM6-Easy environment. The CVPO algorithm is applied to ieee 39-Bus and ieee 300-Bus systems. The simulation results of SACPER and CVPO are validated through extensive comparisons with other state-of-the-art DRL approaches.
This paper investigates the problem of iterative learningcontrol in sensor/actuator networks for hyperbolic distributed parameter systems with time delays. For systems with known delays, this study firstly proposes a...
详细信息
To address the problem of lifting weight sway generated by underactuated gantry cranes during operation, considering the load diversity of the portal machine and other disturbance effects, this paper proposes a contro...
详细信息
ISBN:
(纸本)9798350321050
To address the problem of lifting weight sway generated by underactuated gantry cranes during operation, considering the load diversity of the portal machine and other disturbance effects, this paper proposes a control method that does not rely on an exact model. Firstly, a simple kinematic model of the gantry cranes is established and dynamically analyzed, then a compact form dynamic linearization is performed to obtain the data model, a pseudo-partial derivative estimation law is computed online, a model-free adaptive control based gantry cranes anti-sway controller is designed, and finally a simulation analysis is performed. The simulation results show that the load swing angle amplitude is effectively suppressed and quickly dissipated, while it has strong robustness to external unknown disturbances. Compared with the PID control simulation analysis, this control algorithm has more effective anti-sway control effect and can reach the stable state faster.
In this paper, the problem of consensus tracking for multi-agent systems in the presence of noise inter-ference is investigated. Unlike the traditional model-based approach, this paper assumes that the dynamics of all...
详细信息
This article proposes an output feedback fault-tolerant control (FTC) method to compensate for connection interruption faults and actuator faults present in interconnected systems. An intermediate estimator based faul...
详细信息
ISBN:
(纸本)9798350321050
This article proposes an output feedback fault-tolerant control (FTC) method to compensate for connection interruption faults and actuator faults present in interconnected systems. An intermediate estimator based fault reconstruction method and a decentralized FTC scheme are designed, using Kirchhoff matrix tree lemma to eliminate coupling effects between subsystems in interconnected systems. In addition, by combining the intermediate estimation algorithm and Lyapunov function method, it is proven that the global error system is uniformly ultimately bounded (UUB). This article also discusses the occurrence of connection failures and proves that our proposed method is still effective as long as at least one cycle can include all nodes. In comparison to previous results, the proposed decentralized FTC approach deals with interconnected dynamical systems that have unknown nonlinear interconnections and unreliable connections. Finally and most importantly, the rationality of the proposed FTC method has been verified using a networked chemical reactor recovery servo system.
This paper studies the synthesis of robust control for a satellite attitude control system without pre-knowledge of model dynamics. First, by exploring the time-series data of satellite angular velocity, a lifted disc...
详细信息
Model predictive control (MPC) has simple control structure and can achieve dynamic optimization control under constraint conditions. Due to the predictive control based on the system model, MPC has a high requirement...
详细信息
ISBN:
(纸本)9798350321050
Model predictive control (MPC) has simple control structure and can achieve dynamic optimization control under constraint conditions. Due to the predictive control based on the system model, MPC has a high requirement for the accuracy of model parameters. This paper proposes a recursive least squares method based on the mathematical model of the motor for online parameter identification of permanent magnet synchronous motor (PMSM), which addresses the issue of sudden parameter changes that affect control effectiveness during operation. Through online identification, it is possible to effectively track changes in motor parameters and obtain real-time and accurate motor models. In addition, the updated motor model can serve as a predictive model for MPC. Thus, MPC can achieve more dynamic accurate prediction. Finally, the simulation is carried out to verify the effectiveness of the method that is proposed in this paper.
A linear active disturbance rejection control (LADRC) strategy is proposed to suppress the structural vibration caused by external excitations and internal uncertainties in intelligent structures under complex working...
详细信息
ISBN:
(纸本)9798350321050
A linear active disturbance rejection control (LADRC) strategy is proposed to suppress the structural vibration caused by external excitations and internal uncertainties in intelligent structures under complex working conditions via an Anlu EG4S20B256 chip. First, the electromechanical coupling model of the whole vibration control system is obtained based on the dynamic equations of the all-clamped plate structure and the electromagnetic equations of the inertial actuator. Second, based on the system model, a third-order extended state observer (ESO) is designed to estimate the internal modelling errors and external excitation perturbations of the system in real time. In addition, the influence of internal and external disturbances on the control effect in the experiment is offset by a feedforward compensation. Finally, a vibration control platform based on the Anlu FPGA chip is built to verify the control effect of the proposed vibration active control strategy through physical real-time simulation.
In the last decade, data-driven approaches have become popular choices for quadrotor control, thanks to their ability to facilitate the adaptation to unknown or uncertain flight conditions. Among the different data-dr...
详细信息
ISBN:
(纸本)9798350377712;9798350377705
In the last decade, data-driven approaches have become popular choices for quadrotor control, thanks to their ability to facilitate the adaptation to unknown or uncertain flight conditions. Among the different data-driven paradigms, Deep Reinforcement learning (DRL) is currently one of the most explored. However, the design of DRL agents for Micro Aerial Vehicles (MAVs) remains an open challenge. While some works have studied the output configuration of these agents (i.e., what kind of control to compute), there is no general consensus on the type of input data these approaches should employ. Multiple works simply provide the DRL agent with full state information, without questioning if this might be redundant and unnecessarily complicate the learning process, or pose superfluous constraints on the availability of such information in real platforms. In this work, we provide an in-depth benchmark analysis of different configurations of the observation space. We optimize multiple DRL agents in simulated environments with different input choices and study their robustness and their sim-to-real transfer capabilities with zero-shot adaptation. We believe that the outcomes and discussions presented in this work supported by extensive experimental results could be an important milestone in guiding future research on the development of DRL agents for aerial robot tasks.
As the main means of transportation in global trade, ships have received great attention in industry and academia. Aiming at the automatic control of the ship in the ocean, an adaptive fast terminal sliding mode contr...
详细信息
ISBN:
(纸本)9798350321050
As the main means of transportation in global trade, ships have received great attention in industry and academia. Aiming at the automatic control of the ship in the ocean, an adaptive fast terminal sliding mode control (FTSMC) method is proposed and regulates the ship course effectively. In particular, considering the nonlinear dynamics of the ship and unexpected external disturbances, the linear extended state observer (LESO) is employed to estimate and compensate for the total disturbance in ship course control. Specifically, an adaptive control gain is elaborately designed to eliminate the tracking error caused by the estimation error of the total disturbance, which ensures the asymptotic convergence of the closed-loop system. Based on the designed sliding manifold, the proposed method can achieve the control objective without singularity. Furthermore, the closed-loop stability are proved by rigorous Lyapunov-based analysis. In addition, the simulations with comparative results are provided to validate the effectiveness and robustness of the proposed method.
暂无评论