作者:
Bo PangZhong-Ping JiangControl and Networks Lab
Department of Electrical and Computer Engineering Tandon School of Engineering New York University Six Metrotech Center Brooklyn NY 11201 USA
This paper studies the robustness of reinforcement learning for discrete-time linear stochastic systems with multiplicative noise evolving in continuous state and action spaces. As one of the popular methods in reinfo...
详细信息
This paper studies the robustness of reinforcement learning for discrete-time linear stochastic systems with multiplicative noise evolving in continuous state and action spaces. As one of the popular methods in reinforcement learning, the robustness of policy iteration is a longstanding open issue for the stochastic linear quadratic regulator (LQR) problem with multiplicative noise. A solution in the spirit of small-disturbance input-to-state stability is given, guaranteeing that the solutions of the policy iteration algorithm are bounded and enter a small neighborhood of the optimal solution, whenever the error in each iteration is bounded and small. In addition, a novel off-policy multiple-trajectory optimistic least-squares policy iteration algorithm is proposed, to learn a near-optimal solution of the stochastic LQR problem directly from online input/state data, without explicitly identifying the system matrices. The efficacy of the proposed algorithm is supported by rigorous convergence analysis and numerical results on a second-order example.
With the advent of the global aging era, the impact of stroke on humans is continuously increasing. Therefore, new therapeutic approaches to enhance patient recovery are receiving growing attention. Stroke is a severe...
详细信息
This paper presents joint motion planning of a vehicle with an attachedrotating turret. The turret has a limited range as well as the field of *** objective is capture a maneuvering target such that at the terminal ti...
详细信息
In this work, a predictor-based fixed-time sliding mode control is designed to tackle the problem of achieving precise trajectory tracking control of multiple unmanned aerial vehicles formation flight. The proposed ap...
详细信息
Asynchronously switched sampled-data systems can help model power systems and vehicles that evolve in continuous-time with switching behavior and discrete time measurements. We address the problem of jointly estimatin...
详细信息
ISBN:
(数字)9781665467612
ISBN:
(纸本)9781665467629
Asynchronously switched sampled-data systems can help model power systems and vehicles that evolve in continuous-time with switching behavior and discrete time measurements. We address the problem of jointly estimating a switching signal, with uncertainty in the exact switching times, as well as the continuous states of the system. We prove stability of the standard Kalman Filter under uncertainty in the switching times, with statistical bounds relating to the sampling period. We then propose a method for estimation of switching times as well as a method for efficient joint estimation of the state and switching signal inspired by the interacting multiple-model extended-Viterbi algorithm. We validate our algorithms in simulation for a power converter and a maneuvering vehicle.
This paper considers the global finite time stabilization of linear systems subject to actuator saturation. We show that a controllable SISO system whose eigenvalues are real and negative can be globally stabilized in...
详细信息
ISBN:
(数字)9798350382655
ISBN:
(纸本)9798350382662
This paper considers the global finite time stabilization of linear systems subject to actuator saturation. We show that a controllable SISO system whose eigenvalues are real and negative can be globally stabilized in finite time by a homogeneous feedback law. To obviate numerical issues, this homogeneous feedback law is implemented by discretizing a compact set. Numerical simulation validates the effectiveness of the proposed approach.
作者:
Shang, JunLi, YuzheChen, TongwenTongji University
Department of Control Science and Engineering Shanghai Institute of Intelligent Science and Technology National Key Laboratory of Autonomous Intelligent Unmanned Systems Frontiers Science Center for Intelligent Autonomous Systems Shanghai200092 China Northeastern University
State Key Laboratory of Synthetical Automation for Process Industries Shenyang110004 China University of Alberta
Department of Electrical and Computer Engineering EdmontonABT6G 1H9 Canada
This paper investigates stealthy attacks on sampled-data control systems, where a continuous process is sampled periodically, and the resultant discrete output and control signals are transmitted through dual channels...
详细信息
Can deception exist in differential games? We provide a case study for a Turret-Attacker differential game, where two Attackers seek to score points by reaching a target region while a Turret tries to minimize the sco...
详细信息
This paper investigates a partial-information pursuit evasion game in which the Pursuer has a limited-range sensor to detect the Evader. Given a fixed final time, we derive the optimal evasion strategy for the Evader ...
详细信息
Average consensus protocols emerge with a central role in distributed systems and decision-making such as distributed information fusion, distributed optimization, distributed estimation, and control. A key advantage ...
详细信息
暂无评论