Decentralized optimization has become a standard paradigm for solving large-scale decision-making problems and training large machine learning models without centralizing data. However, this paradigm introduces new pr...
详细信息
— Ensuring safety is a crucial challenge when deploying reinforcement learning (RL) to real-world systems. We develop confidence-based safety filters, a control-theoretic approach for certifying state safety constrai...
详细信息
In this paper, we present a methodology that ensures a priori that all possible unknown dynamics of the system within a compact set of operation will be excited. A controller is used to make sure that the system with ...
详细信息
ISBN:
(数字)9781665465076
ISBN:
(纸本)9781665465083
In this paper, we present a methodology that ensures a priori that all possible unknown dynamics of the system within a compact set of operation will be excited. A controller is used to make sure that the system with unknown dynamics will follow the reference trajectory and Radial Basis Function (RBF) neural networks are employed to estimate the unknown nonlinearities. The persistency of excitation condition is guaranteed as a prerequisite to achieve accurate estimation of the unknown nonlinear terms and efficient learning. A simulation example clarifies the proposed approach and verifies the aforementioned assertions.
Advanced Air Mobility (AAM) envisages a sustainable, safe, convenient, and affordable air transport system. In socio-technical transition of AAM, there are a number of trade-offs in ecosystem that need to be studied. ...
详细信息
Advanced Air Mobility (AAM) envisages a sustainable, safe, convenient, and affordable air transport system. In socio-technical transition of AAM, there are a number of trade-offs in ecosystem that need to be studied. Three perspectives on economic feasibility are explored: first, based on history of VTOL services and value of time estimates, we discuss whether AAM can provide customers with competitive mobility services; second, what are the stakeholders’ insights on the deployment of AAM; last, the experience in the development of autonomous driving technology, such as parallel intelligence, can inform future AAM research.
In this work, an Integral Reinforcement Learning (RL) framework is employed to provide provably safe, convergent and almost globally optimal policies in a novel Off-Policy Iterative method for simply-connected workspa...
In this work, an Integral Reinforcement Learning (RL) framework is employed to provide provably safe, convergent and almost globally optimal policies in a novel Off-Policy Iterative method for simply-connected workspaces. This restriction stems from the impossibility of strictly global navigation in multiply connected manifolds, and is necessary for formulating continuous solutions. The current method generalizes and improves upon previous results, where parametrized controllers hindered the method in scope and results. Through enhancing the traditional reactive paradigm with RL, the proposed scheme is demonstrated to outperform both previous reactive methods as well as an RRT* method in path length, cost function values and execution times, indicating almost global optimality.
The paper discusses the problem of automatic tuning of the PID controller. The auto-tuning algorithm of the PID controller based on one machine learning method, which is equivalent to the steepest descent, is proposed...
详细信息
The objective of this work is to simultaneously control and identify the nonlinear longitudinal dynamics of small-scale fixed-wing Unmanned Aerial Vehicles (UAVs). The main difficulty in this endeavor lies in the sati...
详细信息
ISBN:
(数字)9781665465076
ISBN:
(纸本)9781665465083
The objective of this work is to simultaneously control and identify the nonlinear longitudinal dynamics of small-scale fixed-wing Unmanned Aerial Vehicles (UAVs). The main difficulty in this endeavor lies in the satisfaction of the Persistence of Excitation (PE) condition, which eventually ensures accurate learning. Towards this direction, our key components comprise Radial Basis Function - Neural Networks (RBF-NNs), which are suitable mathematical models for universal function approximation, alongside with: i) the recently developed Dynamic Regression Extension and Mixing (DREM) technique; a new procedure for designing parameter estimators with enhanced performance, as well as ii) a novel control design for the longitudinal UAV dynamics utilizing the Prescribed Performance control (PPC) methodology, which enables robust trajectory tracking with predetermined transient and steady state quality, even in the presence of model uncertainties.
In this paper we investigate the design of optimal spatially distributed controllers for a linear and spatially invariant reaction-diffusion process over the real line. The controller receives state measurements from ...
详细信息
ISBN:
(数字)9783907144107
ISBN:
(纸本)9798331540920
In this paper we investigate the design of optimal spatially distributed controllers for a linear and spatially invariant reaction-diffusion process over the real line. The controller receives state measurements from different spatial locations with non-negligible delays. In this set-up and for the class of proportional spatially invariant state feedback controllers, the optimal control synthesis problem is equivalent to a feedback gain optimization for a spatially distributed delay system. We show that the spatial locality of optimal feedback gains is affected not only by diffusion and reaction coefficients, but also by the parameter representing communication time-delay that causes a sharp flattening of the control gains. In the expensive control regime, the optimal controller is solved analytically, yielding some practical design guidelines.
While Koopman-based techniques like extended Dynamic Mode Decomposition are nowadays ubiquitous in the data-driven approximation of dynamical systems, quantitative error estimates were only recently established. To th...
详细信息
While Koopman-based techniques like extended Dynamic Mode Decomposition are nowadays ubiquitous in the data-driven approximation of dynamical systems, quantitative error estimates were only recently established. To this end, both sources of error resulting from a finite dictionary and only finitely-many data points in the generation of the surrogate model have to be taken into account. We generalize the rigorous analysis of the approximation error to the control setting while simultaneously reducing the impact of the curse of dimensionality by using a recently proposed bilinear approach. In particular, we establish uniform bounds on the approximation error of state-dependent quantities like constraints or a performance index enabling data-based optimal and predictive control with guarantees.
This paper proposes a distributed prescribed-time observer for nonlinear systems representable in a block-triangular observable canonical form. Using a weighted average of neighbor estimates exchanged over a strongly ...
详细信息
暂无评论