检索结果-内蒙古大学图书馆

A learning-based Nonlinear model predictive controller for a Real Go-Kart based on Black-Box Dynamics modeling Through Gaussian Processes

引用

IEEE TRANSACTIONS ON control SYSTEMS TECHNOLOGY 2023年第5期31卷 2055-2065页

作者： Picotti, Enrico Mion, Enrico Dalla Libera, Alberto Pavlovic, Josip Censi, Andrea Frazzoli, Emilio Beghi, Alessandro Bruschetta, Mattia Univ Padua Dept Informat Engn I-35122 Padua Italy KTM GmbH A-5230 Mattighofen Austria Hexagon Geosyst Serv AG CH-6300 Zug Switzerland Swiss Fed Inst Technol Inst Dynam Syst & Control CH-8092 Zurich Switzerland

Lately, nonlinear model predictive control (NMPC) has been successfully applied to (semi-) autonomous driving problems and has proven to be a very promising technique. However, accurate control models for real vehicles could require costly and time-demanding specific measurements. To address this problem, the exploitation of system data to complement or derive the prediction model of the NMPC has been explored, employing learning dynamics approaches within learning-based NMPC (LbNMPC). Its application to the automotive field has focused on discrete gray-box modeling, in which a nominal dynamics model is enhanced by the data-driven component. In this manuscript, we present an LbNMPC controller for a real go-kart based on a continuous black-box model of the accelerations obtained by Gaussian processes (GP). We show the effectiveness of the proposed approach by testing the controller on a real go-kart vehicle, highlighting the approximation steps required to get an exploitable GP model on a real-time application.

关键词： Index Terms- Automotive control autonomous vehicles data-driven modeling learning-based model predictive control

来源：评论

学校读者我要写书评

暂无评论

learning safety in model-based Reinforcement learning using MPC and Gaussian Processes 22

Learning safety in model-based Reinforcement Learning using ...

引用

22nd World Congress of the International Federation of Automatic control (IFAC)

作者： Airaldi, Filippo De Schutter, Bart Dabiri, Azita Delft Univ Technol Delft Ctr Syst & Control Mekelweg 2 NL-2628 CD Delft Netherlands

ISBN: (纸本)9781713872344

This paper proposes a method to encourage safety in model predictive control (MPC)-based Reinforcement learning (RL) via Gaussian Process (GP) regression. The framework consists of 1) a parametric MPC scheme that is employed as model-based controller with approximate knowledge on the real system's dynamics, 2) an episodic RL algorithm tasked with adjusting the MPC parametrization in order to increase its performance, and 3) GP regressors used to estimate, directly from data, constraints on the MPC parameters capable of predicting, up to some probability, whether the parametrization is likely to yield a safe or unsafe policy. These constraints are then enforced onto the RL updates in an effort to enhance the learning method with a probabilistic safety mechanism. Compared to other recent publications combining safe RL with MPC, our method does not require further assumptions on, e.g., the prediction model in order to retain computational tractability. We illustrate the results of our method in a numerical example on the control of a quadrotor drone in a safety-critical environment. Copyright (c) 2023 The Authors. This is an open access article under the CC BY-NC-ND license (https://***/licenses/by-nc-nd/4.0/)

关键词： learning-based model predictive control Safe Reinforcement learning Gaussian Processes

来源：评论

学校读者我要写书评

暂无评论

Quasi revenue-neutral congestion pricing in cities: Crediting drivers to avoid city centers

引用

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES 2022年 145卷

作者： Li, Ye Ramezani, Mohsen Univ Sydney Sch Civil Engn Sydney Australia Beijing Univ Technol Beijing Key Lab Traff Engn Beijing Peoples R China

This paper introduces a predictive congestion pricing method in cities wherein the tolls alter from region to region. We consider a large urban network is partitioned into multiple regions each with a well-defined Macroscopic Fundamental Diagram (MFD) where multiple routes exist between each origin and destination regions. The proposed cordon pricing method is designed to (i) minimize vehicles' total time spent in the network and (ii) aim for a revenue-neutral tolling. A controller based on model predictive control (MPC) approach is proposed to determine the (possibly negative) optimal time-and region-varying tolls. The MPC controller comprises a regional MFD-based traffic model with no need of destination information and a long-short term memory neural network (LSTM-NN) to obtain an accurate estimation of inter-region transfer flows. Results of numerical experiments indicate the effectiveness of the proposed congestion pricing method to achieve the two objectives simultaneously, compared with No toll and reactive feedback controllers.

关键词： Network fundamental diagram Routing Receding horizon learning-based model predictive control Machine learning

来源：评论

学校读者我要写书评

暂无评论

learning safety in model-based Reinforcement learning using MPC and Gaussian Processes

引用

IFAC-PapersOnLine 2023年第2期56卷 5759-5764页

作者： Filippo Airaldi Bart De Schutter Azita Dabiri Delft Center for Systems and Control Delft University of Technology Mekelweg 2 2628 CD Delft The Netherlands

关键词： learning-based model predictive control Safe Reinforcement learning Gaussian Processes

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：