文献详情 >Monte Carlo Grid Dynamic Progr... 收藏

arXiv

Monte Carlo Grid Dynamic Programming: Almost Sure Convergence and Probability Constraints

作者：Ramadan, Mohammad S. Al-Tawaha, Ahmad Shouman, Mohamed Atallah, Ahmed Jin, Ming

作者机构：Mathematics and Computer Science Division Argonne National Laboratory LemontIL60439 United States Departement of Electrical and Computer Engineering Virginia Tech BlacksburgVA United States Aichi Nagoya Japan Department of Mechanical & Aerospace Engineering University of California San Diego La Jolla CA92093-0411 United States

出版物：《arXiv》 (arXiv)

年卷期：2023年

核心收录：

主　　题：Stochastic models

摘要：Dynamic Programming (DP) suffers from the wellknown curse of dimensionality, further exacerbated by the need to compute expectations over process noise in stochastic models. This paper presents a Monte Carlo-based sampling approach for the state space and an interpolation procedure for the resulting value function, dependent on the process noise density, in a self-approximating fashion, eliminating the need for ordering or set-membership tests. We provide proof of almost sure convergence for the value iteration (and consequently, policy iteration) procedure. The proposed meshless sampling and interpolation algorithm alleviates the burden of gridding the state space, traditionally required in DP, and avoids constructing a piecewise constant value function over a grid. Moreover, we demonstrate that the proposed interpolation procedure is wellsuited for handling probabilistic constraints by sampling both infeasible and feasible regions. The curse of dimensionality cannot be avoided, however, this approach offers a practical framework for addressing lower-order stochastic nonlinear systems with probabilistic constraints, while eliminating the need for linear interpolations and set membership tests. Numerical examples are presented to further explain and illustrate the convenience of the proposed algorithms. © 2023, CC BY.

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Monte Carlo Grid Dynamic Programming: Almost Sure Convergence and Probability Constraints

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Monte Carlo Grid Dynamic Programming: Almost Sure Convergence and Probability Constraints

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：