文献详情 >Dynamic Inventory Optimization... 收藏

Dynamic Inventory Optimization with Learning and Model Ambig...

Dynamic Inventory Optimization with Learning and Model Ambiguity

学位级别：博士

授予年度：2019年

主题：Bayesian dynamic programming Bayesian learning inventory management

摘要：Classic inventory control problems typically assume that the demand distribution is known a priori. In reality, this assumption is not always satisfied. Motivated by this concern, the joint optimization of learn- ing and control is studied. We first consider the situation where parameters of the demand distribution are not known a priori, but need to be learned using right-censored sales data. A Bayesian framework is adopted for demand learning and the corresponding control problem is analyzed via Bayesian dynamic programming (BDP). Structural results of the optimal policy are established. In particular, we show that the BDP-optimal decisions can be expressed as the sum of a myopic-optimal decision plus a non- negative exploration boost which is proportional to the posterior index of dispersion of the unknown mean demand. This structure clearly articulates the manner in which the statistical learning and inven- tory control are jointly optimized. Next, we study an optimal inventory control problem in the presence of model miss-specification. In this problem, decision makers account for the miss-specification via solv- ing a worst-case problem against an adversary, nature, who has the ability to alter the underlying demand distribution so as to minimize the decision maker s expected reward. We show that the decision maker s robust-optimal decisions are bounded above by the optimal solutions of the nominal model. This structural result clearly explains the trade-off between optimization and risk aversion. In the last chapter, we attempt to incorporate the elements of the Bayesian and robust approaches, namely robust Bayesian optimization. In particular, we are interested in how decision makers can remain robust to model uncertainty while also learning at the same time. We establish an analytical upper bound of the decision maker s optimal decisions, which can be expressed as the sum of a myopic-optimal decision plus an exploration boost and minus a risk aversion adj

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Dynamic Inventory Optimization with Learning and Model Ambiguity

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Dynamic Inventory Optimization with Learning and Model Ambiguity

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：