咨询与建议

限定检索结果

文献类型

  • 1 篇 期刊文献

馆藏范围

  • 1 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 1 篇 理学
    • 1 篇 数学

主题

  • 1 篇 reinforcement le...
  • 1 篇 actor-critic alg...
  • 1 篇 control randomiz...
  • 1 篇 policy gradient
  • 1 篇 optimal switchin...

机构

  • 1 篇 humboldt univ de...
  • 1 篇 lab finance marc...
  • 1 篇 ecole polytech c...
  • 1 篇 lab finance marc...

作者

  • 1 篇 denkert robert
  • 1 篇 pham huyen
  • 1 篇 warin xavier

语言

  • 1 篇 其他
检索条件"主题词=Reinforcement learning in continuous time"
1 条 记 录,以下是1-10 订阅
排序:
Control Randomisation Approach for Policy Gradient and Application to reinforcement learning in Optimal Switching
收藏 引用
APPLIED MATHEMATICS AND OPTIMIZATION 2025年 第1期91卷 1-33页
作者: Denkert, Robert Pham, Huyen Warin, Xavier Humboldt Univ Dept Math Berlin Germany Ecole Polytech CMAP Palaiseau France Lab Finance Marches Energie EDF R&D Palaiseau France Lab Finance Marches Energie FiME Palaiseau France
We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the connection between stochastic control problems and randomised problems, enablin... 详细信息
来源: 评论