咨询与建议

限定检索结果

文献类型

  • 2 篇 期刊文献
  • 1 篇 会议

馆藏范围

  • 3 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 3 篇 理学
    • 3 篇 数学
    • 1 篇 统计学(可授理学、...
  • 1 篇 工学
    • 1 篇 计算机科学与技术...
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 3 篇 control randomiz...
  • 1 篇 vy process
  • 1 篇 memory reduction
  • 1 篇 backward stochas...
  • 1 篇 reinforcement le...
  • 1 篇 stochastic contr...
  • 1 篇 actor-critic alg...
  • 1 篇 impulse control
  • 1 篇 non-markovian
  • 1 篇 l & eacute
  • 1 篇 policy gradient
  • 1 篇 real option
  • 1 篇 optimal stopping
  • 1 篇 monte carlo
  • 1 篇 optimal switchin...

机构

  • 1 篇 humboldt univ de...
  • 1 篇 lab finance marc...
  • 1 篇 csiro n ryde nsw...
  • 1 篇 linnaeus univ va...
  • 1 篇 ecole polytech c...
  • 1 篇 lab finance marc...
  • 1 篇 csiro clayton vi...

作者

  • 1 篇 cooksey m.
  • 1 篇 langrene n.
  • 1 篇 denkert robert
  • 1 篇 tarnopolskaya t.
  • 1 篇 perninge magnus
  • 1 篇 chen w.
  • 1 篇 zhu z.
  • 1 篇 pham huyen
  • 1 篇 warin xavier

语言

  • 2 篇 其他
  • 1 篇 英文
检索条件"主题词=Control randomization"
3 条 记 录,以下是1-10 订阅
排序:
control Randomisation Approach for Policy Gradient and Application to Reinforcement Learning in Optimal Switching
收藏 引用
APPLIED MATHEMATICS AND OPTIMIZATION 2025年 第1期91卷 1-33页
作者: Denkert, Robert Pham, Huyen Warin, Xavier Humboldt Univ Dept Math Berlin Germany Ecole Polytech CMAP Palaiseau France Lab Finance Marches Energie EDF R&D Palaiseau France Lab Finance Marches Energie FiME Palaiseau France
We propose a comprehensive framework for policy gradient methods tailored to continuous time reinforcement learning. This is based on the connection between stochastic control problems and randomised problems, enablin... 详细信息
来源: 评论
Optimal stopping of BSDEs with constrained jumps and related zero-sum games
收藏 引用
STOCHASTIC PROCESSES AND THEIR APPLICATIONS 2024年 173卷
作者: Perninge, Magnus Linnaeus Univ Vaxjo Sweden
In this paper, we introduce a non -linear Snell envelope which at each time represents the maximal value that can be achieved by stopping a BSDE with constrained jumps. We establish the existence of the Snell envelope... 详细信息
来源: 评论
New Regression Monte Carlo Methods for High-dimensional Real Options Problems in Minerals industry  21
New Regression Monte Carlo Methods for High-dimensional Real...
收藏 引用
21st International Congress on Modelling and Simulation (MODSIM) held jointly with the 23rd National Conference of the Australian-Society-for-Operations-Research / DSTO led Defence Operations Research Symposium (DORS
作者: Langrene, N. Tarnopolskaya, T. Chen, W. Zhu, Z. Cooksey, M. CSIRO Clayton Vic 3168 Australia CSIRO N Ryde NSW 2113 Australia
Mining operations are affected by significant uncertainty in commodity prices, combined with geological uncertainties (both in quantity and quality of the available reserves). Technical difficulties and costs associat... 详细信息
来源: 评论