咨询与建议

限定检索结果

文献类型

  • 6 篇 期刊文献
  • 1 篇 会议

馆藏范围

  • 7 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 4 篇 理学
    • 3 篇 数学
    • 3 篇 统计学(可授理学、...
  • 4 篇 工学
    • 1 篇 电气工程
    • 1 篇 信息与通信工程
    • 1 篇 控制科学与工程
    • 1 篇 计算机科学与技术...
    • 1 篇 软件工程
  • 1 篇 经济学
    • 1 篇 应用经济学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 7 篇 simulation-based...
  • 2 篇 stochastic appro...
  • 2 篇 optimal stopping
  • 1 篇 wireless multime...
  • 1 篇 self-normalized ...
  • 1 篇 partially observ...
  • 1 篇 hidden markov mo...
  • 1 篇 finite-horizon m...
  • 1 篇 markov decision ...
  • 1 篇 function approxi...
  • 1 篇 normalized hadam...
  • 1 篇 actor-critic alg...
  • 1 篇 empirical varian...
  • 1 篇 exponential ineq...
  • 1 篇 ozone network de...
  • 1 篇 functional optim...
  • 1 篇 actor-critic alg...
  • 1 篇 delta-entropy wi...
  • 1 篇 network coding
  • 1 篇 controlled marko...

机构

  • 2 篇 indian inst sci ...
  • 1 篇 mit informat & d...
  • 1 篇 univ fed minas g...
  • 1 篇 gen motors india...
  • 1 篇 tata inst fundam...
  • 1 篇 univ missouri de...
  • 1 篇 univ fed rio de ...
  • 1 篇 duisburg essen u...
  • 1 篇 oregon state uni...
  • 1 篇 weierstrass inst...

作者

  • 2 篇 belomestny denis
  • 2 篇 borkar vs
  • 1 篇 bhatnagar shalab...
  • 1 篇 abdulla mohammed...
  • 1 篇 bertsekas d
  • 1 篇 konda vr
  • 1 篇 schmidt alexandr...
  • 1 篇 abounadi j
  • 1 篇 ferreira marco a...
  • 1 篇 nguyen dong
  • 1 篇 nguyen thinh
  • 1 篇 ruiz-cardenas ra...

语言

  • 6 篇 英文
  • 1 篇 其他
检索条件"主题词=simulation-based algorithms"
7 条 记 录,以下是1-10 订阅
排序:
simulation-based Optimization algorithms for Finite-Horizon Markov Decision Processes
收藏 引用
simulation-TRANSACTIONS OF THE SOCIETY FOR MODELING AND simulation INTERNATIONAL 2008年 第12期84卷 577-600页
作者: Bhatnagar, Shalabh Abdulla, Mohammed Shahid Indian Inst Sci Dept Comp Sci & Automat Bangalore 560012 Karnataka India Gen Motors India Sci Lab Bangalore Karnataka India
We develop four simulation-based algorithms for finite-horizon Markov decision processes. Two of these algorithms are developed for finite state and compact action spaces while the other two are for finite state and f... 详细信息
来源: 评论
ON THE RATES OF CONVERGENCE OF simulation-based OPTIMIZATION algorithms FOR OPTIMAL STOPPING PROBLEMS
收藏 引用
ANNALS OF APPLIED PROBABILITY 2011年 第1期21卷 215-239页
作者: Belomestny, Denis Weierstrass Inst Appl Anal & Stochast D-10117 Berlin Germany
In this paper, we study simulation-based optimization algorithms for solving discrete time optimal stopping problems. Using large deviation theory for the increments of empirical processes, we derive optimal convergen... 详细信息
来源: 评论
Learning algorithms or Markov decision processes with average cost
收藏 引用
SIAM JOURNAL ON CONTROL AND OPTIMIZATION 2001年 第3期40卷 681-698页
作者: Abounadi, J Bertsekas, D Borkar, VS MIT Informat & Decis Syst Lab Cambridge MA 02139 USA Tata Inst Fundamental Res Sch Technol & Comp Sci Mumbai 400005 India
This paper gives the rst rigorous convergence analysis of analogues of Watkins's Q-learning algorithm, applied to average cost control of finite-state Markov chains. We discuss two algorithms which may be viewed a... 详细信息
来源: 评论
Evolutionary Markov chain Monte Carlo algorithms for optimal monitoring network designs
收藏 引用
STATISTICAL METHODOLOGY 2012年 第1-2期9卷 185-194页
作者: Ruiz-Cardenas, Ramiro Ferreira, Marco A. R. Schmidt, Alexandra M. Univ Missouri Dept Stat Columbia MO 65211 USA Univ Fed Minas Gerais Dept Estat Belo Horizonte MG Brazil Univ Fed Rio de Janeiro Inst Matemat BR-21941 Rio De Janeiro Brazil
We propose an evolutionary Markov chain Monte Carlo (eMCMC) framework for optimal design of large-scale monitoring networks. From a Bayesian decision theoretical perspective, the optimal design is the design that maxi... 详细信息
来源: 评论
The actor-critic algorithm as multi-time-scale stochastic approximation
收藏 引用
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES 1997年 第4期22卷 525-543页
作者: Borkar, VS Konda, VR Indian Inst Sci Dept Comp Sci & Automat Bangalore 560012 Karnataka India
The actor-critic algorithm of Barto and others for simulation-based optimization of Markov decision processes is cast as a two time Scale stochastic approximation. Convergence analysis, approximation issues and an exa... 详细信息
来源: 评论
Network Coding-based Wireless Media Transmission Using POMDP
Network Coding-Based Wireless Media Transmission Using POMDP
收藏 引用
17th International Packet Video Workshop
作者: Nguyen, Dong Nguyen, Thinh Oregon State Univ Sch Elect Engn & Comp Sci Corvallis OR 97331 USA
We consider the problem of joint network coding and packet scheduling for multimedia transmission from the Access Point (AP) to multiple receivers in 802.11 networks. The state of receivers is described by a hidden Ma... 详细信息
来源: 评论
SOLVING OPTIMAL STOPPING PROBLEMS VIA EMPIRICAL DUAL OPTIMIZATION
收藏 引用
ANNALS OF APPLIED PROBABILITY 2013年 第5期23卷 1988-2019页
作者: Belomestny, Denis Duisburg Essen Univ D-45127 Essen Germany
In this paper we consider a method of solving optimal stopping problems in discrete and continuous time based on their dual representation. A novel and generic simulation-based optimization algorithm not involving nes... 详细信息
来源: 评论