咨询与建议

限定检索结果

文献类型

  • 61 篇 期刊文献
  • 21 篇 会议

馆藏范围

  • 82 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 74 篇 工学
    • 47 篇 计算机科学与技术...
    • 37 篇 控制科学与工程
    • 31 篇 电气工程
    • 6 篇 软件工程
    • 5 篇 机械工程
    • 3 篇 信息与通信工程
    • 2 篇 仪器科学与技术
    • 2 篇 航空宇航科学与技...
    • 1 篇 电子科学与技术(可...
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
    • 1 篇 环境科学与工程(可...
  • 15 篇 理学
    • 6 篇 数学
    • 6 篇 系统科学
    • 3 篇 物理学
    • 2 篇 化学
    • 1 篇 生物学
    • 1 篇 生态学
  • 10 篇 管理学
    • 10 篇 管理科学与工程(可...
    • 2 篇 工商管理
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 法学
  • 1 篇 教育学
    • 1 篇 教育学
  • 1 篇 军事学

主题

  • 82 篇 neuro-dynamic pr...
  • 28 篇 optimal control
  • 24 篇 reinforcement le...
  • 20 篇 approximate dyna...
  • 19 篇 adaptive critic ...
  • 18 篇 neural networks
  • 15 篇 adaptive dynamic...
  • 12 篇 nonlinear system...
  • 11 篇 dynamic programm...
  • 9 篇 adaptive dynamic...
  • 6 篇 function approxi...
  • 6 篇 policy iteration
  • 5 篇 scheduling
  • 4 篇 markov chains
  • 4 篇 generalized poli...
  • 3 篇 value iteration
  • 3 篇 temporal-differe...
  • 3 篇 q-learning
  • 2 篇 plug-in hybrid e...
  • 2 篇 differential gam...

机构

  • 21 篇 chinese acad sci...
  • 10 篇 univ sci & techn...
  • 8 篇 guangdong univ t...
  • 4 篇 beijing normal u...
  • 3 篇 alphatech inc bu...
  • 2 篇 guangdong univ t...
  • 2 篇 mit informat & d...
  • 2 篇 georgia inst tec...
  • 2 篇 school of automa...
  • 2 篇 mit dept elect e...
  • 2 篇 northeastern uni...
  • 2 篇 univ texas arlin...
  • 2 篇 southern univ sc...
  • 2 篇 univ illinois de...
  • 2 篇 changchun univ t...
  • 2 篇 rzeszow univ tec...
  • 1 篇 univ sci & techn...
  • 1 篇 princeton univ d...
  • 1 篇 univ chinese aca...
  • 1 篇 chinese acad sci...

作者

  • 19 篇 liu derong
  • 18 篇 wei qinglai
  • 7 篇 song ruizhuo
  • 5 篇 zhao bo
  • 5 篇 wang ding
  • 3 篇 bertsekas dp
  • 3 篇 tsitsiklis jn
  • 3 篇 jay h. lee
  • 3 篇 yang xiong
  • 3 篇 lee jh
  • 3 篇 lee jm
  • 3 篇 yan pengfei
  • 2 篇 burghardt andrze...
  • 2 篇 lewis frank l.
  • 2 篇 li yuanchun
  • 2 篇 an tianjiao
  • 2 篇 niket s. kaisare
  • 2 篇 vanroy b
  • 2 篇 szuster marcin
  • 2 篇 lin hanquan

语言

  • 74 篇 英文
  • 4 篇 其他
  • 4 篇 中文
检索条件"主题词=Neuro-Dynamic Programming"
82 条 记 录,以下是61-70 订阅
排序:
Analysis and optimization of service availability in an HA cluster with load-dependent machine availability
收藏 引用
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2007年 第9期18卷 1307-1319页
作者: Ang, Chee-Wei Tham, Chen-Khong Inst Infocomm Res Singapore 119613 Singapore Natl Univ Singapore Dept Elect & Comp Engn Singapore 119260 Singapore
Calculations of service availability of a High-Availability (HA) cluster are usually based on the assumption of load-independent machine availabilities. In this paper, we study the issues and show how the service avai... 详细信息
来源: 评论
Asymptotic analysis of temporal-difference learning algorithms with constant step-sizes
收藏 引用
MACHINE LEARNING 2006年 第2期63卷 107-133页
作者: Tadic, VB Univ Sheffield Dept Automat Control & Syst Engn Sheffield S1 3JD S Yorkshire England
The mean-square asymptotic behavior of temporal-difference learning algorithrns with constant step-sizes and linear function approximation is analyzed in this paper. The analysis is carried out for the case of discoun... 详细信息
来源: 评论
Relative value function approximation for the capacitated re-entrant line scheduling problem
收藏 引用
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2005年 第3期2卷 285-299页
作者: Choi, JY Reveliotis, S Georgia Inst Technol Sch Ind & Syst Engn Atlanta GA 30332 USA
The problem addressed in this study is that of determining how to allocate the workstation processing and buffering capacity in a capacitated re-entrant line to the job instances competing for it, in order to maximize... 详细信息
来源: 评论
Approximate dynamic programming strategies and their applicability for process control: A review and future directions
收藏 引用
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS 2004年 第3期2卷 263-278页
作者: Lee, JM Lee, JH Georgia Inst Technol Sch Chem & Biomol Engn Atlanta GA 30332 USA
This paper reviews dynamic programming (DP), surveys approximate solution methods for it, and considers their applicability to process control problems. Reinforcement Learning (RL) and neuro-dynamic programming (NDP),... 详细信息
来源: 评论
Valuation of American options via basis functions
收藏 引用
IEEE TRANSACTIONS ON AUTOMATIC CONTROL 2004年 第3期49卷 374-385页
作者: Lai, TL Wong, SPS Stanford Univ Dept Stat Stanford CA 94305 USA Hong Kong Univ Sci & Technol Dept Informat & Syst Management Hong Kong Hong Kong Peoples R China
After a brief review of recent developments in the pricing and hedging of American options, this paper modifies the basis function approach to adaptive control and neuro-dynamic programming, and applies it to develop:... 详细信息
来源: 评论
Simulation-based learning of cost-to-go for control of nonlinear processes
收藏 引用
KOREAN JOURNAL OF CHEMICAL ENGINEERING 2004年 第2期21卷 338-344页
作者: Lee, JM Lee, JH Georgia Inst Technol Sch Chem & Biomol Engn Atlanta GA 30332 USA
In this paper, we present a simulation-based dynamic programming method that learns the 'cost-to-go' function in an iterative manner. The method is intended to combat two important drawbacks of the conventiona... 详细信息
来源: 评论
neuro-dynamic programming method for MPC 1
收藏 引用
IFAC Proceedings Volumes 2001年 第25期34卷 143-148页
作者: Jong Min Lee Jay H. Lee School of Chemical Engineering Georgia Institute of Technology Atlanta GA 30332 U.S.A
In this paper, we present how the approach of neuro-dynamic programming(NDP) can be used to combat two important deficiencies of the conventional Model Predictive Control (MPC) formulation, the sometimes exorbitant on... 详细信息
来源: 评论
Optimization of a Fed-Batch Bioreactor Using Simulation-Based Approach
收藏 引用
IFAC Proceedings Volumes 2004年 第1期37卷 347-352页
作者: Catalina Valencia Peroni Jay H. Lee Niket S. Kaisare Universitat Rovira I Virgili Tarragona Catalunya Spain School of Chemical Engineering Georgia Institute of Technology Atlanta CA 30332 U.S.A.
We use simulation-based approach to find the optimal feeding strategy for cloned invertase expression in Saccharomyces cerevisiae in a fed-batch bioreactor. The optimal strategy maximizes the productivity and minimize... 详细信息
来源: 评论
Simulation based strategy for nonlinear optimal control: application to a microbial cell reactor
收藏 引用
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2003年 第3-4期13卷 347-363页
作者: Kaisare, NS Lee, JM Lee, JH Georgia Inst Technol Sch Chem Engn Atlanta GA 30332 USA
Optimal control of systems with complex nonlinear behaviour such as steady state multiplicity results in a nonlinear optimization problem that needs to be solved online at each sample time. We present an approach base... 详细信息
来源: 评论
Markov decision processes with delays and asynchronous cost collection
收藏 引用
IEEE TRANSACTIONS ON AUTOMATIC CONTROL 2003年 第4期48卷 568-574页
作者: Katsikopoulos, KV Engelbrecht, SE Univ Massachusetts Dept Mech & Ind Engn Amherst MA 01003 USA Univ Massachusetts Dept Comp Sci Amherst MA 01003 USA
Markov decision processes (MDPs) may involve three types of delays. First, state information, rather than being available instantaneously, may arrive with a delay (observation delay). Second, an action may take effect... 详细信息
来源: 评论