咨询与建议

限定检索结果

文献类型

  • 754 篇 期刊文献
  • 209 篇 会议
  • 21 篇 学位论文
  • 1 册 图书

馆藏范围

  • 985 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 744 篇 工学
    • 306 篇 计算机科学与技术...
    • 272 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 41 篇 石油与天然气工程
    • 40 篇 土木工程
    • 36 篇 软件工程
    • 30 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 24 篇 动力工程及工程热...
    • 17 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 建筑学
  • 358 篇 管理学
    • 341 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 235 篇 理学
    • 200 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 1 篇 法学

主题

  • 985 篇 approximate dyna...
  • 143 篇 optimal control
  • 141 篇 reinforcement le...
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 61 篇 markov decision ...
  • 60 篇 dynamic programm...
  • 51 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 28 篇 adaptive dynamic...
  • 23 篇 adaptive control
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 policy iteration
  • 21 篇 linear programmi...
  • 20 篇 neuro-dynamic pr...
  • 18 篇 value function a...
  • 18 篇 dynamic pricing
  • 17 篇 value iteration

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 cornell univ sch...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 930 篇 英文
  • 44 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate Dynamic Programming"
985 条 记 录,以下是891-900 订阅
dynamic Asset Allocation Approaches for Counter- Piracy Operations
Dynamic Asset Allocation Approaches for Counter- Piracy Oper...
收藏 引用
International Conference on Information Fusion
作者: Woosun An Diego Fernando Martinez Ayala David Sidoti Manisha Mishra Xu Han Krishna R. Pattipati Eva D. Regnier David L. Kleinman James A. Hansen Dept. Electrical and Computer Engineering University of Connecticut Connecticut United States Naval Postgraduate School California United States Naval Research Laboratory California United States
Piracy on the high seas is a problem of world-wide concern. In response to this threat, the US Navy has developed a visualization tool known as the Pirate Attack Risk Surface (PARS) that integrates intelligence data, ... 详细信息
来源: 评论
ACCOUNTING RISK IN MULTISTAGE STOCHASTIC PROBLEMS USING approximate dynamic programming
收藏 引用
IFAC Proceedings Volumes 2007年 第5期40卷 153-158页
作者: Nikolaos E. Pratikakis Matthew J. Realff Jay H. Lee Chemical and Biomolecular Engineering Georgia Institute of Technology311 Ferst Drive Atlanta GA 30332-0100 USA
This work proposes a methodology to generate risk averse policies for Markov Decision Processes(MDPs). This methodology is based on modifying the one stage reward or cost to weigh the trade-off between expected perfor... 详细信息
来源: 评论
Robust approximate Bilinear programming for Value Function Approximation
收藏 引用
JOURNAL OF MACHINE LEARNING RESEARCH 2011年 12卷 3027-3063页
作者: Petrik, Marek Zilberstein, Shlomo IBM Corp Thomas J Watson Res Ctr Yorktown Hts NY 10598 USA Univ Massachusetts Dept Comp Sci Amherst MA 01003 USA
Value function approximation methods have been successfully used in many applications, but the prevailing techniques often lack useful a priori error bounds. We propose a new approximate bilinear programming formulati... 详细信息
来源: 评论
Adaptive dynamic programming for Finite-Horizon Optimal Control of Discrete-Time Nonlinear Systems with ε-Error Bound
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS 2011年 第1期22卷 24-36页
作者: Wang, Fei-Yue Jin, Ning Liu, Derong Wei, Qinglai Chinese Acad Sci Inst Automat Key Lab Complex Syst & Intelligence Sci Beijing 100190 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
In this paper, we study the finite-horizon optimal control problem for discrete-time nonlinear systems using the adaptive dynamic programming (ADP) approach. The idea is to use an iterative ADP algorithm to obtain the... 详细信息
来源: 评论
Fast Evaluation of Quadratic Control-Lyapunov Policy
收藏 引用
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY 2011年 第4期19卷 939-946页
作者: Wang, Yang Boyd, Stephen Stanford Univ Dept Elect Engn Stanford CA 94305 USA
The evaluation of a control-Lyapunov policy, with quadratic Lyapunov function, requires the solution of a quadratic program (QP) at each time step. For small problems this QP can be solved explicitly;for larger proble... 详细信息
来源: 评论
Optimal Tracking Control for a Class of Nonlinear Discrete-Time Systems with Time Delays Based on Heuristic dynamic programming
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS 2011年 第12期22卷 1851-1862页
作者: Zhang, Huaguang Song, Ruizhuo Wei, Qinglai Zhang, Tieyan Northeastern Univ Sch Informat Sci & Engn Shenyang 110004 Peoples R China Northeastern Univ Natl Educ Minist Key Lab Integrated Automat Proc Ind Shenyang 110004 Peoples R China Chinese Acad Sci Inst Automat Key Lab Complex Syst & Intelligence Sci Beijing 100190 Peoples R China Shenyang Inst Engn Dept Elect Engn Shenyang 110136 Peoples R China
In this paper, a novel heuristic dynamic programming (HDP) iteration algorithm is proposed to solve the optimal tracking control problem for a class of nonlinear discrete-time systems with time delays. The novel algor... 详细信息
来源: 评论
The Effect of Robust Decisions on the Cost of Uncertainty in Military Airlift Operations
收藏 引用
ACM TRANSACTIONS ON MODELING AND COMPUTER SIMULATION 2011年 第1期22卷 1–19页
作者: Powell, Warren B. Bouzaiene-Ayari, Belgacem Berger, Jean Boukhtouta, Abdeslem George, Abraham P. Princeton Univ Dept Operat Res & Financial Engn Princeton NJ 08544 USA DRDC Valcartier Quebec City ON Canada
There are a number of sources of randomness that arise in military airlift operations. However, the cost of uncertainty can be difficult to estimate, and is easy to overestimate if we use simplistic decision rules. Us... 详细信息
来源: 评论
Linear programming based decomposition methods for inventory distribution systems
收藏 引用
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2011年 第2期211卷 282-297页
作者: Kunnumkal, Sumit Topaloglu, Huseyin Cornell Univ Sch Operat Res & Informat Engn Ithaca NY 14853 USA Indian Sch Business Hyderabad 500032 Andhra Pradesh India
We consider an inventory distribution system consisting of one warehouse and multiple retailers. The retailers face random demand and are supplied by the warehouse. The warehouse replenishes its stock from an external... 详细信息
来源: 评论
A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
收藏 引用
控制理论与应用(英文版) 2011年 第3期9卷 336-352页
作者: Warren B.POWELL Department of Operations Research and Financial Engineering Princeton University
We review the literature on approximate dynamic programming,with the goal of better understanding the theory behind practical algorithms for solving dynamic programs with continuous and vector-valued states and action... 详细信息
来源: 评论
Semi-Markov adaptive critic heuristics with application to airline revenue management
收藏 引用
控制理论与应用(英文版) 2011年 第3期9卷 421-430页
作者: Ketaki KULKARNI Abhijit GOSAVI Susan MURRAY Katie GRANTHAM Department of Engineering Management and Systems Engineering Missouri University of Science and Technology
The adaptive critic heuristic has been a popular algorithm in reinforcement learning(RL) and approximate dynamic programming(ADP) *** is one of the ?rst RL and ADP *** and ADP algorithms are particularly useful for so... 详细信息
来源: 评论