咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是681-690 订阅
排序:
Identifying cost-effective dynamic policies to control epidemics
收藏 引用
STATISTICS IN MEDICINE 2016年 第28期35卷 5189-5209页
作者: Yaesoubi, Reza Cohen, Ted Yale Sch Publ Hlth Hlth Policy & Management 60 Coll St New Haven CT 06520 USA Yale Sch Publ Hlth Epidemiol Microbial Dis 60 Coll St New Haven CT 06520 USA
We describe a mathematical decision model for identifying dynamic health policies for controlling epidemics. These dynamic policies aim to select the best current intervention based on accumulating epidemic data and t... 详细信息
来源: 评论
Improving Quality of Prediction in Highly dynamic Environments Using approximate dynamic programming
收藏 引用
QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL 2010年 第7期26卷 717-732页
作者: Ganesan, Rajesh Balakrishna, Poornima Sherry, Lance George Mason Univ Ctr Air Transportat Syst Res Dept Syst Engn & Operat Res Fairfax VA 22030 USA
In many applications, decision making under uncertainty often involves two steps-prediction of a certain quality parameter or indicator of the system under study and the subsequent use of the prediction in choosing ac... 详细信息
来源: 评论
approximate linear programming for networks: Average cost bounds
收藏 引用
COMPUTERS & OPERATIONS RESEARCH 2015年 63卷 32-45页
作者: Veatch, Michael H. Gordon Coll Dept Math Wenham MA 01984 USA
This paper uses approximate linear programming (ALP) to compute average cost bounds for queueing network control problems. Like most approximate dynamic programming (ADP) methods, ALP approximates the differential cos... 详细信息
来源: 评论
Discrete-Time Optimal Control Scheme Based on Q-Learning Algorithm  7
Discrete-Time Optimal Control Scheme Based on <i>Q</i>-Learn...
收藏 引用
7th International Conference on Intelligent Control and Information Processing (ICICIP)
作者: Wei, Qinglai Liu, Derong Song, Ruizhuo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
This paper is concerned with optimal control problems of discrete-time nonlinear systems via a novel Q-learning algorithm. In the newly developed Q-learning algorithm, the iterative Q function in each iteration is req... 详细信息
来源: 评论
Algorithmic Solutions for Optimal Switching Problems  2
Algorithmic Solutions for Optimal Switching Problems
收藏 引用
2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management (SMRLO)
作者: Hinz, Juri Yee, Jeremy Univ Technol Sydney Sch Math Sydney NSW Australia
In practice, optimal control problems of stochastic switching are notoriously challenging from a computational viewpoint, since typical real-world applications are high dimensional. In this approach, we suggest an alg... 详细信息
来源: 评论
Improving Quality of Prediction in Highly dynamic Environments Using approximate dynamic programming
Improving Quality of Prediction in Highly Dynamic Environmen...
收藏 引用
INFORMS Annual Meeting on Recent Advancements in Quality and Reliability
作者: Ganesan, Rajesh Balakrishna, Poornima Sherry, Lance George Mason Univ Ctr Air Transportat Syst Res Dept Syst Engn & Operat Res Fairfax VA 22030 USA
In many applications, decision making under uncertainty often involves two steps-prediction of a certain quality parameter or indicator of the system under study and the subsequent use of the prediction in choosing ac... 详细信息
来源: 评论
Discrete-Time Two-Player Zero-Sum Games for Nonlinear Systems Using Iterative Adaptive dynamic programming  13th
Discrete-Time Two-Player Zero-Sum Games for Nonlinear System...
收藏 引用
13th International Symposium on Neural Networks (ISNN)
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
This paper is concerned with a discrete-time two-player zero-sum game of nonlinear systems, which is solved by a new iterative adaptive dynamic programming (ADP) method. In the present iterative ADP algorithm, two ite... 详细信息
来源: 评论
Dual MPC with Reinforcement Learning
Dual MPC with Reinforcement Learning
收藏 引用
11th IFAC Symposium on dynamics and Control of Process Systems including Biosystems
作者: Morinelly, Juan E. Ydstie, B. Erik Carnegie Mellon Univ Dept Chem Engn Pittsburgh PA 15213 USA
An adaptive optimal control algorithm for system with uncertain dynamics is formulated under a Reinforcement Learning framework. An embedded exploratory component, is included explicitly in the objective function of a... 详细信息
来源: 评论
Conversion of MDP Problems into Heuristics Based Planning Problems using Temporal Decomposition  13
Conversion of MDP Problems into Heuristics Based Planning Pr...
收藏 引用
13th International Bhurban Conference on Applied Sciences and Technology (IBCAST)
作者: Gillani, Rida Nasir, Ali Univ Cent Punjab Dept Elect Engn Lahore Pakistan
This paper presents an approach for recasting Markov Decision Process (MDP) problems into heuristics based planning problems. The basic idea is to use temporal decomposition of the state space based on a subset of sta... 详细信息
来源: 评论
Itinerary-based nesting control with upsell
收藏 引用
JOURNAL OF REVENUE AND PRICING MANAGEMENT 2016年 第2期15卷 107-137页
作者: Pun, Chan Seng Klabjan, Diego Karaesmen, Fikri Shebalov, Sergey Northwestern Univ Evanston IL 60208 USA Sabre Holdings Southlake TX USA Koc Univ Istanbul Turkey
In order to accept future high-yield booking requests, airlines protect seats from low-yield passengers. More seats may be reserved when passengers faced with closed fare classes can upsell to open higher fare classes... 详细信息
来源: 评论