咨询与建议

限定检索结果

文献类型

  • 749 篇 期刊文献
  • 209 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 982 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 749 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 251 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 51 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 电子科学与技术(可...
  • 357 篇 管理学
    • 340 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 982 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 142 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 926 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate Dynamic Programming"
982 条 记 录,以下是691-700 订阅
排序:
Dual MPC with Reinforcement Learning
Dual MPC with Reinforcement Learning
收藏 引用
11th IFAC Symposium on dynamics and Control of Process Systems including Biosystems
作者: Morinelly, Juan E. Ydstie, B. Erik Carnegie Mellon Univ Dept Chem Engn Pittsburgh PA 15213 USA
An adaptive optimal control algorithm for system with uncertain dynamics is formulated under a Reinforcement Learning framework. An embedded exploratory component, is included explicitly in the objective function of a... 详细信息
来源: 评论
Conversion of MDP Problems into Heuristics Based Planning Problems using Temporal Decomposition  13
Conversion of MDP Problems into Heuristics Based Planning Pr...
收藏 引用
13th International Bhurban Conference on Applied Sciences and Technology (IBCAST)
作者: Gillani, Rida Nasir, Ali Univ Cent Punjab Dept Elect Engn Lahore Pakistan
This paper presents an approach for recasting Markov Decision Process (MDP) problems into heuristics based planning problems. The basic idea is to use temporal decomposition of the state space based on a subset of sta... 详细信息
来源: 评论
Itinerary-based nesting control with upsell
收藏 引用
JOURNAL OF REVENUE AND PRICING MANAGEMENT 2016年 第2期15卷 107-137页
作者: Pun, Chan Seng Klabjan, Diego Karaesmen, Fikri Shebalov, Sergey Northwestern Univ Evanston IL 60208 USA Sabre Holdings Southlake TX USA Koc Univ Istanbul Turkey
In order to accept future high-yield booking requests, airlines protect seats from low-yield passengers. More seats may be reserved when passengers faced with closed fare classes can upsell to open higher fare classes... 详细信息
来源: 评论
Solving Control Problems with Linear State dynamics - a Practical User Guide  2
Solving Control Problems with Linear State Dynamics - a Prac...
收藏 引用
2nd International Symposium on Stochastic Models in Reliability Engineering, Life Science, and Operations Management (SMRLO)
作者: Hinz, Juri Yee, Jeremy Univ Technol Sydney Sch Math Sydney NSW Australia
In industrial applications, practitioners usually face a considerable complexity when optimizing operating strategies under uncertainty. Typical real-world problems arising in practice are notoriously challenging from... 详细信息
来源: 评论
Energy management of PV-storage systems: ADP approach with temporal difference learning  19
Energy management of PV-storage systems: ADP approach with t...
收藏 引用
19th Power Systems Computation Conference (PSCC)
作者: Keerthisinghe, Chanaka Verbic, Gregor Chapman, Archie C. Univ Sydney Sch Elect & Informat Engn Sydney NSW Australia
In the future, residential energy users can seize the full potential of demand response schemes by using an automated home energy management system (HEMS) to schedule their distributed energy resources. In order to ge... 详细信息
来源: 评论
Stochastic approximate dynamic programming with Link Estimation for High Quality Path Selection in Wireless Mesh Networks
Stochastic Approximate Dynamic Programming with Link Estimat...
收藏 引用
Globecom Workshops
作者: Oliveira, Talmai Agrawal, Dharma P. Univ Cincinnati Sch Comp Sci & Informat Ctr Distributed & Mobile Comp Cincinnati OH 45221 USA
A lot of work has recently been published regarding metrics that could identify high quality paths in Wireless Mesh Networks (WMN). While results are encouraging, no optimal strategy has yet been identified that could... 详细信息
来源: 评论
Differential TD Learning for Value Function Approximation  55
Differential TD Learning for Value Function Approximation
收藏 引用
55th IEEE Conference on Decision and Control (CDC)
作者: Devraj, Adithya M. Meyn, Sean P. Univ Florida Dept Elect & Comp Engn Gainesville FL 32611 USA
Value functions arise as a component of algorithms as well as performance metrics in statistics and engineering applications. Computation of the associated Bellman equations is numerically challenging in all but a few... 详细信息
来源: 评论
Risk-Averse Anticipation for dynamic Vehicle Routing  10th
Risk-Averse Anticipation for Dynamic Vehicle Routing
收藏 引用
10th International Conference on Learning and Intelligent Optimization (LION)
作者: Ulmer, Marlin W. Voss, Stefan Tech Univ Carolo Wilhelmina Braunschweig Muhlenpfordtstr 23 D-38106 Braunschweig Germany Univ Hamburg Von Melle Pk 5 D-20146 Hamburg Germany
In the field of dynamic vehicle routing, the importance to integrate stochastic information about possible future events in current decision making increases. Integration is achieved by anticipatory solution approache... 详细信息
来源: 评论
ADP Based Long-Term Renewable Generation Planning While Considering the Effects of Hourly SCUC
ADP Based Long-Term Renewable Generation Planning While Cons...
收藏 引用
IEEE-Power-and-Energy-Society General Meeting (PESGM)
作者: Chen, Zhi Wu, Lei Arkansas Tech Univ Dept Elect Engn Russellville AR 72801 USA Clarkson Univ Dept Elect & Comp Engn Potsdam NY USA
This paper proposes an approximate dynamic programming (ADP) based approach for solving the long-term renewable generation expansion planning problem by considering the effects of hourly security-constrained unit comm... 详细信息
来源: 评论
LEARNING IN CONSTRAINED STOCHASTIC dynamic POTENTIAL GAMES  41
LEARNING IN CONSTRAINED STOCHASTIC DYNAMIC POTENTIAL GAMES
收藏 引用
41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
作者: Macua, Sergio Valcarcel Zazo, Santiago Zazo, Javier Univ Politecn Madrid E-28040 Madrid Spain
We extend earlier works on continuous potential games to the most general case: stochastic time varying environment, stochastic rewards, non-reduced form and constrained state-action sets. We provide conditions for a ... 详细信息
来源: 评论