咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是851-860 订阅
排序:
Performance Guarantee of a Sub-Optimal Policy for a Robotic Surveillance Application *
收藏 引用
IFAC Proceedings Volumes 2013年 第30期46卷 283-290页
作者: Myoungkuk Park Krishnamoorthy Kalyanam Swaroop Darbha P.P. Khargonekar P.R. Chandler M. Pachter Department of Mechanical Engineering Texas A&M University College Station TX 77843 USA Infoscitex Corporation Dayton OH 45431 USA Department of Electrical Engineering University of Florida Gainesville FL 32525 Autonomous Control Branch Air Force Research Laboratory Wright-Patterson A.F.B. OH 45433 Department of Electrical Engineering Air Force Institute of Technology Wright-Patterson A.F.B. OH 45433
This paper focuses on the development and analysis of sub-optimal decision algorithms for a collection of robots that assist a remotely located operator in perimeter surveillance. The operator is tasked with the class... 详细信息
来源: 评论
Online Partially Model-Free Solution of Two-Player Zero Sum Differential Games
收藏 引用
IFAC Proceedings Volumes 2013年 第32期46卷 696-701页
作者: P Praveen Shubhendu Bhasin Department of Electrical Engineering Indian Institute of Technology Delhi India
An online adaptive dynamic programming based iterative algorithm is proposed for a two-player zero sum linear differential game problem arising in the control of process systems affected by disturbances. The objective... 详细信息
来源: 评论
A Data-driven Model for Large Wildfire Behaviour Prediction in Europe
收藏 引用
Procedia Computer Science 2013年 18卷 1861-1870页
作者: Dario Rodriguez-Aseretto Daniele de Rigo Margherita Di Leo Ana Cortés Jesús San-Miguel-Ayanz European Commission Joint Research Centre Institute for Environment and Sustainability Via E. Fermi 2749 I-21027 Ispra (VA) Italy Politecnico di Milano Dipartimento di Elettronica e Informazione Via Ponzio 34/5 I-20133 Milano Italy Universitat Autonoma de Barcelona Computer Architecture and Operating Systems Campus Bellaterra Cerdanyola 08193 Spain
The European Forest Fire Information System (EFFIS) has been established by the Joint Research Centre (JRC) and the Directorate General for Environment (DG ENV) of the European Commission (EC) in close collaboration w... 详细信息
来源: 评论
A UNIFIED FRAMEWORK FOR LINEAR FUNCTION APPROXIMATION OF VALUE FUNCTIONS IN STOCHASTIC CONTROL
A UNIFIED FRAMEWORK FOR LINEAR FUNCTION APPROXIMATION OF VAL...
收藏 引用
European Signal Processing Conference
作者: Matilde Sanchez-Fernandez Sergio Valcarcel Santiago Zazoy Universidad Carlos III de Madrid Signal Theory & Communictions Dept. Universidad Politecnica de Madrid Signals Systems & Radiocommunications Dept. Av. Complutense Universidad Politecnica de Madrid Signals Systems & Radiocommunications Dept. Av. Complutense
This paper contributes with a unified formulation that merges previous analysis on the prediction of the performance (value function) of certain sequence of actions (policy) when an agent operates a Markov decision pr... 详细信息
来源: 评论
On Integral Value Iteration for Continuous-Time Linear Systems
On Integral Value Iteration for Continuous-Time Linear Syste...
收藏 引用
American Control Conference
作者: Jae Young Lee Jin Bae Park Yoon Ho Choi Department of Electrical and Electronic Engineering Yonsei University Shinchon-Dong Seodaemum-Gu Seoul 120-749 Korea Department of Electronic Engineering Kyonggi University Suwon Kyonggi-Do 443-760 Korea
This paper investigates the properties of integral value iteration (I-VI) which is one of the reinforcement learning (RL) technique for solving online the continuous-time (CT) optimal control problems without using th... 详细信息
来源: 评论
Lagrangian relaxation and constraint generation for allocation and advanced scheduling
收藏 引用
COMPUTERS & OPERATIONS RESEARCH 2012年 第10期39卷 2323-2336页
作者: Gocgun, Yasin Ghate, Archis Univ Washington Seattle WA 98195 USA Univ British Columbia Sauder Sch Business Vancouver BC V5Z 1M9 Canada
Diverse applications in manufacturing, logistics, health care, telecommunications, and computing require that renewable resources be dynamically scheduled to handle distinct classes of job service requests arriving ra... 详细信息
来源: 评论
Metamodeling and the Critic-based approach to multi-level optimization
收藏 引用
NEURAL NETWORKS 2012年 第Aug.期32卷 179-185页
作者: Werbos, Ludmilla Kozma, Robert Silva-Lugo, Rodrigo Pazienza, Giovanni E. Werbos, Paul J. IntControl LLC Memphis TN 38152 USA Univ Memphis CLION Memphis TN 38152 USA Natl Sci Fdn Arlington VA 22230 USA
Large-scale networks with hundreds of thousands of variables and constraints are becoming more and more common in logistics, communications, and distribution domains. Traditionally, the utility functions defined on su... 详细信息
来源: 评论
Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach
收藏 引用
NEUROCOMPUTING 2012年 第1期78卷 14-22页
作者: Wang, Ding Liu, Derong Wei, Qinglai Chinese Acad Sci Inst Automat State Key Lab Intelligent Control & Management Co Beijing 100190 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
In this paper, a finite-horizon neuro-optimal tracking control strategy for a class of discrete-time nonlinear systems is proposed. Through system transformation, the optimal tracking problem is converted into designi... 详细信息
来源: 评论
Constrained adaptive optimal control using a reinforcement learning agent
收藏 引用
AUTOMATICA 2012年 第10期48卷 2614-2619页
作者: Lin, Wei-Song Zheng, Chen-Hong NTUEE Taipei 106 Taiwan Natl Taiwan Univ Dept Elect Engn Taipei Taiwan
To synthesize the optimal control strategies of nonlinear systems on infinite horizon while subject to mixed equality and inequality constraints has been a challenge to control engineers. This paper regards it as a pr... 详细信息
来源: 评论
A least squares temporal difference actor-critic algorithm with applications to warehouse management
收藏 引用
NAVAL RESEARCH LOGISTICS 2012年 第3-4期59卷 197-211页
作者: Estanjini, Reza Moazzez Li, Keyong Paschalidis, Ioannis Ch Boston Univ Dept Elect & Comp Engn Div Syst Engn Boston MA 02215 USA Boston Univ Ctr Informat & Syst Engn Boston MA 02215 USA
This article develops a new approximate dynamic programming (DP) algorithm for Markov decision problems and applies it to a vehicle dispatching problem arising in warehouse management. The algorithm is of the actor-cr... 详细信息
来源: 评论