咨询与建议

限定检索结果

文献类型

  • 228 篇 会议
  • 4 篇 期刊文献

馆藏范围

  • 232 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 98 篇 工学
    • 93 篇 计算机科学与技术...
    • 40 篇 软件工程
    • 25 篇 电气工程
    • 14 篇 控制科学与工程
    • 4 篇 机械工程
    • 1 篇 力学(可授工学、理...
    • 1 篇 信息与通信工程
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 23 篇 理学
    • 23 篇 数学
    • 6 篇 统计学(可授理学、...
    • 4 篇 系统科学
    • 1 篇 化学
    • 1 篇 大气科学
  • 9 篇 管理学
    • 7 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 52 篇 learning
  • 46 篇 optimal control
  • 37 篇 reinforcement le...
  • 34 篇 learning (artifi...
  • 27 篇 equations
  • 22 篇 heuristic algori...
  • 21 篇 control systems
  • 20 篇 convergence
  • 19 篇 neural networks
  • 18 篇 function approxi...
  • 17 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 14 篇 markov processes
  • 14 篇 artificial neura...
  • 14 篇 cost function
  • 13 篇 stochastic proce...
  • 12 篇 algorithm design...
  • 12 篇 adaptive control

机构

  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 northeastern uni...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...
  • 2 篇 college of infor...
  • 2 篇 school of automa...

作者

  • 7 篇 hado van hasselt
  • 7 篇 lewis frank l.
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 liu derong
  • 5 篇 huaguang zhang
  • 5 篇 zhang huaguang
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 xu xin
  • 4 篇 vrabie draguna
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 yanhong luo
  • 4 篇 damien ernst
  • 4 篇 jan peters
  • 4 篇 peters jan
  • 4 篇 zhao dongbin
  • 3 篇 xu hao
  • 3 篇 martin riedmille...

语言

  • 232 篇 英文
检索条件"任意字段=2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2009"
232 条 记 录,以下是41-50 订阅
排序:
Algorithm and Stability of ATC Receding Horizon Control
Algorithm and Stability of ATC Receding Horizon Control
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Zhang, Hongwei Huang, Jie Lewis, Frank L. Chinese Univ Hong Kong Dept Mech & Automat Engn Shatin Hong Kong Peoples R China Univ Texas Arlingto Automat & Robot Res Inst Ft Worth TX 76118 USA
Receding horizon control (RHC), also known as model predictive control (MPC), is a suboptimal control scheme that solves a finite horizon open-loop optimal control problem in an infinite horizon context and yields a m... 详细信息
来源: 评论
Data-Driven Partially Observable dynamic Processes Using adaptive dynamic programming
Data-Driven Partially Observable Dynamic Processes Using Ada...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Zhong, Xiangnan Ni, Zhen Tang, Yufei He, Haibo Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
adaptive dynamic programming (ADP) has been widely recognized as one of the "core methodologies" to achieve optimal control for intelligent systems in Markov decision process (MDP). Generally, ADP control de... 详细信息
来源: 评论
Model-Based Multi-Objective reinforcement learning
Model-Based Multi-Objective Reinforcement Learning
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Wiering, Marco A. Withagen, Maikel Drugan, Madalina M. Univ Groningen Inst Artificial Intelligence NL-9700 AB Groningen Netherlands Vrije Univ Brussel Artificial Intelligence Lab Ixelles Brunei
This paper describes a novel multi-objective reinforcement learning algorithm. The proposed algorithm first learns a model of the multi-objective sequential decision making problem, after which this learned model is u... 详细信息
来源: 评论
adaptive dynamic programming for Discrete-time LQR Optimal Tracking Control Problems with Unknown dynamics
Adaptive Dynamic Programming for Discrete-time LQR Optimal T...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Liu, Yang Luo, Yanhong Zhang, Huaguang Northeastern Univ Sch Informat Sci & Engn Shenyang 110819 Liaoning Peoples R China
In this paper, an optimal tracking control approach based on adaptive dynamic programming (ADP) algorithm is proposed to solve the linear quadratic regulation (LQR) problems for unknown discrete-time systems in an onl... 详细信息
来源: 评论
Structure search of probabilistic models and data correction for EDA-RL
Structure search of probabilistic models and data correction...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Handa, Hisashi Graduate School of Natural Science and Technology Okayama University Tsushima-naka 3-1-1 Okayama 700-8530 Japan
We have proposed a novel Estimation of Distribution Algorithm for solving reinforcement learning problems: EDA-RL. The EDA-RL can perform well if the complexity of the structure of the probabilistic model is adapted t... 详细信息
来源: 评论
Continuous-Time Differential dynamic programming with Terminal Constraints
Continuous-Time Differential Dynamic Programming with Termin...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Sun, Wei Theodorou, Evangelos A. Tsiotras, Panagiotis
In this work, we revisit the continuous-time Differential dynamic programming (DDP) approach for solving optimal control problems with terminal state constraints. We derive two algorithms, each for different order of ... 详细信息
来源: 评论
Cognitive Control in Cognitive dynamic Systems: A New Way of Thinking Inspired by The Brain
Cognitive Control in Cognitive Dynamic Systems: A New Way of...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Haykin, Simon Amiri, Ashkan Fatemi, Mehdi McMaster Univ Cognit Syst Lab Hamilton ON L8S 4K1 Canada
Briefly, main purpose of the paper is fourfold: a) Cognitive perception, which consists of two functional blocks: improved sparse-coding under the influence of perceptual attention for extracting relevant information ... 详细信息
来源: 评论
Using Approximate dynamic programming for Estimating the Revenues of a Hydrogen-based High-Capacity Storage Device
Using Approximate Dynamic Programming for Estimating the Rev...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Francois-Lavet, Vincent Fonteneau, Raphael Ernst, Damien Univ Liege Dept Elect Engn & Comp Sci B-4000 Liege Belgium
This paper proposes a methodology to estimate the maximum revenue that can be generated by a company that operates a high-capacity storage device to buy or sell electricity on the day-ahead electricity market. The met... 详细信息
来源: 评论
Higher order Q-learning
Higher order Q-Learning
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Edwards, Ashley Pottenger, William M. Department of Computer Science University of Georgia Athens GA 30606 United States Department of Computer Science and DIMACS Rutgers University Piscataway NJ 08854 United States
Higher order learning is a statistical relational learning framework in which relationships between different instances of the same class are leveraged (Ganiz, Lytkin and Pottenger, 2009). learning can be supervised o... 详细信息
来源: 评论
Supervised adaptive dynamic programming based adaptive cruise control
Supervised adaptive dynamic programming based adaptive cruis...
收藏 引用
symposium Series on Computational Intelligence, ieee SSCI2011 - 2011 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2011
作者: Zhao, Dongbin Hu, Zhaohui Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy of Sciences Beijing 100190 China
This paper proposes a supervised adaptive dynamic programming (SADP) algorithm for the full range adaptive cruise control (ACC) system. The full range ACC system considers both the ACC situation in highway system and ... 详细信息
来源: 评论