咨询与建议

限定检索结果

文献类型

  • 228 篇 会议
  • 4 篇 期刊文献

馆藏范围

  • 232 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 98 篇 工学
    • 93 篇 计算机科学与技术...
    • 40 篇 软件工程
    • 25 篇 电气工程
    • 14 篇 控制科学与工程
    • 4 篇 机械工程
    • 1 篇 力学(可授工学、理...
    • 1 篇 信息与通信工程
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 23 篇 理学
    • 23 篇 数学
    • 6 篇 统计学(可授理学、...
    • 4 篇 系统科学
    • 1 篇 化学
    • 1 篇 大气科学
  • 9 篇 管理学
    • 7 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 52 篇 learning
  • 46 篇 optimal control
  • 37 篇 reinforcement le...
  • 34 篇 learning (artifi...
  • 27 篇 equations
  • 22 篇 heuristic algori...
  • 21 篇 control systems
  • 20 篇 convergence
  • 19 篇 neural networks
  • 18 篇 function approxi...
  • 17 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 14 篇 markov processes
  • 14 篇 artificial neura...
  • 14 篇 cost function
  • 13 篇 stochastic proce...
  • 12 篇 algorithm design...
  • 12 篇 adaptive control

机构

  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 northeastern uni...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...
  • 2 篇 college of infor...
  • 2 篇 school of automa...

作者

  • 7 篇 hado van hasselt
  • 7 篇 lewis frank l.
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 liu derong
  • 5 篇 huaguang zhang
  • 5 篇 zhang huaguang
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 xu xin
  • 4 篇 vrabie draguna
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 yanhong luo
  • 4 篇 damien ernst
  • 4 篇 jan peters
  • 4 篇 peters jan
  • 4 篇 zhao dongbin
  • 3 篇 xu hao
  • 3 篇 martin riedmille...

语言

  • 232 篇 英文
检索条件"任意字段=2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2009"
232 条 记 录,以下是1-10 订阅
排序:
2009 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2009 - Proceedings
2009 IEEE Symposium on Adaptive Dynamic Programming and Rein...
收藏 引用
2009 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2009
The proceedings contain 34 papers. The topics discussed include: a unified framework for temporal difference methods;efficient data reuse in value function approximation;constrained optimal control of affine nonlinear...
来源: 评论
2009 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2009 - Proceedings: Welcome Message
2009 IEEE Symposium on Adaptive Dynamic Programming and Rein...
收藏 引用
2009 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2009 - Proceedings 2009年 viii页
作者: Liu, Derong
来源: 评论
ieee SSCI 2014 - 2014 ieee symposium Series on Computational Intelligence - adprl 2014: 2014 ieee symposium on adaptive dynamic programming and reinforcement learning, Proceedings
IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational...
收藏 引用
2014 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2014
The proceedings contain 42 papers. The topics discussed include: approximate real-time optimal control based on sparse Gaussian process models;subspace identification for predictive state representation by nuclear nor...
来源: 评论
ieee SSCI 2011: symposium Series on Computational Intelligence - adprl 2011: 2011 ieee symposium on adaptive dynamic programming and reinforcement learning
IEEE SSCI 2011: Symposium Series on Computational Intelligen...
收藏 引用
symposium Series on Computational Intelligence, ieee SSCI2011 - 2011 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2011
The proceedings contain 45 papers. The topics discussed include: active learning for personalizing treatment;active exploration by searching for experiments that falsify the computed control policy;optimistic planning...
来源: 评论
Proceedings of the 2013 ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2013 - 2013 ieee symposium Series on Computational Intelligence, SSCI 2013
Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic P...
收藏 引用
2013 4th ieee symposium on adaptive dynamic programming and reinforcement learning, adprl 2013
The proceedings contain 28 papers. The topics discussed include: local stability analysis of high-order recurrent neural networks with multi-step piecewise linear activation functions;finite-horizon optimal control de...
来源: 评论
symposium on adaptive dynamic programming and reinforcement learning (ieee adprl 2011)
Symposium on adaptive dynamic programming and reinforcement ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
adprl 2011 is the third ieee International symposium on Approximate dynamic programming and reinforcement learning. The area of approximate dynamic programming and reinforcement learning is a fusion of a number of res...
来源: 评论
Proceedings of the 2007 ieee symposium on Approximate dynamic programming and reinforcement learning (adprl 2007)
Proceedings of the 2007 IEEE Symposium on Approximate Dynami...
收藏 引用
2007 ieee symposium on Approximate dynamic programming and reinforcement learning, adprl 2007
The proceedings contain 49 papers. The topics discussed include: fitted Q iteration with CMACs;reinforcement-learning-based magneto-hydrodynamic control hypersonic flows;a novel fuzzy reinforcement learning approach i... 详细信息
来源: 评论
Feature Discovery in Approximate dynamic programming
Feature Discovery in Approximate Dynamic Programming
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Preux, Philippe Girgin, Sertan Loth, Manuel Univ Lille Lab Informat Fondamentale Lille Comp Sci Lab CNRS Lille France INRIA Paris France
Feature discovery aims at finding the best representation of data. This is a very important topic in machine learning, and in reinforcement learning in particular. Based on our recent work on feature discovery in the ... 详细信息
来源: 评论
Integrating Sporadic Imitation in reinforcement learning Robots
Integrating Sporadic Imitation in Reinforcement Learning Rob...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Richert, Willi Scheller, Ulrich Koch, Markus Kleinjohann, Bernd Stern, Claudius Univ Gesamthsch Paderborn Fac Comp Sci Elect Engn & Math D-33102 Paderborn Germany
Although the combination of reinforcement learning and imitation has been already considered in recent research, it always revolved around fixed settings where demonstrator and imitator are fixed and the imitation pro... 详细信息
来源: 评论
Exploring the Relationship of Reward and Punishment in reinforcement learning Evolving Action Meta-learning Functions in Goal Navigation
Exploring the Relationship of Reward and Punishment in Reinf...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Lowe, Robert Ziemke, Tom Univ Skovde Interact Lab Skovde Sweden
We present a reinforcement learning algorithm based on Dyna-Sarsa that utilizes separate representations of reward and punishment when guiding state-action value learning and action selection. The adoption of policy m... 详细信息
来源: 评论