咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是831-840 订阅
排序:
Coupling perception and action using minimax optimal control
Coupling perception and action using minimax optimal control
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Tom Erez William D. Smart Washington University Saint Louis MO USA
This paper proposes a novel approach for coupling perception and action through minimax dynamic programming. We tackle domains where the agent has some control over the observation process (e.g. via the manipulation o... 详细信息
来源: 评论
Cooperative retransmissions using Markov decision process with reinforcement learning
Cooperative retransmissions using Markov decision process wi...
收藏 引用
ieee International symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Ghasem Naddafzadeh Shirazi Peng-Yong Kong Chen-Khong Tham Institute for Infocomm Research Agency for Science Technology & Research (A*STAR) Singapore
In cooperative retransmissions, nodes with better channel qualities help other nodes in retransmitting a failed packet to its intended destination. In this paper, we propose a cooperative retransmission scheme where e... 详细信息
来源: 评论
Proceedings of the 2007 ieee symposium on Approximate dynamic programming and reinforcement learning (ADPRL 2007)
Proceedings of the 2007 IEEE Symposium on Approximate Dynami...
收藏 引用
2007 ieee symposium on Approximate dynamic programming and reinforcement learning, ADPRL 2007
The proceedings contain 49 papers. The topics discussed include: fitted Q iteration with CMACs;reinforcement-learning-based magneto-hydrodynamic control hypersonic flows;a novel fuzzy reinforcement learning approach i... 详细信息
来源: 评论
adaptive autonomous control using online value iteration with gaussian processes
Adaptive autonomous control using online value iteration wit...
收藏 引用
ieee International Conference on Robotics and Automation (ICRA)
作者: Axel Rottmann Wolfram Burgard Department of Computer Science University of Freiburg Freiburg im Breisgau Germany
In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, our method learns the system dynamics and ... 详细信息
来源: 评论
Special issue on adaptive dynamic programming and reinforcement learning in feedback control
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 2008年 第4期38卷 896-897页
作者: Lewis, F. L. Liu, Derong Lendaris, George G. Univ Texas Arlington Dept Elect Engn Automat & Robot Res Inst Arlington TX 76019 USA Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA Portland State Univ Dept Elect & Comp Engn Syst Sci Grad Program Portland OR 97207 USA
The 18 papers in this special issue focus on adaptive dynamic programming and reinforcement learning in feedback control.
来源: 评论
2007 ieee international symposium on approximate dynamic programming and reinforcement learning
Proceedings of the 2007 IEEE Symposium on Approximate Dynami...
收藏 引用
Proceedings of the 2007 ieee symposium on Approximate dynamic programming and reinforcement learning, ADPRL 2007 2007年
作者: Liu, Derong Munos, Remi Si, Jennie Wunsch, II, Donald C.
No abstract available
来源: 评论
reinforcement learning of adaptive Longitudinal Vehicle Control for dynamic Collaborative Driving
Reinforcement Learning of Adaptive Longitudinal Vehicle Cont...
收藏 引用
ieee Intelligent Vehicles symposium
作者: Ng, Luke Clark, Christopher M. Huissoon, Jan P. Univ Waterloo Dept Mech & Mechatron Engn Waterloo ON N2L 3G1 Canada Calif Polytech State Univ San Luis Obispo Dept Comp Sci San Luis Obispo CA 93407 USA
dynamic collaborative driving involves the motion coordination of multiple vehicles using shared information from vehicles instrumented to perceive their surroundings in order to improve road usage and safety. A basic... 详细信息
来源: 评论
adaptive critic-based neurofuzzy controller for the steam generator water level
收藏 引用
ieee TRANSACTIONS ON NUCLEAR SCIENCE 2008年 第3期55卷 1678-1685页
作者: Fakhrazari, Amin Boroushaki, Mehrdad Sharif Univ Technol Dept Mech Engn Tehran Iran
In this paper, an adaptive critic-based neurofuzzy controller is presented for water level regulation of nuclear steam generators. The problem has been of great concern for many years as the steam generator is a highl... 详细信息
来源: 评论
Higher level application of ADP: A next phase for the control field?
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 2008年 第4期38卷 901-912页
作者: Lendaris, George G. Portland State Univ Dept Elect & Comp Engn NW Computat Intelligence Lab Syst Sci Grad Program Portland OR 97207 USA
Two distinguishing features of humanlike control vis-a-vis current technological control are the ability to make use of experience while selecting a control policy for distinct situations and the ability to do so fast... 详细信息
来源: 评论
Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 2008年 第4期38卷 994-1001页
作者: Yang, Qinmin Vance, Jonathan Blake Jagannathan, S. Missouri Univ Sci & Technol Dept Elect & Comp Engn Rolla MO 65409 USA
A nonaffine discrete-time system represented by the nonlinear autoregressive moving average with eXogenous input (NARMAX) representation with unknown nonlinear system dynamics is considered. An equivalent affinelike r... 详细信息
来源: 评论