咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是861-870 订阅
排序:
DHP adaptive critic motion control of autonomous wheeled mobile robot
DHP adaptive critic motion control of autonomous wheeled mob...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Lin, Wei-Song Yang, Ping-Chieh Natl Taiwan Univ Dept Elect Engn Inst Elect Engn 1 Sec 4Roosevelt Rd Taipei 106 Taiwan
Autonomous drive of wheeled mobile robot (WMR) needs implementing velocity and path tracking control subject to complex dynamical constraints. Conventionally, this control design is obtained by analysis and synthesis ... 详细信息
来源: 评论
adaptive critic designs for discrete-time zero-sum games with application to H control
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 2007年 第1期37卷 240-247页
作者: Al-Tamimi, Asma Abu-Khalaf, Murad Lewis, Frank L. Univ Texas Automat & Robot Res Inst Ft Worth TX 76118 USA
In this correspondence, adaptive critic approximate dynamic programming designs are derived to solve the discrete-time zero-sum game in which the state and action spaces are continuous. This results in a forward-in-ti... 详细信息
来源: 评论
Dual representations for dynamic programming and reinforcement learning
Dual representations for dynamic programming and reinforceme...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Wang, Tao Bowling, Michael Schuurmans, Dale Univ Alberta Dept Comp Sci Edmonton AB Canada
We investigate the dual approach to dynamic programming and reinforcement learning, based on maintaining an explicit representation of stationary distributions as opposed to value functions. A significant advantage of... 详细信息
来源: 评论
Toward effective combination of off-line and on-line training in ADP framework
Toward effective combination of off-line and on-line trainin...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Prokhorov, Danil Toyota Technol Ctr Ann Arbor MI 48105 USA
We are interested in finding the most effective combination between off-line and on-line/real-time training in approximate dynamic programming. We introduce our approach of combining proven off-line methods of trainin... 详细信息
来源: 评论
Randomly sampling actions in dynamic programming
Randomly sampling actions in dynamic programming
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Atkeson, Christopher G. Carnegie Mellon Univ Inst Robot Pittsburgh PA 15213 USA
We describe an approach towards reducing the curse of dimensionality for deterministic dynamic programming with continuous actions by randomly sampling actions while computing a steady state value function and policy.... 详细信息
来源: 评论
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
Discrete-time nonlinear HJB solution using approximate dynam...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Al-Tamimi, Asma Lewis, Frank Univ Texas Automat & Robot Res Inst Ft Worth TX 76118 USA Univ Texas Arlington Automat & Robot Res Inst Ft Worth TX 76118 USA
In this paper, a greedy iteration scheme based on approximate dynamic programming (ADP), namely Heuristic dynamic programming (HDP), is used to solve for the value function of the Hamilton Jacobi Bellman equation (HJB... 详细信息
来源: 评论
The knowledge gradient policy for offline learning with independent normal rewards
The knowledge gradient policy for offline learning with inde...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Frazier, Peter Powell, Warren Princeton Univ Dept Operat Res & Financial Engn Princeton NJ 08544 USA
We define a new type of policy, the knowledge gradient policy, in the context of an offline learning problem. We show how to compute the knowledge gradient policy efficiently and demonstrate through Monte Carlo simula... 详细信息
来源: 评论
On a successful application of multi-agent reinforcement learning to operations research benchmarks
On a successful application of multi-agent reinforcement lea...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Gabel, Thomas Riedmiller, Martin Univ Osnabruck Dept Math & Comp Sci Inst Cognit Sci D-49069 Osnabruck Germany
In this paper, we suggest and analyze the use of approximate reinforcement learning techniques for a new category of challenging benchmark problems from the field of Operations Research. We demonstrate that interpreti... 详细信息
来源: 评论
Model-based reinforcement learning in factored-state MDPs
Model-based reinforcement learning in factored-state MDPs
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Strehl, Alexander L. Rutgers State Univ Dept Comp Sci Piscataway NJ 08854 USA
We consider the problem of learning in a factored state Markov Decision Process that is structured to allow a compact representation. We show that the well-known algorithm, factored Rmax, performs near-optimally on al... 详细信息
来源: 评论
reinforcement learning in continuous action spaces
Reinforcement learning in continuous action spaces
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: van Hasselt, Hado Wiering, Marco A. Univ Utrecht Dept Informat & Comp Sci Intelligent Syst Grp Padualaan 14 NL-3508 TB Utrecht Netherlands
Quite some research has been done on reinforcement learning in continuous environments, but the research on problems where the actions can also be chosen from a continuous space is much more limited. We present a new ... 详细信息
来源: 评论