咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是641-650 订阅
A Combined Hierarchical reinforcement learning Based Approach For Multi-robot Cooperative Target Searching in Complex Unknown Environments
A Combined Hierarchical Reinforcement Learning Based Approac...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Cai, Yifan Yang, Simon X. Xu, Xin Univ Guelph Sch Engn Guelph ON N1G 2W1 Canada Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China
Effective cooperation of multi-robots in unknown environments is essential in many robotic applications, such as environment exploration and target searching. In this paper, a combined hierarchical reinforcement learn... 详细信息
来源: 评论
Optimistic Planning for Continuous-Action Deterministic Systems
Optimistic Planning for Continuous-Action Deterministic Syst...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Busoniu, Lucian Daniels, Alexander Munos, Remi Babuska, Robert Univ Lorraine CRAN UMR 7039 Nancy France CNRS CRAN UMR 7039 Nancy France Delft Univ Technol DCSC Delft Netherlands INRIA Lille Nord Europe Team SequeL Lille France
We consider the class of online planning algorithms for optimal control, which compared to dynamic programming are relatively unaffected by large state dimensionality. We introduce a novel planning algorithm called SO... 详细信息
来源: 评论
Delayed Insertion and Rule Effect Moderation of Domain Knowledge for reinforcement learning
Delayed Insertion and Rule Effect Moderation of Domain Knowl...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Teng, Teck-Hou Tan, Ah-Hwee Nanyang Technol Univ Sch Comp Engn Ctr Computat Intelligence Singapore Singapore Nanyang Technol Univ Sch Comp Engn Singapore Singapore
Though not a fundamental pre-requisite to efficient machine learning, insertion of domain knowledge into adaptive virtual agent is nonetheless known to improve learning efficiency and reduce model complexity. Conventi... 详细信息
来源: 评论
The Second Order Temporal Difference Error for Sarsa(λ)
The Second Order Temporal Difference Error for Sarsa(λ)
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Fu, Qiming Liu, Quan Xiao, Fei Chen, Guixin Soochow Univ Dept Comp Sci & Technol Suzhou Peoples R China
Traditional reinforcement learning algorithms, such as Q-learning, Q(lambda), Sarsa, and Sarsa(lambda), update the action value function using temporal difference (TD) error, which is computed by the last action value... 详细信息
来源: 评论
reinforcement learning to Train Ms. Pac-Man Using Higher-order Action-relative Inputs
Reinforcement Learning to Train Ms. Pac-Man Using Higher-ord...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Bom, Luuk Henken, Ruud Wiering, Marco Univ Groningen Inst Artificial Intelligence & Cognit Engn Fac Math & Nat Sci NL-9700 AB Groningen Netherlands
reinforcement learning algorithms enable an agent to optimize its behavior from interacting with a specific environment. Although some very successful applications of reinforcement learning algorithms have been develo... 详细信息
来源: 评论
A novel approach for constructing basis functions in approximate dynamic programming for feedback control
A novel approach for constructing basis functions in approxi...
收藏 引用
2013 4th ieee symposium on adaptive dynamic programming and reinforcement learning, ADPRL 2013
作者: Wang, Jian Huang, Zhenhua Xu, Xin College of Mechatronics and Automation National University of Defense Tech Changsha 410073 China Xi'An Air Force Military Representative Office Xi'an China
This paper presents a novel approach for constructing basis functions in approximate dynamic programming (ADP) through the locally linear embedding (LLE) process. It considers the experience (sample) data as a high-di... 详细信息
来源: 评论
adaptive learning in Tracking Control Based on the Dual Critic Network Design
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2013年 第6期24卷 913-928页
作者: Ni, Zhen He, Haibo Wen, Jinyu Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA Huazhong Univ Sci & Technol Coll Elect Elect & Engn Wuhan 430074 Peoples R China
In this paper, we present a new adaptive dynamic programming approach by integrating a reference network that provides an internal goal representation to help the systems learning and optimization. Specifically, we bu... 详细信息
来源: 评论
Goal Representation Heuristic dynamic programming on Maze Navigation
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2013年 第12期24卷 2038-2050页
作者: Ni, Zhen He, Haibo Wen, Jinyu Xu, Xin Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA Huazhong Univ Sci & Technol Sch Elect & Elect Engn State Key Lab Adv Electromagnet Engn & Technol Wuhan 430074 Peoples R China Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China
Goal representation heuristic dynamic programming (GrHDP) is proposed in this paper to demonstrate online learning in the Markov decision process. In addition to the (external) reinforcement signal in literature, we d... 详细信息
来源: 评论
reinforcement learning and approximate dynamic programming for feedback control /
收藏 引用
2013年
作者: edited by Frank L. Lewis Derong Liu.
来源: 内蒙古大学图书馆图书 评论
Design and real-time implementation of optimal power system wide area system-centric controller based on temporal difference learning
Design and real-time implementation of optimal power system ...
收藏 引用
Conference Record of the ieee Industry Applications Society Annual Meeting (IAS)
作者: Reza Yousefian Sukumar Kamalasadan Department of Electrical and Computer Engineering University of North Carolina at Charlotte Charlotte NC
In this paper a new method for designing and implementing coordinated wide area controller architecture is presented and tested using real-time digital simulation on a benchmark two area power system model for improve... 详细信息
来源: 评论