咨询与建议

限定检索结果

文献类型

  • 746 篇 会议
  • 270 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,020 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 711 篇 工学
    • 520 篇 计算机科学与技术...
    • 380 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 312 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 12 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 994 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1020 条 记 录,以下是731-740 订阅
排序:
High-order local dynamic programming
High-order local dynamic programming
收藏 引用
作者: Tassa, Yuval Todorov, Emanuel Interdisciplinary Center for Neural Computation Hebrew University Jerusalem Israel Applied Mathematics and Computer Science and Engineering University of Washington Seattle United States
We describe a new local dynamic programming algorithm for solving stochastic continuous Optimal Control problems. We use cubature integration to both propagate the state distribution and perform the Bellman backup. Th... 详细信息
来源: 评论
Online adaptive learning of optimal control solutions using integral reinforcement learning
Online adaptive learning of optimal control solutions using ...
收藏 引用
作者: Vamvoudakis, Kyriakos G. Vrabie, Draguna Lewis, Frank L. Automation and Robotics Research Institute University of Texas at Arlington Fort Worth TX 76118 United States
In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learning the continuous-time optimal control solution for nonlinear systems with infinite horizon costs and partial knowled... 详细信息
来源: 评论
A Neural Architecture to Address reinforcement learning Problems
A Neural Architecture to Address Reinforcement Learning Prob...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: de Arruda, Rodrigo L. S. Von Zuben, Fernando J. Univ Campinas UNICAMP Sch Elect & Comp Engn FEEC Dept Comp Engn & Ind Automat DCA Lab Bioinformat & Bioinspired Comp LBiC Campinas SP Brazil
In this paper, the reinforcement learning problem is formulated equivalently to a Markov Decision Process. We address the solution of such problem using a novel adaptive dynamic programming algorithm which is based on... 详细信息
来源: 评论
Optimal Control for a Class of Unknown Nonlinear Systems via the Iterative GDHP Algorithm
Optimal Control for a Class of Unknown Nonlinear Systems via...
收藏 引用
8th International symposium on Neural Networks
作者: Wang, Ding Liu, Derong Chinese Acad Sci Inst Automat Beijing 100190 Peoples R China
Using the neural-network-based iterative adaptive dynamic programming (ADP) algorithm, an optimal control scheme for a class of unknown discrete-time nonlinear systems with discount factor in the cost function is prop... 详细信息
来源: 评论
adaptive Dual Heuristic programming Based on Delta-Bar-Delta learning Rule
Adaptive Dual Heuristic Programming Based on Delta-Bar-Delta...
收藏 引用
8th International symposium on Neural Networks
作者: Wu, Jun Xu, Xin Lian, Chuanqiang Huang, Yan Natl Univ Def Technol Coll Mechatron & Automat Inst Automat Changsha 410073 Hunan Peoples R China
Dual Heuristic programming (DHP) is a class of approximate dynamic programming methods using neural networks. Although there have been some successful applications of DHP, its performance and convergence are greatly i... 详细信息
来源: 评论
reinforcement learning with adaptive Kanerva Coding for Xpilot Game AI
Reinforcement Learning with Adaptive Kanerva Coding for Xpil...
收藏 引用
ieee Congress on Evolutionary Computation (CEC)
作者: Allen, Martin Fritzsche, Phil Univ Wisconsin Dept Comp Sci La Crosse WI 54601 USA Coll New London Comp Sci Dept Connecticut New London CT USA
The Xpilot-AI video game platform allows the creation of artificially intelligent and autonomous control agents. At the same time, the Xpilot environment is highly complex, with very many state variables and action ch... 详细信息
来源: 评论
A new approach for power management in sensor node based on reinforcement learning
A new approach for power management in sensor node based on ...
收藏 引用
International symposium on Computer Networks and Distributed Systems
作者: Kianpisheh, Somayeh Charkari, Nasrolah Moghadam Faculty of Electrical and Computer Engineering Tarbiat Modares University Tehran Iran
Wireless sensor networks are composed of small nodes with limited battery life and computational ability. Energy reduction in these networks is an important issue to extend network lifetime. dynamic power management i... 详细信息
来源: 评论
Hierarchical Approximate Policy Iteration with Binary-Tree State Space Decomposition
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS 2011年 第12期22卷 1863-1877页
作者: Xu, Xin Liu, Chunming Yang, Simon X. Hu, Dewen Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China Univ Guelph Sch Engn Guelph ON N1G 2W1 Canada
In recent years, approximate policy iteration (API) has attracted increasing attention in reinforcement learning (RL), e. g., least-squares policy iteration (LSPI) and its kernelized version, the kernel-based LSPI alg... 详细信息
来源: 评论
Bayesian active learning with basis functions
Bayesian active learning with basis functions
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Ilya O. Ryzhov Warren B. Powell Operations Research and Financial Engineering Princeton University Princeton NJ USA
A common technique for dealing with the curse of dimensionality in approximate dynamic programming is to use a parametric value function approximation, where the value of being in a state is assumed to be a linear com... 详细信息
来源: 评论
adaptive dynamic programming with balanced weights seeking strategy
Adaptive dynamic programming with balanced weights seeking s...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Jian Fu Haibo He Zhen Ni School of Automation Wuhan University of Technology Wuhan Hubei China Department of Electrical Computer and Biomedical Engineering University of Rhode Island Kingston RI USA
In this paper we propose to integrate the recursive Levenberg-Marquardt method into the adaptive dynamic programming (ADP) design for improved learning and adaptive control performance. Our key motivation is to consid... 详细信息
来源: 评论