咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是771-780 订阅
排序:
ieee SSCI 2011 - symposium Series on Computational Intelligence - ieee ALIFE 2011: 2011 ieee symposium on Artificial Life
IEEE SSCI 2011 - Symposium Series on Computational Intellige...
收藏 引用
symposium Series on Computational Intelligence, ieee SSCI 2011 - 2011 ieee symposium on Artificial Life, ieee ALIFE 2011
The proceedings contain 30 papers. The topics discussed include: computation of population spatial distribution in individual-based ecosystem simulation;towards imitation-enhanced reinforcement learning in multi-agent...
来源: 评论
Active exploration for robot parameter selection in episodic reinforcement learning
Active exploration for robot parameter selection in episodic...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Oliver Kroemer Jan Peters Max-Planck Institute Tubingen Germany
As the complexity of robots and other autonomous systems increases, it becomes more important that these systems can adapt and optimize their settings actively. However, such optimization is rarely trivial. Sampling f... 详细信息
来源: 评论
Directed exploration of policy space using support vector classifiers
Directed exploration of policy space using support vector cl...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Ioannis Rexakis Michail G. Lagoudakis Department of Electronic and Computer Engineering Technical University of Crete Crete Greece
Good policies in reinforcement learning problems typically exhibit significant structure. Several recent learning approaches based on the approximate policy iteration scheme suggest the use of classifiers for capturin... 详细信息
来源: 评论
2011 ieee International symposium on Intelligent Control, ISIC 2011
2011 IEEE International Symposium on Intelligent Control, IS...
收藏 引用
2011 ieee International symposium on Intelligent Control, ISIC 2011
The proceedings contain 39 papers. The topics discussed include: optimal network localization by particle swarm optimization;a framework for adaptive tuning of distributed model predictive controllers by Lagrange mult...
来源: 评论
Approximate dynamic programming for stochastic systems with additive and multiplicative noise
Approximate dynamic programming for stochastic systems with ...
收藏 引用
ieee International symposium on Intelligent Control (ISIC)
作者: Yu Jiang Zhong-Ping Jiang Department of Electrical and Computer Engineering Polytechnic Institute of New York University Brooklyn NY USA College of Engineering Beijing University China
This paper studies the stochastic optimal control problem with additive and multiplicative noise via reinforcement learning (RL) and approximate/adaptive dynamic programming (ADP). Using Itô calculus, a policy it... 详细信息
来源: 评论
Model-free H∞ stochastic optimal design for unknown linear networked control system zero-sum games via Q-learning
Model-free H∞ stochastic optimal design for unknown linear ...
收藏 引用
ieee International symposium on Intelligent Control (ISIC)
作者: Hao Xu S. Jagannathan Department of Electrical and Computer Engineering Missouri University of Science and Technology USA
In this paper, stochastic optimal strategy for unknown linear networked control system (NCS) quadratic zero-sum games related to H ∞ optimal control in the presence of random delays and packet losses is solved in fo... 详细信息
来源: 评论
Direct adaptive control of a flexible robot using reinforcement learning
Direct adaptive control of a flexible robot using reinforcem...
收藏 引用
International Conference on Industrial Electronics, Control and Robotics
作者: Subudhi, Bidyadhar Pradhan, Santanu Kumar
This paper proposes a new adaptive control using the concept of reinforcement learning to address adaptivity for varied payload conditions for a two-link flexible manipulator (TLFM). The application of reinforcement l... 详细信息
来源: 评论
adaptive Fuzzy Control of Switched Objective Functions in Pursuit-Evasion Scenarios  49
Adaptive Fuzzy Control of Switched Objective Functions in Pu...
收藏 引用
49th ieee Conference on Decision and Control (CDC)
作者: Goode, Brian Kurdila, Andrew Roan, Mike Virginia Polytech Inst & State Univ Dept Mech Engn Blacksburg VA 24060 USA
In recent efforts, the authors have derived simple switched control schemes that qualitatively yield an attractive performance in two player pursuit-evasion games. A drawback of these methods is that detailed knowledg... 详细信息
来源: 评论
A hierarchical learning architecture with multiple-goal representations based on adaptive dynamic programming
A hierarchical learning architecture with multiple-goal repr...
收藏 引用
2010 International Conference on Networking, Sensing and Control, ICNSC 2010
作者: He, Haibo Liu, Bo Department of Electrical Computer and Biomedical Engineering University of Rhode Island Kingston RI 02881 United States Department of Electrical and Computer Engineering Stevens Institute of Technology Hoboken NJ 07030 United States
In this paper we propose a hierarchical learning architecture with multiple-goal representations based on adaptive dynamic programming (ADP). The key idea of this architecture is to integrate a reference network to pr... 详细信息
来源: 评论
Iterative learning Control of A Class of Fractional Order Nonlinear Systems
Iterative Learning Control of A Class of Fractional Order No...
收藏 引用
ieee International Conference on Control Applications Part of 2010 ieee Multi-Conference on Systems and Control
作者: Li, Yan Ahn, Hyo-Sung Chen, YangQuan Shandong Univ Sch Control Sci & Engn Jinan 250061 Shandong Peoples R China Gwangju Inst Sci & Technol 1 Oryong Dong Gwangju South Korea Utah State Univ Ctr Self Organizing & Intelligent Syst Dept Elect & Comp Engn Logan UT 84322 USA
This paper firstly addresses the convergence analysis of iterative learning control of a class of fractional order nonlinear systems using the generalized Gronwall-Bellman lemma. Detailed problem definition and conver... 详细信息
来源: 评论