咨询与建议

限定检索结果

文献类型

  • 746 篇 会议
  • 270 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,020 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 711 篇 工学
    • 520 篇 计算机科学与技术...
    • 380 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 312 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 12 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 994 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1020 条 记 录,以下是611-620 订阅
排序:
A comparison of approximate dynamic programming techniques on benchmark energy storage problems: Does anything work?
A comparison of approximate dynamic programming techniques o...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Daniel R. Jiang Thuy V. Pham Warren B. Powell Daniel F. Salas Warren R. Scott Department of Electrical & Electronics Enzineering Dehradun India Graphic Era University Dehradun India School of Rlectronics Dehradun India Graphic Era Hill University Bhimtal India
As more renewable, yet volatile, forms of energy like solar and wind are being incorporated into the grid, the problem of finding optimal control policies for energy storage is becoming increasingly important. These s... 详细信息
来源: 评论
Multi-objective reinforcement learning for AUV thruster failure recovery
Multi-objective reinforcement learning for AUV thruster fail...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Seyed Reza Ahmadzadeh Petar Kormushev Darwin G. Caldwell Department of Advanced Robotics Istituto Italiano di Tecnologia Genova
This paper investigates learning approaches for discovering fault-tolerant control policies to overcome thruster failures in Autonomous Underwater Vehicles (AUV). The proposed approach is a model-based direct policy s... 详细信息
来源: 评论
Pseudo-MDPs and factored linear action models
Pseudo-MDPs and factored linear action models
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Hengshuai Yao Csaba Szepesvári Bernardo Ávila Pires Xinhua Zhang Department of Computing Science University of Alberta Edmonton Alberta Canada Machine Learning Research Group National ICT Australia Sydney and Canberra Australia
In this paper we introduce the concept of pseudo-MDPs to develop abstractions. Pseudo-MDPs relax the requirement that the transition kernel has to be a probability kernel. We show that the new framework captures many ... 详细信息
来源: 评论
Accelerated gradient temporal difference learning algorithms
Accelerated gradient temporal difference learning algorithms
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Dominik Meyer Rémy Degenne Ahmed Omrane Hao Shen Institute for Data Processing Technische Universität München Germany
In this paper we study Temporal Difference (TD) learning with linear value function approximation. The classic TD algorithm is known to be unstable with linear function approximation and off-policy learning. Recently ... 详细信息
来源: 评论
A two stage learning technique for dual learning in the pursuit-evasion differential game
A two stage learning technique for dual learning in the purs...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Ahmad A. Al-Talabi Howard M. Schwartz Mechatronics Engineering Department Baghdad University Baghdad Iraq Department of Systems and Computer Engineering Carleton University Ottawa ON Canada
This paper addresses the case of dual learning in the pursuit-evasion (PE) differential game and examines how fast the players can learn their default control strategies. The players should learn their default control... 详细信息
来源: 评论
Clipping in Neurocontrol by adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2014年 第10期25卷 1909-1920页
作者: Fairbank, Michael Prokhorov, Danil Alonso, Eduardo City Univ London Sch Informat Dept Comp Sci London EC1V OHB England Toyota Res Inst NA Ann Arbor MI 48105 USA
In adaptive dynamic programming, neurocontrol, and reinforcement learning, the objective is for an agent to learn to choose actions so as to minimize a total cost function. In this paper, we show that when discretized... 详细信息
来源: 评论
2014 ieee International symposium on Intelligent Control, ISIC 2014
2014 IEEE International Symposium on Intelligent Control, IS...
收藏 引用
2014 ieee International symposium on Intelligent Control, ISIC 2014
The proceedings contain 56 papers. The topics discussed include: consensus with convergence rate in directed networks with multiple non-differentiable input delays;differentiated consensuses in a stochastic network wi...
来源: 评论
Beyond exponential utility functions: A variance-adjusted approach for risk-averse reinforcement learning
Beyond exponential utility functions: A variance-adjusted ap...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Abhijit A. Gosavi Sajal K. Das Susan L. Murray Department of Engineering Management and Systems Engineering Missouri University of Science and Technology Rolla MO Department of Computer Science Missouri University of Science and Technology Rolla MO
Utility theory has served as a bedrock for modeling risk in economics. Where risk is involved in decision-making, for solving Markov decision processes (MDPs) via utility theory, the exponential utility (EU) function ... 详细信息
来源: 评论
Design and real-time implementation of optimal power system wide area system-centric controller based on temporal difference learning
Design and real-time implementation of optimal power system ...
收藏 引用
2014 ieee Industry Application Society Annual Meeting, IAS 2014
作者: Yousefian, Reza Kamalasadan, Sukumar Department of Electrical and Computer Engineering University of North Carolina at Charlotte CharlotteNC United States
In this paper a new method for designing and implementing coordinated wide area controller architecture is presented and tested using real-time digital simulation on a benchmark two area power system model for improve... 详细信息
来源: 评论
symposium on adaptive dynamic programming and reinforcement learning (ieee ADPRL 2011)
Symposium on adaptive dynamic programming and reinforcement ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
ADPRL 2011 is the third ieee International symposium on Approximate dynamic programming and reinforcement learning. The area of approximate dynamic programming and reinforcement learning is a fusion of a number of res...
来源: 评论