咨询与建议

限定检索结果

文献类型

  • 229 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 247 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 113 篇 工学
    • 103 篇 计算机科学与技术...
    • 42 篇 软件工程
    • 38 篇 电气工程
    • 23 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 机械工程
    • 2 篇 力学(可授工学、理...
    • 1 篇 仪器科学与技术
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 27 篇 理学
    • 25 篇 数学
    • 7 篇 系统科学
    • 6 篇 统计学(可授理学、...
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 大气科学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 54 篇 optimal control
  • 51 篇 learning
  • 44 篇 reinforcement le...
  • 35 篇 learning (artifi...
  • 27 篇 equations
  • 25 篇 neural networks
  • 22 篇 heuristic algori...
  • 20 篇 convergence
  • 20 篇 control systems
  • 18 篇 function approxi...
  • 18 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 15 篇 cost function
  • 14 篇 markov processes
  • 14 篇 nonlinear system...
  • 14 篇 artificial neura...
  • 13 篇 stochastic proce...
  • 12 篇 adaptive dynamic...

机构

  • 10 篇 chinese acad sci...
  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 department of el...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 natl univ def te...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...

作者

  • 13 篇 liu derong
  • 7 篇 hado van hasselt
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 zhao dongbin
  • 5 篇 xu xin
  • 5 篇 lewis frank l.
  • 5 篇 huaguang zhang
  • 5 篇 wei qinglai
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 haibo he
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 zhang huaguang
  • 4 篇 ni zhen
  • 4 篇 yanhong luo
  • 4 篇 wang ding
  • 4 篇 he haibo
  • 4 篇 damien ernst

语言

  • 246 篇 英文
  • 1 篇 其他
检索条件"任意字段=2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014"
247 条 记 录,以下是131-140 订阅
排序:
Longitudinal Control of Hypersonic Vehicles Based on Direct Heuristic dynamic programming Using ANFIS
Longitudinal Control of Hypersonic Vehicles Based on Direct ...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: Luo, Xiong Chen, Yi Si, Jennie Liu, Feng USTB Sch Comp & Commun Engn Beijing 100083 Peoples R China Arizona State Univ Sch Elect Comp & Energy Engn Tempe AZ 85287 USA
Since the launch of the scramjet, recent years have witnessed a growing interest in the study of airbreathing hypersonic vehicles. Due to its strong coupling characteristics, high nonlinearity, and uncertain parameter... 详细信息
来源: 评论
Beyond exponential utility functions: A variance-adjusted approach for risk-averse reinforcement learning
Beyond exponential utility functions: A variance-adjusted ap...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Abhijit A. Gosavi Sajal K. Das Susan L. Murray Department of Engineering Management and Systems Engineering Missouri University of Science and Technology Rolla MO Department of Computer Science Missouri University of Science and Technology Rolla MO
Utility theory has served as a bedrock for modeling risk in economics. Where risk is involved in decision-making, for solving Markov decision processes (MDPs) via utility theory, the exponential utility (EU) function ... 详细信息
来源: 评论
A comparison of approximate dynamic programming techniques on benchmark energy storage problems: Does anything work?
A comparison of approximate dynamic programming techniques o...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Daniel R. Jiang Thuy V. Pham Warren B. Powell Daniel F. Salas Warren R. Scott Department of Electrical & Electronics Enzineering Dehradun India Graphic Era University Dehradun India School of Rlectronics Dehradun India Graphic Era Hill University Bhimtal India
As more renewable, yet volatile, forms of energy like solar and wind are being incorporated into the grid, the problem of finding optimal control policies for energy storage is becoming increasingly important. These s... 详细信息
来源: 评论
Finite-horizon optimal control design for uncertain linear discrete-time systems
Finite-horizon optimal control design for uncertain linear d...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Qiming Zhao Hao Xu S. Jagannathan Department of Electrical and Computer Engineering Missouri University of Science and Technology Rolla MO USA
In this paper, the finite-horizon optimal adaptive control design for linear discrete-time systems with unknown system dynamics by using adaptive dynamic programming (ADP) is presented. In the presence of full state f... 详细信息
来源: 评论
High-order local dynamic programming
High-order local dynamic programming
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Yuval Tassa Emanuel Todorov Interdisciplinary Center of Neural Computation Hebrew University Jerusalem Israel Applied Mathematics and Computer Science & Engineering University of Washington Seattle USA
We describe a new local dynamic programming algorithm for solving stochastic continuous Optimal Control problems. We use cubature integration to both propagate the state distribution and perform the Bellman backup. Th... 详细信息
来源: 评论
Bayesian active learning with basis functions
Bayesian active learning with basis functions
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Ilya O. Ryzhov Warren B. Powell Operations Research and Financial Engineering Princeton University Princeton NJ USA
A common technique for dealing with the curse of dimensionality in approximate dynamic programming is to use a parametric value function approximation, where the value of being in a state is assumed to be a linear com... 详细信息
来源: 评论
N-step optimal time-invariant trajectory tracking control for a class of nonlinear systems
N-step optimal time-invariant trajectory tracking control fo...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Ruizhuo Song Huaguang Zhang School of Information Science and Engineering Northeastern University Shenyang China
In this paper, the time-invariant trajectory tracking control problem under N-step control is solved by finite horizon approximate dynamic programming (ADP) algorithms. At first, we convert the tracking control proble... 详细信息
来源: 评论
Optimal control for a class of nonlinear systems with state delay based on adaptive dynamic programming with ε-error bound
Optimal control for a class of nonlinear systems with state ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Xiaofeng Lin Nuyun Cao Yuzhang Lin School of Electrical Engineering Guangxi University Nanning China Department of Electrical Engineering Tsinghua University Beijing China
In this paper, a finite-horizon ε-optimal control for a class of nonlinear systems with state delay is proposed by adaptive dynamic programming (ADP) algorithm. First of all, the performance index function is defined... 详细信息
来源: 评论
Feedback controller parameterizations for reinforcement learning
Feedback controller parameterizations for Reinforcement Lear...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: John W. Roberts Ian R. Manchester Russ Tedrake MIT CSAIL Cambridge MA USA
reinforcement learning offers a very general framework for learning controllers, but its effectiveness is closely tied to the controller parameterization used. Especially when learning feedback controllers for weakly ... 详细信息
来源: 评论
adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems
Adaptive dynamic programming for optimal control of unknown ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Derong Liu Ding Wang Dongbin Zhao Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China
An intelligent optimal control scheme for unknown nonlinear discrete-time systems with discount factor in the cost function is proposed in this paper. An iterative adaptive dynamic programming (ADP) algorithm via glob... 详细信息
来源: 评论