咨询与建议

限定检索结果

文献类型

  • 228 篇 会议
  • 4 篇 期刊文献

馆藏范围

  • 232 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 98 篇 工学
    • 93 篇 计算机科学与技术...
    • 40 篇 软件工程
    • 25 篇 电气工程
    • 14 篇 控制科学与工程
    • 4 篇 机械工程
    • 1 篇 力学(可授工学、理...
    • 1 篇 信息与通信工程
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 23 篇 理学
    • 23 篇 数学
    • 6 篇 统计学(可授理学、...
    • 4 篇 系统科学
    • 1 篇 化学
    • 1 篇 大气科学
  • 9 篇 管理学
    • 7 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 52 篇 learning
  • 46 篇 optimal control
  • 37 篇 reinforcement le...
  • 34 篇 learning (artifi...
  • 27 篇 equations
  • 22 篇 heuristic algori...
  • 21 篇 control systems
  • 20 篇 convergence
  • 19 篇 neural networks
  • 18 篇 function approxi...
  • 17 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 14 篇 markov processes
  • 14 篇 artificial neura...
  • 14 篇 cost function
  • 13 篇 stochastic proce...
  • 12 篇 algorithm design...
  • 12 篇 adaptive control

机构

  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 northeastern uni...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...
  • 2 篇 college of infor...
  • 2 篇 school of automa...

作者

  • 7 篇 hado van hasselt
  • 7 篇 lewis frank l.
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 liu derong
  • 5 篇 huaguang zhang
  • 5 篇 zhang huaguang
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 xu xin
  • 4 篇 vrabie draguna
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 yanhong luo
  • 4 篇 damien ernst
  • 4 篇 jan peters
  • 4 篇 peters jan
  • 4 篇 zhao dongbin
  • 3 篇 xu hao
  • 3 篇 martin riedmille...

语言

  • 232 篇 英文
检索条件"任意字段=2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2009"
232 条 记 录,以下是121-130 订阅
排序:
adaptive dynamic programming-based optimal tracking control for nonlinear systems using general value iteration
Adaptive dynamic programming-based optimal tracking control ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Xiaofeng Lin Qiang Ding Weikai Kong Chunning Song Qingbao Huang School of Electrical Engineering Guangxi University Nanning China
For the optimal tracking control problem of affine nonlinear systems, a general value iteration algorithm based on adaptive dynamic programming is proposed in this paper. By system transformation, the optimal tracking... 详细信息
来源: 评论
Optimal control for a class of nonlinear systems with state delay based on adaptive dynamic programming with ε-error bound
Optimal control for a class of nonlinear systems with state ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Xiaofeng Lin Nuyun Cao Yuzhang Lin School of Electrical Engineering Guangxi University Nanning China Department of Electrical Engineering Tsinghua University Beijing China
In this paper, a finite-horizon ε-optimal control for a class of nonlinear systems with state delay is proposed by adaptive dynamic programming (ADP) algorithm. First of all, the performance index function is defined... 详细信息
来源: 评论
ADP-based optimal control for a class of nonlinear discrete-time systems with inequality constraints
ADP-based optimal control for a class of nonlinear discrete-...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Yanhong Luo Geyang Xiao College of Information Science and Engineering Northeastern University
In this paper, the adaptive dynamic programming (ADP) approach is utilized to design a neural-network-based optimal controller for a class of nonlinear discrete-time (DT) systems with inequality constraints. To begin ... 详细信息
来源: 评论
Feedback controller parameterizations for reinforcement learning
Feedback controller parameterizations for Reinforcement Lear...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: John W. Roberts Ian R. Manchester Russ Tedrake MIT CSAIL Cambridge MA USA
reinforcement learning offers a very general framework for learning controllers, but its effectiveness is closely tied to the controller parameterization used. Especially when learning feedback controllers for weakly ... 详细信息
来源: 评论
adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems
Adaptive dynamic programming for optimal control of unknown ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Derong Liu Ding Wang Dongbin Zhao Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China
An intelligent optimal control scheme for unknown nonlinear discrete-time systems with discount factor in the cost function is proposed in this paper. An iterative adaptive dynamic programming (ADP) algorithm via glob... 详细信息
来源: 评论
Bayesian active learning with basis functions
Bayesian active learning with basis functions
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Ilya O. Ryzhov Warren B. Powell Operations Research and Financial Engineering Princeton University Princeton NJ USA
A common technique for dealing with the curse of dimensionality in approximate dynamic programming is to use a parametric value function approximation, where the value of being in a state is assumed to be a linear com... 详细信息
来源: 评论
Supervised adaptive dynamic programming based adaptive cruise control
Supervised adaptive dynamic programming based adaptive cruis...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Dongbin Zhao Zhaohui Hu Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China
This paper proposes a supervised adaptive dynamic programming (SADP) algorithm for the full range adaptive cruise control (ACC) system. The full range ACC system considers both the ACC situation in highway system and ... 详细信息
来源: 评论
N-step optimal time-invariant trajectory tracking control for a class of nonlinear systems
N-step optimal time-invariant trajectory tracking control fo...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Ruizhuo Song Huaguang Zhang School of Information Science and Engineering Northeastern University Shenyang China
In this paper, the time-invariant trajectory tracking control problem under N-step control is solved by finite horizon approximate dynamic programming (ADP) algorithms. At first, we convert the tracking control proble... 详细信息
来源: 评论
Kalman Temporal Differences: The deterministic case
Kalman Temporal Differences: The deterministic case
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Matthieu Geist Olivier Pietquin Gabriel Fricout IMS Research Group Supélec Metz France IMS Research Group Metz France MC cluster ArcelorMittal Research Maizieres-Les-Metz France
This paper deals with value function and Q-function approximation in deterministic Markovian decision processes. A general statistical framework based on the Kalman filtering paradigm is introduced. Its principle is t... 详细信息
来源: 评论
A reinforcement learning approach for sequential mastery testing
A reinforcement learning approach for sequential mastery tes...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: El-Sayed M. El-Alfy College of Computer Sciences and Engineering King Fahd University of Petroleum and Minerals Dhahran Saudi Arabia
This paper explores a novel application for reinforcement learning (RL) techniques to sequential mastery testing. In such systems, the goal is to classify each examined person, using the minimal number of test items, ... 详细信息
来源: 评论