咨询与建议

限定检索结果

文献类型

  • 229 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 247 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 113 篇 工学
    • 103 篇 计算机科学与技术...
    • 42 篇 软件工程
    • 38 篇 电气工程
    • 23 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 机械工程
    • 2 篇 力学(可授工学、理...
    • 1 篇 仪器科学与技术
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 27 篇 理学
    • 25 篇 数学
    • 7 篇 系统科学
    • 6 篇 统计学(可授理学、...
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 大气科学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 54 篇 optimal control
  • 51 篇 learning
  • 44 篇 reinforcement le...
  • 35 篇 learning (artifi...
  • 27 篇 equations
  • 25 篇 neural networks
  • 22 篇 heuristic algori...
  • 20 篇 convergence
  • 20 篇 control systems
  • 18 篇 function approxi...
  • 18 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 15 篇 cost function
  • 14 篇 markov processes
  • 14 篇 nonlinear system...
  • 14 篇 artificial neura...
  • 13 篇 stochastic proce...
  • 12 篇 adaptive dynamic...

机构

  • 10 篇 chinese acad sci...
  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 department of el...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 natl univ def te...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...

作者

  • 13 篇 liu derong
  • 7 篇 hado van hasselt
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 zhao dongbin
  • 5 篇 xu xin
  • 5 篇 lewis frank l.
  • 5 篇 huaguang zhang
  • 5 篇 wei qinglai
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 haibo he
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 zhang huaguang
  • 4 篇 ni zhen
  • 4 篇 yanhong luo
  • 4 篇 wang ding
  • 4 篇 he haibo
  • 4 篇 damien ernst

语言

  • 246 篇 英文
  • 1 篇 其他
检索条件"任意字段=2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014"
247 条 记 录,以下是141-150 订阅
排序:
Supervised adaptive dynamic programming based adaptive cruise control
Supervised adaptive dynamic programming based adaptive cruis...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Dongbin Zhao Zhaohui Hu Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China
This paper proposes a supervised adaptive dynamic programming (SADP) algorithm for the full range adaptive cruise control (ACC) system. The full range ACC system considers both the ACC situation in highway system and ... 详细信息
来源: 评论
A reinforcement learning approach for sequential mastery testing
A reinforcement learning approach for sequential mastery tes...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: El-Sayed M. El-Alfy College of Computer Sciences and Engineering King Fahd University of Petroleum and Minerals Dhahran Saudi Arabia
This paper explores a novel application for reinforcement learning (RL) techniques to sequential mastery testing. In such systems, the goal is to classify each examined person, using the minimal number of test items, ... 详细信息
来源: 评论
adaptive dynamic programming with balanced weights seeking strategy
Adaptive dynamic programming with balanced weights seeking s...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Jian Fu Haibo He Zhen Ni School of Automation Wuhan University of Technology Wuhan Hubei China Department of Electrical Computer and Biomedical Engineering University of Rhode Island Kingston RI USA
In this paper we propose to integrate the recursive Levenberg-Marquardt method into the adaptive dynamic programming (ADP) design for improved learning and adaptive control performance. Our key motivation is to consid... 详细信息
来源: 评论
An adaptive-learning framework for semi-cooperative multi-agent coordination
An adaptive-learning framework for semi-cooperative multi-ag...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Abdeslem Boukhtouta Jean Berger Warren B. Powell Abraham George Defence Research and Development Canada QUE Canada Department of Operations Research and Financial Engineering Princeton University Princeton NJ USA
Complex problems involving multiple agents exhibit varying degrees of cooperation. The levels of cooperation might reflect both differences in information as well as differences in goals. In this research, we develop ... 详细信息
来源: 评论
An approximate dynamic programming based controller for an underactuated 6DoF quadrotor
An approximate Dynamic Programming based controller for an u...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Emanuel Stingu Frank L. Lewis Automation & Robotics Research Institute University of Texas Arlington Arlington TX USA
This paper discusses how the principles of adaptive dynamic programming (ADP) can be applied to the control of a quadrotor helicopter platform flying in an uncontrolled environment and subjected to various disturbance... 详细信息
来源: 评论
Optimistic planning for belief-augmented Markov Decision Processes
Optimistic planning for belief-augmented Markov Decision Pro...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Raphael Fonteneau Lucian Buşoniu Rémi Munos Department of Electrical Engineering and Computer Science University of Liège BELGIUM Universite de Lorraine CRAN FRANCE SequeL Team Inria Lille FRANCE
This paper presents the Bayesian Optimistic Planning (BOP) algorithm, a novel model-based Bayesian reinforcement learning approach. BOP extends the planning approach of the Optimistic Planning for Markov Decision Proc... 详细信息
来源: 评论
An integrated design for intensified direct heuristic dynamic programming
An integrated design for intensified direct heuristic dynami...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Xiong Luo Jennie Si Yuchao Zhou School of Computer and Communication Engineering University of Science and Technology Beijing (USTB) Beijing China Arizona State University Tempe AZ US
There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of mac... 详细信息
来源: 评论
adaptive dynamic programming for terminally constrained finite-horizon optimal control problems  53
Adaptive dynamic programming for terminally constrained fini...
收藏 引用
53rd ieee Annual Conference on Decision and Control (CDC)
作者: Andrews, L. Klotz, J. R. Kamalapurkar, R. Dixon, W. E. Univ Florida Dept Mech & Aerosp Engn Gainesville FL USA
adaptive dynamic programming is applied to control-affine nonlinear systems with uncertain drift dynamics to obtain a near-optimal solution to a finite-horizon optimal control problem with hard terminal constraints. A... 详细信息
来源: 评论
Protecting against evaluation overfitting in empirical reinforcement learning
Protecting against evaluation overfitting in empirical reinf...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Shimon Whiteson Brian Tanner Matthew E. Taylor Peter Stone Informatics Institute University of Amsterdam Netherlands Department of Computing Science University of Alberta Canada Department of Computer Science Lafayette College USA Department of Computer Science University of Texas Austin USA
Empirical evaluations play an important role in machine learning. However, the usefulness of any evaluation depends on the empirical methodology employed. Designing good empirical methodologies is difficult in part be... 详细信息
来源: 评论
Higher-level application of adaptive dynamic programming/reinforcement learning - a next phase for controls and system identification?
Higher-level application of Adaptive Dynamic Programming/Rei...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: George G. Lendaris Systems Science Graduate Program Portland State University Portland OR USA
In previous work it was shown that adaptive-Critic-type Approximate dynamic programming could be applied in a “higher-level” way to create autonomous agents capable of using experience to discern context and select ... 详细信息
来源: 评论