咨询与建议

限定检索结果

文献类型

  • 229 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 247 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 113 篇 工学
    • 103 篇 计算机科学与技术...
    • 42 篇 软件工程
    • 38 篇 电气工程
    • 23 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 机械工程
    • 2 篇 力学(可授工学、理...
    • 1 篇 仪器科学与技术
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 27 篇 理学
    • 25 篇 数学
    • 7 篇 系统科学
    • 6 篇 统计学(可授理学、...
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 大气科学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 54 篇 optimal control
  • 51 篇 learning
  • 44 篇 reinforcement le...
  • 35 篇 learning (artifi...
  • 27 篇 equations
  • 25 篇 neural networks
  • 22 篇 heuristic algori...
  • 20 篇 convergence
  • 20 篇 control systems
  • 18 篇 function approxi...
  • 18 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 15 篇 cost function
  • 14 篇 markov processes
  • 14 篇 nonlinear system...
  • 14 篇 artificial neura...
  • 13 篇 stochastic proce...
  • 12 篇 adaptive dynamic...

机构

  • 10 篇 chinese acad sci...
  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 department of el...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 natl univ def te...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...

作者

  • 13 篇 liu derong
  • 7 篇 hado van hasselt
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 zhao dongbin
  • 5 篇 xu xin
  • 5 篇 lewis frank l.
  • 5 篇 huaguang zhang
  • 5 篇 wei qinglai
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 haibo he
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 zhang huaguang
  • 4 篇 ni zhen
  • 4 篇 yanhong luo
  • 4 篇 wang ding
  • 4 篇 he haibo
  • 4 篇 damien ernst

语言

  • 246 篇 英文
  • 1 篇 其他
检索条件"任意字段=2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014"
247 条 记 录,以下是161-170 订阅
排序:
Real-time tracking on adaptive critic design with uniformly ultimately bounded condition
Real-time tracking on adaptive critic design with uniformly ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Zhen Ni Xiao Fang Haibo He Dongbin Zhao Xin Xu Department of Electrical University of Rhode Island Kingston RI USA Institute of Automation Chinese Academy of Sciences Beijing China Institute of Automation National University of Defense Technology Changsha China
In this paper, we proposed a new nonlinear tracking controller based on heuristic dynamic programming (HDP) with the tracking filter. Specifically, we integrate a goal network into the regular HDP design and provide t... 详细信息
来源: 评论
reinforcement learning in the game of Othello: learning against a fixed opponent and learning from self-play
Reinforcement learning in the game of Othello: Learning agai...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Michiel van der Ree Marco Wiering Faculty of Mathematics and Natural Sciences University of Groningen Institute of Artificial Intelligence and Cognitive Engineering The Netherlands
This paper compares three strategies in using reinforcement learning algorithms to let an artificial agent learn to play the game of Othello. The three strategies that are compared are: learning by self-play, learning... 详细信息
来源: 评论
reinforcement learning algorithms for solving classification problems
Reinforcement learning algorithms for solving classification...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Marco A. Wiering Hado van Hasselt Auke-Dirk Pietersma Lambert Schomaker Department of Artificial Intelligence University of Groningam Netherlands Multi-agent and Adaptive Computation Centrum Wiskunde and Informatica Netherlands
We describe a new framework for applying reinforcement learning (RL) algorithms to solve classification tasks by letting an agent act on the inputs and learn value functions. This paper describes how classification pr... 详细信息
来源: 评论
Self-learning Cruise Control Using Kernel-Based Least Squares Policy Iteration
收藏 引用
ieee TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY 2014年 第3期22卷 1078-1087页
作者: Wang, Jian Xu, Xin Liu, Daxue Sun, Zhenping Chen, Qingyang Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China
This paper presents a novel learning-based cruise controller for autonomous land vehicles (ALVs) with unknown dynamics and external disturbances. The learning controller consists of a time-varying proportional-integra... 详细信息
来源: 评论
A reinforcement learning algorithm developed to model GenCo strategic bidding behavior in multidimensional and continuous state and action spaces
A reinforcement learning algorithm developed to model GenCo ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Alfred Yong Fu Lau Dipti Srinivasan Thomas Reindl National University of Singapore Singapore SG Department of Electrical Computer Engineering National University of Singapore Singapore Solar Energy Research Institute of Singapore National University of Singapore Singapore
The electricity market has provided a complex economic environment, and consequently has increased the requirement for advancement of learning methods. In the agent-based modeling and simulation framework of this econ... 详细信息
来源: 评论
adaptive computation of optimal nonrandomized policies in constrained average-reward MDPs
Adaptive computation of optimal nonrandomized policies in co...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Eugene A. Feinberg Department of Applied Mathematics and Statistics Stony Brook University Stony Brook NY USA
This paper deals with computation of optimal nonrandomized nonstationary policies and mixed stationary policies for average-reward Markov decision processes with multiple criteria and constraints. We consider problems... 详细信息
来源: 评论
Optimistic planning for continuous-action deterministic systems
Optimistic planning for continuous-action deterministic syst...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Lucian Buşoniu Alexander Daniels Rémi Munos Robert Babuška Department of Automation Technical University of Cluj-Napoca Romania France DCSC Delft University of Technology the Netherlands Team SequeL INRIA Lille-Nord Europe France
We consider the class of online planning algorithms for optimal control, which compared to dynamic programming are relatively unaffected by large state dimensionality. We introduce a novel planning algorithm called SO... 详细信息
来源: 评论
reinforcement learning to train Ms. Pac-Man using higher-order action-relative inputs
Reinforcement learning to train Ms. Pac-Man using higher-ord...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Luuk Bom Ruud Henken Marco Wiering Faculty of Mathematics and Natural Sciences University of Groningen The Netherlands
reinforcement learning algorithms enable an agent to optimize its behavior from interacting with a specific environment. Although some very successful applications of reinforcement learning algorithms have been develo... 详细信息
来源: 评论
Delayed insertion and rule effect moderation of domain knowledge for reinforcement learning
Delayed insertion and rule effect moderation of domain knowl...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Teck-Hou Teng Ah-Hwee Tan School of Computer Engineering Center for Computational Intelligence School of Computer Engineering Nanyang Technological University
Though not a fundamental pre-requisite to efficient machine learning, insertion of domain knowledge into adaptive virtual agent is nonetheless known to improve learning efficiency and reduce model complexity. Conventi... 详细信息
来源: 评论
Particle Swarn Optimized adaptive dynamic programming
Particle Swarn Optimized Adaptive Dynamic Programming
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Dongbin Zhao Jianqiang Yi Derong Liu Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China Department of Electrical and Computer Engineering University of Illinois Chicago Chicago IL USA
Particle swarm optimization is used for the training of the action network and critic network of the adaptive dynamic programming approach. The typical structures of the adaptive dynamic programming and particle swarm... 详细信息
来源: 评论