咨询与建议

限定检索结果

文献类型

  • 228 篇 会议
  • 4 篇 期刊文献

馆藏范围

  • 232 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 98 篇 工学
    • 93 篇 计算机科学与技术...
    • 40 篇 软件工程
    • 25 篇 电气工程
    • 14 篇 控制科学与工程
    • 4 篇 机械工程
    • 1 篇 力学(可授工学、理...
    • 1 篇 信息与通信工程
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 23 篇 理学
    • 23 篇 数学
    • 6 篇 统计学(可授理学、...
    • 4 篇 系统科学
    • 1 篇 化学
    • 1 篇 大气科学
  • 9 篇 管理学
    • 7 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 52 篇 learning
  • 46 篇 optimal control
  • 37 篇 reinforcement le...
  • 34 篇 learning (artifi...
  • 27 篇 equations
  • 22 篇 heuristic algori...
  • 21 篇 control systems
  • 20 篇 convergence
  • 19 篇 neural networks
  • 18 篇 function approxi...
  • 17 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 14 篇 markov processes
  • 14 篇 artificial neura...
  • 14 篇 cost function
  • 13 篇 stochastic proce...
  • 12 篇 algorithm design...
  • 12 篇 adaptive control

机构

  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 northeastern uni...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...
  • 2 篇 college of infor...
  • 2 篇 school of automa...

作者

  • 7 篇 hado van hasselt
  • 7 篇 lewis frank l.
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 liu derong
  • 5 篇 huaguang zhang
  • 5 篇 zhang huaguang
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 xu xin
  • 4 篇 vrabie draguna
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 yanhong luo
  • 4 篇 damien ernst
  • 4 篇 jan peters
  • 4 篇 peters jan
  • 4 篇 zhao dongbin
  • 3 篇 xu hao
  • 3 篇 martin riedmille...

语言

  • 232 篇 英文
检索条件"任意字段=2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2009"
232 条 记 录,以下是161-170 订阅
排序:
reinforcement learning in the game of Othello: learning against a fixed opponent and learning from self-play
Reinforcement learning in the game of Othello: Learning agai...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Michiel van der Ree Marco Wiering Faculty of Mathematics and Natural Sciences University of Groningen Institute of Artificial Intelligence and Cognitive Engineering The Netherlands
This paper compares three strategies in using reinforcement learning algorithms to let an artificial agent learn to play the game of Othello. The three strategies that are compared are: learning by self-play, learning... 详细信息
来源: 评论
reinforcement learning algorithms for solving classification problems
Reinforcement learning algorithms for solving classification...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Marco A. Wiering Hado van Hasselt Auke-Dirk Pietersma Lambert Schomaker Department of Artificial Intelligence University of Groningam Netherlands Multi-agent and Adaptive Computation Centrum Wiskunde and Informatica Netherlands
We describe a new framework for applying reinforcement learning (RL) algorithms to solve classification tasks by letting an agent act on the inputs and learn value functions. This paper describes how classification pr... 详细信息
来源: 评论
A reinforcement learning algorithm developed to model GenCo strategic bidding behavior in multidimensional and continuous state and action spaces
A reinforcement learning algorithm developed to model GenCo ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Alfred Yong Fu Lau Dipti Srinivasan Thomas Reindl National University of Singapore Singapore SG Department of Electrical Computer Engineering National University of Singapore Singapore Solar Energy Research Institute of Singapore National University of Singapore Singapore
The electricity market has provided a complex economic environment, and consequently has increased the requirement for advancement of learning methods. In the agent-based modeling and simulation framework of this econ... 详细信息
来源: 评论
Optimistic planning for continuous-action deterministic systems
Optimistic planning for continuous-action deterministic syst...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Lucian Buşoniu Alexander Daniels Rémi Munos Robert Babuška Department of Automation Technical University of Cluj-Napoca Romania France DCSC Delft University of Technology the Netherlands Team SequeL INRIA Lille-Nord Europe France
We consider the class of online planning algorithms for optimal control, which compared to dynamic programming are relatively unaffected by large state dimensionality. We introduce a novel planning algorithm called SO... 详细信息
来源: 评论
Cognitive control in cognitive dynamic systems: A new way of thinking inspired by the brain
Cognitive control in cognitive dynamic systems: A new way of...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Simon Haykin Ashkan Amiri Mehdi Fatemi Cognitive Systems Laboratory McMaster University Hamilton Ontario Canada
Briefly, main purpose of the paper is fourfold: a) Cognitive perception, which consists of two functional blocks: improved sparse-coding under the influence of perceptual attention for extracting relevant information ... 详细信息
来源: 评论
adaptive fault identification for a class of nonlinear dynamic systems
Adaptive fault identification for a class of nonlinear dynam...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Li-Bing Wu Dan Ye Xin-Gang Zhao College of Information Science and Engineering Northeastern University Shenyang Liaoning P. R. China College of Sciences University of Science and Technology Liaoning Anshan Liaoning P. R. China State Key Laboratory of Robotics and Shenyang Institute of Automation CAS Shenyang Liaoning P. R. China
This paper is concerned with the diagnosis problem of actuator faults for a class of nonlinear systems. It is assumed that the upper bound of the Lipschtiz constant of the nonlinearity in the faulty system is unknown.... 详细信息
来源: 评论
reinforcement learning to train Ms. Pac-Man using higher-order action-relative inputs
Reinforcement learning to train Ms. Pac-Man using higher-ord...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Luuk Bom Ruud Henken Marco Wiering Faculty of Mathematics and Natural Sciences University of Groningen The Netherlands
reinforcement learning algorithms enable an agent to optimize its behavior from interacting with a specific environment. Although some very successful applications of reinforcement learning algorithms have been develo... 详细信息
来源: 评论
Neuro-controller of cement rotary kiln temperature with adaptive critic designs
Neuro-controller of cement rotary kiln temperature with adap...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Xiaofeng Lin Tangbo Liu Shaojian Song Chunning Song College of Electrical Engineering Guangxi University Nanning China College of Electrical Engineering Guangxi University China
The production process of the cement rotary kiln is a typical engineering thermodynamics with large inertia, lagging and nonlinearity. So it is very difficult to control this process accurately using traditional contr... 详细信息
来源: 评论
Delayed insertion and rule effect moderation of domain knowledge for reinforcement learning
Delayed insertion and rule effect moderation of domain knowl...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Teck-Hou Teng Ah-Hwee Tan School of Computer Engineering Center for Computational Intelligence School of Computer Engineering Nanyang Technological University
Though not a fundamental pre-requisite to efficient machine learning, insertion of domain knowledge into adaptive virtual agent is nonetheless known to improve learning efficiency and reduce model complexity. Conventi... 详细信息
来源: 评论
Particle Swarn Optimized adaptive dynamic programming
Particle Swarn Optimized Adaptive Dynamic Programming
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Dongbin Zhao Jianqiang Yi Derong Liu Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China Department of Electrical and Computer Engineering University of Illinois Chicago Chicago IL USA
Particle swarm optimization is used for the training of the action network and critic network of the adaptive dynamic programming approach. The typical structures of the adaptive dynamic programming and particle swarm... 详细信息
来源: 评论