咨询与建议

限定检索结果

文献类型

  • 229 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 247 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 113 篇 工学
    • 103 篇 计算机科学与技术...
    • 42 篇 软件工程
    • 38 篇 电气工程
    • 23 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 机械工程
    • 2 篇 力学(可授工学、理...
    • 1 篇 仪器科学与技术
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 27 篇 理学
    • 25 篇 数学
    • 7 篇 系统科学
    • 6 篇 统计学(可授理学、...
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 大气科学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 54 篇 optimal control
  • 51 篇 learning
  • 44 篇 reinforcement le...
  • 35 篇 learning (artifi...
  • 27 篇 equations
  • 25 篇 neural networks
  • 22 篇 heuristic algori...
  • 20 篇 convergence
  • 20 篇 control systems
  • 18 篇 function approxi...
  • 18 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 15 篇 cost function
  • 14 篇 markov processes
  • 14 篇 nonlinear system...
  • 14 篇 artificial neura...
  • 13 篇 stochastic proce...
  • 12 篇 adaptive dynamic...

机构

  • 10 篇 chinese acad sci...
  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 department of el...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 natl univ def te...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...

作者

  • 13 篇 liu derong
  • 7 篇 hado van hasselt
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 zhao dongbin
  • 5 篇 xu xin
  • 5 篇 lewis frank l.
  • 5 篇 huaguang zhang
  • 5 篇 wei qinglai
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 haibo he
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 zhang huaguang
  • 4 篇 ni zhen
  • 4 篇 yanhong luo
  • 4 篇 wang ding
  • 4 篇 he haibo
  • 4 篇 damien ernst

语言

  • 246 篇 英文
  • 1 篇 其他
检索条件"任意字段=2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014"
247 条 记 录,以下是101-110 订阅
An integrated design for intensified direct heuristic dynamic programming
An integrated design for intensified direct heuristic dynami...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Xiong Luo Jennie Si Yuchao Zhou School of Computer and Communication Engineering University of Science and Technology Beijing (USTB) Beijing China Arizona State University Tempe AZ US
There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of mac... 详细信息
来源: 评论
adaptive optimal control for nonlinear discrete-time systems
Adaptive optimal control for nonlinear discrete-time systems
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Chunbin Qin Huaguang Zhang Yanhong Luo School of Information Science and Engineering Northeastern University Shenyang China Basic Experiment Teaching Center Henan University Kaifeng China
This paper proposes an on-line near-optimal control scheme based on capabilities of neural networks (NNs), in function approximation, to attain the on-line solution of optimal control problem for nonlinear discrete-ti... 详细信息
来源: 评论
Real-time tracking on adaptive critic design with uniformly ultimately bounded condition
Real-time tracking on adaptive critic design with uniformly ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Zhen Ni Xiao Fang Haibo He Dongbin Zhao Xin Xu Department of Electrical University of Rhode Island Kingston RI USA Institute of Automation Chinese Academy of Sciences Beijing China Institute of Automation National University of Defense Technology Changsha China
In this paper, we proposed a new nonlinear tracking controller based on heuristic dynamic programming (HDP) with the tracking filter. Specifically, we integrate a goal network into the regular HDP design and provide t... 详细信息
来源: 评论
Scalarized multi-objective reinforcement learning: Novel design techniques
Scalarized multi-objective reinforcement learning: Novel des...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Kristof Van Moffaert Madalina M. Drugan Ann Nowé Department of Computer Science Vrije Universiteit Brussel Brussels Belgium
In multi-objective problems, it is key to find compromising solutions that balance different objectives. The linear scalarization function is often utilized to translate the multi-objective nature of a problem into a ... 详细信息
来源: 评论
Optimistic planning for continuous-action deterministic systems
Optimistic planning for continuous-action deterministic syst...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Lucian Buşoniu Alexander Daniels Rémi Munos Robert Babuška Department of Automation Technical University of Cluj-Napoca Romania France DCSC Delft University of Technology the Netherlands Team SequeL INRIA Lille-Nord Europe France
We consider the class of online planning algorithms for optimal control, which compared to dynamic programming are relatively unaffected by large state dimensionality. We introduce a novel planning algorithm called SO... 详细信息
来源: 评论
A combined hierarchical reinforcement learning based approach for multi-robot cooperative target searching in complex unknown environments
A combined hierarchical reinforcement learning based approac...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Yifan Cai Simon X. Yang Xin Xu The School of Engineering University of Guelph Guelph Ontario Canada The College of Mechatronics and Automation National University of Defense Technology Changsha Hunan Province China
Effective cooperation of multi-robots in unknown environments is essential in many robotic applications, such as environment exploration and target searching. In this paper, a combined hierarchical reinforcement learn... 详细信息
来源: 评论
reinforcement learning in the game of Othello: learning against a fixed opponent and learning from self-play
Reinforcement learning in the game of Othello: Learning agai...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Michiel van der Ree Marco Wiering Faculty of Mathematics and Natural Sciences University of Groningen Institute of Artificial Intelligence and Cognitive Engineering The Netherlands
This paper compares three strategies in using reinforcement learning algorithms to let an artificial agent learn to play the game of Othello. The three strategies that are compared are: learning by self-play, learning... 详细信息
来源: 评论
Delayed insertion and rule effect moderation of domain knowledge for reinforcement learning
Delayed insertion and rule effect moderation of domain knowl...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Teck-Hou Teng Ah-Hwee Tan School of Computer Engineering Center for Computational Intelligence School of Computer Engineering Nanyang Technological University
Though not a fundamental pre-requisite to efficient machine learning, insertion of domain knowledge into adaptive virtual agent is nonetheless known to improve learning efficiency and reduce model complexity. Conventi... 详细信息
来源: 评论
A reinforcement learning algorithm developed to model GenCo strategic bidding behavior in multidimensional and continuous state and action spaces
A reinforcement learning algorithm developed to model GenCo ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Alfred Yong Fu Lau Dipti Srinivasan Thomas Reindl National University of Singapore Singapore SG Department of Electrical Computer Engineering National University of Singapore Singapore Solar Energy Research Institute of Singapore National University of Singapore Singapore
The electricity market has provided a complex economic environment, and consequently has increased the requirement for advancement of learning methods. In the agent-based modeling and simulation framework of this econ... 详细信息
来源: 评论
reinforcement learning to train Ms. Pac-Man using higher-order action-relative inputs
Reinforcement learning to train Ms. Pac-Man using higher-ord...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Luuk Bom Ruud Henken Marco Wiering Faculty of Mathematics and Natural Sciences University of Groningen The Netherlands
reinforcement learning algorithms enable an agent to optimize its behavior from interacting with a specific environment. Although some very successful applications of reinforcement learning algorithms have been develo... 详细信息
来源: 评论