咨询与建议

限定检索结果

文献类型

  • 229 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 247 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 113 篇 工学
    • 103 篇 计算机科学与技术...
    • 42 篇 软件工程
    • 38 篇 电气工程
    • 23 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 机械工程
    • 2 篇 力学(可授工学、理...
    • 1 篇 仪器科学与技术
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 27 篇 理学
    • 25 篇 数学
    • 7 篇 系统科学
    • 6 篇 统计学(可授理学、...
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 大气科学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 54 篇 optimal control
  • 51 篇 learning
  • 44 篇 reinforcement le...
  • 35 篇 learning (artifi...
  • 27 篇 equations
  • 25 篇 neural networks
  • 22 篇 heuristic algori...
  • 20 篇 convergence
  • 20 篇 control systems
  • 18 篇 function approxi...
  • 18 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 15 篇 cost function
  • 14 篇 markov processes
  • 14 篇 nonlinear system...
  • 14 篇 artificial neura...
  • 13 篇 stochastic proce...
  • 12 篇 adaptive dynamic...

机构

  • 10 篇 chinese acad sci...
  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 department of el...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 natl univ def te...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...

作者

  • 13 篇 liu derong
  • 7 篇 hado van hasselt
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 zhao dongbin
  • 5 篇 xu xin
  • 5 篇 lewis frank l.
  • 5 篇 huaguang zhang
  • 5 篇 wei qinglai
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 haibo he
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 zhang huaguang
  • 4 篇 ni zhen
  • 4 篇 yanhong luo
  • 4 篇 wang ding
  • 4 篇 he haibo
  • 4 篇 damien ernst

语言

  • 246 篇 英文
  • 1 篇 其他
检索条件"任意字段=2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014"
247 条 记 录,以下是151-160 订阅
排序:
Application of reinforcement learning-based algorithms in CO2 allowance and electricity markets
Application of reinforcement learning-based algorithms in CO...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Vishnuteja Nanduri Department of Industrial & Manufacturing Engineering University of Wisconsin Milwaukee Milwaukee WI USA
Climate change is one of the most important challenges faced by the world this century. In the U.S., the electric power industry is the largest emitter of CO 2 , contributing to the climate crisis. Federal emissions c... 详细信息
来源: 评论
Bias-corrected Q-learning to control max-operator bias in Q-learning
Bias-corrected Q-learning to control max-operator bias in Q-...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Donghun Lee Boris Defourny Warren B. Powell Department of Computer Science Princeton University Princeton NJ USA Operations Research and Financial Engineering Princeton University Princeton NJ USA
We identify a class of stochastic control problems with highly random rewards and high discount factor which induce high levels of statistical error in the estimated action-value function. This produces significant le... 详细信息
来源: 评论
A novel approach for constructing basis functions in approximate dynamic programming for feedback control
A novel approach for constructing basis functions in approxi...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Jian Wang Zhenhua Huang Xin Xu College of Mechatronics and Automation National University of Defense Tech Changsha P. R. China
This paper presents a novel approach for constructing basis functions in approximate dynamic programming (ADP) through the locally linear embedding (LLE) process. It considers the experience (sample) data as a high-di... 详细信息
来源: 评论
Online adaptive learning of optimal control solutions using integral reinforcement learning
Online adaptive learning of optimal control solutions using ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Kyriakos G. Vamvoudakis Draguna Vrabie Frank L. Lewis Automation and Robotics Research Institute University of Texas Arlington Fort Worth TX USA
In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learning the continuous-time optimal control solution for nonlinear systems with infinite horizon costs and partial knowled...
来源: 评论
Scalarized multi-objective reinforcement learning: Novel design techniques
Scalarized multi-objective reinforcement learning: Novel des...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Kristof Van Moffaert Madalina M. Drugan Ann Nowé Department of Computer Science Vrije Universiteit Brussel Brussels Belgium
In multi-objective problems, it is key to find compromising solutions that balance different objectives. The linear scalarization function is often utilized to translate the multi-objective nature of a problem into a ... 详细信息
来源: 评论
A combined hierarchical reinforcement learning based approach for multi-robot cooperative target searching in complex unknown environments
A combined hierarchical reinforcement learning based approac...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Yifan Cai Simon X. Yang Xin Xu The School of Engineering University of Guelph Guelph Ontario Canada The College of Mechatronics and Automation National University of Defense Technology Changsha Hunan Province China
Effective cooperation of multi-robots in unknown environments is essential in many robotic applications, such as environment exploration and target searching. In this paper, a combined hierarchical reinforcement learn... 详细信息
来源: 评论
adaptive optimal control for nonlinear discrete-time systems
Adaptive optimal control for nonlinear discrete-time systems
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Chunbin Qin Huaguang Zhang Yanhong Luo School of Information Science and Engineering Northeastern University Shenyang China Basic Experiment Teaching Center Henan University Kaifeng China
This paper proposes an on-line near-optimal control scheme based on capabilities of neural networks (NNs), in function approximation, to attain the on-line solution of optimal control problem for nonlinear discrete-ti... 详细信息
来源: 评论
Impact of signal transmission delays on power system damping control using heuristic dynamic programming
Impact of signal transmission delays on power system damping...
收藏 引用
2014 ieee symposium Series on Computational Intelligence, ieee SSCI 2014 - 2014 ieee symposium on Computational Intelligence Applications in Smart Grid, CIASG 2014
作者: Tang, Yufei Zhong, Xiangnan Ni, Zhen Yan, Jun He, Haibo Department of Electrical Computer and Biomedical Engineering University of Rhode Island KingstonRI02881 United States
In this paper, the impact of signal transmission delays on static VAR compensator (SVC) based power system damping control using reinforcement learning is investigated. The SVC is used to damp low-frequency oscillatio... 详细信息
来源: 评论
Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online learning Optimal Control Approach
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2014年 第2期25卷 418-428页
作者: Liu, Derong Wang, Ding Li, Hongliang Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, using a neural-network-based online learning optimal control approach, a novel decentralized control strategy is developed to stabilize a class of continuous-time nonlinear interconnected large-scale sy... 详细信息
来源: 评论
Approximate reinforcement learning: An overview
Approximate reinforcement learning: An overview
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Lucian Buşoniu Damien Ernst Bart De Schutter Robert Babuška Delft Center of Systems & Control Delft University of Technnology Netherlands FRS-FNRS Systems and Modeling Unit University of Liège Belgium
reinforcement learning (RL) allows agents to learn how to optimally interact with complex environments. Fueled by recent advances in approximation-based algorithms, RL has obtained impressive successes in robotics, ar... 详细信息
来源: 评论