咨询与建议

限定检索结果

文献类型

  • 16 篇 会议

馆藏范围

  • 16 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 15 篇 工学
    • 14 篇 电气工程
    • 14 篇 计算机科学与技术...
    • 1 篇 控制科学与工程
  • 1 篇 理学
    • 1 篇 数学
  • 1 篇 管理学
    • 1 篇 图书情报与档案管...

主题

  • 4 篇 learning
  • 4 篇 learning (artifi...
  • 3 篇 neural network
  • 2 篇 reinforcement le...
  • 2 篇 adaptive dynamic...
  • 2 篇 dynamic programm...
  • 1 篇 sarsa
  • 1 篇 swarm robotics
  • 1 篇 eligibility trac...
  • 1 篇 clustering
  • 1 篇 adaptive dynamic...
  • 1 篇 flaws
  • 1 篇 evolutionary rob...
  • 1 篇 networked contro...
  • 1 篇 game animals
  • 1 篇 behavior analysi...
  • 1 篇 second order td ...
  • 1 篇 electricity mark...
  • 1 篇 strategic biddin...
  • 1 篇 planning

机构

  • 2 篇 univ groningen i...
  • 1 篇 nanyang technol ...
  • 1 篇 northeastern uni...
  • 1 篇 guangxi univ sch...
  • 1 篇 xi'an air force ...
  • 1 篇 univ skovde inte...
  • 1 篇 hiroshima univ g...
  • 1 篇 natl univ singap...
  • 1 篇 cnrs cran umr 70...
  • 1 篇 arizona state un...
  • 1 篇 univ sci & techn...
  • 1 篇 vrije univ bruss...
  • 1 篇 natl univ singap...
  • 1 篇 shinshu univ fac...
  • 1 篇 delft univ techn...
  • 1 篇 univ guelph sch ...
  • 1 篇 univ lorraine cr...
  • 1 篇 nanyang technol ...
  • 1 篇 missouri univ sc...
  • 1 篇 tsinghua univ de...

作者

  • 2 篇 xu xin
  • 2 篇 wiering marco
  • 1 篇 fu qiming
  • 1 篇 huang zhenhua
  • 1 篇 bom luuk
  • 1 篇 xu hao
  • 1 篇 wang jian
  • 1 篇 zhou yuchao
  • 1 篇 luo xiong
  • 1 篇 lau alfred yong ...
  • 1 篇 jagannathan s.
  • 1 篇 munos remi
  • 1 篇 nowe ann
  • 1 篇 qin chunbin
  • 1 篇 babuska robert
  • 1 篇 zhang huaguang
  • 1 篇 si jennie
  • 1 篇 cao nuyun
  • 1 篇 yasuda toshiyuki
  • 1 篇 chen guixin

语言

  • 16 篇 英文
检索条件"任意字段=2013 4th IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2013"
16 条 记 录,以下是1-10 订阅
排序:
Exploring the Relationship of Reward and Punishment in reinforcement learning Evolving Action Meta-learning Functions in Goal Navigation
Exploring the Relationship of Reward and Punishment in Reinf...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Lowe, Robert Ziemke, Tom Univ Skovde Interact Lab Skovde Sweden
We present a reinforcement learning algorithm based on Dyna-Sarsa that utilizes separate representations of reward and punishment when guiding state-action value learning and action selection. the adoption of policy m... 详细信息
来源: 评论
adaptive Optimal Control for Nonlinear Discrete-Time Systems
Adaptive Optimal Control for Nonlinear Discrete-Time Systems
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Qin, Chunbin Zhang, Huaguang Luo, Yanhong Northeastern Univ Sch Informat Sci & Engn Shenyang 110004 Peoples R China
this paper proposes an on-line near-optimal control scheme based on capabilities of neural networks (NNs), in function approximation, to attain the on-line solution of optimal control problem for nonlinear discrete-ti... 详细信息
来源: 评论
Optimal control for a class of nonlinear systems with state delay based on adaptive dynamic programming with ε-error bound
Optimal control for a class of nonlinear systems with state ...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Lin, Xiaofeng Cao, Nuyun Lin, Yuzhang Guangxi Univ Sch Elect Engn Nanning 530004 Peoples R China Tsinghua Univ Dept Elect Engn Beijing Peoples R China
In this paper, a finite-horizon epsilon-optimal control for a class of nonlinear systems with state delay is proposed by adaptive dynamic programming (ADP) algorithm. First of all, the performance index function is de... 详细信息
来源: 评论
Finite Horizon Stochastic Optimal Control of Uncertain Linear Networked Control System
Finite Horizon Stochastic Optimal Control of Uncertain Linea...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Xu, Hao Jagannathan, S. Missouri Univ Sci & Technol Dept Elect & Comp Engn Rolla MO 65409 USA
In this paper, finite horizon stochastic optimal control issue has been studied for linear networked control system (LNCS) in the presence of network imperfections such as network-induced delays and packet losses by u... 详细信息
来源: 评论
A reinforcement learning algorithm developed to model GenCo strategic bidding behavior in multidimensional and continuous state and action spaces
A reinforcement learning algorithm developed to model GenCo ...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Lau, Alfred Yong Fu Srinivasan, Dipti Reindl, thomas Natl Univ Singapore Dept Elect Comp Engn 4 Engn Dr 3 Singapore 117576 Singapore Natl Univ Singapore Solar Energy Res Inst Singapore 117574 Singapore
the electricity market have provided a complex economic environment, and consequently have increased the requirement for advancement of learning methods. In the agent-based modeling and simulation framework of this ec... 详细信息
来源: 评论
Scalarized Multi-Objective reinforcement learning: Novel Design Techniques
Scalarized Multi-Objective Reinforcement Learning: Novel Des...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Van Moffaert, Kristof Drugan, Madalina M. Nowe, Ann Vrije Univ Brussel Dept Comp Sci B-1050 Brussels Belgium
In multi-objective problems, it is key to find compromising solutions that balance different objectives. the linear scalarization function is often utilized to translate the multi-objective nature of a problem into a ... 详细信息
来源: 评论
A Combined Hierarchical reinforcement learning Based Approach For Multi-robot Cooperative Target Searching in Complex Unknown Environments
A Combined Hierarchical Reinforcement Learning Based Approac...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Cai, Yifan Yang, Simon X. Xu, Xin Univ Guelph Sch Engn Guelph ON N1G 2W1 Canada Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China
Effective cooperation of multi-robots in unknown environments is essential in many robotic applications, such as environment exploration and target searching. In this paper, a combined hierarchical reinforcement learn... 详细信息
来源: 评论
Analyzing Collective Behavior in Evolutionary Swarm Robotic Systems Based on an Ethological Approach
Analyzing Collective Behavior in Evolutionary Swarm Robotic ...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Yasuda, Toshiyuki Wada, Nanami Ohkura, Kazuhiro Matsumura, Yoshiyuki Hiroshima Univ Grad Sch Engn 1-4-1 Kagamiyama Higashihiroshima 7398527 Japan Shinshu Univ Fac Text Sci & Technol Ueda Nagano 3868567 Japan
Swarm robotic systems are a type of multi-robot systems which generally consist of many homogeneous autonomous robots without any type of global controllers. Swarm robotics aims at designing desired collective behavio... 详细信息
来源: 评论
the Second Order Temporal Difference Error for Sarsa(λ)
The Second Order Temporal Difference Error for Sarsa(λ)
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Fu, Qiming Liu, Quan Xiao, Fei Chen, Guixin Soochow Univ Dept Comp Sci & Technol Suzhou Peoples R China
Traditional reinforcement learning algorithms, such as Q-learning, Q(lambda), Sarsa, and Sarsa(lambda), update the action value function using temporal difference (TD) error, which is computed by the last action value... 详细信息
来源: 评论
Optimistic Planning for Continuous-Action Deterministic Systems
Optimistic Planning for Continuous-Action Deterministic Syst...
收藏 引用
4th ieee International symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Busoniu, Lucian Daniels, Alexander Munos, Remi Babuska, Robert Univ Lorraine CRAN UMR 7039 Nancy France CNRS CRAN UMR 7039 Nancy France Delft Univ Technol DCSC Delft Netherlands INRIA Lille Nord Europe Team SequeL Lille France
We consider the class of online planning algorithms for optimal control, which compared to dynamic programming are relatively unaffected by large state dimensionality. We introduce a novel planning algorithm called SO... 详细信息
来源: 评论