咨询与建议

限定检索结果

文献类型

  • 229 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 247 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 113 篇 工学
    • 103 篇 计算机科学与技术...
    • 42 篇 软件工程
    • 38 篇 电气工程
    • 23 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 机械工程
    • 2 篇 力学(可授工学、理...
    • 1 篇 仪器科学与技术
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 27 篇 理学
    • 25 篇 数学
    • 7 篇 系统科学
    • 6 篇 统计学(可授理学、...
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 大气科学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 54 篇 optimal control
  • 51 篇 learning
  • 44 篇 reinforcement le...
  • 35 篇 learning (artifi...
  • 27 篇 equations
  • 25 篇 neural networks
  • 22 篇 heuristic algori...
  • 20 篇 convergence
  • 20 篇 control systems
  • 18 篇 function approxi...
  • 18 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 15 篇 cost function
  • 14 篇 markov processes
  • 14 篇 nonlinear system...
  • 14 篇 artificial neura...
  • 13 篇 stochastic proce...
  • 12 篇 adaptive dynamic...

机构

  • 10 篇 chinese acad sci...
  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 department of el...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 natl univ def te...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...

作者

  • 13 篇 liu derong
  • 7 篇 hado van hasselt
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 zhao dongbin
  • 5 篇 xu xin
  • 5 篇 lewis frank l.
  • 5 篇 huaguang zhang
  • 5 篇 wei qinglai
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 haibo he
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 zhang huaguang
  • 4 篇 ni zhen
  • 4 篇 yanhong luo
  • 4 篇 wang ding
  • 4 篇 he haibo
  • 4 篇 damien ernst

语言

  • 246 篇 英文
  • 1 篇 其他
检索条件"任意字段=2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014"
247 条 记 录,以下是71-80 订阅
排序:
Approximate reinforcement learning: An overview
Approximate reinforcement learning: An overview
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Buşoniu, Lucian Ernst, Damien De Schutter, Bart Babuška, Robert Delft Center for Systems and Control Delft Univ. of Technology Netherlands Research Associate of the FRS-FNRS Systems and Modeling Unit University of Liège Liège Belgium
reinforcement learning (RL) allows agents to learn how to optimally interact with complex environments. Fueled by recent advances in approximation-based algorithms, RL has obtained impressive successes in robotics, ar... 详细信息
来源: 评论
Grounding subgoals in information transitions
Grounding subgoals in information transitions
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Van Dijk, Sander G. Polani, Daniel Adaptive Systems Research Group University of Hertfordshire Hatfield United Kingdom
In reinforcement learning problems, the construction of subgoals has been identified as an important step to speed up learning and to enable skill transfer. For this purpose, one typically extracts states from various... 详细信息
来源: 评论
Discrete-time adaptive dynamic programming using wavelet basis function neural networks
Discrete-time adaptive dynamic programming using wavelet bas...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Jin, Ning Liu, Derong Huang, Ting Pang, Zhongyu Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
dynamic programming for discrete time systems is difficult due to the "curse of dimensionality": one has to find a series of control actions that must be taken in sequence, hoping that this sequence will lea... 详细信息
来源: 评论
Annealing-Pareto Multi-Objective Multi-Armed Bandit Algorithm
Annealing-Pareto Multi-Objective Multi-Armed Bandit Algorith...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Yahyaa, Saba Q. Drugan, Madalina M. Manderick, Bernard Vrije Univ Brussel Dept Comp Sci Pl Laan 2 B-1050 Brussels Belgium
In the stochastic multi-objective multi-armed bandit (or MOMAB), arms generate a vector of stochastic rewards, one per objective, instead of a single scalar reward. As a result, there is not only one optimal arm, but ... 详细信息
来源: 评论
Neuro-controller of Cement Rotary Kiln Temperature with adaptive Critic Designs
Neuro-controller of Cement Rotary Kiln Temperature with Adap...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Lin, Xiaofeng Liu, Tangbo Song, Shaojian Song, Chunning Guangxi Univ Coll Elect Engn Nanning 530004 Peoples R China
The production process of the cement rotary kiln is a typical engineering thermodynamics with large inertia, lagging and nonlinearity. So it is very difficult to control this process accurately using traditional contr... 详细信息
来源: 评论
Using ADP to understand and replicate brain intelligence: the next level design
Using ADP to understand and replicate brain intelligence: th...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Werbos, Paul J. Natl Sci Fdn Arlington VA 22203 USA
Since the 1960's I proposed that we could understand and replicate the highest level of intelligence seen in the brain, by building ever more capable and general systems for adaptive dynamic programming (ADP) - li... 详细信息
来源: 评论
Protecting against evaluation overfitting in empirical reinforcement learning
Protecting against evaluation overfitting in empirical reinf...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Whiteson, Shimon Tanner, Brian Taylor, Matthew E. Stone, Peter Informatics Institute University of Amsterdam Netherlands Department of Computing Science University of Alberta Canada Department of Computer Science Lafayette College United States Department of Computer Science University of Texas Austin United States
Empirical evaluations play an important role in machine learning. However, the usefulness of any evaluation depends on the empirical methodology employed. Designing good empirical methodologies is difficult in part be... 详细信息
来源: 评论
reinforcement learning algorithms for solving classification problems
Reinforcement learning algorithms for solving classification...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Wiering, Marco A. Van Hasselt, Hado Pietersma, Auke-Dirk Schomaker, Lambert Dept. of Artificial Intelligence University of Groningen Netherlands Multi-agent and Adaptive Computation Centrum Wiskunde en Informatica Netherlands
We describe a new framework for applying reinforcement learning (RL) algorithms to solve classification tasks by letting an agent act on the inputs and learn value functions. This paper describes how classification pr... 详细信息
来源: 评论
Using reward-weighted regression for reinforcement learning of task space control
Using reward-weighted regression for reinforcement learning ...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Peters, Jan Schaal, Stefan Univ So Calif Los Angeles CA 90089 USA
Many robot control problems of practical importance, including task or operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of the known optimization or rein... 详细信息
来源: 评论
dynamic lead time promising
Dynamic lead time promising
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Reindorp, Matthew J. Fu, Michael C. Department of Industrial Engineering and Innovation Sciences Eindhoven University of Technology Netherlands Robert H. Smith School of Business Institute for Systems Research University of Maryland United States
We consider a make-to-order business that serves customers in multiple priority classes. Orders from customers in higher classes bring greater revenue, but they expect shorter lead times than customers in lower classe... 详细信息
来源: 评论