咨询与建议

限定检索结果

文献类型

  • 229 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 247 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 113 篇 工学
    • 103 篇 计算机科学与技术...
    • 42 篇 软件工程
    • 38 篇 电气工程
    • 23 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 机械工程
    • 2 篇 力学(可授工学、理...
    • 1 篇 仪器科学与技术
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 27 篇 理学
    • 25 篇 数学
    • 7 篇 系统科学
    • 6 篇 统计学(可授理学、...
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 大气科学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 54 篇 optimal control
  • 51 篇 learning
  • 44 篇 reinforcement le...
  • 35 篇 learning (artifi...
  • 27 篇 equations
  • 25 篇 neural networks
  • 22 篇 heuristic algori...
  • 20 篇 convergence
  • 20 篇 control systems
  • 18 篇 function approxi...
  • 18 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 15 篇 cost function
  • 14 篇 markov processes
  • 14 篇 nonlinear system...
  • 14 篇 artificial neura...
  • 13 篇 stochastic proce...
  • 12 篇 adaptive dynamic...

机构

  • 10 篇 chinese acad sci...
  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 department of el...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 natl univ def te...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...

作者

  • 13 篇 liu derong
  • 7 篇 hado van hasselt
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 zhao dongbin
  • 5 篇 xu xin
  • 5 篇 lewis frank l.
  • 5 篇 huaguang zhang
  • 5 篇 wei qinglai
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 haibo he
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 zhang huaguang
  • 4 篇 ni zhen
  • 4 篇 yanhong luo
  • 4 篇 wang ding
  • 4 篇 he haibo
  • 4 篇 damien ernst

语言

  • 246 篇 英文
  • 1 篇 其他
检索条件"任意字段=2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014"
247 条 记 录,以下是21-30 订阅
Tunable and Generic Problem Instance Generation for Multi-objective reinforcement learning
Tunable and Generic Problem Instance Generation for Multi-ob...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Garrett, Deon Bieger, Jordi Throisson, Kristinn R. Reykjavik Univ Iceland Inst Intelligent Machines Reykjavik Iceland Reykjavik Univ Reykjavik Iceland
A significant problem facing researchers in reinforcement learning, and particularly in multi-objective learning, is the dearth of good benchmarks. In this paper, we present a method and software tool enabling the cre... 详细信息
来源: 评论
Using Approximate dynamic programming for Estimating the Revenues of a Hydrogen-based High-Capacity Storage Device
Using Approximate Dynamic Programming for Estimating the Rev...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Francois-Lavet, Vincent Fonteneau, Raphael Ernst, Damien Univ Liege Dept Elect Engn & Comp Sci B-4000 Liege Belgium
This paper proposes a methodology to estimate the maximum revenue that can be generated by a company that operates a high-capacity storage device to buy or sell electricity on the day-ahead electricity market. The met... 详细信息
来源: 评论
Cognitive Control in Cognitive dynamic Systems: A New Way of Thinking Inspired by The Brain
Cognitive Control in Cognitive Dynamic Systems: A New Way of...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Haykin, Simon Amiri, Ashkan Fatemi, Mehdi McMaster Univ Cognit Syst Lab Hamilton ON L8S 4K1 Canada
Briefly, main purpose of the paper is fourfold: a) Cognitive perception, which consists of two functional blocks: improved sparse-coding under the influence of perceptual attention for extracting relevant information ... 详细信息
来源: 评论
Heuristics for Multiagent reinforcement learning in Decentralized Decision Problems
Heuristics for Multiagent Reinforcement Learning in Decentra...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Allen, Martin W. Hahn, David MacFarland, Douglas C. Univ Wisconsin Dept Comp Sci La Crosse WI 54601 USA
Decentralized partially observable Markov decision processes (Dec-POMDPs) model cooperative multiagent scenarios, providing a powerful general framework for team-based artificial intelligence. While optimal algorithms... 详细信息
来源: 评论
Using supervised training signals of observable state dynamics to speed-up and improve reinforcement learning
Using supervised training signals of observable state dynami...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Elliott, Daniel L. Anderson, Charles Colorado State Univ Dept Comp Sci Ft Collins CO 80523 USA
A common complaint about reinforcement learning (RL) is that it is too slow to learn a value function which gives good performance. This issue is exacerbated in continuous state spaces. This paper presents a straight-... 详细信息
来源: 评论
A Comparison of Approximate dynamic programming Techniques on Benchmark Energy Storage Problems: Does Anything Work?
A Comparison of Approximate Dynamic Programming Techniques o...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Jiang, Daniel R. Pham, Thuy V. Powell, Warren B. Salas, Daniel F. Scott, Warren R.
As more renewable, yet volatile, forms of energy like solar and wind are being incorporated into the grid, the problem of finding optimal control policies for energy storage is becoming increasingly important. These s... 详细信息
来源: 评论
Event-based Optimal Regulator Design for Nonlinear Networked Control Systems
Event-based Optimal Regulator Design for Nonlinear Networked...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Sahoo, Avimanyu Xu, Hao Jagannathan, S. Missouri Univ Sc & Tech Dept Elect & Comp Engn Rolla MO 65409 USA Texas A&M Univ Coll Sci & Engn Dept Elect Engn Corpus Christi TX USA
This paper presents a novel stochastic event-based near optimal control strategy to regulate a networked control system (NCS) represented as an uncertain nonlinear continuous time system. An online stochastic actor-cr... 详细信息
来源: 评论
Convergence of Value Iterations for Total-Cost MDPs and POMDPs with General State and Action Sets
Convergence of Value Iterations for Total-Cost MDPs and POMD...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Feinberg, Eugene A. Kasyanov, Pavlo O. Zgurovsky, Michael Z. SUNY Stony Brook Dept Appl Math & Stat Stony Brook NY 11794 USA Natl Tech Univ Ukraine Kyiv Polytech Inst Inst Appl Syst Anal UA-03056 Kiev Ukraine Natl Tech Univ Ukraine Kyiv Polytech Inst UA-03056 Kiev Ukraine
This paper describes conditions for convergence to optimal values of the dynamic programming algorithm applied to total-cost Markov Decision Processes (MDPSs) with Borel state and action sets and with possibly unbound... 详细信息
来源: 评论
reinforcement learning-based Optimal Control Considering L Computation Time Delay of Linear Discrete-time Systems
Reinforcement Learning-based Optimal Control Considering <i>...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Fujita, Taishi Ushio, Toshimitsu
In embedded control systems, the control input is computed based on sensing data of a plant in a processor and there is a delay, called the computation time delay, due to the computation and the data transmission. Whe... 详细信息
来源: 评论
Beyond Exponential Utility Functions: A Variance-Adjusted Approach for Risk-Averse reinforcement learning
Beyond Exponential Utility Functions: A Variance-Adjusted Ap...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (adprl)
作者: Gosavi, Abhijit A. Das, Sajal K. Murray, Susan L. Missouri Univ Sci & Technol Dept Engn Management & Syst Engn Rolla MO 65409 USA Missouri Univ Sci & Technol Dept Comp Sci Rolla MO 65409 USA
Utility theory has served as a bedrock for modeling risk in economics. Where risk is involved in decision-making, for solving Markov decision processes (MDPs) via utility theory, the exponential utility (EU) function ... 详细信息
来源: 评论