咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是171-180 订阅
排序:
Policy Iteration Algorithm for Constrained Cost Optimal Control of Discrete-Time Nonlinear System
Policy Iteration Algorithm for Constrained Cost Optimal Cont...
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: Li, Tao Wei, Qinglai Li, Hongyang Song, Ruizhuo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing Peoples R China Univ Sci & Technol Beijing Sch Automat Beijing Peoples R China
In this paper, optimal control problems with constraints on summation of auxiliary utility function are called constrained cost optimal control problems and a constrained cost policy iteration adaptive dynamic program... 详细信息
来源: 评论
Efficient learning in Cellular Simultaneous Recurrent Neural Networks - The Case of Maze Navigation Problem
Efficient Learning in Cellular Simultaneous Recurrent Neural...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Roman Ilin Robert Kozma Paul J. Werbos Department of Mathematical Sciences University of Memphis Memphis TN USA National Science Foundation Arlington VA USA
Cellular simultaneous recurrent neural networks (SRN) show great promise in solving complex function approximation problems. In particular, approximate dynamic programming is an important application area where SRNs h... 详细信息
来源: 评论
DATE: Disturbance-Aware Traffic Engineering with reinforcement learning in Software-Defined Networks  29
DATE: Disturbance-Aware Traffic Engineering with Reinforceme...
收藏 引用
29th ieee/ACM international symposium on Quality of Service (IWQOS)
作者: Ye, Minghao Zhang, Junjie Guo, Zehua Chao, H. Jonathan NYU Dept Elect & Comp Engn New York NY 11201 USA Fortinet Inc Sunnyvale CA 94086 USA Beijing Inst Technol Beijing 100081 Peoples R China
Traffic Engineering (TE) has been applied to optimize network performance by routing/rerouting flows based on traffic loads and network topologies. To cope with network dynamics from emerging applications, it is essen... 详细信息
来源: 评论
Deep reinforcement learning based finite-horizon optimal tracking control for nonlinear system
Deep reinforcement learning based finite-horizon optimal tra...
收藏 引用
Joint Meeting of the 2nd IFAC Workshop on Linear Parameter Varying Systems (LPVS) / 9th IFAC symposium on Robust Control Design (ROCOND)
作者: Kim, Jong Woo Park, Byung Jun Yoo, Haeun Lee, Jay H. Lee, Jong Min Seoul Natl Univ Inst Chem Proc Sch Chem & Biol Engn 1 Gwanak Ro Seoul 08826 South Korea Korea Adv Inst Sci & Technol Chem & Biomol Engn Dept Daejeon 3041 South Korea
reinforcement learning (RL) can be used to obtain an approximate numerical solution to the Hamilton-Jacobi-Bellman (HJB) equation. Recent advances in machine learning community enable the use of deep neural networks (... 详细信息
来源: 评论
An Optimal ADP Algorithm for a High-Dimensional Stochastic Control Problem
An Optimal ADP Algorithm for a High-Dimensional Stochastic C...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Juliana Nascimento Warren Powell Department of Operations Research and Financial Engineering Princeton University Engineering Princeton NJ USA
We propose a provably optimal approximate dynamic programming algorithm for a class of multistage stochastic problems, taking into account that the probability distribution of the underlying stochastic process is not ... 详细信息
来源: 评论
***: Power-Aware Traffic Engineering via Deep reinforcement learning  29
***: Power-Aware Traffic Engineering via Deep Reinforcement ...
收藏 引用
29th ieee/ACM international symposium on Quality of Service (IWQOS)
作者: Pan, Tian Peng, Xiaoyu Shi, Qianqian Bian, Zizheng Lin, Xingchen Song, Enge Li, Fuliang Xu, Yang Huang, Tao BUPT State Key Lab Networking & Switching Technol Beijing Peoples R China Sci & Technol Commun Networks Lab Shijiazhuang Hebei Peoples R China Northeastern Univ Shenyang Liaoning Peoples R China Fudan Univ Shanghai Peoples R China
Power-aware traffic engineering via coordinated sleeping is usually formulated into Integer programming problems, which are generally NP-hard with unbounded computation time for large-scale networks. This results in d... 详细信息
来源: 评论
Deep reinforcement learning for Perishable Inventory Optimization Problem
Deep Reinforcement Learning for Perishable Inventory Optimiz...
收藏 引用
2023 ieee international Conference on Industrial Engineering and Engineering Management, IEEM 2023
作者: Nomura, Yusuke Liu, Ziang Nishi, Tatsushi Graduate School of Environmental Life Natural Science and Technology Okayama University 3-1-1 Tsushima-Naka Kita-ku Okayama Okayama City Japan
While global attention on reducing food waste has increased, the demand for perishable commodities such as food and pharmaceuticals is growing. This emphasizes the need for effective perishable inventory management, w... 详细信息
来源: 评论
Model-Based reinforcement learning in Factored-State MDPs
Model-Based Reinforcement Learning in Factored-State MDPs
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Alexander L. Strehl Department of Computer Science Rutgers University Piscataway NJ USA
We consider the problem of learning in a factored-state Markov decision process that is structured to allow a compact representation. We show that the well-known algorithm, factored Rmax, performs near-optimally on al... 详细信息
来源: 评论
Adaptive critic-based neurofuzzy controller for the steam generator water level
Adaptive critic-based neurofuzzy controller for the steam ge...
收藏 引用
15th international Workshop on Room-Temperature Semiconductor X- and Gamma-Ray Detectors/ 2006 ieee Nuclear Science symposium
作者: Fakhrazari, Amin Boroushaki, Mehrdad Sharif Univ Technol Dept Mech Engn Tehran Iran
In this paper, an adaptive critic-based neurofuzzy controller is presented for water level regulation of nuclear steam generators. The problem has been of great concern for many years as the steam generator is a highl... 详细信息
来源: 评论
An approximate dynamic programming Approach for Job Releasing and Sequencing in a Reentrant Manufacturing Line
An Approximate Dynamic Programming Approach for Job Releasin...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Jose A. Ramirez-Hernandez Emmanuel Fernandez Department of Electrical & Computer Engineering University of Cincinnati OH USA
This paper presents the application of an approximate dynamic programming (ADP) algorithm to the problem of job releasing and sequencing of a benchmark reentrant manufacturing line (RML). The ADP approach is based on ... 详细信息
来源: 评论