咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是141-150 订阅
排序:
Mobile-Aware Online Task Offloading Based on Deep reinforcement learning in Mobile Edge Computing Networks  34
Mobile-Aware Online Task Offloading Based on Deep Reinforcem...
收藏 引用
ieee 34th Annual international symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Li, Yuting Liu, Yitong Liu, Xingcheng Tu, Qiang Xie, Yi Sun Yat Sen Univ Sch Elect & Informat Technol Guangzhou Peoples R China Sun Yat Sen Univ Sch Comp Sci & Engn Guangzhou Peoples R China Jiangsu Viscore Technol Co Ltd Suzhou Peoples R China
Mobile Edge Computing (MEC) is one of the key enabling technologies for future 6G wireless networks that can provide lower latency service and more efficient resource utilization for future intelligent applications an... 详细信息
来源: 评论
Development of a real-time learning scheduler using reinforcement learning concepts
Development of a real-time learning scheduler using reinforc...
收藏 引用
Proceedings of the 1994 ieee international symposium on Intelligent Control
作者: Rabelo, Luis C. Jones, Albert Yih, Yuehwern Ohio Univ Athens United States
A scheme for the scheduling of Flexible Manufacturing Systems (FMS) has been developed which divides the scheduling function (built upon a generic controller architecture) into four different steps: candidate rule sel... 详细信息
来源: 评论
Adaptive critic-based neurofuzzy controller for the steam generator water level
收藏 引用
ieee TRANSACTIONS ON NUCLEAR SCIENCE 2008年 第3期55卷 1678-1685页
作者: Fakhrazari, Amin Boroushaki, Mehrdad Sharif Univ Technol Dept Mech Engn Tehran Iran
In this paper, an adaptive critic-based neurofuzzy controller is presented for water level regulation of nuclear steam generators. The problem has been of great concern for many years as the steam generator is a highl... 详细信息
来源: 评论
Bridging Hamilton-Jacobi Safety Analysis and reinforcement learning
Bridging Hamilton-Jacobi Safety Analysis and Reinforcement L...
收藏 引用
ieee international Conference on Robotics and Automation (ICRA)
作者: Fisac, Jaime E. Lugovoy, Neil E. Rubies-Royo, Vicenc Ghosh, Shromona Tomlin, Claire J. Univ Calif Berkeley Dept Elect Engn & Comp Sci Berkeley CA 94720 USA
Safety analysis is a necessary component in the design and deployment of autonomous robotic systems. Techniques from robust optimal control theory, such as Hamilton-Jacobi reachability analysis, allow a rigorous forma... 详细信息
来源: 评论
RLS Algorithms and Convergence Analysis Method for Online DLQR Control Design via Heuristic dynamic programming  16
RLS Algorithms and Convergence Analysis Method for Online DL...
收藏 引用
16th UKSim-AMSS international Conference on Computer Modelling and Simulation (UKSim)
作者: Santos, Watson R. M. Queiroz, Jonathan A. Neto, Joao Viana da F. Rego, Patricia H. M. Santana, Ewaldo Andrade, Gustavo Univ Estadual Maranhao Fed Univ Maranhao Fed Inst Maranhao Embedded Syst & Intelligent Control Lab Sao Luis Maranhao Brazil
In this paper, a method to design online optimal policies that encompasses Hamilton-Jacobi-Bellman (HJB) equation solution approximation and heuristic dynamic programming (HDP) approach is proposed. Recursive least sq... 详细信息
来源: 评论
Optimal Control of a Wind Generator System Using Non-Squares Estimators  24
Optimal Control of a Wind Generator System Using Non-Squares...
收藏 引用
24th ieee international symposium on Industrial Electronics (ISIE)
作者: Queiroz, Jonathan Araujo Barros, Allan Kardec Neto, Joao Viana da F. Santana, Ewaldo Univ Fed Maranhao Biol Informat Proc Lab Sao Luis Brazil
The control of eolic and solar energy systems demands methods and technics adapted to the high degree of environment non-stationarities whose adjustments are carried out via adaptive filters. Among the best known are ... 详细信息
来源: 评论
On a Successful Application of Multi-Agent reinforcement learning to Operations Research Benchmarks
On a Successful Application of Multi-Agent Reinforcement Lea...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Thomas Gabel Martin Riedmiller Department of Mathematics and Computer Science Institute of Cognitive Science University of Osnabrück Osnabruck Germany
In this paper, we suggest and analyze the use of approximate reinforcement learning techniques for a new category of challenging benchmark problems from the field of operations research. We demonstrate that interpreti... 详细信息
来源: 评论
Solving PBQP-Based Register Allocation using Deep reinforcement learning  22
Solving PBQP-Based Register Allocation using Deep Reinforcem...
收藏 引用
20th ieee/ACM international symposium on Code Generation and Optimization (CGO)
作者: Kim, Minsu Park, Jeong-Keun Moon, Soo-Mook Seoul Natl Univ Dept Elect & Comp Engn Seoul South Korea
Irregularly structured registers are hard to abstract and allocate. Partitioned Boolean quadratic programming (PBQP) is a useful abstraction to represent complex register constraints, even those in highly irregular pr... 详细信息
来源: 评论
A performance gradient perspective on approximate dynamic programming and its application to partially observable Markov decision processes
A performance gradient perspective on approximate dynamic pr...
收藏 引用
2006 ieee international symposium on Intelligent Control, ISIC 2006
作者: Dankert, James Lei, Yang Si, Jennie Department of Electrical Engineering Arizona State University Tempe AZ 85287-5706
This paper shows an approach to integrating common approximate dynamic programming (ADP) algorithms into a theoretical framework to address both analytical characteristicsand algorithmic features. Several important in... 详细信息
来源: 评论
A New Discrete-Time Iterative Adaptive dynamic programming Algorithm Based on Q-learning  12th
收藏 引用
12th international symposium on Neural Networks (ISNN)
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
In this paper, a novel Q-learning based policy iteration adaptive dynamic programming (ADP) algorithm is developed to solve the optimal control problems for discrete-time nonlinear systems. The idea is to use a policy... 详细信息
来源: 评论