咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是121-130 订阅
排序:
Effective Elastic Scaling of Deep learning Workloads  28
Effective Elastic Scaling of Deep Learning Workloads
收藏 引用
28th ieee international symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (ieee MASCOTS)
作者: Saxena, Vaibhav Jayaram, K. R. Basu, Saurav Sabharwal, Yogish Verma, Ashish IBM Res Delhi India Microsoft Hyderabad India
We examine the elastic scaling of Deep learning (DL) jobs and propose a novel resource allocation strategy for DL training jobs, resulting in improved job run time performance as well as increased cluster utilization.... 详细信息
来源: 评论
Revisiting Maximum Entropy Inverse reinforcement learning: New Perspectives and Algorithm
Revisiting Maximum Entropy Inverse Reinforcement Learning: N...
收藏 引用
ieee symposium Series on Computational Intelligence (ieee SSCI)
作者: Snoswell, Aaron J. Singh, Surya P. N. Ye, Nan Univ Queensland Sch Informat Technol & Elect Engn Brisbane Qld Australia Intuit Surg Sunnyvale CA USA Univ Queensland Sch Math & Phys Brisbane Qld Australia
We provide new perspectives and inference algorithms for Maximum Entropy (MaxEnt) Inverse reinforcement learning (IRL), which provides a principled method to find a most non-committal reward function consistent with g... 详细信息
来源: 评论
2007 ieee ADPRL international Program Committee Members
2007 IEEE ADPRL International Program Committee Members
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
Provides a listing of current committee members.
来源: 评论
A reinforcement learning-based Bi-Objective Routing Algorithm for Energy Harvesting Mobile Ad-hoc Networks  7
A Reinforcement Learning-based Bi-Objective Routing Algorith...
收藏 引用
7th international symposium on Telecommunications (IST)
作者: Maleki, Meisam Hakami, Vesal Dehghan, Mehdi Amirkabir Univ Technol Dept Comp Engn Tehran Iran
dynamic topology, lack of a fixed infrastructure and limited energy in mobile ad-hoc networks (MANETs) give rise to a challenging operational environment. MANET routing protocols should consider dynamic network change... 详细信息
来源: 评论
Fuzzy Q-learning: a new approach for fuzzy dynamic programming
Fuzzy Q-learning: a new approach for fuzzy dynamic programmi...
收藏 引用
Proceedings of the 3rd ieee Conference on Fuzzy Systems. Part 3 (of 3)
作者: Berenji, Hamid R. NASA Ames Research Cent Mountain View United States
Fuzzy reinforcement learning (FRL) involves 'jump starting' reinforcement learning with fuzzy logic rules. By using FRL, prior domain knowledge, which may be very approximate and imprecise, can be expressed in... 详细信息
来源: 评论
A self-learning reactive navigation method for mobile robots
A self-learning reactive navigation method for mobile robots
收藏 引用
international Conference on Machine learning and Cybernetics
作者: Xu, X Wang, XN He, HG Natl Univ Def Technol Sch Comp Changsha 410073 Peoples R China
This paper addresses the navigation problem of mobile robots in unknown environments, where global path planning methods cannot be applied. In such cases, reactive navigation controllers are commonly employed to deal ... 详细信息
来源: 评论
Trajectory Tracking of Underactuated Sea Vessels With Uncertain dynamics: An Integral reinforcement learning Approach
Trajectory Tracking of Underactuated Sea Vessels With Uncert...
收藏 引用
ieee international Conference on Systems, Man, and Cybernetics (SMC)
作者: Abouheaf, Mohammed Gueaieb, Wail Miah, Md Suruz Spinello, Davide Univ Ottawa Sch Elect Engn & Comp Sci Ottawa ON Canada Bradley Univ Dept Elect & Comp Engn Peoria IL USA Univ Ottawa Dept Mech Engn Ottawa ON Canada
Underactuated systems like sea vessels have degrees of motion that are insufficiently matched by a set of independent actuation forces. In addition, the underlying trajectory-tracking control problems grow in complexi... 详细信息
来源: 评论
programming and reinforcement learning
Programming and Reinforcement Learning
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
Welcome to ADPRL 2007 - the very first ieee international symposium on approximate dynamic programming and reinforcement learning. The area of approximate dynamic programming and reinforcement learning is a fusion of ...
来源: 评论
Optimal Tracking Control of the Boiler-turbine System Based on Adaptive dynamic programming
Optimal Tracking Control of the Boiler-turbine System Based ...
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: Liu, Yujia Gao, Aiguo Wei, Qinglai Chinese Acad Sci Inst Automat Beijing Peoples R China North China Elect Power Res Inst Co Ltd Beijing Peoples R China
To guarantee the efficient performance of the power plant, an adaptive tacking controller for the nonlinear boiler-turbine system based on offline policy iteration adaptive dynamic prorgamming (ADP) method is proposed... 详细信息
来源: 评论
Incorporating approximate dynamic programming-Based Parameter Tuning into PD-type Virtual Inertia Control of DFIGs
Incorporating Approximate Dynamic Programming-Based Paramete...
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: Guo, Wentao Liu, Feng Si, Jennie Mei, Shengwei Tsinghua Univ Dept Elect Engn State Key Lab Power Syst Beijing 100084 Peoples R China Arizona State Univ Dept Elect Engn Tempe AZ 85287 USA
Doubly fed induction generators (DFIGs) are widely used in wind power generation. For controlling DFIGs to maintain network frequency within a safety range, the proportional-derivative (PD) type virtual inertia contro... 详细信息
来源: 评论