咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是131-140 订阅
排序:
Adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems
Adaptive dynamic programming for optimal control of unknown ...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning
作者: Liu, Derong Wang, Ding Zhao, Dongbin Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy of Sciences Beijing 100190 China
An intelligent optimal control scheme for unknown nonlinear discrete-time systems with discount factor in the cost function is proposed in this paper. An iterative adaptive dynamic programming (ADP) algorithm via glob... 详细信息
来源: 评论
ieee SSCI 2011: symposium Series on Computational Intelligence - ADPRL 2011: 2011 ieee symposium on Adaptive dynamic programming and reinforcement learning
IEEE SSCI 2011: Symposium Series on Computational Intelligen...
收藏 引用
symposium Series on Computational Intelligence, ieee SSCI2011 - 2011 ieee symposium on Adaptive dynamic programming and reinforcement learning, ADPRL 2011
The proceedings contain 45 papers. The topics discussed include: active learning for personalizing treatment;active exploration by searching for experiments that falsify the computed control policy;optimistic planning...
来源: 评论
dynamic lead time promising
Dynamic lead time promising
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning
作者: Reindorp, Matthew J. Fu, Michael C. Department of Industrial Engineering and Innovation Sciences Eindhoven University of Technology Netherlands Robert H. Smith School of Business Institute for Systems Research University of Maryland United States
We consider a make-to-order business that serves customers in multiple priority classes. Orders from customers in higher classes bring greater revenue, but they expect shorter lead times than customers in lower classe... 详细信息
来源: 评论
High-order local dynamic programming
High-order local dynamic programming
收藏 引用
作者: Tassa, Yuval Todorov, Emanuel Interdisciplinary Center for Neural Computation Hebrew University Jerusalem Israel Applied Mathematics and Computer Science and Engineering University of Washington Seattle United States
We describe a new local dynamic programming algorithm for solving stochastic continuous Optimal Control problems. We use cubature integration to both propagate the state distribution and perform the Bellman backup. Th... 详细信息
来源: 评论
A non-parametric approach to approximate dynamic programming
A non-parametric approach to approximate dynamic programming
收藏 引用
10th international Conference on Machine learning and Applications, ICMLA 2011
作者: Glaude, Hadrien Akrimi, Fadi Geist, Matthieu Pietquin, Olivier 57070 Metz France 2 rue Edouard Belin 57070 Metz France
approximate dynamic programming (ADP) is a machine learning method aiming at learning an optimal control policy for a dynamic and stochastic system from a logged set of observed interactions between the system and one... 详细信息
来源: 评论
Optimal Control for a Class of Unknown Nonlinear Systems via the Iterative GDHP Algorithm
Optimal Control for a Class of Unknown Nonlinear Systems via...
收藏 引用
8th international symposium on Neural Networks
作者: Wang, Ding Liu, Derong Chinese Acad Sci Inst Automat Beijing 100190 Peoples R China
Using the neural-network-based iterative adaptive dynamic programming (ADP) algorithm, an optimal control scheme for a class of unknown discrete-time nonlinear systems with discount factor in the cost function is prop... 详细信息
来源: 评论
symposium on adaptive dynamic programming and reinforcement learning (ieee ADPRL 2011)
Symposium on adaptive dynamic programming and reinforcement ...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
ADPRL 2011 is the third ieee international symposium on approximate dynamic programming and reinforcement learning. The area of approximate dynamic programming and reinforcement learning is a fusion of a number of res...
来源: 评论
A new approach for power management in sensor node based on reinforcement learning
A new approach for power management in sensor node based on ...
收藏 引用
international symposium on Computer Networks and Distributed Systems
作者: Kianpisheh, Somayeh Charkari, Nasrolah Moghadam Faculty of Electrical and Computer Engineering Tarbiat Modares University Tehran Iran
Wireless sensor networks are composed of small nodes with limited battery life and computational ability. Energy reduction in these networks is an important issue to extend network lifetime. dynamic power management i... 详细信息
来源: 评论
Adaptive Dual Heuristic programming Based on Delta-Bar-Delta learning Rule
Adaptive Dual Heuristic Programming Based on Delta-Bar-Delta...
收藏 引用
8th international symposium on Neural Networks
作者: Wu, Jun Xu, Xin Lian, Chuanqiang Huang, Yan Natl Univ Def Technol Coll Mechatron & Automat Inst Automat Changsha 410073 Hunan Peoples R China
Dual Heuristic programming (DHP) is a class of approximate dynamic programming methods using neural networks. Although there have been some successful applications of DHP, its performance and convergence are greatly i... 详细信息
来源: 评论
Optimization Control of Rectifier in HVDC System with ADHDP
Optimization Control of Rectifier in HVDC System with ADHDP
收藏 引用
8th international symposium on Neural Networks
作者: Song, Chunning Zhou, Xiaohua Lin, Xiaofeng Song, Shaojian Guangxi Univ Coll Elect Engn Guangxi Nanning 530004 Peoples R China
A novel nonlinear optimal controller for a rectifier in HVDC transmission system, using artificial neural networks, is presented in this paper. The action dependent heuristic dynamic programming(ADHDP), a member of th... 详细信息
来源: 评论