咨询与建议

限定检索结果

文献类型

  • 61 篇 期刊文献
  • 21 篇 会议

馆藏范围

  • 82 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 74 篇 工学
    • 47 篇 计算机科学与技术...
    • 37 篇 控制科学与工程
    • 31 篇 电气工程
    • 6 篇 软件工程
    • 5 篇 机械工程
    • 3 篇 信息与通信工程
    • 2 篇 仪器科学与技术
    • 2 篇 航空宇航科学与技...
    • 1 篇 电子科学与技术(可...
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
    • 1 篇 环境科学与工程(可...
  • 15 篇 理学
    • 6 篇 数学
    • 6 篇 系统科学
    • 3 篇 物理学
    • 2 篇 化学
    • 1 篇 生物学
    • 1 篇 生态学
  • 10 篇 管理学
    • 10 篇 管理科学与工程(可...
    • 2 篇 工商管理
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 法学
  • 1 篇 教育学
    • 1 篇 教育学
  • 1 篇 军事学

主题

  • 82 篇 neuro-dynamic pr...
  • 28 篇 optimal control
  • 24 篇 reinforcement le...
  • 20 篇 approximate dyna...
  • 19 篇 adaptive critic ...
  • 18 篇 neural networks
  • 15 篇 adaptive dynamic...
  • 12 篇 nonlinear system...
  • 11 篇 dynamic programm...
  • 9 篇 adaptive dynamic...
  • 6 篇 function approxi...
  • 6 篇 policy iteration
  • 5 篇 scheduling
  • 4 篇 markov chains
  • 4 篇 generalized poli...
  • 3 篇 value iteration
  • 3 篇 temporal-differe...
  • 3 篇 q-learning
  • 2 篇 plug-in hybrid e...
  • 2 篇 differential gam...

机构

  • 21 篇 chinese acad sci...
  • 10 篇 univ sci & techn...
  • 8 篇 guangdong univ t...
  • 4 篇 beijing normal u...
  • 3 篇 alphatech inc bu...
  • 2 篇 guangdong univ t...
  • 2 篇 mit informat & d...
  • 2 篇 georgia inst tec...
  • 2 篇 school of automa...
  • 2 篇 mit dept elect e...
  • 2 篇 northeastern uni...
  • 2 篇 univ texas arlin...
  • 2 篇 southern univ sc...
  • 2 篇 univ illinois de...
  • 2 篇 changchun univ t...
  • 2 篇 rzeszow univ tec...
  • 1 篇 univ sci & techn...
  • 1 篇 princeton univ d...
  • 1 篇 univ chinese aca...
  • 1 篇 chinese acad sci...

作者

  • 19 篇 liu derong
  • 18 篇 wei qinglai
  • 7 篇 song ruizhuo
  • 5 篇 zhao bo
  • 5 篇 wang ding
  • 3 篇 bertsekas dp
  • 3 篇 tsitsiklis jn
  • 3 篇 jay h. lee
  • 3 篇 yang xiong
  • 3 篇 lee jh
  • 3 篇 lee jm
  • 3 篇 yan pengfei
  • 2 篇 burghardt andrze...
  • 2 篇 lewis frank l.
  • 2 篇 li yuanchun
  • 2 篇 an tianjiao
  • 2 篇 niket s. kaisare
  • 2 篇 vanroy b
  • 2 篇 szuster marcin
  • 2 篇 lin hanquan

语言

  • 74 篇 英文
  • 4 篇 其他
  • 4 篇 中文
检索条件"主题词=neuro-dynamic programming"
82 条 记 录,以下是61-70 订阅
排序:
On the convergence of temporal-difference learning with linear function approximation
收藏 引用
MACHINE LEARNING 2001年 第3期42卷 241-267页
作者: Tadic, V Univ Melbourne Dept Elect & Elect Engn Parkville Vic 3010 Australia
The asymptotic properties of temporal-difference learning algorithms with linear function approximation are analyzed in this paper. The analysis is carried out in the context of the approximation of a discounted cost-... 详细信息
来源: 评论
Parallel dynamic water supply scheduling in a cluster of computers
收藏 引用
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE 2001年 第15期13卷 1281-1302页
作者: Damas, A Salmerón, M Ortega, J Olivares, G Pomares, H Univ Granada Fac Ciencias Dept Comp Architecture & Comp Technol E-18071 Granada Spain
The parallelization of complex planning and control problems arising in diverse application areas in the industrial, services and commercial environments not only allows the determination of control variables in the r... 详细信息
来源: 评论
Simulation-based learning of cost-to-go for control of nonlinear processes
收藏 引用
KOREAN JOURNAL OF CHEMICAL ENGINEERING 2004年 第2期21卷 338-344页
作者: Lee, JM Lee, JH Georgia Inst Technol Sch Chem & Biomol Engn Atlanta GA 30332 USA
In this paper, we present a simulation-based dynamic programming method that learns the 'cost-to-go' function in an iterative manner. The method is intended to combat two important drawbacks of the conventiona... 详细信息
来源: 评论
Finite-Approximation-Error-Based Discrete-Time Iterative Adaptive dynamic programming
收藏 引用
IEEE TRANSACTIONS ON CYBERNETICS 2014年 第12期44卷 2820-2833页
作者: Wei, Qinglai Wang, Fei-Yue Liu, Derong Yang, Xiong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, a new iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite horizon discrete-time nonlinear systems with finite approximation errors. First, ... 详细信息
来源: 评论
Asynchronous Stochastic Approximations With Asymptotically Biased Errors and Deep Multiagent Learning
收藏 引用
IEEE TRANSACTIONS ON AUTOMATIC CONTROL 2021年 第9期66卷 3969-3983页
作者: Ramaswamy, Arunselvan Bhatnagar, Shalabh Quevedo, Daniel E. Paderborn Univ Heinz Nixdorf Inst D-33098 Paderborn Germany Paderborn Univ Dept Comp Sci D-33098 Paderborn Germany Indian Inst Sci Dept Comp Sci & Automat Bangalore 560012 Karnataka India Indian Inst Sci Robert Bosch Ctr Cyber Phys Syst Bangalore 560012 Karnataka India Queensland Univ Technol Sch Elect Engn & Robot Brisbane Qld 4001 Australia
Asynchronous stochastic approximations (SAs) are an important class of model-free algorithms, tools, and techniques that are popular in multiagent and distributed control scenarios. To counter Bellman's curse of d... 详细信息
来源: 评论
State of the Art of Adaptive dynamic programming and Reinforcement Learning
收藏 引用
CAAI Artificial Intelligence Research 2022年 第2期1卷 93-110页
作者: Derong Liu Mingming Ha Shan Xue Department of Mechanical and Energy Engineering Southern University of Science and TechnologyShenzhen 518055China Department of Electrical and Computer Engineering University of Illinois at ChicagoIL 606071USA School of Automation and Electrical Engineering University of Science and Technology BeijingBeijing 100083China School of Computer Science and Engineering South China University of TechnologyGuangzhou 510006China
This article introduces the state-of-the-art development of adaptive dynamic programming and reinforcement learning(ADPRL).First,algorithms in reinforcement learning(RL)are introduced and their roots in dynamic progra... 详细信息
来源: 评论
Model research and features simplification for Scheduling of wafer fabrication system
Model research and features simplification for Scheduling of...
收藏 引用
4th International Conference on Computer Science and Education
作者: Wang, Ying Lin, Zhixian Li, Maoqing Xiamen Univ Dept Automat Xiamen 361005 Peoples R China
Scheduling of wafer fabrication system with machine failures and repair time is studied by neuro-dynmamic programming in this paper. States set and scheduling set are constructed, state transition probability is deduc... 详细信息
来源: 评论
Discrete-Time Optimal Control Scheme Based on Q-Learning Algorithm  7
Discrete-Time Optimal Control Scheme Based on <i>Q</i>-Learn...
收藏 引用
7th International Conference on Intelligent Control and Information Processing (ICICIP)
作者: Wei, Qinglai Liu, Derong Song, Ruizhuo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
This paper is concerned with optimal control problems of discrete-time nonlinear systems via a novel Q-learning algorithm. In the newly developed Q-learning algorithm, the iterative Q function in each iteration is req... 详细信息
来源: 评论
Discrete-Time Generalized Policy Iteration ADP Algorithm With Approximation Errors
Discrete-Time Generalized Policy Iteration ADP Algorithm Wit...
收藏 引用
IEEE Symposium Series on Computational Intelligence (IEEE SSCI)
作者: Wei, Qinglai Li, Benkai Song, Ruizhuo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing Peoples R China
This paper concerns with a novel generalized policy iteration (GPI) algorithm with approximation errors. Approximation errors are explicitly considered in the GPI algorithm. The properties of the stable GPI algorithm ... 详细信息
来源: 评论
Fair Energy Scheduling in Vehicle-to-Grid Networks in the Smart Grid
Fair Energy Scheduling in Vehicle-to-Grid Networks in the Sm...
收藏 引用
IEEE International Conference on Communications (ICC)
作者: Zhong, Weifeng Yu, Rong Zhang, Yan Guangdong Univ Technol Guangzhou Guangdong Peoples R China Simula Res Lab Trondheim Norway
Plug-in hybrid electric vehicles (PHEVs) are receiving growing attention to achieve a sustainable transport system and society. Due to the limited vehicle battery capacity, PHEVs perform charging and re-charging from ... 详细信息
来源: 评论