咨询与建议

限定检索结果

文献类型

  • 61 篇 期刊文献
  • 21 篇 会议

馆藏范围

  • 82 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 74 篇 工学
    • 47 篇 计算机科学与技术...
    • 37 篇 控制科学与工程
    • 31 篇 电气工程
    • 6 篇 软件工程
    • 5 篇 机械工程
    • 3 篇 信息与通信工程
    • 2 篇 仪器科学与技术
    • 2 篇 航空宇航科学与技...
    • 1 篇 电子科学与技术(可...
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
    • 1 篇 环境科学与工程(可...
  • 15 篇 理学
    • 6 篇 数学
    • 6 篇 系统科学
    • 3 篇 物理学
    • 2 篇 化学
    • 1 篇 生物学
    • 1 篇 生态学
  • 10 篇 管理学
    • 10 篇 管理科学与工程(可...
    • 2 篇 工商管理
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 法学
  • 1 篇 教育学
    • 1 篇 教育学
  • 1 篇 军事学

主题

  • 82 篇 neuro-dynamic pr...
  • 28 篇 optimal control
  • 24 篇 reinforcement le...
  • 20 篇 approximate dyna...
  • 19 篇 adaptive critic ...
  • 18 篇 neural networks
  • 15 篇 adaptive dynamic...
  • 12 篇 nonlinear system...
  • 11 篇 dynamic programm...
  • 9 篇 adaptive dynamic...
  • 6 篇 function approxi...
  • 6 篇 policy iteration
  • 5 篇 scheduling
  • 4 篇 markov chains
  • 4 篇 generalized poli...
  • 3 篇 value iteration
  • 3 篇 temporal-differe...
  • 3 篇 q-learning
  • 2 篇 plug-in hybrid e...
  • 2 篇 differential gam...

机构

  • 21 篇 chinese acad sci...
  • 10 篇 univ sci & techn...
  • 8 篇 guangdong univ t...
  • 4 篇 beijing normal u...
  • 3 篇 alphatech inc bu...
  • 2 篇 guangdong univ t...
  • 2 篇 mit informat & d...
  • 2 篇 georgia inst tec...
  • 2 篇 school of automa...
  • 2 篇 mit dept elect e...
  • 2 篇 northeastern uni...
  • 2 篇 univ texas arlin...
  • 2 篇 southern univ sc...
  • 2 篇 univ illinois de...
  • 2 篇 changchun univ t...
  • 2 篇 rzeszow univ tec...
  • 1 篇 univ sci & techn...
  • 1 篇 princeton univ d...
  • 1 篇 univ chinese aca...
  • 1 篇 chinese acad sci...

作者

  • 19 篇 liu derong
  • 18 篇 wei qinglai
  • 7 篇 song ruizhuo
  • 5 篇 zhao bo
  • 5 篇 wang ding
  • 3 篇 bertsekas dp
  • 3 篇 tsitsiklis jn
  • 3 篇 jay h. lee
  • 3 篇 yang xiong
  • 3 篇 lee jh
  • 3 篇 lee jm
  • 3 篇 yan pengfei
  • 2 篇 burghardt andrze...
  • 2 篇 lewis frank l.
  • 2 篇 li yuanchun
  • 2 篇 an tianjiao
  • 2 篇 niket s. kaisare
  • 2 篇 vanroy b
  • 2 篇 szuster marcin
  • 2 篇 lin hanquan

语言

  • 74 篇 英文
  • 4 篇 其他
  • 4 篇 中文
检索条件"主题词=neuro-dynamic programming"
82 条 记 录,以下是31-40 订阅
排序:
Approximate dynamic programming strategies and their applicability for process control: A review and future directions
收藏 引用
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS 2004年 第3期2卷 263-278页
作者: Lee, JM Lee, JH Georgia Inst Technol Sch Chem & Biomol Engn Atlanta GA 30332 USA
This paper reviews dynamic programming (DP), surveys approximate solution methods for it, and considers their applicability to process control problems. Reinforcement Learning (RL) and neuro-dynamic programming (NDP),... 详细信息
来源: 评论
Generalized Policy Iteration Adaptive dynamic programming for Discrete-Time Nonlinear Systems
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2015年 第12期45卷 1577-1591页
作者: Liu, Derong Wei, Qinglai Yan, Pengfei Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
This paper is concerned with a novel generalized policy iteration algorithm for solving optimal control problems for discrete-time nonlinear systems. The idea is to use an iterative adaptive dynamic programming algori... 详细信息
来源: 评论
Discrete-Time Optimal Control via Local Policy Iteration Adaptive dynamic programming
收藏 引用
IEEE TRANSACTIONS ON CYBERNETICS 2017年 第10期47卷 3367-3379页
作者: Wei, Qinglai Liu, Derong Lin, Qiao Song, Ruizhuo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
In this paper, a discrete-time optimal control scheme is developed via a novel local policy iteration adaptive dynamic programming algorithm. In the discrete-time local policy iteration algorithm, the iterative value ... 详细信息
来源: 评论
What You Should Know About Approximate dynamic programming
收藏 引用
NAVAL RESEARCH LOGISTICS 2009年 第3期56卷 239-249页
作者: Powell, Warren B. Princeton Univ Dept Operat Res & Financial Engn Princeton NJ 08544 USA
Approximate dynamic programming (ADP) is a broad umbrella for a modeling and algorithmic strategy for solving problems that are sometimes large and complex, and are usually (but not always) stochastic. It is most ofte... 详细信息
来源: 评论
A Novel Iterative θ-Adaptive dynamic programming for Discrete-Time Nonlinear Systems
收藏 引用
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2014年 第4期11卷 1176-1190页
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
This paper is concerned with a new iterative theta-adaptive dynamic programming (ADP) technique to solve optimal control problems of infinite horizon discrete-time nonlinear systems. The idea is to use an iterative AD... 详细信息
来源: 评论
Discrete-Time Two-Player Zero-Sum Games for Nonlinear Systems Using Iterative Adaptive dynamic programming  13th
Discrete-Time Two-Player Zero-Sum Games for Nonlinear System...
收藏 引用
13th International Symposium on Neural Networks (ISNN)
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
This paper is concerned with a discrete-time two-player zero-sum game of nonlinear systems, which is solved by a new iterative adaptive dynamic programming (ADP) method. In the present iterative ADP algorithm, two ite... 详细信息
来源: 评论
Learning Nursery Rhymes Using Adaptive Parameter neurodynamic programming  1
Learning Nursery Rhymes Using Adaptive Parameter Neurodynami...
收藏 引用
1st Australasian Conference on Artificial Life and Computational Intelligence (ACALCI)
作者: Walker, Josiah Chalup, Stephan K. Univ Newcastle Sch Elect Engn & Comp Sci Callaghan NSW 2308 Australia
In this study on music learning, we develop an average reward based adaptive parameterisation for reinforcement learning meta-parameters. These are tested using an approximation of user feedback based on the goal of l... 详细信息
来源: 评论
Optimal Learning Control for Discrete-Time Nonlinear Systems Using Generalized Policy Iteration Based Adaptive dynamic programming  11
Optimal Learning Control for Discrete-Time Nonlinear Systems...
收藏 引用
11th World Congress on Intelligent Control and Automation
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, a novel generalized policy iteration algorithm is investigated to solve infinite horizon optimal control problems for discrete-time nonlinear systems. Two iteration indices are introduced in the general... 详细信息
来源: 评论
Multiagent Reinforcement Learning:Rollout and Policy Iteration
收藏 引用
IEEE/CAA Journal of Automatica Sinica 2021年 第2期8卷 249-272页
作者: Dimitri Bertsekas the Arizona State University(ASU) TempeAZ 85281 USAand also with Massachusetts Institute of Technology(MIT)CambridgeMA 02139
We discuss the solution of complex multistage decision problems using methods that are based on the idea of policy iteration(PI),i.e.,start from some base policy and generate an improved *** is the simplest method of ... 详细信息
来源: 评论
Simulation based strategy for nonlinear optimal control: application to a microbial cell reactor
收藏 引用
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2003年 第3-4期13卷 347-363页
作者: Kaisare, NS Lee, JM Lee, JH Georgia Inst Technol Sch Chem Engn Atlanta GA 30332 USA
Optimal control of systems with complex nonlinear behaviour such as steady state multiplicity results in a nonlinear optimization problem that needs to be solved online at each sample time. We present an approach base... 详细信息
来源: 评论