咨询与建议

限定检索结果

文献类型

  • 743 篇 会议
  • 265 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,012 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 704 篇 工学
    • 517 篇 计算机科学与技术...
    • 376 篇 电气工程
    • 275 篇 控制科学与工程
    • 152 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 65 篇 管理学
    • 62 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 308 篇 reinforcement le...
  • 213 篇 dynamic programm...
  • 202 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 87 篇 neural networks
  • 73 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 52 篇 control systems
  • 51 篇 convergence
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 43 篇 adaptive control
  • 40 篇 artificial neura...
  • 40 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 986 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1012 条 记 录,以下是71-80 订阅
排序:
Discrete-Time Stable Generalized Self-learning Optimal Control With Approximation Errors
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2018年 第4期29卷 1226-1238页
作者: Wei, Qinglai Li, Benkai Song, Ruizhuo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
In this paper, a generalized policy iteration (GPI) algorithm with approximation errors is developed for solving infinite horizon optimal control problems for nonlinear systems. The developed stable GPI algorithm prov... 详细信息
来源: 评论
adaptive, Optimal, Virtual Synchronous Generator Control of Three-Phase Grid-Connected Inverters Under Different Grid Conditions-An adaptive dynamic programming Approach
收藏 引用
ieee TRANSACTIONS ON INDUSTRIAL INFORMATICS 2022年 第11期18卷 7388-7399页
作者: Wang, Zhongyang Yu, Yunjun Gao, Weinan Davari, Masoud Deng, Chao Fuzhou Inst Technol Sch Appl Sci & Engn Fuzhou 350506 Peoples R China Nanchang Univ Dept Automat Informat Engn Nanchang 330031 Jiangxi Peoples R China Florida Inst Technol Florida Tech Coll Engn & Sci Dept Mech & Civil Engn Melbourne FL 32901 USA Georgia Southern Univ Dept Elect & Comp Engn Statesboro Campus Statesboro GA 30460 USA Nanjing Univ Posts & Telecommun Inst Adv Technol Nanjing 210023 Peoples R China
This article proposes an adaptive, optimal, data-driven control approach based on reinforcement learning and adaptive dynamic programming to the three-phase grid-connected inverter employed in virtual synchronous gene... 详细信息
来源: 评论
Integrating Sporadic Imitation in reinforcement learning Robots
Integrating Sporadic Imitation in Reinforcement Learning Rob...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Richert, Willi Scheller, Ulrich Koch, Markus Kleinjohann, Bernd Stern, Claudius Univ Gesamthsch Paderborn Fac Comp Sci Elect Engn & Math D-33102 Paderborn Germany
Although the combination of reinforcement learning and imitation has been already considered in recent research, it always revolved around fixed settings where demonstrator and imitator are fixed and the imitation pro... 详细信息
来源: 评论
Neural-Network-Based reinforcement learning Controller for Nonlinear Systems with Non-symmetric Dead-zone Inputs
Neural-Network-Based Reinforcement Learning Controller for N...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Zhang, Xin Zhang, Huaguang Liu, Derong Kim, Yongsu Northeastern Univ Sch Informat Sci & Engn Shenyang 110004 Liaoning Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
A novel adaptive-critic-based NN controller using reinforcement learning is developed for a class of nonlinear systems with non-symmetric dead-zone inputs. The adaptive critic NN controller uses two NNs: the critic NN... 详细信息
来源: 评论
Structure search of probabilistic models and data correction for EDA-RL
Structure search of probabilistic models and data correction...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Handa, Hisashi Graduate School of Natural Science and Technology Okayama University Tsushima-naka 3-1-1 Okayama 700-8530 Japan
We have proposed a novel Estimation of Distribution Algorithm for solving reinforcement learning problems: EDA-RL. The EDA-RL can perform well if the complexity of the structure of the probabilistic model is adapted t... 详细信息
来源: 评论
Neural-Network-Based Robust Control Schemes for Nonlinear Multiplayer Systems With Uncertainties via adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2019年 第3期49卷 579-588页
作者: Jiang, He Zhang, Huaguang Luo, Yanhong Han, Ji Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Liaoning Peoples R China
This paper investigates the robust control issues of nonlinear multiplayer systems by utilizing adaptive dynamic programming (ADP) methods and fills a gap in the ADP field, where actuator uncertainties for multiplayer... 详细信息
来源: 评论
A novel approach for constructing basis functions in approximate dynamic programming for feedback control
A novel approach for constructing basis functions in approxi...
收藏 引用
2013 4th ieee symposium on adaptive dynamic programming and reinforcement learning, ADPRL 2013
作者: Wang, Jian Huang, Zhenhua Xu, Xin College of Mechatronics and Automation National University of Defense Tech Changsha 410073 China Xi'An Air Force Military Representative Office Xi'an China
This paper presents a novel approach for constructing basis functions in approximate dynamic programming (ADP) through the locally linear embedding (LLE) process. It considers the experience (sample) data as a high-di... 详细信息
来源: 评论
Theoretical Analysis of a reinforcement learning based Switching Scheme
Theoretical Analysis of a Reinforcement Learning based Switc...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Heydari, Ali South Dakota Sch Mines & Technol Dept Mech Engn Rapid City SD 57701 USA
A reinforcement learning based scheme for optimal switching with an infinite-horizon cost function is briefly proposed in this paper. Several theoretical questions are shown to arise regarding its convergence, optimal... 详细信息
来源: 评论
Neural-Network-Based adaptive dynamic Surface Control for MIMO Systems with Unknown Hysteresis
Neural-Network-Based Adaptive Dynamic Surface Control for MI...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Liu, Lei Wang, Zhanshan Shen, Zhengwei Northeastern Univ Coll Informat Sci & Engn Shenyang Liaoning Peoples R China
This paper focuses on the composite adaptive tracking control for a class of nonlinear multiple-input-multiple-output (MIMO) systems with unknown backlash-like hysteresis nonlinearities. A dynamic surface control meth... 详细信息
来源: 评论
Convergent reinforcement learning Control with Neural Networks and Continuous Action Search
Convergent Reinforcement Learning Control with Neural Networ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Lee, Minwoo Anderson, Charles W. Colorado State Univ Dept Comp Sci Ft Collins CO 80523 USA
We combine a convergent TD-learning method and direct continuous action search with neural networks for function approximation to obtain both stability and generalization over inexperienced state-action pairs. We exte... 详细信息
来源: 评论