咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 266 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,015 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 708 篇 工学
    • 521 篇 计算机科学与技术...
    • 377 篇 电气工程
    • 277 篇 控制科学与工程
    • 155 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 67 篇 管理学
    • 64 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 309 篇 reinforcement le...
  • 214 篇 dynamic programm...
  • 203 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 87 篇 neural networks
  • 74 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 40 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 989 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1015 条 记 录,以下是311-320 订阅
排序:
Guaranteed cost neural tracking control for a class of uncertain nonlinear systems using adaptive dynamic programming
收藏 引用
NEUROCOMPUTING 2016年 198卷 80-90页
作者: Yang, Xiong Liu, Derong Wei, Qinglai Wang, Ding Chinese Acad Sci Complex Syst Inst Automat State Key Lab Management & Control Beijing 100190 Peoples R China Univ Sci & Technol Sch Automat & Elect Engn Beijing 100083 Peoples R China
This paper presents an adaptive dynamic programming-based guaranteed cost neural tracking control algorithm for a class of continuous-time matched uncertain nonlinear systems. By introducing an augmented system and em... 详细信息
来源: 评论
A Lyapunov function based optimal hybrid power system controller for improved transient stability
收藏 引用
ELECTRIC POWER SYSTEMS RESEARCH 2016年 137卷 6-15页
作者: Yousefian, R. Kamalasadan, S. Univ N Carolina Dept Elect & Comp Engn Charlotte NC 28223 USA
In this paper, an intelligent power system stabilizer based on a stable and optimal hybrid learning-based adaptive control architecture is proposed which is evolved from approximate dynamic programming technique. The ... 详细信息
来源: 评论
GrHDP Solution for Optimal Consensus Control of Multiagent Discrete-Time Systems
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2020年 第7期50卷 2362-2374页
作者: Zhong, Xiangnan He, Haibo Univ North Texas Dept Elect Engn Denton TX 76207 USA Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
This paper develops a new online learning consensus control scheme for multiagent discrete-time systems by goal representation heuristic dynamic programming (GrHDP) techniques. The agents in the whole system are inter... 详细信息
来源: 评论
learning-Based Attitude Tracking Control With High-Performance Parameter Estimation
收藏 引用
ieee TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS 2022年 第3期58卷 2218-2230页
作者: Dong, Hongyang Zhao, Xiaowei Hu, Qinglei Yang, Haoyang Qi, Pengyuan Univ Warwick Intelligent Control & Smart Energy Res Grp Sch Engn Coventry CV4 7AL W Midlands England Beihang Univ Sch Automat Sci & Elect Engn Beijing 100191 Peoples R China Beihang Univ Res Inst Frontier Sci Beijing 100191 Peoples R China
This article aims to handle the optimal attitude tracking control tasks for rigid bodies via a reinforcement-learning-based control scheme, in which a constrained parameter estimator is designed to compensate system u... 详细信息
来源: 评论
adaptive reinforcement learning Strategy-Based Sliding Mode Control of Uncertain Euler-Lagrange Systems With Prescribed Performance Guarantees: Autonomous Underwater Vehicles-Based Verification
收藏 引用
ieee TRANSACTIONS ON FUZZY SYSTEMS 2024年 第11期32卷 6160-6171页
作者: Wu, Yang Wang, Yue-Ying Xie, Xiang-Peng Wu, Zheng-Guang Yan, Huai-Cheng Qilu Univ Technol Shandong Acad Sci Inst Automat Jinan 250014 Peoples R China Shanghai Univ Sch Mechatron Engn & Automat Shanghai 200444 Peoples R China Nanjing Univ Posts & Telecommun Sch Internet Things Nanjing 210023 Peoples R China Zhejiang Univ Inst Cyber Syst & Control State Key Lab Ind Control Technol Hangzhou 310027 Peoples R China East China Univ Sci & Technol Sch Informat Sci & Engn Shanghai 200237 Peoples R China
This article studies the tracking control problem of uncertain Euler-Lagrange systems. Despite receiving widespread attention in recent years, the problem remains unresolved to a large content when considering respons... 详细信息
来源: 评论
Policy Gradient Approaches for Multi-Objective Sequential Decision Making: A Comparison
Policy Gradient Approaches for Multi-Objective Sequential De...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Parisi, Simone Pirotta, Matteo Smacchia, Nicola Bascetta, Luca Restelli, Marcello Politecn Milan Dept Elect Informat & Bioengn Piazza Leonardo da Vinci 32 I-20133 Milan Italy
This paper investigates the use of policy gradient techniques to approximate the Pareto frontier in Multi-Objective Markov Decision Processes (MOMDPs). Despite the popularity of policy-gradient algorithms and the fact... 详细信息
来源: 评论
Manifold-Based reinforcement learning via Locally Linear Reconstruction
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2017年 第4期28卷 934-947页
作者: Xu, Xin Huang, Zhenhua Zuo, Lei He, Haibo Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
Feature representation is critical not only for pattern recognition tasks but also for reinforcement learning (RL) methods to solve learning control problems under uncertainties. In this paper, a manifold-based RL app... 详细信息
来源: 评论
dynamic Event-Triggered Control for Hierarchical Differential Games
收藏 引用
ieee TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS 2024年
作者: Xue, Shan Luo, Biao Zhang, Weidong Liu, Derong Hainan Univ Sch Informat & Commun Engn Haikou 570228 Hainan Peoples R China Southern Univ Sci & Technol Sch Automat & Intelligent Mfg Shenzhen 518055 Peoples R China Cent South Univ Sch Automat Changsha 410083 Peoples R China Shanghai Jiao Tong Univ Dept Automat Shanghai 200240 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
This paper proposes a novel dynamic event-triggered control method for a class of completely unknown nonaffine hierarchical differential games, incorporating asymmetric boundaries in both system states and control str... 详细信息
来源: 评论
Opposition-based reinforcement learning in the management of water resources
Opposition-based reinforcement learning in the management of...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Mahootchi, M. Tizhoosh, H. R. Ponnambalam, K. Univ Waterloo Sysr Design Engn 200 Univ Ave W Waterloo ON N2L 3G1 Canada
Opposition-Based learning (OBL) is a new scheme in machine intelligence. In this paper, an OBL version Q-learning which exploits opposite quantities to accelerate the learning is used for management of single reservoi... 详细信息
来源: 评论
An adaptive Hierarchical Energy Management Strategy for Hybrid Electric Vehicles Combining Heuristic Domain Knowledge and Data-Driven Deep reinforcement learning
收藏 引用
ieee TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION 2022年 第3期8卷 3275-3288页
作者: Hu, Bo Li, Jiaxi Chongqing Univ Technol Key Lab Adv Mfg Technol Automobile Parts Minist Educ Chongqing 400054 Peoples R China Ningbo Yinzhou DLT Technol Co Ltd Ningbo 315000 Peoples R China
With the development of artificial intelligence, there has been a growing interest in machine learning-based control strategy, among which reinforcement learning (RL) has opened up a new direction in the field of hybr... 详细信息
来源: 评论