咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是21-30 订阅
排序:
adaptive Optimal Control via Q-learning for Ito Fuzzy Stochastic Nonlinear Continuous-Time Systems With Stackelberg Game
收藏 引用
ieee TRANSACTIONS ON FUZZY SYSTEMS 2024年 第4期32卷 2029-2038页
作者: Ming, Zhongyang Zhang, Huaguang Yan, Ying Yang, Liu Northeastern Univ Sch Informat Sci & Engn Shenyang 110004 Peoples R China
In order to solve the two-player Stackelberg game for the continuous-time nonlinear stochastic system, using the Takagi-Sugeno (T-S) fuzzy stochastic model, this paper defines the novel Q-functions and suggests an ada... 详细信息
来源: 评论
adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal learning Control
收藏 引用
ieee/CAA Journal of Automatica Sinica 2023年 第9期10卷 1797-1809页
作者: Ding Wang Jiangyu Wang Mingming Zhao Peng Xin Junfei Qiao IEEE Faculty of Information Technology the Beijing Key Laboratory of Computational Intelligence and Intelligent Systemthe Beijing Laboratory of Smart Environmental Protectionand the Beijing Institute of Artificial IntelligenceBeijing University of TechnologyBeijing 100124China
This paper is concerned with a novel integrated multi-step heuristic dynamic programming(MsHDP)algorithm for solving optimal control *** is shown that,initialized by the zero cost function,MsHDP can converge to the op... 详细信息
来源: 评论
reinforcement learning-Based 3D Trajectory Tracking Control of Hypersonic Gliding Vehicles With Time-Varying Uncertainties
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2025年 22卷 8187-8199页
作者: Luo, Biao Sun, Jingyi Tang, Rui Xu, Xiaodong Cent South Univ Sch Automat Changsha 410083 Peoples R China
In this paper, a robust three-dimensional trajectory tracking control scheme based on reinforcement learning is proposed for the glide phase of a hypersonic gliding vehicle (HGV) with time-varying uncertainties. First... 详细信息
来源: 评论
Integral reinforcement learning-Based dynamic Event-Triggered Nonzero-Sum Games of USVs
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2025年 第4期55卷 1706-1716页
作者: Xue, Shan Zhang, Weidong Luo, Biao Liu, Derong Hainan Univ Sch Informat & Commun Engn Haikou 570228 Peoples R China Shanghai Jiao Tong Univ Dept Automat Shanghai 200240 Peoples R China Cent South Univ Sch Automat Changsha 410083 Peoples R China Southern Univ Sci & Technol Sch Automat & Intelligent Mfg Shenzhen 518055 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
In this article, an integral reinforcement learning (IRL) method is developed for dynamic event-triggered nonzero-sum (NZS) games to achieve the Nash equilibrium of unmanned surface vehicles (USVs) with state and inpu... 详细信息
来源: 评论
Approximate dynamic programming for Constrained Piecewise Affine Systems With Stability and Safety Guarantees
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2025年 第3期55卷 1722-1734页
作者: He, Kanghui Shi, Shengling van den Boom, Ton de Schutter, Bart Delft Univ Technol Delft Ctr Syst & Control NL-2628 CD Delft Netherlands MIT Dept Chem Engn Cambridge MA 02139 USA
Infinite-horizon optimal control of constrained piecewise affine (PWA) systems has been approximately addressed by hybrid model predictive control (MPC), which, however, has computational limitations, both in offline ... 详细信息
来源: 评论
Proceedings of the 2013 ieee symposium on adaptive dynamic programming and reinforcement learning, ADPRL 2013 - 2013 ieee symposium Series on Computational Intelligence, SSCI 2013
Proceedings of the 2013 IEEE Symposium on Adaptive Dynamic P...
收藏 引用
2013 4th ieee symposium on adaptive dynamic programming and reinforcement learning, ADPRL 2013
The proceedings contain 28 papers. The topics discussed include: local stability analysis of high-order recurrent neural networks with multi-step piecewise linear activation functions;finite-horizon optimal control de...
来源: 评论
Safe reinforcement learning and adaptive Optimal Control With Applications to Obstacle Avoidance Problem
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2024年 第3期21卷 4599-4612页
作者: Wang, Ke Mu, Chaoxu Ni, Zhen Liu, Derong Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Florida Atlantic Univ Dept Elect Engn & Comp Sci Boca Raton FL 33431 USA Southern Univ Sci & Technol Sch Syst Design & Intelligent Mfg Shenzhen 518055 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
This paper presents a novel composite obstacle avoidance control method to generate safe motion trajectories for autonomous systems in an adaptive manner. First, system safety is described using forward invariance, an... 详细信息
来源: 评论
Self-Triggered Approximate Optimal Neuro-Control for Nonlinear Systems Through adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2024年 第3期36卷 4713-4723页
作者: Zhao, Bo Zhang, Shunchao Liu, Derong Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Chongqing Univ Posts & Telecommun Key Lab Ind Internet Things & Networked Control Minist Educ Chongqing 400065 Peoples R China Guangdong Univ Finance Sch Internet Finance & Informat Engn Guangzhou 510521 Peoples R China Southern Univ Sci & Technol Sch Syst Design & Intelligent Mfg Shenzhen 518055 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
In this article, a novel self-triggered approximate optimal neuro-control scheme is presented for nonlinear systems by utilizing adaptive dynamic programming (ADP). According to the Bellman principle of optimality, th... 详细信息
来源: 评论
Optimal Control of Nonlinear Systems Using Experience Inference Human-Behavior learning
收藏 引用
ieee/CAA Journal of Automatica Sinica 2023年 第1期10卷 90-102页
作者: Adolfo Perrusquía Weisi Guo IEEE the School of Aerospace Transport and ManufacturingCranfield UniversityBedfordUK
Safety critical control is often trained in a simulated environment to mitigate *** migration of the biased controller requires further *** this paper,an experience inference human-behavior learning is proposed to sol... 详细信息
来源: 评论
Novel Discounted adaptive Critic Control Designs With Accelerated learning Formulation
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2024年 第5期54卷 3003-3016页
作者: Ha, Mingming Wang, Ding Liu, Derong Ant Grp MYbank Beijing 100020 Peoples R China Univ Sci & Technol Beijing Sch Automation & Elect Engn Beijing 100083 Peoples R China Beijing Univ Technol Fac Informat Technol Beijing Key Lab Computat Intelligence & Intelligen Beijing 100124 Peoples R China Southern Univ Sci & Technol Sch Syst Design & Intelligent Mfg Shenzhen 518055 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
Inspired by the successive relaxation method, a novel discounted iterative adaptive dynamic programming framework is developed, in which the iterative value function sequence possesses an adjustable convergence rate. ... 详细信息
来源: 评论