咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 266 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,015 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 708 篇 工学
    • 521 篇 计算机科学与技术...
    • 377 篇 电气工程
    • 277 篇 控制科学与工程
    • 155 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 67 篇 管理学
    • 64 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 309 篇 reinforcement le...
  • 214 篇 dynamic programm...
  • 203 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 87 篇 neural networks
  • 74 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 40 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 989 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1015 条 记 录,以下是11-20 订阅
排序:
adaptive Critic Control With Knowledge Transfer for Uncertain Nonlinear dynamical Systems: A reinforcement learning Approach
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2025年 22卷 6752-6761页
作者: Zhang, Liangju Zhang, Kun Xie, Xiang Peng Chadli, Mohammed Nanjing Univ Posts & Telecommun Coll Automat Nanjing 210023 Jiangsu Peoples R China Nanjing Univ Posts & Telecommun Coll ArtificialIntelligence Nanjing 210023 Jiangsu Peoples R China Beihang Univ Sch Astronaut Beijing Peoples R China Nanjing Univ Posts & Telecommun Sch Internet Things Nanjing Peoples R China Univ Paris Saclay IBISC Lab F-91000 Evry France
This paper presents an online transfer heuristic dynamic programming (THDP) control approach for a class of nonlinear discrete systems. The proposed approach integrates transfer learning with adaptive critic control. ... 详细信息
来源: 评论
Parallel Control for Nonzero-Sum Games With Completely Unknown Nonlinear dynamics via reinforcement learning
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2025年 第4期55卷 2884-2896页
作者: Lu, Jingwei Wei, Qinglai Wang, Fei-Yue Tsinghua Univ Dept Ind Engn Beijing 100084 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Multimodal Artificial Intelligence S Beijing 100190 Peoples R China Macau Univ Sci & Technol Inst Syst Engn Macau Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Macau Univ Sci & Technol Fac Innovat Engn Macau Peoples R China
This article utilizes parallel control to investigate the problem of continuous-time (CT) nonzero-sum games (NZSGs) for completely unknown nonlinear systems via reinforcement learning (RL), and a parallel control-base... 详细信息
来源: 评论
Data-Driven Combined Longitudinal and Lateral Control for the Car Following Problem
收藏 引用
ieee TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY 2025年 第3期33卷 991-1005页
作者: Cui, Leilei Chakraborty, Sayan Ozbay, Kaan Jiang, Zhong-Ping MIT Cambridge MA 02139 USA NYU Tandon Sch Engn Dept Elect & Comp Engn Control & Networks Lab Brooklyn NY 11201 USA NYU C2SMARTER Ctr Tandon Sch Engn Dept Civil & Urban Engn Brooklyn NY 11201 USA NYU Dept Elect & Comp Engn Dept Civil & Urban Engn Control & Networks LabTandon Sch Engn Brooklyn NY 11201 USA
This article studies the problem of data-driven combined longitudinal and lateral control of autonomous vehicles (AVs) such that the AV can stay within a safe but minimum distance from its leading vehicle and, at the ... 详细信息
来源: 评论
reinforcement learning to Stabilize Singularly Perturbed DC-Side dynamics of Grid-Connected Voltage-Source Converters in Modern AC-DC Grids Using Singular Perturbation Theory and adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON INDUSTRIAL ELECTRONICS 2025年 第3期72卷 2914-2926页
作者: Davari, Masoud Zhao, Jianguo Yang, Chunyu Gao, Weinan Chai, Tianyou Georgia Southern Univ Dept Elect & Comp Engn Statesboro Campus Statesboro GA 30460 USA China Univ Min & Technol Sch Informat & Control Engn Xuzhou 221116 Peoples R China Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Peoples R China
The stability and performance of ac-dc systems in grid modernization heavily rely on the rectification mode of grid-connected voltage-source converters (GC-VSCs). Being considered as the heart of the system, its impac... 详细信息
来源: 评论
An Unknown Multiplayer Nonzero-Sum Game: Prescribed-Time dynamic Event-Triggered Control via adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2025年 22卷 8317-8328页
作者: Zhang, Kun Zhang, Zhi-Xuan Xie, Xiang Peng Rubio, Jose de Jesus Beihang Univ Sch Astronaut Beijing 100191 Peoples R China Nanjing Univ Posts & Telecommun Coll Automat Nanjing 210023 Peoples R China Nanjing Univ Posts & Telecommun Coll Artificial Intelligence Nanjing 210023 Peoples R China Nanjing Univ Posts & Telecommun Sch Internet Things Nanjing 210023 Peoples R China Inst Politcn Nacl Seccin Estudios Posgrad Invest Esime Azcapotzalco Ciudad De Mexico 02250 Mexico
In this paper, the novel prescribed-time dynamic event-triggered control method of an unknown multiplayer nonzero-sum game (MP-NZSG) is designed by using adaptive dynamic programming (ADP). Firstly, a neural network-b... 详细信息
来源: 评论
Data-Model Hybrid-Driven Safe reinforcement learning for adaptive Avoidance Control Against Unsafe Moving Zones
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2025年 PP卷 PP页
作者: Wang, Ke Mu, Chaoxu Zhang, Anguo Sun, Changyin Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Anhui Univ Sch Artificial Intelligence Hefei 230026 Peoples R China Southeast Univ Sch Automat Nanjing 210096 Peoples R China
With the gradual application of reinforcement learning (RL), safety has emerged as a paramount concern. This article presents a novel data-model hybrid-driven safe RL (SRL) scheme to address the challenge of avoidance... 详细信息
来源: 评论
adaptive Robust Stochastic Configuration Networks for Near-Infrared Multivariate Analysis
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2025年 PP卷 PP页
作者: Li, Yuqiang Du, Wenli Wang, Xinjie Yang, Minglei Zhao, Yunmeng East China Univ Sci & Technol Minist Educ Key Lab Smart Mfg Energy Chem Proc Shanghai 200237 Peoples R China Qingyuan Innovat Lab Quanzhou 362801 Peoples R China Hangzhou Normal Univ Sch Informat Sci & Technol Hangzhou 311121 Peoples R China
Near-infrared (NIR) technology has gained wide acceptance in practical processes and is now the measurement of choice in many sectors. However, with increasing spectral dimensionality, it is challenging to establish a... 详细信息
来源: 评论
Approximate dynamic programming for Constrained Piecewise Affine Systems With Stability and Safety Guarantees
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2025年 第3期55卷 1722-1734页
作者: He, Kanghui Shi, Shengling van den Boom, Ton de Schutter, Bart Delft Univ Technol Delft Ctr Syst & Control NL-2628 CD Delft Netherlands MIT Dept Chem Engn Cambridge MA 02139 USA
Infinite-horizon optimal control of constrained piecewise affine (PWA) systems has been approximately addressed by hybrid model predictive control (MPC), which, however, has computational limitations, both in offline ... 详细信息
来源: 评论
Integral reinforcement learning-Based dynamic Event-Triggered Nonzero-Sum Games of USVs
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2025年 第4期55卷 1706-1716页
作者: Xue, Shan Zhang, Weidong Luo, Biao Liu, Derong Hainan Univ Sch Informat & Commun Engn Haikou 570228 Peoples R China Shanghai Jiao Tong Univ Dept Automat Shanghai 200240 Peoples R China Cent South Univ Sch Automat Changsha 410083 Peoples R China Southern Univ Sci & Technol Sch Automat & Intelligent Mfg Shenzhen 518055 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
In this article, an integral reinforcement learning (IRL) method is developed for dynamic event-triggered nonzero-sum (NZS) games to achieve the Nash equilibrium of unmanned surface vehicles (USVs) with state and inpu... 详细信息
来源: 评论
reinforcement learning-Based 3D Trajectory Tracking Control of Hypersonic Gliding Vehicles With Time-Varying Uncertainties
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2025年 22卷 8187-8199页
作者: Luo, Biao Sun, Jingyi Tang, Rui Xu, Xiaodong Cent South Univ Sch Automat Changsha 410083 Peoples R China
In this paper, a robust three-dimensional trajectory tracking control scheme based on reinforcement learning is proposed for the glide phase of a hypersonic gliding vehicle (HGV) with time-varying uncertainties. First... 详细信息
来源: 评论