咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 266 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,015 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 708 篇 工学
    • 521 篇 计算机科学与技术...
    • 377 篇 电气工程
    • 277 篇 控制科学与工程
    • 155 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 67 篇 管理学
    • 64 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 309 篇 reinforcement le...
  • 214 篇 dynamic programm...
  • 203 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 87 篇 neural networks
  • 74 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 40 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 989 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1015 条 记 录,以下是291-300 订阅
排序:
A Two Stage learning Technique for Dual learning in the Pursuit-Evasion Differential Game
A Two Stage Learning Technique for Dual Learning in the Purs...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Al-Talabi, Ahmad A. Schwartz, Howard M. Carleton Univ Dept Syst & Comp Engn 1125 Colonel By Dr Ottawa ON K1S 5B6 Canada Univ Baghdad Al Khwarizmi Coll Engn Mechatron Engn Dept Baghdad Iraq
This paper addresses the case of dual learning in the pursuit-evasion (PE) differential game and examines how fast the players can learn their default control strategies. The players should learn their default control... 详细信息
来源: 评论
adaptive Critic Control Design with Knowledge Transfer for Wastewater Treatment Applications
收藏 引用
ieee TRANSACTIONS ON INDUSTRIAL INFORMATICS 2024年 第2期20卷 1488-1497页
作者: Wang, Ding Li, Xin Zhao, Mingming Qiao, Junfei Beijing Univ Technol Fac Informat Technol Beijing Key Lab Computat Intelligence & Intellige Beijing 100124 Peoples R China Beijing Univ Technol Beijing Inst Artificial Intelligence Beijing 100124 Peoples R China
The wastewater treatment process (WWTP) is of great significance to environmental protection. To improve the efficiency of the WWTP, it is crucial to ensure that the dissolved oxygen (DO) concentration tracks the set ... 详细信息
来源: 评论
Event-Driven H-Constrained Control Using adaptive Critic learning
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2021年 第10期51卷 4860-4872页
作者: Yang, Xiong He, Haibo Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
This article considers an event-driven H-infinity control problem of continuous-time nonlinear systems with asymmetric input constraints. Initially, the H-infinity-constrained control problem is converted into a two-p... 详细信息
来源: 评论
Computing and Communication Cost-Aware Service Migration Enabled by Transfer reinforcement learning for dynamic Vehicular Edge Computing Networks
收藏 引用
ieee TRANSACTIONS ON MOBILE COMPUTING 2024年 第1期23卷 257-269页
作者: Peng, Yan Tang, Xiaogang Zhou, Yiqing Li, Jintao Qi, Yanli Liu, Ling Lin, Hai Inst Comp Technol State Key Lab Processors Beijing 100190 Peoples R China Beijing Key Lab Mobile Comp & Pervas Device Beijing 100190 Peoples R China Space Engn Univ Sch Aerosp Informat Beijing 100015 Peoples R China
Due to the high mobility of vehicles, service migration is inevitable in vehicular edge computing (VEC) networks. Frequent service migrations incur prohibitive migration cost including the computing cost (e.g., increa... 详细信息
来源: 评论
Event-Triggered ADP for Nonzero-Sum Games of Unknown Nonlinear Systems
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2022年 第5期33卷 1905-1913页
作者: Zhao, Qingtao Sun, Jian Wang, Gang Chen, Jie Beijing Inst Technol Key Lab Intelligent Control & Decis Complex Syst Beijing 100081 Peoples R China Chongqing Innovat Ctr Beijing Inst Technol Chongqing 401120 Peoples R China Tongji Univ Sch Elect & Informat Engn Shanghai 200092 Peoples R China Beijing Inst Technol State Key Lab Intelligent Control & Decis Complex Beijing 100081 Peoples R China
For nonzero-sum (NZS) games of nonlinear systems, reinforcement learning (RL) or adaptive dynamic programming (ADP) has shown its capability of approximating the desired index performance and the optimal input policy ... 详细信息
来源: 评论
adaptive dynamic programming based Control Scheme for Uncertain Two-Wheel Robots
Adaptive Dynamic Programming based Control Scheme for Uncert...
收藏 引用
ieee International Conference on Autonomous Robot Systems and Competitions (ICARSC)
作者: Thien Van Nguyen Hai Xuan Le Hoang Viet Tran Duc Anh Nguyen Minh Ngoc Nguyen Linh Nguyen Hanoi Univ Ind Hanoi Vietnam Hanoi Univ Sci & Technol Hanoi Vietnam Federat Univ Australia Melbourne Vic Australia
The paper addresses the problem of effectively controlling a two-wheel robot given its inherent non-linearity and parameter uncertainties. In order to deal with the unknown and uncertain dynamics of the robot, it is p... 详细信息
来源: 评论
reinforcement learning Control of Robotic Knee With Human-in-the-Loop by Flexible Policy Iteration
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2022年 第10期33卷 5873-5887页
作者: Gao, Xiang Si, Jennie Wen, Yue Li, Minhan Huang, He Arizona State Univ Dept Elect Comp & Energy Engn Tempe AZ 85287 USA North Carolina State Univ Dept Biomed Engn Raleigh NC 27695 USA Univ N Carolina Chapel Hill NC 27599 USA
We are motivated by the real challenges presented in a human-robot system to develop new designs that are efficient at data level and with performance guarantees, such as stability and optimality at system level. Exis... 详细信息
来源: 评论
Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 2008年 第4期38卷 994-1001页
作者: Yang, Qinmin Vance, Jonathan Blake Jagannathan, S. Missouri Univ Sci & Technol Dept Elect & Comp Engn Rolla MO 65409 USA
A nonaffine discrete-time system represented by the nonlinear autoregressive moving average with eXogenous input (NARMAX) representation with unknown nonlinear system dynamics is considered. An equivalent affinelike r... 详细信息
来源: 评论
Online adaptive learning of optimal control solutions using integral reinforcement learning
Online adaptive learning of optimal control solutions using ...
收藏 引用
作者: Vamvoudakis, Kyriakos G. Vrabie, Draguna Lewis, Frank L. Automation and Robotics Research Institute University of Texas at Arlington Fort Worth TX 76118 United States
In this paper we introduce an online algorithm that uses integral reinforcement knowledge for learning the continuous-time optimal control solution for nonlinear systems with infinite horizon costs and partial knowled... 详细信息
来源: 评论
Discrete-Time Non-Zero-Sum Games With Completely Unknown dynamics
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2021年 第6期51卷 2929-2943页
作者: Song, Ruizhuo Wei, Qinglai Zhang, Huaguang Lewis, Frank L. Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Peoples R China Univ Texas Arlington UTA Res Inst Ft Worth TX 76118 USA
In this article, off-policy reinforcement learning (RL) algorithm is established to solve the discrete-time N-player nonzero-sum (NZS) games with completely unknown dynamics. The N-coupled generalized algebraic Riccat... 详细信息
来源: 评论