咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 269 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,018 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 711 篇 工学
    • 520 篇 计算机科学与技术...
    • 380 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 311 篇 reinforcement le...
  • 215 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 77 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1018 条 记 录,以下是371-380 订阅
排序:
Fractional-Order Systems Optimal Control via Actor-Critic reinforcement learning and Its Validation for Chaotic MFET
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2024年 22卷 1173-1182页
作者: Li, Dongdong Dong, Jiuxiang Northeastern Univ Coll Informat Sci & Engn State Key Lab Synthet Automat Proc Ind Shenyang 110819 Peoples R China Northeastern Univ Key Lab Vibrat & Control Aeroprop Syst Minist Educ Shenyang 110819 Peoples R China
Since the existence of fractional order dynamics, it is difficult to obtain an optimality equation to solve for fractional-order optimal control. In this paper, a fractional Hamilton-Jacobi-Bellman (HJB) equation base... 详细信息
来源: 评论
Off-Policy Risk-Sensitive reinforcement learning-Based Constrained Robust Optimal Control
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2023年 第4期53卷 2478-2491页
作者: Li, Cong Liu, Qingchen Zhou, Zhehua Buss, Martin Liu, Fangzhou Tech Univ Munich Chair Automat Control Engn D-80333 Munich Germany Univ Sci & Technol China Dept Automat Hefei 230027 Peoples R China Harbin Inst Technol Res Ctr Intelligent Control & Syst Harbin 150001 Peoples R China
This article proposes an off-policy risk-sensitive reinforcement learning (RL)-based control framework to jointly optimize the task performance and constraint satisfaction in a disturbed environment. The risk-aware va... 详细信息
来源: 评论
Event-Triggered Local Control for Nonlinear Interconnected Systems Through Particle Swarm Optimization-Based adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2023年 第12期53卷 7342-7353页
作者: Zhao, Bo Shi, Guang Liu, Derong Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Chongqing Univ Posts & Telecommun Key Lab Ind Internet Things & Networked Control Minist Educ Chongqing 400065 Peoples R China Coordinat Ctr China Dept Engn Natl Comp Network Emergency Response Tech Team Beijing 100029 Peoples R China Southern Univ Sci & Technol Sch Syst Design & Intelligent Mfg Shenzhen 518055 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
This article investigates local control problems for nonlinear interconnected systems by using adaptive dynamic programming (ADP) with particle swarm optimization (PSO). Through constructing a proper local value funct... 详细信息
来源: 评论
Approximate dynamic programming Solutions of Multi-Agent Graphical Games Using Actor-Critic Network Structures
Approximate Dynamic Programming Solutions of Multi-Agent Gra...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: Abouheaf, Mohammed I. Lewis, Frank L. Univ Texas Arlington Res Inst Ft Worth TX 76118 USA
This paper studies a new class of multi-agent discrete-time dynamical graphical games, where interactions between agents are restricted by a communication graph structure. The paper brings together discrete Hamiltonia... 详细信息
来源: 评论
Evaluation of policy gradient methods and variants on the cart-pole benchmark
Evaluation of policy gradient methods and variants on the ca...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Riedmiller, Martin Peters, Jan Schaal, Stefan Univ Osnabruck Neuroinformat Grp D-4500 Osnabruck Germany Univ Southern Calif Computat Learning & Motor Control Los Angeles CA 90007 USA
In this paper, we evaluate different versions from the three main kinds of model-free policy gradient methods, i.e., finite difference gradients, 'vanilla' policy gradients and natural policy gradients. Each o... 详细信息
来源: 评论
A Hybrid Controller for Musculoskeletal Robots Targeting Lifting Tasks in Industrial Metaverse
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2024年 第5期54卷 2708-2719页
作者: Qin, Shijie Li, Houcheng Cheng, Long Chinese Acad Sci Inst Automat State Key Lab Multimodal Artificial Intelligence S Beijing 100190 Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China
In manufacturing, musculoskeletal robots have gained more attention with the potential advantages of flexibility, robustness, and adaptability over conventional serial-link rigid robots. Focusing on the fundamental li... 详细信息
来源: 评论
adaptive Online Distributed Optimal Control of Very-Large-Scale Robotic Systems
收藏 引用
ieee TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS 2021年 第2期8卷 678-689页
作者: Zhu, Pingping Liu, Chang Ferrari, Silvia Marshall Univ Dept Comp Sci & Elect Engn Huntington WV 25705 USA Cornell Univ Sibley Sch Mech & Aerosp Engn Ithaca NY 14853 USA
Autonomous systems comprised of many cooperative agents have the potential for enabling long-duration tasks and data collection critical to the understanding of a wide range of phenomena in spatially and temporally va... 详细信息
来源: 评论
learning to Regulate Rolling Ball Motion
Learning to Regulate Rolling Ball Motion
收藏 引用
ieee symposium Series on Computational Intelligence (ieee SSCI)
作者: Jha, Devesh K. Yerazunis, William Nikovski, Daniel Farahmand, Amir-massoud Mitsubishi Elect Res Labs Cambridge MA 02139 USA
In this paper, we present a problem of regulating the motion of a rolling ball in a one-dimensional space in the presence of non-linear effects of friction and contact. The regulation problem is solved using a model-b... 详细信息
来源: 评论
learning with binary-valued utility using derivative adaptive critic methods
Learning with binary-valued utility using derivative adaptiv...
收藏 引用
2004 ieee International Joint Conference on Neural Networks - Proceedings
作者: Matzner, Shari A. Shannon, Thaddeus T. Lendaris, George G. NW Compl. Intelligence Lab Systems Science Ph.D. Program Portland State University P.O. Box 751 Portland OR 97207 United States
adaptive Critic methods for reinforcement learning are known to provide consistent solutions to optimal control problems, and are also considered plausible models for cognitive learning processes. This paper discusses... 详细信息
来源: 评论
A Low-Power Circuit for adaptive dynamic programming  31
A Low-Power Circuit for Adaptive Dynamic Programming
收藏 引用
31st International Conference on VLSI Design / 17th International Conference on Embedded Systems (VLSID & ES))
作者: Zheng, Nan Mazumder, Pinaki Univ Michigan Elect Engn & Comp Sci Dept Ann Arbor MI 48109 USA
This paper presents a low-power CMOS design for accelerating an adaptive dynamic programming algorithm, called action-dependent heuristic dynamic programming, which is widely employed in many real-life control problem... 详细信息
来源: 评论