咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 266 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,015 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 708 篇 工学
    • 521 篇 计算机科学与技术...
    • 377 篇 电气工程
    • 277 篇 控制科学与工程
    • 155 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 67 篇 管理学
    • 64 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 309 篇 reinforcement le...
  • 214 篇 dynamic programm...
  • 203 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 87 篇 neural networks
  • 74 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 40 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 989 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1015 条 记 录,以下是221-230 订阅
排序:
Integral reinforcement learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown dynamics
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2014年 第3期11卷 706-714页
作者: Li, Hongliang Liu, Derong Wang, Ding Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, we develop an integral reinforcement learning algorithm based on policy iteration to learn online the Nash equilibrium solution for a two-player zero-sum differential game with completely unknown linear... 详细信息
来源: 评论
learning-Based adaptive Optimal Control for Connected Vehicles in Mixed Traffic: Robustness to Driver Reaction Time
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2022年 第6期52卷 5267-5277页
作者: Huang, Mengzhe Jiang, Zhong-Ping Ozbay, Kaan NYU Tandon Sch Engn Dept Elect & Comp Engn Control & Networks Lab Brooklyn NY 11201 USA NYU Tandon Sch Engn C2SMART Ctr Brooklyn NY 11201 USA
Through vehicle-to-vehicle (V2V) communication, both human-driven and autonomous vehicles can actively exchange data, such as velocities and bumper-to-bumper distances. Employing the shared data, control laws with imp... 详细信息
来源: 评论
An Effective PQ-Decoupling Control Scheme Using adaptive dynamic programming Approach to Reducing Oscillations of Virtual Synchronous Generators for Grid Connection With Different Impedance Types
收藏 引用
ieee TRANSACTIONS ON INDUSTRIAL ELECTRONICS 2024年 第4期71卷 3763-3775页
作者: Wang, Zhongyang Wang, Youqing Davari, Masoud Blaabjerg, Frede Beijing Univ Chem Technol Coll Informat Sci & Technol Beijing 100029 Peoples R China Georgia Southern Univ Statesboro Dept Elect & Comp Engn Statesboro GA 30460 USA Aalborg Univ AAU Energy Dept DK-9220 Aalborg Denmark
The power coupling of the virtual synchronous generator (VSG) in the grid-connected mode may aggravate power oscillation because of a resistance-inductive line. In order to deal with this issue, this research study pr... 详细信息
来源: 评论
reinforcement learning in multidimensional continuous action spaces
Reinforcement learning in multidimensional continuous action...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Pazis, Jason Lagoudakis, Michail G. Department of Computer Science Duke University Durham NC 27708-0129 United States Department of Electronic and Computer Engineering Technical University of Crete Chania Crete 73100 Greece
The majority of learning algorithms available today focus on approximating the state (V ) or state-action (Q) value function and efficient action selection comes as an afterthought. On the other hand, real-world probl... 详细信息
来源: 评论
The knowledge gradient policy for offline learning with independent normal rewards
The knowledge gradient policy for offline learning with inde...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Frazier, Peter Powell, Warren Princeton Univ Dept Operat Res & Financial Engn Princeton NJ 08544 USA
We define a new type of policy, the knowledge gradient policy, in the context of an offline learning problem. We show how to compute the knowledge gradient policy efficiently and demonstrate through Monte Carlo simula... 详细信息
来源: 评论
GrDHP: A General Utility Function Representation for Dual Heuristic dynamic programming
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2015年 第3期26卷 614-627页
作者: Ni, Zhen He, Haibo Zhao, Dongbin Xu, Xin Prokhorov, Danil V. Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China Toyota Res Inst NA Toyota Tech Ctr Ann Arbor MI 48105 USA
A general utility function representation is proposed to provide the required derivable and adjustable utility function for the dual heuristic dynamic programming (DHP) design. Goal representation DHP (GrDHP) is prese... 详细信息
来源: 评论
Multi-Objective reinforcement learning for AUV Thruster Failure Recovery
Multi-Objective Reinforcement Learning for AUV Thruster Fail...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Ahmadzadeh, Seyed Reza Kormushev, Petar Caldwell, Darwin G. Ist Italiano Tecnol Dept Adv Robot Via Morego 30 I-16163 Genoa Italy
This paper investigates learning approaches for discovering fault-tolerant control policies to overcome thruster failures in Autonomous Underwater Vehicles (AUV). The proposed approach is a model-based direct policy s... 详细信息
来源: 评论
Active exploration for robot parameter selection in episodic reinforcement learning
Active exploration for robot parameter selection in episodic...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Kroemer, Oliver Peters, Jan Max Planck Institute 38 Spemannstr. Tuebingen 72012 Germany
As the complexity of robots and other autonomous systems increases, it becomes more important that these systems can adapt and optimize their settings actively. However, such optimization is rarely trivial. Sampling f... 详细信息
来源: 评论
Asymptotically Stable adaptive-Optimal Control Algorithm With Saturating Actuators and Relaxed Persistence of Excitation
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2016年 第11期27卷 2386-2398页
作者: Vamvoudakis, Kyriakos G. Miranda, Marcio Fantini Hespanha, Joao P. Univ Calif Santa Barbara Ctr Control Dynam Syst & Computat Santa Barbara CA 93106 USA Univ Fed Minas Gerais Colegio Tecn BR-31270901 Belo Horizonte MG Brazil
This paper proposes a control algorithm based on adaptive dynamic programming to solve the infinite-horizon optimal control problem for known deterministic nonlinear systems with saturating actuators and nonquadratic ... 详细信息
来源: 评论
Closed-Loop Control of Anesthesia and Mean Arterial Pressure Using reinforcement learning
Closed-Loop Control of Anesthesia and Mean Arterial Pressure...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Padmanabhan, Regina Meskin, Nader Haddad, Wassim M. Qatar Univ Dept Elect Engn Doha Qatar Georgia Inst Technol Sch Aerosp Engn Atlanta GA 30332 USA
General anesthesia is required for patients undergoing surgery as well as for some patients in the intensive care units with acute respiratory distress syndrome. However, most anesthetics affect cardiac and respirator... 详细信息
来源: 评论