咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 265 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,014 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 707 篇 工学
    • 520 篇 计算机科学与技术...
    • 376 篇 电气工程
    • 275 篇 控制科学与工程
    • 154 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 63 篇 管理学
    • 60 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 309 篇 reinforcement le...
  • 214 篇 dynamic programm...
  • 202 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 96 篇 learning
  • 87 篇 neural networks
  • 72 篇 heuristic algori...
  • 67 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 52 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 41 篇 adaptive control
  • 40 篇 artificial neura...
  • 39 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 988 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1014 条 记 录,以下是111-120 订阅
排序:
Using Approximate dynamic programming for Estimating the Revenues of a Hydrogen-based High-Capacity Storage Device
Using Approximate Dynamic Programming for Estimating the Rev...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Francois-Lavet, Vincent Fonteneau, Raphael Ernst, Damien Univ Liege Dept Elect Engn & Comp Sci B-4000 Liege Belgium
This paper proposes a methodology to estimate the maximum revenue that can be generated by a company that operates a high-capacity storage device to buy or sell electricity on the day-ahead electricity market. The met... 详细信息
来源: 评论
Offline Data-Driven adaptive Critic Design With Variational Inference for Wastewater Treatment Process Control
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2024年 第4期21卷 4987-4998页
作者: Qiao, Junfei Yang, Ruyue Wang, Ding Beijing Univ Technol Fac Informat Technol Beijing Key Lab Computat Intelligence & Intelligen Beijing Lab Smart Environm Protect Beijing 100124 Peoples R China Beijing Univ Technol Beijing Inst Artificial Intelligence Beijing 100124 Peoples R China
Wastewater treatment is indispensable to the functioning of urban society, and its optimal control has enormous social benefits. However, precise modelling of the unstable and complex treatment process is challenging ... 详细信息
来源: 评论
adaptive dynamic programming with balanced weights seeking strategy
Adaptive dynamic programming with balanced weights seeking s...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Fu, Jian He, Haibo Ni, Zhen School of Automation Wuhan University of Technology Wuhan Hubei 430070 China Department of Electrical Computerand Biomedical Engineering University of Rhode Island Kingston RI 02881 United States
In this paper we propose to integrate the recursive Levenberg-Marquardt method into the adaptive dynamic programming (ADP) design for improved learning and adaptive control performance. Our key motivation is to consid... 详细信息
来源: 评论
reinforcement learning-Based Linear Quadratic Regulation of Continuous-Time Systems Using dynamic Output Feedback
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2020年 第11期50卷 4670-4679页
作者: Rizvi, Syed Ali Asad Lin, Zongli Univ Virginia Charles L Brown Dept Elect & Comp Engn Charlottesville VA 22904 USA
In this paper, we propose a model-free solution to the linear quadratic regulation (LQR) problem of continuous-time systems based on reinforcement learning using dynamic output feedback. The design objective is to lea... 详细信息
来源: 评论
A Novel Iterative θ-adaptive dynamic programming for Discrete-Time Nonlinear Systems
收藏 引用
ieee TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2014年 第4期11卷 1176-1190页
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
This paper is concerned with a new iterative theta-adaptive dynamic programming (ADP) technique to solve optimal control problems of infinite horizon discrete-time nonlinear systems. The idea is to use an iterative AD... 详细信息
来源: 评论
Using reward-weighted regression for reinforcement learning of task space control
Using reward-weighted regression for reinforcement learning ...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Peters, Jan Schaal, Stefan Univ So Calif Los Angeles CA 90089 USA
Many robot control problems of practical importance, including task or operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of the known optimization or rein... 详细信息
来源: 评论
adaptive railway traffic control using approximate dynamic programming
收藏 引用
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES 2020年 113卷 91-107页
作者: Ghasempour, Taha Heydecker, Benjamin UCL Ctr Transport Studies London WC1E 6BT England
This study presents an adaptive railway traffic controller for real-time operations based on approximate dynamic programming (ADP). By assessing requirements and opportunities, the controller aims to limit consecutive... 详细信息
来源: 评论
An approximate dynamic programming based controller for an underactuated 6DoF quadrotor
An approximate Dynamic Programming based controller for an u...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Stingu, Emanuel Lewis, Frank L. Automation and Robotics Research Institute University of Texas at Arlington Arlington TX United States
This paper discusses how the principles of adaptive dynamic programming (ADP) can be applied to the control of a quadrotor helicopter platform flying in an uncontrolled environment and subjected to various disturbance... 详细信息
来源: 评论
Online adaptive Integral reinforcement learning for Nonlinear Multi-Input System
收藏 引用
ieee TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS 2023年 第11期70卷 4176-4180页
作者: Lv, Yongfeng Chang, Huimin Zhao, Jun Taiyuan Univ Technol Coll Elect & Power Engn Taiyuan 030024 Peoples R China Shanxi Univ Sch Math Sci Taiyuan 030006 Peoples R China Shandong Univ Sci & Technol Coll Transportat Qingdao 266590 Peoples R China
In this brief article, a novel adaptive integral reinforcement learning (AIRL) scheme is proposed for the continuous-time (CT) system. Moreover, it is used to learn the optimal controls of the partially unknown multi-... 详细信息
来源: 评论
learning-Based adaptive Optimal Control of Linear Time-Delay Systems: A Policy Iteration Approach
收藏 引用
ieee TRANSACTIONS ON AUTOMATIC CONTROL 2024年 第1期69卷 629-636页
作者: Cui, Leilei Pang, Bo Jiang, Zhong-Ping NYU Tandon Sch Engn Dept Elect & Comp Engn Control & Networks Lab Brooklyn NY 11201 USA
This article studies the adaptive optimal control problem for a class of linear time-delay systems described by delay differential equations. A crucial strategy is to take advantage of recent developments in reinforce... 详细信息
来源: 评论