咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 266 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,015 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 708 篇 工学
    • 521 篇 计算机科学与技术...
    • 377 篇 电气工程
    • 277 篇 控制科学与工程
    • 155 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 67 篇 管理学
    • 64 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 309 篇 reinforcement le...
  • 214 篇 dynamic programm...
  • 203 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 87 篇 neural networks
  • 74 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 40 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 989 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1015 条 记 录,以下是151-160 订阅
排序:
Active exploration by searching for experiments that falsify the computed control policy
Active exploration by searching for experiments that falsify...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Fonteneau, Raphael Murphy, Susan A. Wehenkel, Louis Ernst, Damien Department of Electrical Engineering and Computer Science University of Liège Belgium Department of Statistics University of Michigan United States
We propose a strategy for experiment selection - in the context of reinforcement learning - based on the idea that the most interesting experiments to carry out at some stage are those that are the most liable to fals... 详细信息
来源: 评论
Data-Driven Neuro-Optimal Temperature Control of Water-Gas Shift Reaction Using Stable Iterative adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON INDUSTRIAL ELECTRONICS 2014年 第11期61卷 6399-6408页
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, a novel data-driven stable iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal temperature control problems for water-gas shift (WGS) reaction systems. According to the ... 详细信息
来源: 评论
Protecting against evaluation overfitting in empirical reinforcement learning
Protecting against evaluation overfitting in empirical reinf...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Whiteson, Shimon Tanner, Brian Taylor, Matthew E. Stone, Peter Informatics Institute University of Amsterdam Netherlands Department of Computing Science University of Alberta Canada Department of Computer Science Lafayette College United States Department of Computer Science University of Texas Austin United States
Empirical evaluations play an important role in machine learning. However, the usefulness of any evaluation depends on the empirical methodology employed. Designing good empirical methodologies is difficult in part be... 详细信息
来源: 评论
Heuristics for Multiagent reinforcement learning in Decentralized Decision Problems
Heuristics for Multiagent Reinforcement Learning in Decentra...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Allen, Martin W. Hahn, David MacFarland, Douglas C. Univ Wisconsin Dept Comp Sci La Crosse WI 54601 USA
Decentralized partially observable Markov decision processes (Dec-POMDPs) model cooperative multiagent scenarios, providing a powerful general framework for team-based artificial intelligence. While optimal algorithms... 详细信息
来源: 评论
reinforcement learning-Based Structural Control of Floating Wind Turbines
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2022年 第3期52卷 1603-1613页
作者: Zhang, Jincheng Zhao, Xiaowei Wei, Xing Univ Warwick Sch Engn Coventry CV4 7AL W Midlands England
The structural control of floating wind turbines using active tuned mass damper is investigated in this article. To our knowledge, this is for the first time that reinforcement learning-based control approach is emplo... 详细信息
来源: 评论
On learning with imperfect representations
On learning with imperfect representations
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Kalyanakrishnan, Shivaram Stone, Peter Department of Computer Science University of Texas at Austin 1616 Guadalupe St Austin TX 78701 United States
In this paper we present a perspective on the relationship between learning and representation in sequential decision making tasks. We undertake a brief survey of existing real-world applications, which demonstrates t... 详细信息
来源: 评论
adaptive dynamic programming for Control: A Survey and Recent Advances
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2021年 第1期51卷 142-160页
作者: Liu, Derong Xue, Shan Zhao, Bo Luo, Biao Wei, Qinglai Guangdong Univ Technol Sch Automat Guangzhou 510006 Peoples R China South China Univ Technol Sch Comp Sci & Engn Guangzhou 510006 Peoples R China Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Cent South Univ Sch Automat Changsha 410083 Peoples R China Peng Cheng Lab Shenzhen 518000 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China
This article reviews the recent development of adaptive dynamic programming (ADP) with applications in control. First, its applications in optimal regulation are introduced, and some skilled and efficient algorithms a... 详细信息
来源: 评论
Higher order Q-learning
Higher order Q-Learning
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Edwards, Ashley Pottenger, William M. Department of Computer Science University of Georgia Athens GA 30606 United States Department of Computer Science and DIMACS Rutgers University Piscataway NJ 08854 United States
Higher order learning is a statistical relational learning framework in which relationships between different instances of the same class are leveraged (Ganiz, Lytkin and Pottenger, 2009). learning can be supervised o... 详细信息
来源: 评论
adaptive critic designs for discrete-time zero-sum games with application to H control
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 2007年 第1期37卷 240-247页
作者: Al-Tamimi, Asma Abu-Khalaf, Murad Lewis, Frank L. Univ Texas Automat & Robot Res Inst Ft Worth TX 76118 USA
In this correspondence, adaptive critic approximate dynamic programming designs are derived to solve the discrete-time zero-sum game in which the state and action spaces are continuous. This results in a forward-in-ti... 详细信息
来源: 评论
Neuro-controller of Cement Rotary Kiln Temperature with adaptive Critic Designs
Neuro-controller of Cement Rotary Kiln Temperature with Adap...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning
作者: Lin, Xiaofeng Liu, Tangbo Song, Shaojian Song, Chunning Guangxi Univ Coll Elect Engn Nanning 530004 Peoples R China
The production process of the cement rotary kiln is a typical engineering thermodynamics with large inertia, lagging and nonlinearity. So it is very difficult to control this process accurately using traditional contr... 详细信息
来源: 评论