咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 266 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,015 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 708 篇 工学
    • 521 篇 计算机科学与技术...
    • 377 篇 电气工程
    • 277 篇 控制科学与工程
    • 155 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 67 篇 管理学
    • 64 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 309 篇 reinforcement le...
  • 214 篇 dynamic programm...
  • 203 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 87 篇 neural networks
  • 74 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 40 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 989 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1015 条 记 录,以下是361-370 订阅
排序:
Pareto Upper Confidence Bounds algorithms: an empirical study
Pareto Upper Confidence Bounds algorithms: an empirical stud...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Drugan, Madalina M. Nowe, Ann Manderick, Bernard Vrije Univ Brussel Artificial Intelligence Lab Ixelles Belgium
Many real-world stochastic environments are inherently multi-objective environments with conflicting objectives. The multi-objective multi-armed bandits (MOMAB) are extensions of the classical, i.e. single objective, ... 详细信息
来源: 评论
Joining Force of Human Muscular Task Planning With Robot Robust and Delicate Manipulation for programming by Demonstration
收藏 引用
ieee-ASME TRANSACTIONS ON MECHATRONICS 2020年 第5期25卷 2574-2584页
作者: Wang, Fei Zhou, Xingqun Wang, Jianhui Zhang, Xing He, Zhenquan Song, Bo Northeastern Univ Fac Robot Sci & Engn Shenyang 110169 Peoples R China Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Peoples R China Chinese Acad Sci Inst Intelligent Machines Hefei 230031 Peoples R China
Recently, programing by demonstration (PbD) received much attention for its capacity of fast programming with increasing demands in the robot manipulation area, especially in industrial applications. However, one of t... 详细信息
来源: 评论
Enhancing supervisory training signals with environmental reinforcement learning using adaptive dynamic programming and artificial neural networks  15
Enhancing supervisory training signals with environmental re...
收藏 引用
15th ieee International Conference on Cognitive Informatics and Cognitive Computing, ICCI*CC 2016
作者: Melton, Niklas Wunsch, Donald C. Applied Computational Intelligence Laboratory Department of Electrical and Computer Engineering Missouri University of Science and Technology RollaMO United States
A method for hybridizing supervised learning with adaptive dynamic programming was developed to increase the speed, quality, and robustness of on-line neural network learning from an imperfect teacher. reinforcement l... 详细信息
来源: 评论
On Model-Free reinforcement learning of Reduced-order Optimal Control for Singularly Perturbed Systems  57
On Model-Free Reinforcement Learning of Reduced-order Optima...
收藏 引用
57th ieee Conference on Decision and Control (CDC)
作者: Mukherjee, Sayak Bai, He Chakrabortty, Aranya North Carolina State Univ Dept Elect & Comp Engn Raleigh NC 27695 USA Oklahoma State Univ Sch Mech & Aerosp Engn Stillwater OK 74078 USA
We propose a model-free reduced-order optimal control design for linear time-invariant singularly perturbed (SP) systems using reinforcement learning (RL). Both the state and input matrices of the plant model are assu... 详细信息
来源: 评论
A Neural Architecture to Address reinforcement learning Problems
A Neural Architecture to Address Reinforcement Learning Prob...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: de Arruda, Rodrigo L. S. Von Zuben, Fernando J. Univ Campinas UNICAMP Sch Elect & Comp Engn FEEC Dept Comp Engn & Ind Automat DCA Lab Bioinformat & Bioinspired Comp LBiC Campinas SP Brazil
In this paper, the reinforcement learning problem is formulated equivalently to a Markov Decision Process. We address the solution of such problem using a novel adaptive dynamic programming algorithm which is based on... 详细信息
来源: 评论
Longitudinal Control of Hypersonic Vehicles Based on Direct Heuristic dynamic programming Using ANFIS
Longitudinal Control of Hypersonic Vehicles Based on Direct ...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: Luo, Xiong Chen, Yi Si, Jennie Liu, Feng USTB Sch Comp & Commun Engn Beijing 100083 Peoples R China Arizona State Univ Sch Elect Comp & Energy Engn Tempe AZ 85287 USA
Since the launch of the scramjet, recent years have witnessed a growing interest in the study of airbreathing hypersonic vehicles. Due to its strong coupling characteristics, high nonlinearity, and uncertain parameter... 详细信息
来源: 评论
adaptive Data Replication Optimization Based on reinforcement learning
Adaptive Data Replication Optimization Based on Reinforcemen...
收藏 引用
ieee symposium Series on Computational Intelligence (ieee SSCI)
作者: Wee, Chee Keong Nayak, Richi eHlth Queensland Business Applicat Technol Serv Digital Applicat Serv Brisbane Qld Australia Queensland Univ Technol Sci & Engn Fac Sch Elect Engn & Comp Sci Brisbane Qld Australia
Data replication plays an important role in enterprise IT landscapes, where data is shared among multiple IT systems. IT administrators need to tune the replicating software's configuration setting for it to perfo... 详细信息
来源: 评论
A Boundedness Theoretical Analysis for GrADP Design: A Case Study on Maze Navigation
A Boundedness Theoretical Analysis for GrADP Design: A Case ...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: Ni, Zhen Zhong, Xiangnan He, Haibo Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
A new theoretical analysis towards the goal representation adaptive dynamic programming (GrADP) design proposed in [1], [2] is investigated in this paper. Unlike the proofs of convergence for adaptive dynamic programm... 详细信息
来源: 评论
reinforcement learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2020年 第10期31卷 4330-4340页
作者: Zhao, Bo Liu, Derong Luo, Chaomin Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Chinese Acad Sci State Key Lab Management & Control Complex Syst Inst Automat Beijing 100190 Peoples R China Guangdong Univ Technol Sch Automat Guangzhou 510006 Peoples R China Mississippi State Univ Dept Elect & Comp Engn Mississippi State MS 39762 USA
This article presents a novel reinforcement learning strategy that addresses an optimal stabilizing problem for unknown nonlinear systems subject to uncertain input constraints. The control algorithm is composed of tw... 详细信息
来源: 评论
adaptive Alert Management for Balancing Optimal Performance among Distributed CSOCs using reinforcement learning
收藏 引用
ieee TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2020年 第1期31卷 16-33页
作者: Shah, Ankit Ganesan, Rajesh Jajodia, Sushil Samarati, Pierangela Cam, Hasan George Mason Univ Ctr Secure Informat Syst Fairfax VA 20030 USA Univ Milan Comp Sci Dept I-20133 Milan Italy US Army Res Lab Adelphi MD 20783 USA
Large organizations typically have Cybersecurity Operations Centers (CSOCs) distributed at multiple locations that are independently managed, and they have their own cybersecurity analyst workforce. Under normal opera... 详细信息
来源: 评论