咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是521-530 订阅
Model-Free Dual Heuristic dynamic programming
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2015年 第8期26卷 1834-1839页
作者: Ni, Zhen He, Haibo Zhong, Xiangnan Prokhorov, Danil V. Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA Toyota Tech Ctr Toyota Res Inst North Amer Ann Arbor MI 48105 USA
Model-based dual heuristic dynamic programming (MB-DHP) is a popular approach in approximating optimal solutions in control problems. Yet, it usually requires offline training for the model network, and thus resulting... 详细信息
来源: 评论
Intelligent Control of Grid-Connected Microgrids: An adaptive Critic-Based Approach
收藏 引用
ieee JOURNAL OF EMERGING AND SELECTED TOPICS IN POWER ELECTRONICS 2015年 第2期3卷 493-504页
作者: Seidi, Sima Bakhshai, Alireza Queens Univ Queens Ctr Energy & Power Elect Res Kingston ON K7L 3N6 Canada Queens Univ Dept Elect & Comp Engn Kingston ON K7L 3N6 Canada
This paper presents an adaptive and intelligent power control approach for microgrid systems in the gridconnected operation mode. The proposed critic-based adaptive control system contains a neuro-fuzzy controller and... 详细信息
来源: 评论
adaptive learning solution of the nonzero-sum differential game with unknown dynamics using adaptive dynamic programming
Adaptive learning solution of the nonzero-sum differential g...
收藏 引用
第28届中国控制与决策会议
作者: Chunbin Qin Hongfei Sun Xianxing Liu Jiaqi Chen The School of Computer and Information Engineering Henan University The College of Environment and Planning Henan University The School of Software Henan University
In this paper,a novel partially model-free adaptive dynamic programming(ADP) algorithm is presented to solve online the nonzero-sum differential games of continuous-time linear systems with unknown drift ***,by using ... 详细信息
来源: 评论
GrDHP: A General Utility Function Representation for Dual Heuristic dynamic programming
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2015年 第3期26卷 614-627页
作者: Ni, Zhen He, Haibo Zhao, Dongbin Xu, Xin Prokhorov, Danil V. Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Natl Univ Def Technol Coll Mechatron & Automat Changsha 410073 Hunan Peoples R China Toyota Res Inst NA Toyota Tech Ctr Ann Arbor MI 48105 USA
A general utility function representation is proposed to provide the required derivable and adjustable utility function for the dual heuristic dynamic programming (DHP) design. Goal representation DHP (GrDHP) is prese... 详细信息
来源: 评论
reinforcement-learning-Based Robust Controller Design for Continuous-Time Uncertain Nonlinear Systems Subject to Input Constraints
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2015年 第7期45卷 1372-1385页
作者: Liu, Derong Yang, Xiong Wang, Ding Wei, Qinglai Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
The design of stabilizing controller for uncertain nonlinear systems with control constraints is a challenging problem. The constrained-input coupled with the inability to identify accurately the uncertainties motivat... 详细信息
来源: 评论
ADP with MCTS algorithm for Gomoku
ADP with MCTS algorithm for Gomoku
收藏 引用
ieee symposium Series on Computational Intelligence (SSCI)
作者: Zhentao Tang Dongbin Zhao Kun Shao Le L.V. The State Key Laboratory of Management and Control for Complex Systems Chinese Academy of Sciences Beijing China Institute of Automation Chinese Academy of Sciences Beijing Beijing CN
Inspired by the core idea of AlphaGo, we combine a neural network, which is trained by adaptive dynamic programming (ADP), with Monte Carlo Tree Search (MCTS) algorithm for Gomoku. MCTS algorithm is based on Monte Car... 详细信息
来源: 评论
A Boundedness Theoretical Analysis for GrADP Design: A Case Study on Maze Navigation
A Boundedness Theoretical Analysis for GrADP Design: A Case ...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: Ni, Zhen Zhong, Xiangnan He, Haibo Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
A new theoretical analysis towards the goal representation adaptive dynamic programming (GrADP) design proposed in [1], [2] is investigated in this paper. Unlike the proofs of convergence for adaptive dynamic programm... 详细信息
来源: 评论
A New Discrete-Time Iterative adaptive dynamic programming Algorithm Based on Q-learning  12th
收藏 引用
12th International symposium on Neural Networks (ISNN)
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
In this paper, a novel Q-learning based policy iteration adaptive dynamic programming (ADP) algorithm is developed to solve the optimal control problems for discrete-time nonlinear systems. The idea is to use a policy... 详细信息
来源: 评论
A reinforcement learning approach for cost- and energy-aware mobile data offloading
A reinforcement learning approach for cost- and energy-aware...
收藏 引用
Asia-Pacific Network Operations and Management symposium (APNOMS)
作者: Cheng Zhang Bo Gu Zhi Liu Kyoko Yamori Yoshiaki Tanaka Department of Computer Science and Communications Engineering Waseda University Tokyo Japan Department of Information and Communications Engineering Kogakuin University Tokyo Japan Global Information and Telecommunication Institute Waseda University Tokyo Japan Department of Management Information Asahi University Mizuho-shi Japan Department of Communications and Computer Engineering Waseda University Tokyo Japan
With rapid increases in demand for mobile data, mobile network operators are trying to expand wireless network capacity by deploying WiFi hotspots to offload their mobile traffic. However, these network-centric method... 详细信息
来源: 评论
adaptive dynamic programming Boundary Control of Uncertain Coupled Semi-Linear Parabolic PDE
Adaptive Dynamic Programming Boundary Control of Uncertain C...
收藏 引用
ieee International symposium on Intelligent Control (ISIC)
作者: Talaei, B. Jagannathan, S. Singler, J. Univ Sci & Technol Dept Elect & Comp Engn Rolla MO 65409 USA Univ Sci & Technol Dept Math & Stat Rolla MO 65409 USA
This paper develops an adaptive dynamic programming (ADP) based near optimal boundary control of distributed parameter systems (DPS) governed by uncertain coupled semi-linear parabolic partial differential equations (... 详细信息
来源: 评论