咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 269 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,018 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 711 篇 工学
    • 520 篇 计算机科学与技术...
    • 380 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 311 篇 reinforcement le...
  • 215 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 77 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1018 条 记 录,以下是501-510 订阅
排序:
Off-Policy Actor-Critic Structure for Optimal Control of Unknown Systems With Disturbances
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2016年 第5期46卷 1041-1050页
作者: Song, Ruizhuo Lewis, Frank L. Wei, Qinglai Zhang, Huaguang Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Univ Texas Arlington UTA Res Inst Ft Worth TX 76118 USA Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110004 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Northeastern Univ Sch Informat Sci & Engn Shenyang 110004 Peoples R China
An optimal control method is developed for unknown continuous-time systems with unknown disturbances in this paper. The integral reinforcement learning (IRL) algorithm is presented to obtain the iterative control. Off... 详细信息
来源: 评论
adaptive learning solution of the nonzero-sum differential game with unknown dynamics using adaptive dynamic programming  28
Adaptive learning solution of the nonzero-sum differential g...
收藏 引用
28th Chinese Control and Decision Conference
作者: Qin, Chunbin Sun, Hongfei Liu, Xianxing Chen, Jiaqi Henan Univ Sch Comp & Informat Engn Kaifeng 475004 Peoples R China Henan Univ Coll Environm & Planning Kaifeng 475004 Peoples R China Henan Univ Sch Software Kaifeng 475004 Peoples R China
In this paper, a novel partially model-free adaptive dynamic programming (ADP) algorithm is presented to solve online the nonzero-sum differential games of continuous-time linear systems with unknown drift dynamics. F... 详细信息
来源: 评论
Discrete-Time Generalized Policy Iteration ADP Algorithm With Approximation Errors
Discrete-Time Generalized Policy Iteration ADP Algorithm Wit...
收藏 引用
ieee symposium Series on Computational Intelligence
作者: Qinglai Wei Benkai Li Ruizhuo Song The State Key Laboratory of Management and Control for Complex Systems Chinese Academy of Sciences Beijing China School of Automation and Electrical Engineering University of Science and Technology Beijing Beijing China
This paper concerns with a novel generalized policy iteration (GPI) algorithm with approximation errors. Approximation errors are explicitly considered in the GPI algorithm. The properties of the stable GPI algorithm ... 详细信息
来源: 评论
Towards An Integrated learning Framework for Behavior Modeling of adaptive CGFs  9
Towards An Integrated Learning Framework for Behavior Modeli...
收藏 引用
9th International symposium on Computational Intelligence and Design (ISCID)
作者: Zhang, Qi Yin, Quanjun Xu, Kai Natl Univ Def Technol Coll Informat Syst & Management Changsha Hunan Peoples R China
Computer generated forces (CGFs) are autonomous or semi-autonomous actors within military, simulation based, training and analyzing applications. Rapid, realistic and adaptive behavior modeling for CGFs is imperative ... 详细信息
来源: 评论
Countering Improvised Explosive Devices With adaptive Sensor Networks
Countering Improvised Explosive Devices With Adaptive Sensor...
收藏 引用
ieee symposium on Technologies for Homeland Security (HST)
作者: Buenfil, Jorge R. Ramirez-Marquez, Jose US Army ARDEC Picatinny Arsenal NJ USA Stevens Inst Technol Sch Syst & Enterprises Hoboken NJ USA
The design and architecture of a system for automatic Improvised Explosive Devices detection to protect sensitive areas with minimal human interaction is presented. The system, called ACE for "Army Counter IED En... 详细信息
来源: 评论
ATM: Approximate Task Memoization in the Runtime System
ATM: Approximate Task Memoization in the Runtime System
收藏 引用
International symposium on Parallel and Distributed Processing (IPDPS)
作者: Iulian Brumar Marc Casas Miquel Moreto Mateo Valero Gurindar S. Sohi Barcelona Supercomputing Center (BSC) Barcelona Spain University of Wisconsin-Madison USA
Redundant computations appear during the execution of real programs. Multiple factors contribute to these unnecessary computations, such as repetitive inputs and patterns, calling functions with the same parameters or... 详细信息
来源: 评论
A reinforcement learning Approach for Cost- and Energy-Aware Mobile Data Offloading  18
A Reinforcement Learning Approach for Cost- and Energy-Aware...
收藏 引用
18th Asia-Pacific Network Operations and Management symposium (APNOMS)
作者: Zhang, Cheng Gu, Bo Liu, Zhi Yamori, Kyoko Tanaka, Yoshiaki Waseda Univ Dept Comp Sci & Commun Engn Tokyo 1690072 Japan Kogakuin Univ Dept Informat & Commun Engn Tokyo 1920015 Japan Waseda Univ Global Informat & Telecommun Inst Tokyo 1698555 Japan Asahi Univ Dept Management Informat Mizuho 5010296 Japan Waseda Univ Dept Commun & Comp Engn Tokyo 1698555 Japan
\With rapid increases in demand for mobile data, mobile network operators are trying to expand wireless network capacity by deploying WiFi hotspots to offload their mobile traffic. However, these network-centric metho... 详细信息
来源: 评论
Proceedings - 11th International symposium on Software Engineering for adaptive and Self-Managing Systems, SEAMS 2016
Proceedings - 11th International Symposium on Software Engin...
收藏 引用
11th International symposium on Software Engineering for adaptive and Self-Managing Systems, SEAMS 2016
The proceedings contain 19 papers. The topics discussed include: reusable self-adaptation through bidirectional programming;automatically hardening a self-adaptive system against uncertainty;data-driven continuous evo...
来源: 评论
Analytical Greedy Control and Q-learning for Optimal Power Management of Plug-in Hybrid Electric Vehicles
Analytical Greedy Control and Q-Learning for Optimal Power M...
收藏 引用
ieee symposium Series on Computational Intelligence
作者: Chang Liu Yi Lu Murphey Department of Electrical and Computer Engineering University of Michigan - Dearborn Dearborn MI USA University of Michigan Dearborn Dearborn MI US
In this paper, we present two solutions for achieving the optimal control of PHEVs on short trips. We prove, mathematically, that a greedy control policy is optimal for those short trips where the battery State-of-Cha... 详细信息
来源: 评论
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS Special Section on Deep reinforcement learning and adaptive dynamic programming
收藏 引用
ieee Transactions on Neural Networks and learning Systems 2016年 第12期27卷 2776-2776页
Prospective authors are requested to submit new, unpublished manuscripts for inclusion in the upcoming event described in this call for papers.
来源: 评论