咨询与建议

限定检索结果

文献类型

  • 749 篇 期刊文献
  • 209 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 982 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 749 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 251 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 51 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 电子科学与技术(可...
  • 357 篇 管理学
    • 340 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 982 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 142 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 926 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
982 条 记 录,以下是311-320 订阅
排序:
Reinforcement learning and approximate dynamic programming for feedback control /
收藏 引用
2013年
作者: edited by Frank L. Lewis Derong Liu.
来源: 内蒙古大学图书馆图书 评论
Voronoi Progressive Widening for Cognitive Radar Tracking with Large Waveform Libraries
Voronoi Progressive Widening for Cognitive Radar Tracking wi...
收藏 引用
IEEE Radar Conference (RadarConf)
作者: Rybicki, Brian W. Nelson, Jill K. George Mason Univ Dept Elect & Comp Engn Fairfax VA USA US Naval Res Lab Washington DC USA
We apply an improved variant of Monte Carlo Tree Search (MCTS), MCTS with Voronoi Progressive Widening (VPW), to cognitive radar tracking. Because cognitive radar systems have unparalleled waveform agility across an i... 详细信息
来源: 评论
On-policy and Off-policy Value Iteration Algorithms for Stochastic Zero-Sum Games  14
On-policy and Off-policy Value Iteration Algorithms for Stoc...
收藏 引用
14th Asian Control Conference (ASCC)
作者: Guo, Liangyuan Wang, Bing-Chang Sun, Bo Shandong Univ Sch Control Sci & Engn Jinan Peoples R China
This paper considers the value iteration algorithms of stochastic zero-sum linear quadratic games with unkown dynamics. The model-free on-policy and off-policy learning algorithms are developed, where the system dynam... 详细信息
来源: 评论
Real-Time Learning for Suboptimal Control of Unknown Systems
Real-Time Learning for Suboptimal Control of Unknown Systems
收藏 引用
作者: Makumi, Wanjiku Aprile University of Florida
学位级别:Ph.D., Doctor of Philosophy
approximate dynamic programming (ADP) has emerged as a leading method for solving optimal control problems using reinforcement learning (RL) with many benefits and also many open research problems. Model-based methods... 详细信息
来源: 评论
A New Approach to Finite-Horizon Optimal Control for Discrete-Time Affine Nonlinear Systems via a Pseudolinear Method
收藏 引用
IEEE TRANSACTIONS ON AUTOMATIC CONTROL 2022年 第5期67卷 2610-2617页
作者: Wei, Qinglai Zhu, Liao Li, Tao Liu, Derong Chinese Acad Sci State Key Lab Management & Control Complex Syst Inst Automat Beijing 100190 Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China Macau Univ Sci & Technol Inst Syst Engn Macau 999078 Peoples R China Guangdong Univ Technol Sch Automat Guangzhou 510006 Peoples R China
In this article, a new time-varying adaptivedynamic programming (ADP) algorithm is developed to solve finite-horizon optimal control problems for a class of discrete-time affine nonlinear systems. Inspired by the pseu... 详细信息
来源: 评论
Adaptive dynamic programming for Energy-Efficient Base Station Cell Switching  59
Adaptive Dynamic Programming for Energy-Efficient Base Stati...
收藏 引用
59th Annual IEEE International Conference on Communications (IEEE ICC)
作者: Luo, Junliang Xu, Yi Tian Wu, Di Jenkin, Michael Liu, Xue Dudek, Gregory Samsung AI Ctr Montreal Montreal PQ Canada
Energy saving in wireless networks is growing in importance due to increasing demand for evolving new-gen cellular networks, environmental and regulatory concerns, and potential energy crises arising from geopolitical... 详细信息
来源: 评论
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control
收藏 引用
IEEE/CAA Journal of Automatica Sinica 2022年 第7期9卷 1262-1272页
作者: Mingming Ha Ding Wang Derong Liu School of Automation and Electrical Engineering University of Science and Technology BeijingBeijing 100083China Faculty of Information Technology the Beijing Key Laboratory of Computational Intelligence and Intelligent Systemthe Beijing Laboratory of Smart Environmental Protectionand the Beijing Institute of Artificial IntelligenceBeijing University of TechnologyBeijing 100124China Department of Electrical and Computer Engineering University of Illinois at ChicagoChicago IL 60607 USA IEEE
The core task of tracking control is to make the controlled plant track a desired *** traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of time steps *... 详细信息
来源: 评论
Indirect Shared Control Through Non-Zero Sum Differential Game for Cooperative Automated Driving
收藏 引用
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 2022年 第9期23卷 15980-15992页
作者: Li, Wenyu Li, Qingkun Li, Shengbo Eben Li, Renjie Ren, Yangang Wang, Wenjun Nankai Univ Coll Artificial Intelligence Tianjin 300350 Peoples R China Tsinghua Univ Sch Vehicle & Mobil Beijing 100084 Peoples R China
Cooperative driving of human driver and automated system can effectively reduce the necessity of extremely accurate environment perception of highly automated vehicles, and enhance the robustness of decision-making an... 详细信息
来源: 评论
Tactical UAV path optimization under radar threat using deep reinforcement learning
收藏 引用
NEURAL COMPUTING & APPLICATIONS 2022年 第7期34卷 5649-5664页
作者: Alpdemir, M. Nedim TUBITAK Informat & Informat Secur Res Ctr BILGEM Gebze Turkey
The majority of the research efforts that aim to solve UAV path optimization problems in a Reinforcement Learning (RL) setting focus on closed spaces or urban areas as the operating environment. The problem of Tactica... 详细信息
来源: 评论
Sampling-Based Linear approximate Planning for Underwater Space-Time Fair Scheduling  57
Sampling-Based Linear Approximate Planning for Underwater Sp...
收藏 引用
57th Asilomar Conference on Signals, Systems and Computers
作者: Peng, Chen Mitra, Urbashi Univ Southern Calif Dept Elect & Comp Engn Los Angeles CA 90007 USA
This paper investigates scheduling in space and time domains for multi-user underwater acoustic networks under fairness considerations. The problem is formulated as a sequential decision-making problem under the Marko... 详细信息
来源: 评论