咨询与建议

限定检索结果

文献类型

  • 749 篇 期刊文献
  • 209 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 982 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 749 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 251 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 51 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 电子科学与技术(可...
  • 357 篇 管理学
    • 340 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 982 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 142 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 926 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
982 条 记 录,以下是381-390 订阅
排序:
Direct and indirect reinforcement learning
收藏 引用
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS 2021年 第8期36卷 4439-4467页
作者: Guan, Yang Li, Shengbo Eben Duan, Jingliang Li, Jie Ren, Yangang Sun, Qi Cheng, Bo Tsinghua Univ Sch Vehicle & Mobil Beijing 100084 Peoples R China
Reinforcement learning (RL) algorithms have been successfully applied to a range of challenging sequential decision-making and control tasks. In this paper, we classify RL into direct and indirect RL according to how ... 详细信息
来源: 评论
Robust Reinforcement Learning with Diffusion Wavelets
Robust Reinforcement Learning with Diffusion Wavelets
收藏 引用
作者: Seyedmazloom, Ali George Mason University
学位级别:Ph.D., Doctor of Philosophy
Reinforcement Learning is a method of learning from the environment by constantly observing it and evaluating its response to a set of actions. Long-term learning of a dynamic system for aforementioned interactions wh... 详细信息
来源: 评论
A Stochastic Spatiotemporal Decomposition Decision-Making Approach for Real-Time dynamic Energy Management of Multi-Microgrids
收藏 引用
IEEE TRANSACTIONS ON SUSTAINABLE ENERGY 2021年 第2期12卷 821-833页
作者: Mo, Xiemin Zhu, Jianquan Chen, Jiajun Guo, Ye Xia, Yunrui Liu, Mingbo South China Univ Technol Sch Elect Power Engn Guangzhou 510640 Peoples R China
This paper studies the real-time dynamic energy management (DEM) of multi microgrids (MMGs) considering active and reactive power flow constraints, voltage constraints, battery operational characters, and uncertaintie... 详细信息
来源: 评论
Adaptive dynamic programming for Control: A Survey and Recent Advances
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2021年 第1期51卷 142-160页
作者: Liu, Derong Xue, Shan Zhao, Bo Luo, Biao Wei, Qinglai Guangdong Univ Technol Sch Automat Guangzhou 510006 Peoples R China South China Univ Technol Sch Comp Sci & Engn Guangzhou 510006 Peoples R China Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Cent South Univ Sch Automat Changsha 410083 Peoples R China Peng Cheng Lab Shenzhen 518000 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China
This article reviews the recent development of adaptive dynamic programming (ADP) with applications in control. First, its applications in optimal regulation are introduced, and some skilled and efficient algorithms a... 详细信息
来源: 评论
Electric Vehicle Routing with Public Charging Stations
收藏 引用
TRANSPORTATION SCIENCE 2021年 第3期55卷 637-659页
作者: Kullman, Nicholas D. Goodson, Justin C. Mendoza, Jorge E. Univ Tours CNRS LIFAT EA 6300 ROOT ERL CNRS 7002 F-37200 Tours France St Louis Univ Richard A Chaifetz Sch Business St Louis MO 63103 USA HEC Montreal Montreal PQ H3T 2A7 Canada Ctr Interuniv Rech Reseaux Entreprise Logist & Tr Montreal PQ H3T 1J4 Canada
We introduce the electric vehicle routing problem with public-private recharging strategy in which vehicles may recharge en route at public charging infrastructure as well as at a privately-owned depot. To hedge again... 详细信息
来源: 评论
dynamic Repair Scheduling for Transmission Systems Based on Look-Ahead Strategy Approximation
收藏 引用
IEEE TRANSACTIONS ON POWER SYSTEMS 2021年 第4期36卷 2918-2933页
作者: Yan, Jiahao Hu, Bo Xie, Kaigui Niu, Tao Li, Chunyan Tai, Heng-Ming Chongqing Univ State Key Lab Power Transmiss Equipment & Syst Se Chongqing 400030 Peoples R China Univ Tulsa Dept Elect & Comp Engn Tulsa OK 74104 USA
This paper intends to address the dynamic repair scheduling of electric power transmission systems based on look-ahead strategy approximation. The objective is to minimize system functionality loss during the restorat... 详细信息
来源: 评论
Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems
收藏 引用
IEEE TRANSACTIONS ON CYBERNETICS 2021年 第5期51卷 2372-2383页
作者: Wei, Qinglai Li, Hongyang Yang, Xiong He, Haibo Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China Qingdao Acad Intelligent Ind Qingdao 266109 Peoples R China Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
In this article, a novel distributed policy iteration algorithm is established for infinite horizon optimal control problems of continuous-time nonlinear systems. In each iteration of the developed distributed policy ... 详细信息
来源: 评论
Allocating resources via price management systems: a dynamic programming-based approach
收藏 引用
INTERNATIONAL JOURNAL OF CONTROL 2021年 第8期94卷 2123-2143页
作者: Forootani, Ali Liuzza, Davide Tipaldi, Massimo Glielmo, Luigi Univ Sannio Dept Engn Piazza Roma 21 I-82100 Benevento Italy ENEA Fus & Nucl Safety Dept Frascati Rome Italy
In this paper, a novel model for price management systems in resource allocation problems is proposed. Stochastic customer requests for resource allocations and releases are modelled as constrained parallel Birth-Deat... 详细信息
来源: 评论
Modified value-function-approximation for synchronous policy iteration with single-critic configuration for nonlinear optimal control
收藏 引用
INTERNATIONAL JOURNAL OF CONTROL 2021年 第5期94卷 1321-1333页
作者: Tang, Difan Chen, Lei Tian, Zhao Feng Hu, Eric Univ Adelaide Sch Mech Engn Adelaide SA Australia
This study proposes a modified value-function-approximation (MVFA) and investigates its use under a single-critic configuration based on neural networks (NNs) for synchronous policy iteration (SPI) to deliver compact ... 详细信息
来源: 评论
Nonsmooth Data-Based Reinforcement Learning for Online approximate Optimal Control
Nonsmooth Data-Based Reinforcement Learning for Online Appro...
收藏 引用
作者: Greene, Max Lewis University of Florida
学位级别:Ph.D., Doctor of Philosophy
Autonomous systems are often constrained by time-critical mission constraints and limited power. Such constraints motivate optimality in mission execution. Reinforcement learning (RL) has become a tool to facilitate l... 详细信息
来源: 评论