咨询与建议

限定检索结果

文献类型

  • 751 篇 期刊文献
  • 209 篇 会议
  • 21 篇 学位论文
  • 1 册 图书

馆藏范围

  • 982 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 743 篇 工学
    • 306 篇 计算机科学与技术...
    • 272 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 41 篇 石油与天然气工程
    • 40 篇 土木工程
    • 36 篇 软件工程
    • 29 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 24 篇 动力工程及工程热...
    • 17 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 232 篇 理学
    • 198 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学

主题

  • 982 篇 approximate dyna...
  • 142 篇 optimal control
  • 141 篇 reinforcement le...
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 61 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 51 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 28 篇 adaptive dynamic...
  • 23 篇 adaptive control
  • 22 篇 uncertainty
  • 22 篇 policy iteration
  • 21 篇 linear programmi...
  • 21 篇 neural network
  • 20 篇 neuro-dynamic pr...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 cornell univ sch...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 927 篇 英文
  • 44 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
982 条 记 录,以下是431-440 订阅
排序:
Online Nash-optimization tracking control of multi-motor driven load system with simplified RL scheme
收藏 引用
ISA TRANSACTIONS 2020年 98卷 251-262页
作者: Lv, Yongfeng Ren, Xuemei Na, Jing Beijing Inst Technol Sch Automat Beijing 100081 Peoples R China Kunming Univ Sci & Technol Fac Mech & Elect Engn Kunming 650500 Yunnan Peoples R China
Although the optimal tracking control problem (OTCP) has been addressed recently, only the single-input system is considered in the recent literature. In this paper, the OTCP of unknown multi-motor driven load systems... 详细信息
来源: 评论
dynamic Optimization for Airline Maintenance Operations
收藏 引用
TRANSPORTATION SCIENCE 2020年 第4期54卷 998-1015页
作者: Lagos, Carlos Delgado, Felipe Klapp, Mathias A. Pontificia Univ Catolica Chile Sch Engn Santiago 9999 Chile
The occurrence of unexpected aircraft maintenance tasks can produce expensive changes in an airline's operation. When it comes to critical tasks, it might even cancel programmed flights. Despite this, the challeng... 详细信息
来源: 评论
Actor-critic learning for optimal building energy management with phase change materials
收藏 引用
ELECTRIC POWER SYSTEMS RESEARCH 2020年 188卷
作者: Rahimpour, Zahra Verbic, Gregor Chapman, Archie C. Univ Sydney Sch Elect & Informat Engn Sydney NSW Australia Univ Queensland Sch Informat Technol & Elect Engn Brisbane Qld Australia
Energy management in buildings using phase change materials (PCM) to improve thermal performance is challenging due to the nonlinear thermal capacity of the PCM. To address this problem, this paper adopts a model-free... 详细信息
来源: 评论
Differential-game for resource aware approximate optimal control of large-scale nonlinear systems with multiple players
收藏 引用
NEURAL NETWORKS 2020年 124卷 95-108页
作者: Sahoo, Avimanyu Narayanan, Vignesh Oklahoma State Univ Div Engn Technol 555 Engn North Stillwater OK 74078 USA Washington Univ St Louis MO 63110 USA
In this paper, we propose a novel differential-game based neural network (NN) control architecture to solve an optimal control problem for a class of large-scale nonlinear systems involving N-players. We focus on opti... 详细信息
来源: 评论
approximate dynamic programming via a Smoothed Linear Program
收藏 引用
OPERATIONS RESEARCH 2012年 第3期60卷 655-674页
作者: Desai, Vijay V. Farias, Vivek F. Moallemi, Ciamac C. Columbia Univ Dept Ind Engn & Operat Res New York NY 10027 USA MIT Sloan Sch Management Cambridge MA 02139 USA Columbia Univ Grad Sch Business New York NY 10027 USA
We present a novel linear program for the approximation of the dynamic programming cost-to-go function in high-dimensional stochastic control problems. LP approaches to approximate DP have typically relied on a natura... 详细信息
来源: 评论
Revisiting approximate Linear programming: Constraint-Violation Learning with Applications to Inventory Control and Energy Storage
收藏 引用
MANAGEMENT SCIENCE 2020年 第4期66卷 1544-1562页
作者: Lin, Qihang Nadarajah, Selvaprabu Soheili, Negar Univ Iowa Tippie Coll Business Iowa City IA 52242 USA Univ Illinois Coll Business Adm Chicago IL 60607 USA
approximate linear programs (ALPs) are well-known models for computing value function approximations (VFAs) of intractable Markov decision processes (MDPs). VFAs from ALPs have desirable theoretical properties, define... 详细信息
来源: 评论
A Distributed Iterative Learning Framework for DC Microgrids: Current Sharing and Voltage Regulation
收藏 引用
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 2020年 第2期4卷 119-129页
作者: Liu, Xiao-Kang Jiang, He Wang, Yan-Wu He, Haibo Huazhong Univ Sci & Technol Sch Automat Wuhan 430074 Peoples R China Huazhong Univ Sci & Technol Minist Educ Key Lab Image Proc & Intelligent Control Wuhan 430074 Peoples R China Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
With the penetration of computation intelligence, an increasing number of learning methods are developed into power engineering, such as dc microgrid applications. This paper establishes a distributed iterative learni... 详细信息
来源: 评论
Context-Aware dynamic Asset Allocation for Maritime Interdiction Operations
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2020年 第3期50卷 1055-1073页
作者: Sidoti, David Han, Xu Zhang, Lingyi Avvari, Gopi Vinod Ayala, Diego Fernando Martinez Mishra, Manisha Sankavaram, Muni Sravanth Kellmeyer, David L. Hansen, James A. Pattipati, Krishna R. Univ Connecticut Dept Elect & Comp Engn Storrs CT 06269 USA Two Sigma Modeling Dept New York NY 10013 USA Argus Informat & Advisory Serv LLC Data & Applicat Solut White Plains NY 10601 USA SPAWAR Syst Ctr Pacific Command & Control Dept San Diego CA 92152 USA US Naval Res Lab Marine Meteorol Div Monterey CA 93943 USA
This paper validates two approximate dynamic programming approaches on a maritime interdiction problem involving the allocation of multiple heterogeneous assets over a large area of responsibility to interdict multipl... 详细信息
来源: 评论
A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system
收藏 引用
JOURNAL OF PROCESS CONTROL 2020年 87卷 166-178页
作者: Kim, Jong Woo Park, Byung Jun Yoo, Haeun Oh, Tae Hoon Lee, Jay H. Lee, Jong Min Seoul Natl Univ Sch Chem & Biol Engn Inst Chem Proc 1 Gwanak Ro Seoul 08826 South Korea Korea Adv Inst Sci & Technol Dept Chem & Biomol Engn 291 Daehak Ro Daejeon 34141 South Korea
The Hamilton-Jacobi-Bellman (HJB) equation can be solved to obtain optimal closed-loop control policies for general nonlinear systems. As it is seldom possible to solve the HJB equation exactly for nonlinear systems, ... 详细信息
来源: 评论
Value Iteration-Based H∞ Controller Design for Continuous-Time Nonlinear Systems Subject to Input Constraints
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2020年 第11期50卷 3986-3995页
作者: Zhang, Huaguang Xiao, Geyang Liu, Yang Liu, Lei Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110004 Peoples R China Northeastern Univ Sch Informat Sci & Engn Shenyang 110004 Peoples R China Liaoning Univ Technol Coll Sci Jinzhou 121001 Peoples R China
In this paper, a novel integral reinforcement learning method is proposed based on value iteration (VI) to design the H-infinity controller for continuous-time nonlinear systems subject to input constraints. To confro... 详细信息
来源: 评论