咨询与建议

限定检索结果

文献类型

  • 748 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 980 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 740 篇 工学
    • 306 篇 计算机科学与技术...
    • 272 篇 电气工程
    • 247 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 36 篇 软件工程
    • 29 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 24 篇 动力工程及工程热...
    • 17 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 建筑学
    • 4 篇 航空宇航科学与技...
  • 354 篇 管理学
    • 337 篇 管理科学与工程(可...
    • 51 篇 工商管理
    • 6 篇 公共管理
  • 232 篇 理学
    • 198 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 78 篇 经济学
    • 54 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学

主题

  • 980 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 51 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 23 篇 adaptive control
  • 22 篇 uncertainty
  • 22 篇 policy iteration
  • 21 篇 linear programmi...
  • 20 篇 neural network
  • 20 篇 neuro-dynamic pr...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 924 篇 英文
  • 50 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate dynamic programming"
980 条 记 录,以下是971-980 订阅
排序:
Virtual Generators: Simplified Online Power System Representations for Wide-Area Damping Control
Virtual Generators: Simplified Online Power System Represent...
收藏 引用
IEEE Power and Energy Society General Meeting
作者: Diogenes Molina Jiaqi Liang Ronald G. Harley Ganesh Kumar Venayagamoorthy Intelligent Power Infrastructure Consortium Department of Electrical and Computer Engineering Georgia Institute of Technology Atlanta GA 30332 USA Holcombe Department of Electrical and Computer Engineering Clemson University Clemson SC 29634 USA
This paper introduces a new concept called a Virtual Generator (VG). VGs are simplified representations of groups of coherent synchronous generators in a power system. They resemble commonly used power system dynamic ... 详细信息
来源: 评论
A Least-Squares Temporal Difference based method for solving resource allocation problems
收藏 引用
IFAC JOURNAL OF SYSTEMS AND CONTROL 2020年 13卷
作者: Forootani, Ali Tipaldi, Massimo Zarch, Majid Ghaniee Liuzza, Davide Glielmo, Luigi Univ Sannio Dept Engn Piazza Roma I-82100 Benevento Italy Bu Ali Sina Univ Dept Elect Engn Hamadan Hamadan Iran ENEA Fus & Nucl Safety Dept Rome Italy
Value function approximation has a central role in approximate dynamic programming (ADP) to overcome the so-called curse of dimensionality associated to real stochastic processes. In this regard, we propose a novel Le... 详细信息
来源: 评论
Multiperiod Stochastic Resource Planning in Professional Services Organizations
收藏 引用
DECISION SCIENCES 2019年 第6期50卷 1281-1318页
作者: Solomon, Stanislaus Li, Haitao Womer, Keith Santos, Cipriano Southern Illinois Univ Edwardsville Management & Mkt Dept Sch Business Edwardsville IL 62025 USA Univ Missouri St Louis Supply Chain & Analyt Dept Coll Business Adm St Louis MO 63121 USA Gurobi Optimizat Beaverton OR 97008 USA
Resource planning (RP) in a professional service organization matches workforce resources with project tasks while considering a myriad of factors such as skill requirements, service delivery role, skill type, workfor... 详细信息
来源: 评论
HIGH-DIMENSIONAL PORTFOLIO OPTIMIZATION WITH TRANSACTION COSTS
收藏 引用
INTERNATIONAL JOURNAL OF THEORETICAL AND APPLIED FINANCE 2016年 第4期19卷
作者: Broadie, Mark Shen, Weiwei Columbia Univ Grad Sch Business New York NY 10027 USA Columbia Univ Appl Phys & Appl Math New York NY 10027 USA
This paper studies Merton's portfolio optimization problem with proportional transaction costs in a discrete-time finite horizon. Facing short-sale and borrowing constraints, investors have access to a risk-free a... 详细信息
来源: 评论
Optimal and approximate algorithms for sequential clinical scheduling with no-shows
收藏 引用
IIE Transactions on Healthcare Systems Engineering 2011年 第1期1卷 20-36页
作者: Lin, J. Muthuraman, Kumar Lawley, Mark Weldon School of Biomedical Engineering Purdue University West Lafayette IN 47907-2032 206 S. Martin Jischke Drive United States McCombs School of Business University of Texas Austin TX United States
The accessibility and efficiency of outpatient clinic operations are largely affected by appointment schedules. Clinical scheduling is a process of assigning physician appointment times to sequentially calling patient... 详细信息
来源: 评论
Deep reinforcement learning based finite-horizon optimal control for a discrete-time affine nonlinear system
Deep reinforcement learning based finite-horizon optimal con...
收藏 引用
SICE Annual Conference
作者: Jong Woo Kim Byung Jun Park Haeun Yoo Jay H. Lee Jong Min Lee School of Chemical and Biological Engineering Seoul National University Seoul Republic of Korea Chemical and Biomolecular Engineering Department Korea Advanced Institute of Science and Technology Daejeon Republic of Korea
approximate dynamic programming (ADP) aims to obtain an approximate numerical solution to the discrete-time Hamilton-Jacobi-Bellman (HJB) equation. Heuristic dynamic programming (HDP) is a two-stage iterative scheme o... 详细信息
来源: 评论
Adaptive Traffic Signal Control for Multi-intersection Based on Microscopic Model
Adaptive Traffic Signal Control for Multi-intersection Based...
收藏 引用
International Conference on Tools with Artificial Intelligence
作者: Biao Yin Mahjoub Dridi Abdellah El Moudni Laboratoire IRTES-SeT Université de Technologie de Belfort-Montbéliard (UTBM) Belfort France
In this paper, we mainly propose an online learning method for adaptive traffic signal control in a multi-intersection system. The method uses approximate dynamic programming (ADP) to achieve a near-optimal solution o... 详细信息
来源: 评论
Discrete-Time Generalized Policy Iteration ADP Algorithm With Approximation Errors
Discrete-Time Generalized Policy Iteration ADP Algorithm Wit...
收藏 引用
IEEE Symposium Series on Computational Intelligence
作者: Qinglai Wei Benkai Li Ruizhuo Song The State Key Laboratory of Management and Control for Complex Systems Chinese Academy of Sciences Beijing China School of Automation and Electrical Engineering University of Science and Technology Beijing Beijing China
This paper concerns with a novel generalized policy iteration (GPI) algorithm with approximation errors. Approximation errors are explicitly considered in the GPI algorithm. The properties of the stable GPI algorithm ... 详细信息
来源: 评论
approximate modified policy iteration and its application to the game of Tetris
The Journal of Machine Learning Research
收藏 引用
The Journal of Machine Learning Research 2015年 第1期16卷
作者: Bruno Scherrer Mohammad Ghavamzadeh Victor Gabillon Boris Lesner Matthieu Geist INRIA Nancy-Grand Est Team Maia Vandœuvre-ls-Nancy France Adobe Research & INRIA Lille San Jose CA INRIA Lille-Nord Europe Team SequeL Villeneuve d'Ascq France CentraleSupélec IMS-MaLIS Research Group & UMI (GeorgiaTech-CNRS) Metz France
Modified policy iteration (MPI) is a dynamic programming (DP) algorithm that contains the two celebrated policy and value iteration methods. Despite its generality, MPI has not been thoroughly studied, especially its ... 详细信息
来源: 评论
dynamic policy programming
The Journal of Machine Learning Research
收藏 引用
The Journal of Machine Learning Research 2012年 第1期13卷
作者: Mohammad Gheshlaghi Azar Vicenç Gómez Hilbert J. Kappen Department of Biophysics Radboud University Nijmegen Nijmegen The Netherlands
In this paper, we propose a novel policy iteration method, called dynamic policy programming (DPP), to estimate the optimal policy in the infinite-horizon Markov decision processes. DPP is an incremental algorithm tha... 详细信息
来源: 评论