咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是441-450 订阅
排序:
A model-based deep reinforcement learning method applied to finite-horizon optimal control of nonlinear control-affine system
收藏 引用
JOURNAL OF PROCESS CONTROL 2020年 87卷 166-178页
作者: Kim, Jong Woo Park, Byung Jun Yoo, Haeun Oh, Tae Hoon Lee, Jay H. Lee, Jong Min Seoul Natl Univ Sch Chem & Biol Engn Inst Chem Proc 1 Gwanak Ro Seoul 08826 South Korea Korea Adv Inst Sci & Technol Dept Chem & Biomol Engn 291 Daehak Ro Daejeon 34141 South Korea
The Hamilton-Jacobi-Bellman (HJB) equation can be solved to obtain optimal closed-loop control policies for general nonlinear systems. As it is seldom possible to solve the HJB equation exactly for nonlinear systems, ... 详细信息
来源: 评论
A dynamic mobile production capacity and inventory control problem
收藏 引用
IISE TRANSACTIONS 2020年 第8期52卷 926-943页
作者: Malladi, Satya S. Erera, Alan L. White, Chelsea C., III Georgia Inst Technol Atlanta GA 30332 USA Tech Univ Denmark Lyngby Denmark
We analyze a problem of dynamic logistics planning given uncertain demands for a multi-location production-inventory system with transportable modular production capacity. In such systems, production modules provide c... 详细信息
来源: 评论
An Approximation Algorithm for Network Revenue Management Under Nonstationary Arrivals
收藏 引用
OPERATIONS RESEARCH 2020年 第3期68卷 834-855页
作者: Ma, Yuhang Rusmevichientong, Paat Sumida, Mika Topaloglu, Huseyin Cornell Tech Sch Operat Res & Informat Engn New York NY 10044 USA Univ Southern Calif Marshall Sch Business Los Angeles CA 90089 USA
We provide an approximation algorithm for network revenue management problems. In our approximation algorithm, we construct an approximate policy using value function approximations that are expressed as linear combin... 详细信息
来源: 评论
Meso-parametric value function approximation for dynamic customer acceptances in delivery routing
收藏 引用
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2020年 第1期285卷 183-195页
作者: Ulmer, Marlin W. Thomas, Barrett W. Tech Univ Carolo Wilhelmina Braunschweig Carl Friedrich Gauss Fak Muhlenpfordtstr 23 D-38106 Braunschweig Germany Univ Iowa Tippie Coll Business 108 John Pappajohn Business Bldg Iowa City IA 52242 USA
The rise of mobile communication, ample computing power, and Amazon's training of customers has led to last-mile delivery challenges and created struggles for companies seeking to budget their limited delivery res... 详细信息
来源: 评论
An Approximation Approach for Response-Adaptive Clinical Trial Design
收藏 引用
INFORMS JOURNAL ON COMPUTING 2020年 第4期32卷 877-894页
作者: Ahuja, Vishal Birge, John R. Southern Methodist Univ Cox Sch Business Dallas TX 75275 USA Univ Chicago Booth Sch Business Chicago IL 60637 USA
Multiarmed bandit (MAB) problems, typically modeled as Markov decision processes (MDPs), exemplify the learning versus earning trade-off. An area that has motivated theoretical research in MAB designs is the study of ... 详细信息
来源: 评论
A Multi-Critic Reinforcement Learning Method: An Application to Multi-Tank Water Systems
收藏 引用
IEEE ACCESS 2020年 8卷 173227-173238页
作者: Martinez-Piazuelo, Juan Ochoa, Daniel E. Quijano, Nicanor Giraldo, Luis Felipe Univ los Andes Dept Ingn Elect & Elect Bogota 111711 Colombia Univ Colorado Dept Elect Comp & Energy Engn Boulder CO 80309 USA
This paper investigates the combination of reinforcement learning and neural networks applied to the data-driven control of dynamical systems. In particular, we propose a multi-critic actor-critic architecture that ea... 详细信息
来源: 评论
Robust optimal control for a class of nonlinear systems with unknown disturbances based on disturbance observer and policy iteration
收藏 引用
NEUROCOMPUTING 2020年 390卷 185-195页
作者: Song, Ruizhuo Lewis, Frank L. Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Univ Texas Arlington UTA Res Inst Ft Worth TX 76118 USA
A robust optimal control method for a class of nonlinear systems with unknown disturbances is addressed in this paper. In this framework, adaptive dynamic programming (ADP) is presented to obtain the optimal control. ... 详细信息
来源: 评论
Coordinating Pricing and Empty Container Repositioning in Two-Depot Shipping Systems
收藏 引用
TRANSPORTATION SCIENCE 2020年 第6期54卷 1697-1713页
作者: Lu, Tao Lee, Chung-Yee Lee, Loo-Hay Univ Connecticut Sch Business Storrs CT 06269 USA Hong Kong Univ Sci & Technol Dept Ind Engn & Decis Analyt Kowloon Clear Water Bay Hong Kong Peoples R China Natl Univ Singapore Dept Ind Syst Engn & Management Singapore 119077 Singapore
This paper studies joint decisions on pricing and empty container repositioning in two-depot shipping services with stochastic shipping demand. We formulate the problem as a stochastic dynamic programming model. The e... 详细信息
来源: 评论
Least squares policy iteration with instrumental variables vs. direct policy search: comparison against optimal benchmarks using energy storage
收藏 引用
INFOR 2020年 第1期58卷 141-166页
作者: Moazehi, Somayeh Scott, Warren R. Powell, Warren B. Stevens Inst Technol Sch Business Hoboken NJ 07030 USA Princeton Univ Dept Operat Res & Financial Engn Princeton NJ 08544 USA
This article studies least-squares approximate policy iteration (API) methods with parametrized value-function approximation. We study several variations of the policy evaluation phase, namely, Bellman error minimizat... 详细信息
来源: 评论
dynamic multi-priority, multi-class patient scheduling with stochastic service times
收藏 引用
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2020年 第1期280卷 254-265页
作者: Saure, Antoine Begen, Mehmet A. Patrick, Jonathan Univ Ottawa Telfer Sch Management 55 Laurier Ave East Ottawa ON K1N 6N5 Canada Western Univ Ivey Sch Business 1255 Western Rd London ON N6G 0N1 Canada
Efficient patient scheduling has significant operational, clinical and economical benefits on health care systems by not only increasing the timely access of patients to care but also reducing costs. However, patient ... 详细信息
来源: 评论