咨询与建议

限定检索结果

文献类型

  • 751 篇 期刊文献
  • 209 篇 会议
  • 21 篇 学位论文
  • 1 册 图书

馆藏范围

  • 982 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 743 篇 工学
    • 306 篇 计算机科学与技术...
    • 272 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 41 篇 石油与天然气工程
    • 40 篇 土木工程
    • 36 篇 软件工程
    • 29 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 24 篇 动力工程及工程热...
    • 17 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 232 篇 理学
    • 198 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学

主题

  • 982 篇 approximate dyna...
  • 142 篇 optimal control
  • 141 篇 reinforcement le...
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 61 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 51 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 28 篇 adaptive dynamic...
  • 23 篇 adaptive control
  • 22 篇 uncertainty
  • 22 篇 policy iteration
  • 21 篇 linear programmi...
  • 21 篇 neural network
  • 20 篇 neuro-dynamic pr...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 cornell univ sch...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 927 篇 英文
  • 44 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
982 条 记 录,以下是391-400 订阅
排序:
Application of machine learning to assess the value of information in polymer flooding
收藏 引用
Petroleum Research 2021年 第4期6卷 309-320页
作者: Amine Tadjer Reidar B.Bratvold Aojie Hong Remus Hanea University of Stavanger Norway Equinor Norway
In this work,we provide a more consistent alternative for performing value of information(VOI)analyses to address sequential decision problems in reservoir management and generate insights on the process of reservoir ... 详细信息
来源: 评论
Real-time dispatch of integrated electricity and thermal system incorporating storages via a stochastic dynamic programming with imitation learning
收藏 引用
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS 2023年 153卷
作者: Pan, Zhenning Yu, Tao Huang, Wenqi Wu, Yufeng Chen, Junbin Zhu, Kedong Lu, Jidong South China Univ Technol Sch Elect Power Engn Guangzhou 510640 Peoples R China CSG Digital Grid Res Inst Co Ltd Guangzhou 510670 Peoples R China
Coordinated dispatch of integrated electricity and thermal system (IETS) provides extra operation flexibility which is further improved by integration of electrical and thermal storages. However, the problem non-conve... 详细信息
来源: 评论
An exposition of least square Monte Carlo approach for real options valuation
收藏 引用
GEOENERGY SCIENCE AND ENGINEERING 2023年 222卷
作者: Ahmadi, Rouholah Bratvold, Reidar Brumer Univ Stavanger Fac Sci & Technol Dept Energy Resources Stavanger Norway Natl IOR Centre Norway Bergen Norway
The least square Monte Carlo simulation (LSM) approach is a state-of-the-art approach built upon approximate dynamic programming for the selection of single or multiple exercise options, and it has been extensively us... 详细信息
来源: 评论
Temporal logic guided safe model-based reinforcement learning: A hybrid systems approach
收藏 引用
NONLINEAR ANALYSIS-HYBRID SYSTEMS 2023年 47卷
作者: Cohen, Max H. Serlin, Zachary Leahy, Kevin Belta, Calin Boston Univ Dept Mech Engn 110 Cummington Mall Boston MA 02215 USA MIT Lincoln Lab Lexington MA USA
This paper studies the problem of synthesizing control policies for uncertain continuous -time nonlinear systems from linear temporal logic (LTL) specifications using model-based reinforcement learning (MBRL). Rather ... 详细信息
来源: 评论
Value-gradient iteration with quadratic approximate value functions
收藏 引用
ANNUAL REVIEWS IN CONTROL 2023年 56卷
作者: Yang, Alan Boyd, Stephen Stanford Univ Dept Elect Engn Stanford CA 94305 USA
We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions a... 详细信息
来源: 评论
Optimizing Trading Decisions for Hydro Storage Systems Using approximate Dual dynamic programming
收藏 引用
OPERATIONS RESEARCH 2013年 第4期61卷 810-823页
作者: Loehndorf, Nils Wozabal, David Minner, Stefan Vienna Univ Econ & Business A-1020 Vienna Austria Tech Univ Munich D-80333 Munich Germany
We propose a new approach to optimize operations of hydro storage systems with multiple connected reservoirs whose operators participate in wholesale electricity markets. Our formulation integrates short-term intraday... 详细信息
来源: 评论
approximated multi-agent fitted Q iteration
收藏 引用
SYSTEMS & CONTROL LETTERS 2023年 177卷
作者: Lesage-Landry, Antoine Callaway, Duncan S. Polytech Montreal Dept Elect Engn Mila & GERAD 2500 Polytech Rd Montreal PQ H3T 1J4 Canada Univ Calif Berkeley Energy & Resources Grp 337 Giannini Hall Berkeley CA 94720 USA
We formulate an efficient approximation for multi-agent batch reinforcement learning, the approxi-mated multi-agent fitted Q iteration (AMAFQI). We present a detailed derivation of our approach. We propose an iterativ... 详细信息
来源: 评论
Opportunities for reinforcement learning in stochastic dynamic vehicle routing
收藏 引用
COMPUTERS & OPERATIONS RESEARCH 2023年 150卷
作者: Hildebrandt, Florentin D. Thomas, Barrett W. Ulmer, Marlin W. Otto von Guericke Univ Dept Management Sci Magdeburg Germany Univ Iowa Dept Business Analyt Iowa City IA USA
There has been a paradigm-shift in urban logistic services in the last years;demand for real-time, instant mobility and delivery services grows. This poses new challenges to logistic service providers as the underlyin... 详细信息
来源: 评论
dynamic multistage scheduling for patient-centered care plans
收藏 引用
HEALTH CARE MANAGEMENT SCIENCE 2021年 第4期24卷 827-844页
作者: Diamant, Adam York Univ Schulich Sch Business 111 Ian Macdonald Blvd Toronto ON M3J 1P3 Canada
We investigate the scheduling practices of multistage outpatient health programs that offer care plans customized to the needs of their patients. We formulate the scheduling problem as a Markov decision process (MDP) ... 详细信息
来源: 评论
Optimization of cyclic air braking strategy for heavy haul trains: an ADP approach
Optimization of cyclic air braking strategy for heavy haul t...
收藏 引用
IEEE Intelligent Transportation Systems Conference (ITSC)
作者: Su, S. Liu, W. Huang, Y. Tang, T. Beijing Jiaotong Univ State Key Lab Traff Control & Safety Frontiers Sci Ctr Smart High Speed Railway Syst Beijing Peoples R China
The cyclic air braking strategy on the long steep downward slopes is one of the main challenges to the heavy haul train control in China. To overcome this dilemma, this paper proposes an optimization method of cyclic ... 详细信息
来源: 评论