咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是381-390 订阅
排序:
Modified value-function-approximation for synchronous policy iteration with single-critic configuration for nonlinear optimal control
收藏 引用
INTERNATIONAL JOURNAL OF CONTROL 2021年 第5期94卷 1321-1333页
作者: Tang, Difan Chen, Lei Tian, Zhao Feng Hu, Eric Univ Adelaide Sch Mech Engn Adelaide SA Australia
This study proposes a modified value-function-approximation (MVFA) and investigates its use under a single-critic configuration based on neural networks (NNs) for synchronous policy iteration (SPI) to deliver compact ... 详细信息
来源: 评论
Nonsmooth Data-Based Reinforcement Learning for Online approximate Optimal Control
Nonsmooth Data-Based Reinforcement Learning for Online Appro...
收藏 引用
作者: Greene, Max Lewis University of Florida
学位级别:Ph.D., Doctor of Philosophy
Autonomous systems are often constrained by time-critical mission constraints and limited power. Such constraints motivate optimality in mission execution. Reinforcement learning (RL) has become a tool to facilitate l... 详细信息
来源: 评论
Application of machine learning to assess the value of information in polymer flooding
收藏 引用
Petroleum Research 2021年 第4期6卷 309-320页
作者: Amine Tadjer Reidar B.Bratvold Aojie Hong Remus Hanea University of Stavanger Norway Equinor Norway
In this work,we provide a more consistent alternative for performing value of information(VOI)analyses to address sequential decision problems in reservoir management and generate insights on the process of reservoir ... 详细信息
来源: 评论
Temporal logic guided safe model-based reinforcement learning: A hybrid systems approach
收藏 引用
NONLINEAR ANALYSIS-HYBRID SYSTEMS 2023年 47卷
作者: Cohen, Max H. Serlin, Zachary Leahy, Kevin Belta, Calin Boston Univ Dept Mech Engn 110 Cummington Mall Boston MA 02215 USA MIT Lincoln Lab Lexington MA USA
This paper studies the problem of synthesizing control policies for uncertain continuous -time nonlinear systems from linear temporal logic (LTL) specifications using model-based reinforcement learning (MBRL). Rather ... 详细信息
来源: 评论
Managing a Hybrid RDC-DC Inventory System
收藏 引用
PRODUCTION AND OPERATIONS MANAGEMENT 2021年 第10期30卷 3679-3697页
作者: Wang, Tong Yan, Xiaoyue Yang, Chaolin Shanghai Jiao Tong Univ Antai Coll Econ & Management Shanghai Peoples R China Cornell Univ Samuel Curtis Johnson Grad Sch Management Ithaca NY 14853 USA Shanghai Univ Finance & Econ Sch Informat Management & Engn Res Inst Interdisciplinary Sci Shanghai Peoples R China
In this study, we study a hybrid RDC-DC serial inventory system where the regional distribution center (RDC) replenishes its stock from an outside supplier (OS), while the distribution center (DC) faces random demand ... 详细信息
来源: 评论
Sequential learning based re-optimization approaches for less model-based dynamic pick-up routing problem
收藏 引用
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE-OPERATIONS & LOGISTICS 2024年 第1期11卷
作者: Yu, Wu Southwest Jiaotong Univ Sch Transportat & Logist Chengdu 611756 Sichuan Peoples R China Natl United Engn Lab Integrated & Intelligent Tran Chengdu Peoples R China Natl Engn Lab Big Data Applicat Integrated Transpo Chengdu Peoples R China
We address a lessmodel-based dynamic routing problem arising from home parcel pick-up service, where lessmodel-based means existing customers who dynamically request services independently following Poisson process wi... 详细信息
来源: 评论
Real-time dispatch of integrated electricity and thermal system incorporating storages via a stochastic dynamic programming with imitation learning
收藏 引用
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS 2023年 第1期153卷
作者: Pan, Zhenning Yu, Tao Huang, Wenqi Wu, Yufeng Chen, Junbin Zhu, Kedong Lu, Jidong South China Univ Technol Sch Elect Power Engn Guangzhou 510640 Peoples R China CSG Digital Grid Res Inst Co Ltd Guangzhou 510670 Peoples R China
Coordinated dispatch of integrated electricity and thermal system (IETS) provides extra operation flexibility which is further improved by integration of electrical and thermal storages. However, the problem non-conve... 详细信息
来源: 评论
An exposition of least square Monte Carlo approach for real options valuation
收藏 引用
GEOENERGY SCIENCE AND ENGINEERING 2023年 222卷
作者: Ahmadi, Rouholah Bratvold, Reidar Brumer Univ Stavanger Fac Sci & Technol Dept Energy Resources Stavanger Norway Natl IOR Centre Norway Bergen Norway
The least square Monte Carlo simulation (LSM) approach is a state-of-the-art approach built upon approximate dynamic programming for the selection of single or multiple exercise options, and it has been extensively us... 详细信息
来源: 评论
Value-gradient iteration with quadratic approximate value functions
收藏 引用
ANNUAL REVIEWS IN CONTROL 2023年 56卷
作者: Yang, Alan Boyd, Stephen Stanford Univ Dept Elect Engn Stanford CA 94305 USA
We propose a method for designing policies for convex stochastic control problems characterized by random linear dynamics and convex stage cost. We consider policies that employ quadratic approximate value functions a... 详细信息
来源: 评论
Critical chain based Proactive-Reactive scheduling for Resource-Constrained project scheduling under uncertainty
收藏 引用
EXPERT SYSTEMS WITH APPLICATIONS 2023年 214卷
作者: Peng, Wuliang Lin, Xuejun Li, Haitao Yantai Univ Sch Econ & Management Yantai Peoples R China Univ Missouri St Louis Coll Business Adm St Louis MO USA
Project scheduling problems under both resource constraints and uncertainty have been widely studied due to their real world relevance. In this paper, we design and implement a new integrated proactive-reactive soluti... 详细信息
来源: 评论