咨询与建议

限定检索结果

文献类型

  • 749 篇 期刊文献
  • 209 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 982 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 749 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 251 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 51 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 电子科学与技术(可...
  • 357 篇 管理学
    • 340 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 982 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 142 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 926 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
982 条 记 录,以下是271-280 订阅
排序:
A Low-Rank Approximation for MDPs via Moment Coupling
收藏 引用
OPERATIONS RESEARCH 2024年 第3期72卷 1255-1277页
作者: Zhang, Amy B. Z. Gurvich, Itai Cornell Univ Sch Operat Res & Informat Engn Ithaca NY 14853 USA Northwestern Univ Kellogg Sch Management Evanston IL 60208 USA
We introduce a framework to approximate Markov decision processes (MDPs) that stands on two pillars: (i) state aggregation, as the algorithmic infrastructure, and (ii) central-limit-theorem-type approximations, as the... 详细信息
来源: 评论
A tutorial on value function approximation for stochastic and dynamic transportation
收藏 引用
4OR-A QUARTERLY JOURNAL OF OPERATIONS RESEARCH 2024年 第1期22卷 145-173页
作者: Heinold, Arne Univ Kiel Sch Econ & Business Kiel Germany
This paper provides an introductory tutorial on Value Function Approximation (VFA), a solution class from approximate dynamic programming. VFA describes a heuristic way for solving sequential decision processes like a... 详细信息
来源: 评论
Online accelerated data-driven learning for optimal feedback control of discrete-time partially uncertain systems
收藏 引用
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING 2024年 第3期38卷 848-876页
作者: Somers, Luke Haddad, Wassim M. Kokolakis, Nick-Marios T. Vamvoudakis, Kyriakos G. Georgia Inst Technol Sch Aerosp Engn Atlanta GA USA Georgia Inst Technol Sch Aerosp Engn Atlanta GA 30332 USA
In this paper, we develop an online learning algorithm for solving the Bellman equation for affine in the control discrete-time nonlinear uncertain dynamical systems. To ensure accelerated learning of our algorithm in... 详细信息
来源: 评论
A stabilizing reinforcement learning approach for sampled systems with partially unknown models
收藏 引用
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2024年 第18期34卷 12389-12412页
作者: Beckenbach, Lukas Osinenko, Pavel Streif, Stefan Tech Univ Chemnitz Automatic Control & Dynam Syst Lab Chemnitz Germany Skolkovo Inst Sci & Technol Digital Engn Ctr Moscow Russia
Reinforcement learning is commonly associated with training of reward-maximizing (or cost-minimizing) agents, in other words, controllers. It can be applied in model-free or model-based fashion, using a priori or onli... 详细信息
来源: 评论
Off-Policy Model-Free Learning for Multi-Player Non-Zero-Sum Games With Constrained Inputs
收藏 引用
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS 2023年 第2期70卷 910-920页
作者: Huo, Yu Wang, Ding Qiao, Junfei Li, Menghua Beijing Univ Technol Beijing Inst Artificial Intelligence Fac Informat Technol Beijing Key Lab Computat Intelligence & Intelligen Beijing 100124 Peoples R China Beijing Univ Technol Beijing Inst Artificial Intelligence Fac Informat Technol Beijing Lab Smart Environm Protect Beijing 100124 Peoples R China
In this paper, multi-player non-zero-sum games with control constraints are studied by utilizing a novel model-free approach based on adaptive dynamic programming framework. First, the model-based policy iteration (PI... 详细信息
来源: 评论
Fourier-Hermite dynamic programming for Optimal Control
收藏 引用
IEEE TRANSACTIONS ON AUTOMATIC CONTROL 2023年 第10期68卷 6377-6384页
作者: Hassan, Syeda Sakira Sarkka, Simo Aalto Univ Dept Elect Engn & Automat Espoo 02150 Finland
In this article, we propose a novel computational method for solving nonlinear optimal control problems. The method is based on the use of Fourier-Hermite series for approximating the action-value function arising in ... 详细信息
来源: 评论
Combined Use of dynamic Inversion and Reinforcement Learning for Motion Control of an Supersonic Transport Aircraft
收藏 引用
OPTICAL MEMORY AND NEURAL NETWORKS 2024年 第SUPPL3期33卷 S399-S413页
作者: Dhiman, Gaurav Tiumentsev, Yu. V. Tskhai, R. A. Natl Res Univ Moscow Aviat Inst Moscow 125080 Russia
The task of aircraft motion control has to be solved under conditions of numerous heterogeneous uncertainties both in the aircraft motion model and in the environment in which the aircraft is flying. These uncertainti... 详细信息
来源: 评论
Self-Guided approximate Linear Programs: Randomized Multi-Shot Approximation of Discounted Cost Markov Decision Processes
收藏 引用
MANAGEMENT SCIENCE 2025年 第4期71卷 iv-vi, 2751-3636页
作者: Pakiman, Parshan Nadarajah, Selvaprabu Soheili, Negar Lin, Qihang Univ Illinois Coll Business Adm Chicago IL 60607 USA Univ Iowa Tippie Coll Business Iowa City IA 52242 USA
approximate linear programs (ALPs) are well-known models based on value function approximations (VFAs) to obtain policies and lower bounds on the optimal policy cost of discounted-cost Markov decision processes (MDPs)... 详细信息
来源: 评论
A Bayesian learning and pricing model with multiple unknown demand parameters
收藏 引用
ANNALS OF OPERATIONS RESEARCH 2024年 第1期343卷 493-513页
作者: Xiao, Baichun Yang, Wei Long Isl Univ Coll Management CW Post Brookville NY 11548 USA
This article presents a Bayesian learning model for demand estimation in revenue management. Different from most existing models in the literature, our discussion centers on demand functions with an arbitrary number o... 详细信息
来源: 评论
Balancing resources for dynamic vehicle routing with stochastic customer requests
收藏 引用
OR SPECTRUM 2024年 第2期46卷 331-373页
作者: Soeffker, Ninja Ulmer, Marlin W. Mattfeld, Dirk C. Univ Vienna Dept Business Decis & Analyt Vienna Austria Otto von Guericke Univ Chair Management Sci Magdeburg Germany Tech Univ Carolo Wilhelmina Braunschweig Decis Support Grp Braunschweig Germany
We consider a service provider performing pre-planned service for initially known customers with a fleet of vehicles, e.g., parcel delivery. During execution, new dynamic service requests occur, e.g., for parcel picku... 详细信息
来源: 评论