咨询与建议

限定检索结果

文献类型

  • 754 篇 期刊文献
  • 209 篇 会议
  • 21 篇 学位论文
  • 1 册 图书

馆藏范围

  • 985 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 744 篇 工学
    • 306 篇 计算机科学与技术...
    • 272 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 41 篇 石油与天然气工程
    • 40 篇 土木工程
    • 36 篇 软件工程
    • 30 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 24 篇 动力工程及工程热...
    • 17 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 建筑学
  • 358 篇 管理学
    • 341 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 235 篇 理学
    • 200 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 1 篇 法学

主题

  • 985 篇 approximate dyna...
  • 143 篇 optimal control
  • 141 篇 reinforcement le...
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 61 篇 markov decision ...
  • 60 篇 dynamic programm...
  • 51 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 28 篇 adaptive dynamic...
  • 23 篇 adaptive control
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 policy iteration
  • 21 篇 linear programmi...
  • 20 篇 neuro-dynamic pr...
  • 18 篇 value function a...
  • 18 篇 dynamic pricing
  • 17 篇 value iteration

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 cornell univ sch...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 930 篇 英文
  • 44 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
985 条 记 录,以下是611-620 订阅
排序:
dynamic Pricing for Network Revenue Management: A New Approach and Application in the Hotel Industry
收藏 引用
INFORMS JOURNAL ON COMPUTING 2017年 第1期29卷 18-35页
作者: Zhang, Dan Weatherford, Larry Univ Colorado Leeds Sch Business Boulder CO 80309 USA Univ Wyoming Coll Business Laramie WY 82071 USA
dynamic pricing for network revenue management has received considerable attention in research and practice. Based on data obtained from a major hotel, we use a large-scale numerical study to compare the performance o... 详细信息
来源: 评论
approximate policy iteration for dynamic resource-constrained project scheduling
收藏 引用
OPERATIONS RESEARCH LETTERS 2017年 第5期45卷 442-447页
作者: Parizi, Mahshid Salemi Gocgun, Yasin Ghate, Archis Univ Washington Ind & Syst Engn BOX 352650 Seattle WA 98195 USA Altinbas Univ Dept Ind Engn Istanbul Turkey
We study non-preemptive scheduling problems where heterogeneous projects stochastically arrive over time. The projects include precedence-constrained tasks that require multiple resources. Incomplete projects are held... 详细信息
来源: 评论
Discrete-Time Local Value Iteration Adaptive dynamic programming: Admissibility and Termination Analysis
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017年 第11期28卷 2490-2502页
作者: Wei, Qinglai Liu, Derong Lin, Qiao Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
In this paper, a novel local value iteration adaptive dynamic programming (ADP) algorithm is developed to solve infinite horizon optimal control problems for discrete-time nonlinear systems. The focuses of this paper ... 详细信息
来源: 评论
Data mining for state space orthogonalization in adaptive dynamic programming
收藏 引用
EXPERT SYSTEMS WITH APPLICATIONS 2017年 76卷 49-58页
作者: Ariyajunya, Bancha Chen, Ying Chen, Victoria C. P. Kim, Seoung Bum Burapha Univ Fac Engn Chon Buri Thailand Univ Texas Arlington Dept Ind Mfg & Syst Engn Arlington TX 76019 USA Korea Univ Sch Ind Management Engn Seoul South Korea
dynamic programming (DP) is a mathematical programming approach for optimizing a system that changes over time and is a common approach for developing intelligent systems. Expert systems that are intelligent must be a... 详细信息
来源: 评论
A rollout algorithm framework for heuristic solutions to finite-horizon stochastic dynamic programs
收藏 引用
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2017年 第1期258卷 216-229页
作者: Goodson, Justin C. Thomas, Barrett W. Ohlmann, Jeffrey W. St Louis Univ John Cook Sch Business Dept Operat & Informat Technol Management 3674 Lindell Blvd St Louis MO 63108 USA Univ Iowa Tippie Coll Business Dept Management Sci Iowa City IA 52242 USA
Rollout algorithms have enjoyed success across a variety of domains as heuristic solution procedures for stochastic dynamic programs (SDPs). However, because most rollout implementations are closely tied to specific p... 详细信息
来源: 评论
A Multiobjective Path-Planning Algorithm With Time Windows for Asset Routing in a dynamic Weather-Impacted Environment
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2017年 第12期47卷 3256-3271页
作者: Sidoti, David Avvari, Gopi Vinod Mishra, Manisha Zhang, Lingyi Nadella, Bala Kishore Peak, James E. Hansen, James A. Pattipati, Krishna R. Univ Connecticut Dept Elect & Comp Engn Storrs CT 06269 USA Doran Jones Bronx NY 10454 USA US Naval Res Lab NRL MRY Monterey CA 93943 USA
This paper presents a mixed-initiative tool for multiobjective planning and asset routing (TMPLAR) in dynamic and uncertain environments. TMPLAR is built upon multiobjective dynamic programming algorithms to route ass... 详细信息
来源: 评论
Anticipatory freight selection in intermodal long-haul round-trips
收藏 引用
TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW 2017年 105卷 176-194页
作者: Rivera, Arturo E. Perez Mes, Martijn R. K. Univ Twente Dept Ind Engn & Business Informat Syst POB 217 NL-7500 AE Enschede Netherlands
We consider the planning problem faced by Logistic Service Providers (LSPs) transporting freights periodically, using long-haul round-trips. In each round-trip, freights are delivered and picked up at different locati... 详细信息
来源: 评论
Efficient Reinforcement Learning in Deterministic Systems with Value Function Generalization
收藏 引用
MATHEMATICS OF OPERATIONS RESEARCH 2017年 第3期42卷 762-782页
作者: Wen, Zheng Van Roy, Benjamin Adobe Res San Jose CA 95110 USA Stanford Univ Stanford CA 94305 USA
We consider the problem of reinforcement learning over episodes of a finite-horizon deterministic system and as a solution propose optimistic constraint propagation (OCP), an algorithm designed to synthesize efficient... 详细信息
来源: 评论
Error-Tolerant Iterative Adaptive dynamic programming for Optimal Renewable Home Energy Scheduling and Battery Management
收藏 引用
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS 2017年 第12期64卷 9527-9537页
作者: Wei, Qinglai Lewis, Frank L. Shi, Guang Song, Ruizhuo Chinese Acad Sci State Key Lab Management & Control Complex Syst Inst Automat Beijing 100190 Peoples R China Univ Chinese Acad Sci Beijing 100049 Peoples R China Univ Texas Arlington Res Inst Arlington TX 76118 USA Northeastern Univ Shenyang 110036 Liaoning Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China
In this paper, a novel error-tolerant iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal battery control and management problems in smart home environments with renewable energy. A ma... 详细信息
来源: 评论
Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017年 第3期28卷 704-713页
作者: Song, Ruizhuo Lewis, Frank L. Wei, Qinglai Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Univ Texas Arlington UTA Res Inst Arlington TX 76019 USA Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
This paper establishes an off-policy integral reinforcement learning (IRL) method to solve nonlinear continuous-time (CT) nonzero-sum (NZS) games with unknown system dynamics. The IRL algorithm is presented to obtain ... 详细信息
来源: 评论