咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是861-870 订阅
排序:
Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
收藏 引用
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2012年 第13期22卷 1460-1483页
作者: Vamvoudakis, Kyriakos G. Lewis, F. L. Univ Texas Arlington Automat & Robot Res Inst Ft Worth TX 76118 USA
The two-player zero-sum (ZS) game problem provides the solution to the bounded L2-gain problem and so is important for robust control. However, its solution depends on solving a design HamiltonJacobiIsaacs (HJI) equat... 详细信息
来源: 评论
Multi-rate control policies for elastic traffic in CDMA networks
收藏 引用
PERFORMANCE EVALUATION 2012年 第10期69卷 510-523页
作者: Papadaki, Katerina Friderikos, Vasilis London Sch Econ Dept Management Management Sci Grp London WC2A 2AE England Kings Coll London Ctr Telecommun Res Div Engn London WC2R 2LS England
In this paper a rate control scheme for downlink packet transmission in CDMA networks is proposed based on both the queue lengths and the channel states of mobile users. We are interested in optimal rate allocation po... 详细信息
来源: 评论
Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
收藏 引用
AUTOMATICA 2012年 第8期48卷 1825-1832页
作者: Wang, Ding Liu, Derong Wei, Qinglai Zhao, Dongbin Jin, Ning Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
An intelligent-optimal control scheme for unknown nonaffine nonlinear discrete-time systems with discount factor in the cost function is developed in this paper. The iterative adaptive dynamic programming algorithm is... 详细信息
来源: 评论
An iterative ∈-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
收藏 引用
NEURAL NETWORKS 2012年 32卷 236-244页
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, a finite horizon iterative adaptive dynamic programming (ADP) algorithm is proposed to solve the optimal control problem for a class of discrete-time nonlinear systems with unfixed initial state. A new ... 详细信息
来源: 评论
Neural-Network-Based Optimal Control for a Class of Unknown Discrete-Time Nonlinear Systems Using Globalized Dual Heuristic programming
收藏 引用
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2012年 第3期9卷 628-634页
作者: Liu, Derong Wang, Ding Zhao, Dongbin Wei, Qinglai Jin, Ning Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
In this paper, a neuro-optimal control scheme for a class of unknown discrete-time nonlinear systems with discount factor in the cost function is developed. The iterative adaptive dynamic programming algorithm using g... 详细信息
来源: 评论
Developing green fleet management strategies: Repair/retrofit/replacement decisions under environmental regulation
收藏 引用
TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE 2012年 第8期46卷 1216-1226页
作者: Stasko, Timon H. Gao, H. Oliver Cornell Univ Ithaca NY 14853 USA
The considerable cost of maintaining large fleets has generated interest in cost minimization strategies. With many related decisions, numerous constraints, and significant sources of uncertainty (e.g. vehicle breakdo... 详细信息
来源: 评论
LEAST SQUARES TEMPORAL DIFFERENCE METHODS: AN ANALYSIS UNDER GENERAL CONDITIONS
收藏 引用
SIAM JOURNAL ON CONTROL AND OPTIMIZATION 2012年 第6期50卷 3310-3343页
作者: Yu, Huizhen MIT LIDS Cambridge MA 02139 USA
We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) with the least squares temporal difference (LSTD) algorithm, LSTD(lambda), in an exploration-enhanced learning cont... 详细信息
来源: 评论
dynamic multi-appointment patient scheduling for radiation therapy
收藏 引用
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2012年 第2期223卷 573-584页
作者: Saure, Antoine Patrick, Jonathan Tyldesley, Scott Puterman, Martin L. Univ British Columbia Sauder Sch Business Vancouver BC V6T 1Z2 Canada Univ Ottawa Telfer Sch Management Ottawa ON K1N 6N5 Canada British Columbia Canc Agcy Vancouver BC V5Z 4E6 Canada
Seeking to reduce the potential impact of delays on radiation therapy cancer patients such as psychological distress, deterioration in quality of life and decreased cancer control and survival, and motivated by ineffi... 详细信息
来源: 评论
Computing Near-Optimal Policies in Generalized Joint Replenishment
收藏 引用
INFORMS JOURNAL ON COMPUTING 2012年 第1期24卷 148-164页
作者: Adelman, Daniel Klabjan, Diego Univ Chicago Booth Sch Business Chicago IL 60637 USA Northwestern Univ Dept Ind Engn & Management Sci Evanston IL 60208 USA
We provide a practical methodology for solving the generalized joint replenishment (GJR) problem, based on a mathematical programming approach to approximate dynamic programming. We show how to automatically generate ... 详细信息
来源: 评论
Optimal Controller Design Algorithm For Non-Affine in Input Discrete-Time Nonlinear System
收藏 引用
JORDAN JOURNAL OF MECHANICAL AND INDUSTRIAL ENGINEERING 2012年 第2期6卷 155-161页
作者: Al-Tamimi, A. Hashemite Univ Dept Mech Engn Zarqa Jordan
Convergence is proven of the value-iteration-based algorithm to find the optimal controller in the case of general non-affine in input nonlinear systems. That is, it is shown that algorithm converges to the optimal co... 详细信息
来源: 评论