咨询与建议

限定检索结果

文献类型

  • 754 篇 期刊文献
  • 209 篇 会议
  • 21 篇 学位论文
  • 1 册 图书

馆藏范围

  • 985 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 744 篇 工学
    • 306 篇 计算机科学与技术...
    • 272 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 41 篇 石油与天然气工程
    • 40 篇 土木工程
    • 36 篇 软件工程
    • 30 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 24 篇 动力工程及工程热...
    • 17 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 5 篇 力学(可授工学、理...
    • 5 篇 航空宇航科学与技...
    • 4 篇 建筑学
  • 358 篇 管理学
    • 341 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 235 篇 理学
    • 200 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 1 篇 法学

主题

  • 985 篇 approximate dyna...
  • 143 篇 optimal control
  • 141 篇 reinforcement le...
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 61 篇 markov decision ...
  • 60 篇 dynamic programm...
  • 51 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 28 篇 adaptive dynamic...
  • 23 篇 adaptive control
  • 22 篇 neural network
  • 22 篇 uncertainty
  • 22 篇 policy iteration
  • 21 篇 linear programmi...
  • 20 篇 neuro-dynamic pr...
  • 18 篇 value function a...
  • 18 篇 dynamic pricing
  • 17 篇 value iteration

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 cornell univ sch...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 930 篇 英文
  • 44 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
检索条件"主题词=Approximate dynamic Programming"
985 条 记 录,以下是961-970 订阅
排序:
ADAPTIVE CRITIC MOTION CONTROLLER BASED ON SPARSE RADIAL BASIS FUNCTION NETWORK
ADAPTIVE CRITIC MOTION CONTROLLER BASED ON SPARSE RADIAL BAS...
收藏 引用
World Automation Congress 2008
作者: Lin, Wei-Song Tu, Chia-Hsiang Natl Taiwan Univ Dept Elect Engn Taipei Taiwan
Motion controllers capable of incremental learning and optimization can automatically tune their parameters to pursue optimal control. By implementing reinforcement learning and approximate dynamic programming, an ada... 详细信息
来源: 评论
approximate dynamic programming strategies and their applicability for process control: A review and future directions
收藏 引用
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS 2004年 第3期2卷 263-278页
作者: Lee, JM Lee, JH Georgia Inst Technol Sch Chem & Biomol Engn Atlanta GA 30332 USA
This paper reviews dynamic programming (DP), surveys approximate solution methods for it, and considers their applicability to process control problems. Reinforcement Learning (RL) and Neuro-dynamic programming (NDP),... 详细信息
来源: 评论
Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS 2007年 第2期37卷 425-436页
作者: He, Pingan Jagannathan, S. Univ Missouri Dept Elect & Comp Engn Rolla MO 65409 USA
A novel adaptive-critic-based neural network (NN) controller in discrete time is designed to deliver a desired tracking performance for a class of nonlinear systems in the presence of actuator constraints. The constra... 详细信息
来源: 评论
A Q-Learning-based method applied to stochastic resource constrained project scheduling with new project arrivals
收藏 引用
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2007年 第13期17卷 1214-1231页
作者: Choi, Jaein Realff, Matthew J. Lee, Jay H. Georgia Inst Technol Sch Chem & Biochem Engn Atlanta GA 30332 USA
In many resource-constrained project scheduling problems (RCPSP), the set of candidate projects is not fixed a priori but evolves with time. For example, while performing an initial set of projects according to a cert... 详细信息
来源: 评论
An infinite-dimensional linear programming algorithm for deterministic semi-markov decision processes on borel spaces
收藏 引用
MATHEMATICS OF OPERATIONS RESEARCH 2007年 第3期32卷 528-550页
作者: Klabjan, Diego Adelman, Daniel Univ Illinois Dept Civil & Environm Engn Urbana IL 61801 USA Univ Chicago Grad Sch Business Chicago IL 60637 USA
We devise an algorithm for solving the infinite-dimensional linear programs that arise from general deterministic semi-Markov decision processes on Borel spaces. The algorithm constructs a sequence of approximate prim... 详细信息
来源: 评论
Continuous-time adaptive critics
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS 2007年 第3期18卷 631-647页
作者: Hanselmann, Thomas Noakes, Lyle Zaknich, Anthony Univ Melbourne Dept Elect & Elect Engn Parkville Vic 3010 Australia Univ Western Australia Sch Math & Stat Crawley WA 6009 Australia Murdoch Univ Sch Engn Sci Perth WA 6150 Australia
A continuous-time formulation of an adaptive critic design (ACD) is investigated. Connections to the discrete case are made, where backpropagation through time (BPTT) and real-time recurrent learning (RTRL) are preval... 详细信息
来源: 评论
Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
收藏 引用
AUTOMATICA 2007年 第3期43卷 473-481页
作者: Al-Tamimi, Asma Lewis, Frank L. Abu-Khalaf, Murad Univ Texas Automat & Robot Res Inst Arlington TX 76118 USA
In this paper, the optimal strategies for discrete-time linear system quadratic zero-sum games related to the H-infinity optimal control problem are solved in forward time without knowing the system dynamical matrices... 详细信息
来源: 评论
Kernel-based least squares policy iteration for reinforcement learning
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS 2007年 第4期18卷 973-992页
作者: Xu, Xin Hu, Dewen Lu, Xicheng Natl Univ Def Technol Coll Mechatron & Automat Inst Automat Changsha 410073 Peoples R China Natl Univ Def Technol Coll Mechatron & Automat Dept Automat Control Changsha 410073 Peoples R China Natl Univ Def Technol Sch Comp Changsha 410073 Peoples R China
In this paper, we present a kernel-based least squares policy iteration (KLSPI) algorithm for reinforcement learning (RL) in large or continuous state spaces, which can be used to realize adaptive feedback control of ... 详细信息
来源: 评论
Simulation-based design of dual-mode controller for non-linear processes
收藏 引用
CANADIAN JOURNAL OF CHEMICAL ENGINEERING 2007年 第4期85卷 506-511页
作者: Lee, Jong Min Lee, Jay H. Univ Alberta Dept Chem & Mat Engn Edmonton AB T6G 2G6 Canada Georgia Inst Technol Sch Chem & Biomol Engn Atlanta GA 30332 USA
This paper presents a simulation-based approach for designing a non-linear override control scheme to improve the performance of a local linear controller. The higher-level non-linear controller monitors the dynamic s... 详细信息
来源: 评论
dynamic optimization of the strength ratio during a terrestrial conflict
Dynamic optimization of the strength ratio during a terrestr...
收藏 引用
IEEE International Symposium on approximate dynamic programming and Reinforcement Learning
作者: Sztykgold, Alexandre Coppin, Gilles Hudry, Olivier GET ENST Bretagne LUSSI Dept CNRS TAMCICUMR 2872 Bretagne Germany GET ENST Bretagne Dept Comp Sci CNRS LTCI UMR 5141 Bretagne Germany
The aim of this study is to assist a military decision maker during his decision-making process when applying tactics on the battlefield. For that, we have decided to model the conflict by a game, on which we will see... 详细信息
来源: 评论