咨询与建议

限定检索结果

文献类型

  • 747 篇 期刊文献
  • 208 篇 会议
  • 23 篇 学位论文
  • 1 册 图书

馆藏范围

  • 979 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 746 篇 工学
    • 307 篇 计算机科学与技术...
    • 271 篇 电气工程
    • 249 篇 控制科学与工程
    • 86 篇 交通运输工程
    • 50 篇 机械工程
    • 42 篇 石油与天然气工程
    • 40 篇 土木工程
    • 38 篇 软件工程
    • 31 篇 信息与通信工程
    • 26 篇 化学工程与技术
    • 25 篇 动力工程及工程热...
    • 16 篇 仪器科学与技术
    • 8 篇 环境科学与工程(可...
    • 4 篇 力学(可授工学、理...
    • 4 篇 电子科学与技术(可...
    • 4 篇 建筑学
  • 356 篇 管理学
    • 339 篇 管理科学与工程(可...
    • 52 篇 工商管理
    • 6 篇 公共管理
  • 231 篇 理学
    • 196 篇 数学
    • 65 篇 系统科学
    • 11 篇 统计学(可授理学、...
    • 9 篇 物理学
    • 7 篇 生物学
    • 4 篇 生态学
  • 79 篇 经济学
    • 55 篇 应用经济学
    • 25 篇 理论经济学
  • 18 篇 医学
    • 11 篇 基础医学(可授医学...
    • 10 篇 临床医学
    • 7 篇 公共卫生与预防医...
  • 8 篇 军事学
  • 7 篇 农学
  • 3 篇 法学

主题

  • 979 篇 approximate dyna...
  • 142 篇 reinforcement le...
  • 141 篇 optimal control
  • 83 篇 adaptive dynamic...
  • 77 篇 neural networks
  • 64 篇 adaptive critic ...
  • 62 篇 markov decision ...
  • 59 篇 dynamic programm...
  • 50 篇 markov decision ...
  • 36 篇 nonlinear system...
  • 29 篇 adaptive dynamic...
  • 22 篇 uncertainty
  • 22 篇 adaptive control
  • 21 篇 neural network
  • 21 篇 policy iteration
  • 20 篇 neuro-dynamic pr...
  • 19 篇 linear programmi...
  • 18 篇 value function a...
  • 17 篇 value iteration
  • 17 篇 optimization

机构

  • 63 篇 chinese acad sci...
  • 33 篇 univ sci & techn...
  • 18 篇 princeton univ d...
  • 12 篇 georgia inst tec...
  • 11 篇 tsinghua univ de...
  • 10 篇 school of automa...
  • 9 篇 northeastern uni...
  • 9 篇 cornell univ sch...
  • 9 篇 univ rhode isl d...
  • 8 篇 air force instit...
  • 7 篇 the state key la...
  • 7 篇 south china univ...
  • 7 篇 univ illinois de...
  • 6 篇 univ chicago boo...
  • 6 篇 tsinghua univ sc...
  • 6 篇 univ chinese aca...
  • 6 篇 chinese acad sci...
  • 6 篇 univ chinese aca...
  • 5 篇 natl univ singap...
  • 5 篇 univ illinois de...

作者

  • 65 篇 wei qinglai
  • 58 篇 liu derong
  • 29 篇 song ruizhuo
  • 22 篇 powell warren b.
  • 21 篇 wang ding
  • 16 篇 lee jay h.
  • 15 篇 ulmer marlin w.
  • 13 篇 lee jong min
  • 12 篇 lewis frank l.
  • 12 篇 zhang huaguang
  • 11 篇 li hongliang
  • 10 篇 robbins matthew ...
  • 9 篇 lygeros john
  • 9 篇 derong liu
  • 8 篇 xu xin
  • 8 篇 lunday brian j.
  • 8 篇 topaloglu huseyi...
  • 8 篇 thomas barrett w...
  • 8 篇 huang zhijian
  • 8 篇 mattfeld dirk c.

语言

  • 923 篇 英文
  • 49 篇 其他
  • 4 篇 中文
  • 2 篇 西班牙文
  • 1 篇 德文
  • 1 篇 法文
  • 1 篇 俄文
检索条件"主题词=Approximate Dynamic Programming"
979 条 记 录,以下是961-970 订阅
排序:
Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
收藏 引用
AUTOMATICA 2007年 第3期43卷 473-481页
作者: Al-Tamimi, Asma Lewis, Frank L. Abu-Khalaf, Murad Univ Texas Automat & Robot Res Inst Arlington TX 76118 USA
In this paper, the optimal strategies for discrete-time linear system quadratic zero-sum games related to the H-infinity optimal control problem are solved in forward time without knowing the system dynamical matrices... 详细信息
来源: 评论
Kernel-based least squares policy iteration for reinforcement learning
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS 2007年 第4期18卷 973-992页
作者: Xu, Xin Hu, Dewen Lu, Xicheng Natl Univ Def Technol Coll Mechatron & Automat Inst Automat Changsha 410073 Peoples R China Natl Univ Def Technol Coll Mechatron & Automat Dept Automat Control Changsha 410073 Peoples R China Natl Univ Def Technol Sch Comp Changsha 410073 Peoples R China
In this paper, we present a kernel-based least squares policy iteration (KLSPI) algorithm for reinforcement learning (RL) in large or continuous state spaces, which can be used to realize adaptive feedback control of ... 详细信息
来源: 评论
Simulation-based design of dual-mode controller for non-linear processes
收藏 引用
CANADIAN JOURNAL OF CHEMICAL ENGINEERING 2007年 第4期85卷 506-511页
作者: Lee, Jong Min Lee, Jay H. Univ Alberta Dept Chem & Mat Engn Edmonton AB T6G 2G6 Canada Georgia Inst Technol Sch Chem & Biomol Engn Atlanta GA 30332 USA
This paper presents a simulation-based approach for designing a non-linear override control scheme to improve the performance of a local linear controller. The higher-level non-linear controller monitors the dynamic s... 详细信息
来源: 评论
dynamic optimization of the strength ratio during a terrestrial conflict
Dynamic optimization of the strength ratio during a terrestr...
收藏 引用
IEEE International Symposium on approximate dynamic programming and Reinforcement Learning
作者: Sztykgold, Alexandre Coppin, Gilles Hudry, Olivier GET ENST Bretagne LUSSI Dept CNRS TAMCICUMR 2872 Bretagne Germany GET ENST Bretagne Dept Comp Sci CNRS LTCI UMR 5141 Bretagne Germany
The aim of this study is to assist a military decision maker during his decision-making process when applying tactics on the battlefield. For that, we have decided to model the conflict by a game, on which we will see... 详细信息
来源: 评论
Continuous-time ADP for linear systems with partially unknown dynamics
Continuous-time ADP for linear systems with partially unknow...
收藏 引用
IEEE International Symposium on approximate dynamic programming and Reinforcement Learning
作者: Vrabie, Draguna Abu-Khalaf, Murad Lewis, Frank L. Wang, Youyi Univ Texas Automat & Robot Res Inst Ft Worth TX 76118 USA Nanyang Technol Univ Sch Elect & Elect Engn Singapore Singapore
approximate dynamic programming has been formulated and applied mainly to discrete-time systems. Expressing the ADP concept for continuous-time systems raises difficult issues related to sampling time and system model... 详细信息
来源: 评论
The single-node dynamic service scheduling and dispatching problem
收藏 引用
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2006年 第1期170卷 1-23页
作者: Dall'Orto, LC Crainic, TG Leal, JE Powell, WB Univ Quebec Ecole Sci Gest Dept Management & Technol Montreal PQ H3C 3P8 Canada Pontificia Univ Catolica Rio de Janeiro Dept Ind Engn Rio De Janeiro Brazil Univ Montreal Ctr Res Transportat Montreal PQ H3C 3J7 Canada Princeton Univ Dept Operat Res & Financial Engn Princeton NJ 08544 USA
In this paper, we focus on a particular version of the dynamic service network design (DSND) problem, namely the case of a single-terminal that dispatches services to a number of customers and other terminals. We pres... 详细信息
来源: 评论
A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
收藏 引用
NEURAL NETWORKS 2006年 第10期19卷 1648-1660页
作者: Padhi, Radhakant Unnikrishnan, Nishant Wang, Xiaohua Balakrishnan, S. N. Univ Missouri Rolla Dept Mech & Aerosp Engn Rolla MO 65409 USA Indian Inst Sci Dept Aerosp Engn Bangalore 560012 Karnataka India
Even though dynamic programming offers an optimal control solution in a state feedback form, the method is overwhelmed by computational and storage requirements. approximate dynamic programming implemented with an Ada... 详细信息
来源: 评论
A parallelizable dynamic fleet management model with random travel times
收藏 引用
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2006年 第2期175卷 782-805页
作者: Topaloglu, H. | Cornell Univ Sch Operat Res & Ind Engn Ithaca NY 14853 USA
In this paper, we present a stochastic model for the dynamic fleet management problem with random travel times. Our approach decomposes the problem into time-staged subproblems by formulating it as a dynamic program a... 详细信息
来源: 评论
Intelligent optimal control of excitation and turbine systems in power networks
Intelligent optimal control of excitation and turbine system...
收藏 引用
General Meeting of the Power-Engineering-Society
作者: Venayagamoorthy, G. K. Harley, R. G. Univ Missouri Dept Elect & Comp Engn Real Time Power & Intelligent Syst Lab Rolla MO 65409 USA Georgia Inst Technol Sch Elect & Comp Engn Atlanta GA 30332 USA
The increasing complexity of the modern power grid highlights the need for advanced modeling and control techniques for effective control of excitation and turbine systems. The crucial factors affecting the modern pow... 详细信息
来源: 评论
A self-learning call admission control scheme for CDMA cellular networks
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS 2005年 第5期16卷 1219-1228页
作者: Liu, DR Zhang, Y Zhang, HG Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA Northeastern Univ Sch Informat Sci & Engn Liaoning 110004 Peoples R China
In the present paper, a call admission control scheme that can learn from the network environment and user behavior is developed for code division multiple access (CDMA) cellular networks that handle both voice and da... 详细信息
来源: 评论