咨询与建议

限定检索结果

文献类型

  • 229 篇 会议
  • 18 篇 期刊文献

馆藏范围

  • 247 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 113 篇 工学
    • 103 篇 计算机科学与技术...
    • 42 篇 软件工程
    • 38 篇 电气工程
    • 23 篇 控制科学与工程
    • 5 篇 信息与通信工程
    • 3 篇 机械工程
    • 2 篇 力学(可授工学、理...
    • 1 篇 仪器科学与技术
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 交通运输工程
  • 27 篇 理学
    • 25 篇 数学
    • 7 篇 系统科学
    • 6 篇 统计学(可授理学、...
    • 1 篇 物理学
    • 1 篇 化学
    • 1 篇 大气科学
  • 10 篇 管理学
    • 8 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 1 篇 法学
    • 1 篇 社会学

主题

  • 95 篇 dynamic programm...
  • 54 篇 optimal control
  • 51 篇 learning
  • 44 篇 reinforcement le...
  • 35 篇 learning (artifi...
  • 27 篇 equations
  • 25 篇 neural networks
  • 22 篇 heuristic algori...
  • 20 篇 convergence
  • 20 篇 control systems
  • 18 篇 function approxi...
  • 18 篇 mathematical mod...
  • 16 篇 approximation al...
  • 15 篇 vectors
  • 15 篇 cost function
  • 14 篇 markov processes
  • 14 篇 nonlinear system...
  • 14 篇 artificial neura...
  • 13 篇 stochastic proce...
  • 12 篇 adaptive dynamic...

机构

  • 10 篇 chinese acad sci...
  • 5 篇 school of inform...
  • 4 篇 northeastern uni...
  • 4 篇 department of el...
  • 4 篇 department of in...
  • 3 篇 department of el...
  • 3 篇 automation and r...
  • 3 篇 department of el...
  • 3 篇 robotics institu...
  • 3 篇 key laboratory o...
  • 3 篇 natl univ def te...
  • 3 篇 univ illinois de...
  • 2 篇 department of ar...
  • 2 篇 school of electr...
  • 2 篇 univ groningen i...
  • 2 篇 univ texas autom...
  • 2 篇 colorado state u...
  • 2 篇 guangxi univ sch...
  • 2 篇 national science...
  • 2 篇 informatics inst...

作者

  • 13 篇 liu derong
  • 7 篇 hado van hasselt
  • 7 篇 marco a. wiering
  • 7 篇 dongbin zhao
  • 6 篇 zhao dongbin
  • 5 篇 xu xin
  • 5 篇 lewis frank l.
  • 5 篇 huaguang zhang
  • 5 篇 wei qinglai
  • 5 篇 derong liu
  • 5 篇 warren b. powell
  • 4 篇 haibo he
  • 4 篇 jagannathan s.
  • 4 篇 frank l. lewis
  • 4 篇 zhang huaguang
  • 4 篇 ni zhen
  • 4 篇 yanhong luo
  • 4 篇 wang ding
  • 4 篇 he haibo
  • 4 篇 damien ernst

语言

  • 246 篇 英文
  • 1 篇 其他
检索条件"任意字段=2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2014"
247 条 记 录,以下是241-250 订阅
排序:
Hybrid Ant Colony Optimization Using Memetic Algorithm for Traveling Salesman Problem
Hybrid Ant Colony Optimization Using Memetic Algorithm for T...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Haibin Duan Xiufen Yu School of Automation Science and Electrical Engineering Beihang University Beijing China Center for Space Science and Applied Research Chinese Academy and Sciences Beijing China
Ant colony optimization was originally presented under the inspiration during collective behavior study results on real ant system, and it has strong robustness and easy to combine with other methods in optimization. ... 详细信息
来源: 评论
Clipping in Neurocontrol by adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2014年 第10期25卷 1909-1920页
作者: Fairbank, Michael Prokhorov, Danil Alonso, Eduardo City Univ London Sch Informat Dept Comp Sci London EC1V OHB England Toyota Res Inst NA Ann Arbor MI 48105 USA
In adaptive dynamic programming, neurocontrol, and reinforcement learning, the objective is for an agent to learn to choose actions so as to minimize a total cost function. In this paper, we show that when discretized... 详细信息
来源: 评论
reinforcement-learning-based Magneto-hydrodynamic Control of Hypersonic Flows
Reinforcement-Learning-based Magneto-hydrodynamic Control of...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (adprl)
作者: Nilesh V. Kulkarni Minh Q. Phan NASA Ames Research Center QSS Group Inc. Moffett Field CA USA Dartmouth College Hanover NH USA
In this work, we design a policy-iteration-based Q-learning approach for on-line optimal control of ionized hypersonic flow at the inlet of a scramjet engine. Magneto-hydrodynamics (MHD) has been recently proposed as ... 详细信息
来源: 评论
Finite-Approximation-Error-Based Discrete-Time Iterative adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON CYBERNETICS 2014年 第12期44卷 2820-2833页
作者: Wei, Qinglai Wang, Fei-Yue Liu, Derong Yang, Xiong Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, a new iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite horizon discrete-time nonlinear systems with finite approximation errors. First, ... 详细信息
来源: 评论
adaptive dynamic programming for terminally constrained finite-horizon optimal control problems
Adaptive dynamic programming for terminally constrained fini...
收藏 引用
ieee Annual Conference on Decision and Control
作者: L. Andrews J. R. Klotz R. Kamalapurkar W. E. Dixon Department of Mechanical and Aerospace Engineering University of Florida Gainesville FL USA
adaptive dynamic programming is applied to control-affine nonlinear systems with uncertain drift dynamics to obtain a near-optimal solution to a finite-horizon optimal control problem with hard terminal constraints. A... 详细信息
来源: 评论
Impact of signal transmission delays on power system damping control using heuristic dynamic programming
Impact of signal transmission delays on power system damping...
收藏 引用
ieee symposium on Computational Intelligence Applications In Smart Grid (CIASG)
作者: Yufei Tang Xiangnan Zhong Zhen Ni Jun Yan Haibo He Department of Electrical University of Rhode Island Kingston RI USA
In this paper, the impact of signal transmission delays on static VAR compensator (SVC) based power system damping control using reinforcement learning is investigated. The SVC is used to damp low-frequency oscillatio... 详细信息
来源: 评论
reinforcement learning Output Feedback NN Control Using Deterministic learning Technique
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2014年 第3期25卷 635-641页
作者: Xu, Bin Yang, Chenguang Shi, Zhongke Northwestern Polytech Univ Sch Automat Xian 710072 Peoples R China Univ Plymouth Sch Comp & Math Plymouth PL4 8AA Devon England Beijing Inst Technol Sch Automat Beijing 100086 Peoples R China
In this brief, a novel adaptive-critic-based neural network (NN) controller is investigated for nonlinear pure-feedback systems. The controller design is based on the transformed predictor form, and the actor-critic N... 详细信息
来源: 评论