咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是851-860 订阅
排序:
Ramp metering based on on-line ADHDP (λ) controller
Ramp metering based on on-line ADHDP (λ) controller
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: Xuerui Bai Dongbin Zhao Jianqiang Yi Jing Xu Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China University of Arizona Tucson USA
Increasing dependence on car-based travel has led to the daily occurrence of freeway congestions around the world. In order to improve the worse and worse traffic congestion situation and solve the problems brought wi... 详细信息
来源: 评论
A biologically-inspired computational model for transformation invariant target recognition
A biologically-inspired computational model for transformati...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: Khan M. Iftekharuddin Yaqin Li Intelligence System and Image Processing Lab Department of Electrical and Computer Engineering University of Memphis Memphis TN USA
Transformation invariant image recognition has been an active research area due to its widespread applications in a variety of fields such as military operations, robotics, medical practices, geographic scene analysis... 详细信息
来源: 评论
An approximate dynamic programming strategy for responsive traffic signal control
An approximate dynamic programming strategy for responsive t...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Cai, Chen Univ Coll London Ctr Transport Studies London WC1E 6BT England
This paper proposes an approximate dynamic programming strategy for responsive traffic signal control. It is the first attempt that optimizes signal control objective dynamically through adaptive approximation of valu... 详细信息
来源: 评论
Particle swarm optimized adaptive dynamic programming
Particle swarm optimized adaptive dynamic programming
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Dongbin Zhao Jianqiang Yi Liu, Derong Chinese Acad Sci Inst Automat Key Lab Complex Syst & Intelligence Sci Beijing 100080 Peoples R China Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
Particle swarm optimization is used for the training of the action network and critic network of the adaptive dynamic programming approach. The typical structures of the adaptive dynamic programming and particle swarm... 详细信息
来源: 评论
Using ADP to understand and replicate brain intelligence: the next level design
Using ADP to understand and replicate brain intelligence: th...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Werbos, Paul J. Natl Sci Fdn Arlington VA 22203 USA
Since the 1960's I proposed that we could understand and replicate the highest level of intelligence seen in the brain, by building ever more capable and general systems for adaptive dynamic programming (ADP) - li... 详细信息
来源: 评论
Discrete-time adaptive dynamic programming using wavelet basis function neural networks
Discrete-time adaptive dynamic programming using wavelet bas...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Jin, Ning Liu, Derong Huang, Ting Pang, Zhongyu Univ Illinois Dept Elect & Comp Engn Chicago IL 60607 USA
dynamic programming for discrete time systems is difficult due to the "curse of dimensionality": one has to find a series of control actions that must be taken in sequence, hoping that this sequence will lea... 详细信息
来源: 评论
Using reward-weighted regression for reinforcement learning of task space control
Using reward-weighted regression for reinforcement learning ...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Peters, Jan Schaal, Stefan Univ So Calif Los Angeles CA 90089 USA
Many robot control problems of practical importance, including task or operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of the known optimization or rein... 详细信息
来源: 评论
reinforcement learning by backpropagation through an LSTM model/critic
Reinforcement learning by backpropagation through an LSTM mo...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Bakker, Bram Univ Amsterdam Inst Informat Intelligent Syst Lab Amsterdam NL-1098 SJ Amsterdam Netherlands
This paper describes backpropagation through an LSTM recurrent neural network model/critic, for reinforcement learning tasks in partially observable domains. This combines the advantage of LSTM's strength at learn... 详细信息
来源: 评论
Online reinforcement learning neural network controller design for nanomanipulation
Online reinforcement learning neural network controller desi...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Yang, Qinmin Jagannathan, S. Univ Missouri Dept Elect & Comp Engn Rolla MO 65401 USA
In this paper, a novel reinforcement learning neural network (NN)-based controller, referred to adaptive critic controller, is proposed for affine nonlinear discrete-time systems with applications to nanomanipulation.... 详细信息
来源: 评论
Continuous-time ADP for linear systems with partially unknown dynamics
Continuous-time ADP for linear systems with partially unknow...
收藏 引用
ieee International symposium on Approximate dynamic programming and reinforcement learning
作者: Vrabie, Draguna Abu-Khalaf, Murad Lewis, Frank L. Wang, Youyi Univ Texas Automat & Robot Res Inst Ft Worth TX 76118 USA Nanyang Technol Univ Sch Elect & Elect Engn Singapore Singapore
Approximate dynamic programming has been formulated and applied mainly to discrete-time systems. Expressing the ADP concept for continuous-time systems raises difficult issues related to sampling time and system model... 详细信息
来源: 评论