咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是281-290 订阅
排序:
Self-learning path-tracking control of autonomous vehicles using kernel-based approximate dynamic programming
Self-learning path-tracking control of autonomous vehicles u...
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: Xin Xu Hongyu Zhang Bin Dai Han-gen He Institute of Automation College of Mechatronics and Automation National University of Defense Technology Changsha China
With the fast development of robotics and intelligent vehicles, there has been much research work on modeling and motion control of autonomous vehicles. However, due to model complexity, and unknown disturbances from ... 详细信息
来源: 评论
reinforcement learning of LQR control policy by a double inverted-pendulum biomechanical model
Reinforcement learning of LQR control policy by a double inv...
收藏 引用
ieee international Conference on Industrial Technology (ICIT)
作者: Kamran Iqbal Muhammad Haras University of Arkansas at Little Rock Little Rock Arkansas
Optimal LQR feedback gains can be learned using reinforcement learning (RL) framework for systems with unknown dynamics using policy iteration methods. However, policy iteration in the case of inherently unstable syst...
来源: 评论
Ramp metering based on on-line ADHDP (λ) controller
Ramp metering based on on-line ADHDP (λ) controller
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: Xuerui Bai Dongbin Zhao Jianqiang Yi Jing Xu Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China University of Arizona Tucson USA
Increasing dependence on car-based travel has led to the daily occurrence of freeway congestions around the world. In order to improve the worse and worse traffic congestion situation and solve the problems brought wi... 详细信息
来源: 评论
Optimal Control Applied to Wheeled Mobile Vehicles
Optimal Control Applied to Wheeled Mobile Vehicles
收藏 引用
ieee international symposium on Intelligent Signal Processing
作者: M. Gomez T. Martinez S. Sanchez D. Meziat Departamento de Automática Universidad de Alcalá Spain Departamento de Física Ingeniería de Sistemas y Teoría de la Señal Universidad de Alcalá Spain
The goal of the work described in this paper is to develop a particular optimal control technique based on a Cell-Mapping technique in combination with the Q-learning reinforcement learning method to control wheeled m... 详细信息
来源: 评论
A reinforcement learning approach to support setup decisions in distributed manufacturing systems
A reinforcement learning approach to support setup decisions...
收藏 引用
international Conference on Emerging Technologies and Factory Automation (ETFA)
作者: P. McDonnell S. Joshi Industrial and Manufacturing Engineering Pennsylvania State University University Park PA USA
A reinforcement learning approach to specifying payoffs for setup games is presented. Setup games are normal form, noncooperative games used by heterarchical machine controllers to evaluate reconfiguration decisions. ... 详细信息
来源: 评论
Adaptive critic based design of a fuzzy motor speed controller
Adaptive critic based design of a fuzzy motor speed controll...
收藏 引用
ieee international symposium on Intelligent Control (ISIC)
作者: T.T. Shannon G.G. Lendaris Northwest Computational Intelligence Laboratory and Systems Science Portland State University USA
We show the applicability of the dual heuristic programming (DHP) method of approximate dynamic programming to the design of a fuzzy control system. The DHP and related techniques have been developed in the neurocontr... 详细信息
来源: 评论
An analysis of gradient-based policy iteration
An analysis of gradient-based policy iteration
收藏 引用
international Joint Conference on Neural Networks (IJCNN)
作者: J. Dankert Lei Yang J. Si Department of Electrical Engineering Arizona State University Tempe AZ USA
Recently, a system theoretic framework for learning and optimization has been developed that shows how many approximate dynamic programming paradigms such as perturbation analysis, Markov decision processes, and reinf... 详细信息
来源: 评论
An adaptive clustering method for model-free reinforcement learning
An adaptive clustering method for model-free reinforcement l...
收藏 引用
ieee international Conference on Multi Topic
作者: A. Matt G. Regensburger Institute of Mathematics University of Innsbruck Austria
Machine learning for real world applications is a complex task due to the huge state and action sets they deal with and the a priori unknown dynamics of the environment involved. reinforcement learning offers very eff... 详细信息
来源: 评论
A hybrid model for learning sequential navigation
A hybrid model for learning sequential navigation
收藏 引用
ieee international symposium on Computational Intelligence in Robotics and Automation (CIRA)
作者: R. Sun T. Peterson University of Alabama Tuscaloosa AL USA
To deal with reactive sequential decision tasks, we present a learning model CLARION, which is a hybrid connectionist model consisting of both localist and distributed representations, based on the two-level approach ... 详细信息
来源: 评论
Nonparametric decentralized sequential detection
Nonparametric decentralized sequential detection
收藏 引用
ieee international symposium on Information Theory
作者: A. Kuh Department of Electrical Engineering University of Hawai Honolulu HI USA
We consider a decentralized sequential detection problem with a set of sensors and a fusion center. Each sensor receives information from inputs and possibly other sensors at discrete times and transmits summary infor... 详细信息
来源: 评论