咨询与建议

限定检索结果

文献类型

  • 299 篇 会议
  • 8 篇 期刊文献

馆藏范围

  • 307 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 180 篇 工学
    • 158 篇 计算机科学与技术...
    • 56 篇 电气工程
    • 48 篇 软件工程
    • 47 篇 控制科学与工程
    • 13 篇 信息与通信工程
    • 10 篇 机械工程
    • 6 篇 仪器科学与技术
    • 4 篇 力学(可授工学、理...
    • 4 篇 生物工程
    • 3 篇 动力工程及工程热...
    • 2 篇 交通运输工程
    • 2 篇 核科学与技术
    • 2 篇 生物医学工程(可授...
    • 1 篇 建筑学
    • 1 篇 化学工程与技术
    • 1 篇 航空宇航科学与技...
    • 1 篇 食品科学与工程(可...
  • 40 篇 理学
    • 35 篇 数学
    • 9 篇 系统科学
    • 8 篇 统计学(可授理学、...
    • 4 篇 物理学
    • 4 篇 生物学
    • 1 篇 化学
    • 1 篇 天文学
    • 1 篇 大气科学
    • 1 篇 地球物理学
    • 1 篇 地质学
  • 18 篇 管理学
    • 17 篇 管理科学与工程(可...
    • 7 篇 工商管理
  • 4 篇 经济学
    • 4 篇 应用经济学
  • 1 篇 医学

主题

  • 115 篇 dynamic programm...
  • 76 篇 reinforcement le...
  • 67 篇 learning
  • 47 篇 optimal control
  • 30 篇 neural networks
  • 27 篇 control systems
  • 21 篇 approximate dyna...
  • 21 篇 approximation al...
  • 20 篇 function approxi...
  • 20 篇 equations
  • 17 篇 convergence
  • 16 篇 adaptive dynamic...
  • 16 篇 state-space meth...
  • 16 篇 heuristic algori...
  • 14 篇 mathematical mod...
  • 13 篇 stochastic proce...
  • 12 篇 learning (artifi...
  • 12 篇 adaptive control
  • 12 篇 cost function
  • 11 篇 algorithm design...

机构

  • 5 篇 arizona state un...
  • 4 篇 department of el...
  • 4 篇 school of inform...
  • 4 篇 department of in...
  • 4 篇 univ sci & techn...
  • 4 篇 chinese acad sci...
  • 4 篇 department of el...
  • 3 篇 princeton univ d...
  • 3 篇 northeastern uni...
  • 3 篇 national science...
  • 3 篇 robotics institu...
  • 3 篇 univ illinois de...
  • 3 篇 univ utrecht dep...
  • 2 篇 univ groningen i...
  • 2 篇 sharif univ tech...
  • 2 篇 univ texas autom...
  • 2 篇 pengcheng labora...
  • 2 篇 guangxi univ sch...
  • 2 篇 chinese acad sci...
  • 2 篇 cemagref lisc au...

作者

  • 14 篇 liu derong
  • 9 篇 wei qinglai
  • 8 篇 si jennie
  • 7 篇 xu xin
  • 5 篇 derong liu
  • 4 篇 lewis frank l.
  • 4 篇 martin riedmille...
  • 4 篇 huaguang zhang
  • 4 篇 jennie si
  • 4 篇 marco a. wiering
  • 4 篇 xin xu
  • 4 篇 zhang huaguang
  • 4 篇 dongbin zhao
  • 4 篇 lei yang
  • 4 篇 powell warren b.
  • 4 篇 riedmiller marti...
  • 3 篇 hado van hasselt
  • 3 篇 van hasselt hado
  • 3 篇 jagannathan s.
  • 3 篇 munos remi

语言

  • 305 篇 英文
  • 1 篇 其他
  • 1 篇 中文
检索条件"任意字段=IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning"
307 条 记 录,以下是241-250 订阅
排序:
N-step optimal time-invariant trajectory tracking control for a class of nonlinear systems
N-step optimal time-invariant trajectory tracking control fo...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Ruizhuo Song Huaguang Zhang School of Information Science and Engineering Northeastern University Shenyang China
In this paper, the time-invariant trajectory tracking control problem under N-step control is solved by finite horizon approximate dynamic programming (ADP) algorithms. At first, we convert the tracking control proble... 详细信息
来源: 评论
Proceedings - ieee 14th international symposium on Parallel and Distributed Computing, ISPDC 2015
Proceedings - IEEE 14th International Symposium on Parallel ...
收藏 引用
14th ieee international symposium on Parallel and Distributed Computing, ISPDC 2015
The proceedings contain 26 papers. The topics discussed include: getting ready for approximate computing: trading parallelism for accuracy for DSS workloads;dataClay: the integration of persistent data, parallel progr... 详细信息
来源: 评论
Adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems
Adaptive dynamic programming for optimal control of unknown ...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Derong Liu Ding Wang Dongbin Zhao Key Laboratory of Complex Systems and Intelligence Science Institute of Automation Chinese Academy and Sciences Beijing China
An intelligent optimal control scheme for unknown nonlinear discrete-time systems with discount factor in the cost function is proposed in this paper. An iterative adaptive dynamic programming (ADP) algorithm via glob... 详细信息
来源: 评论
Multi-agent Deep reinforcement learning based Information-Energy Collaboration in Vehicle Edge Computing Networks  35
Multi-agent Deep Reinforcement Learning based Information-En...
收藏 引用
35th ieee international symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2024
作者: Feng, Yaoyu Zhang, Biling Yu, Jung-Lang Beijing University of Posts and Telecommunications School of Network Education China Fu Jen Catholic University Department of Electrical Engineering New Taipei City24205 Taiwan
In the vehicle edge computing network (VECN), how to deal with the computation resources and energy resources shortage problem the roadside units (RSUs) encounter when they are performing delay sensitive computation t... 详细信息
来源: 评论
Using approximate dynamic programming for estimating the revenues of a hydrogen-based high-capacity storage device
Using approximate dynamic programming for estimating the rev...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Vincent François-Lavet Raphael Fonteneau Damien Ernst Department of Electrical Engineering and Computer Science University of Liège Belgium
This paper proposes a methodology to estimate the maximum revenue that can be generated by a company that operates a high-capacity storage device to buy or sell electricity on the day-ahead electricity market. The met... 详细信息
来源: 评论
Adaptive dynamic programming-based optimal tracking control for nonlinear systems using general value iteration
Adaptive dynamic programming-based optimal tracking control ...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Xiaofeng Lin Qiang Ding Weikai Kong Chunning Song Qingbao Huang School of Electrical Engineering Guangxi University Nanning China
For the optimal tracking control problem of affine nonlinear systems, a general value iteration algorithm based on adaptive dynamic programming is proposed in this paper. By system transformation, the optimal tracking... 详细信息
来源: 评论
Iterative local dynamic programming
Iterative local dynamic programming
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Emanuel Todorov Yuval Tassa Department of Cognitive Science University of California San Diego USA Center of Neural Computation Hebrew University of Jerusalem Israel
We develop an iterative local dynamic programming method (iLDP) applicable to stochastic optimal control problems in continuous high-dimensional state and action spaces. Such problems are common in the control of biol... 详细信息
来源: 评论
approximate real-time optimal control based on sparse Gaussian process models
Approximate real-time optimal control based on sparse Gaussi...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Joschka Boedecker Jost Tobias Springenberg Jan Wülfing Martin Riedmiller Department of Computer Science University of Freiburg Freiburg Germany
In this paper we present a fully automated approach to (approximate) optimal control of non-linear systems. Our algorithm jointly learns a non-parametric model of the system dynamics - based on Gaussian Process Regres... 详细信息
来源: 评论
An integrated design for intensified direct heuristic dynamic programming
An integrated design for intensified direct heuristic dynami...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Xiong Luo Jennie Si Yuchao Zhou School of Computer and Communication Engineering University of Science and Technology Beijing (USTB) Beijing China Arizona State University Tempe AZ US
There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of mac... 详细信息
来源: 评论
An approximate dynamic programming based controller for an underactuated 6DoF quadrotor
An approximate Dynamic Programming based controller for an u...
收藏 引用
ieee symposium on Adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Emanuel Stingu Frank L. Lewis Automation & Robotics Research Institute University of Texas Arlington Arlington TX USA
This paper discusses how the principles of Adaptive dynamic programming (ADP) can be applied to the control of a quadrotor helicopter platform flying in an uncontrolled environment and subjected to various disturbance... 详细信息
来源: 评论