咨询与建议

限定检索结果

文献类型

  • 746 篇 会议
  • 270 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,020 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 711 篇 工学
    • 520 篇 计算机科学与技术...
    • 380 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 312 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 12 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 994 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1020 条 记 录,以下是651-660 订阅
排序:
Stable Iterative Optimal Control for Discrete-Time Nonlinear Systems Using Numerical Controller
Stable Iterative Optimal Control for Discrete-Time Nonlinear...
收藏 引用
ieee International Conference on Vehicular Electronics and Safety (ICVES)
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci State Key Lab Management & Control Complex Syst Inst Automat Beijing 100190 Peoples R China
This paper is concerned with a new iterative adaptive dynamic programming (ADP) algorithm to solve optimal control problems for infinite horizon discrete-time nonlinear systems using a numerical controller. The conver... 详细信息
来源: 评论
Hierarchical dynamic Power Management Using Model-Free reinforcement learning
Hierarchical Dynamic Power Management Using Model-Free Reinf...
收藏 引用
14th International symposium on Quality Electronic Design (ISQED)
作者: Wang, Yanzhi Triki, Maryam Lin, Xue Ammari, Ahmed C. Pedram, Massoud Univ So Calif Dept Elect Engn Los Angeles CA 90089 USA
Model-free reinforcement learning (RL) has become a promising technigue for designing a robust dynamic power management (DPM) framework that can cope with variations and uncertainties that emanate from hardware and ap... 详细信息
来源: 评论
A novel adaptive call admission control scheme for distributed reinforcement learning based dynamic spectrum access in cellular networks
A novel adaptive call admission control scheme for distribut...
收藏 引用
10th ieee International symposium on Wireless Communication Systems 2013, ISWCS 2013
作者: Morozs, Nils Clarke, Tim Grace, David Department of Electronics University of York Heslington York YO10 5DD United Kingdom
This paper introduces a novel Q-value based adaptive call admission control scheme (Q-CAC) for distributed reinforcement learning (RL) based dynamic spectrum access (DSA) in mobile cellular networks, which provides a ... 详细信息
来源: 评论
adaptive Control for an HVDC Transmission Link with FACTS and a Wind Farm
Adaptive Control for an HVDC Transmission Link with FACTS an...
收藏 引用
Conference of the ieee PES on Innovative Smart Grid Technologies (ISGT)
作者: Tang, Yufei He, Haibo Wen, Jinyu Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA Huazhong Univ Sci & Technol Coll Elect Elect Engn Wuhan 430074 Peoples R China
Due to the nonlinearity, uncertainty and complexity of the power system, it is a challenging task to design an effective control approach based on the exact model using traditional methods. In this paper, we investiga... 详细信息
来源: 评论
Exploring the relationship of reward and punishment in reinforcement learning
Exploring the relationship of reward and punishment in reinf...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Robert Lowe Tom Ziemke Interaction Lab University of Skövde Skövde Sweden
We present a reinforcement learning algorithm based on Dyna-Sarsa that utilizes separate representations of reward and punishment when guiding state-action value learning and action selection. The adoption of policy m... 详细信息
来源: 评论
Optimal control for a class of nonlinear systems with state delay based on adaptive dynamic programming with ε-error bound
Optimal control for a class of nonlinear systems with state ...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Xiaofeng Lin Nuyun Cao Yuzhang Lin School of Electrical Engineering Guangxi University Nanning China Department of Electrical Engineering Tsinghua University Beijing China
In this paper, a finite-horizon ε-optimal control for a class of nonlinear systems with state delay is proposed by adaptive dynamic programming (ADP) algorithm. First of all, the performance index function is defined... 详细信息
来源: 评论
Finite-horizon optimal control design for uncertain linear discrete-time systems
Finite-horizon optimal control design for uncertain linear d...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Qiming Zhao Hao Xu S. Jagannathan Department of Electrical and Computer Engineering Missouri University of Science and Technology Rolla MO USA
In this paper, the finite-horizon optimal adaptive control design for linear discrete-time systems with unknown system dynamics by using adaptive dynamic programming (ADP) is presented. In the presence of full state f... 详细信息
来源: 评论
Exponential moving average Q-learning algorithm
Exponential moving average Q-learning algorithm
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Mostafa D. Awheda Howard M. Schwartz Department of Systems and Computer Engineering Carleton University Ottawa Canada
A multi-agent policy iteration learning algorithm is proposed in this work. The Exponential Moving Average (EMA) mechanism is used to update the policy for a Q-learning agent so that it converges to an optimal policy ... 详细信息
来源: 评论
An integrated design for intensified direct heuristic dynamic programming
An integrated design for intensified direct heuristic dynami...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Xiong Luo Jennie Si Yuchao Zhou School of Computer and Communication Engineering University of Science and Technology Beijing (USTB) Beijing China Arizona State University Tempe AZ US
There has been a growing interest in the study of adaptive/approximate dynamic programming (ADP) in recent years. The ADP technique provides a powerful tool to understand and improve the principled technologies of mac... 详细信息
来源: 评论
A novel approach for constructing basis functions in approximate dynamic programming for feedback control
A novel approach for constructing basis functions in approxi...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning, (ADPRL)
作者: Jian Wang Zhenhua Huang Xin Xu College of Mechatronics and Automation National University of Defense Tech Changsha P. R. China
This paper presents a novel approach for constructing basis functions in approximate dynamic programming (ADP) through the locally linear embedding (LLE) process. It considers the experience (sample) data as a high-di... 详细信息
来源: 评论