咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 269 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,018 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 711 篇 工学
    • 520 篇 计算机科学与技术...
    • 380 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 311 篇 reinforcement le...
  • 215 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 77 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1018 条 记 录,以下是411-420 订阅
排序:
Stable Iterative Optimal Control for Discrete-Time Nonlinear Systems Using Numerical Controller
Stable Iterative Optimal Control for Discrete-Time Nonlinear...
收藏 引用
ieee International Conference on Vehicular Electronics and Safety (ICVES)
作者: Wei, Qinglai Liu, Derong Chinese Acad Sci State Key Lab Management & Control Complex Syst Inst Automat Beijing 100190 Peoples R China
This paper is concerned with a new iterative adaptive dynamic programming (ADP) algorithm to solve optimal control problems for infinite horizon discrete-time nonlinear systems using a numerical controller. The conver... 详细信息
来源: 评论
Data-Driven learning and Control with Multiple Critic Networks
Data-Driven Learning and Control with Multiple Critic Networ...
收藏 引用
10th World Congress on Intelligent Control and Automation (WCICA)
作者: He, Haibo Ni, Zhen Zhao, Dongbin Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China
In this paper, we extend our previous work of a three-network adaptive dynamic programming design [1] to be a multiple critic networks design for online learning and control. The key idea of this approach is to develo... 详细信息
来源: 评论
An Enhanced reinforcement learning Approach for dynamic Placement of Virtual Network Functions  31
An Enhanced Reinforcement Learning Approach for Dynamic Plac...
收藏 引用
31st Annual ieee International symposium on Personal, Indoor and Mobile Radio Communications (ieee PIMRC)
作者: Houidi, Omar Soualah, Oussama Louati, Wajdi Zeghlache, Djamal Inst Polytech Paris Telecom SudParis Samovar UMR 5157 CNRS Paris France Univ Sfax ReDCAD Lab Sfax Tunisia
This paper addresses Virtualized Network Function Forwarding Graph (VNF-FG) embedding with the objective of realizing long term reward compared to placement algorithms that aim at instantaneous optimal placement. The ... 详细信息
来源: 评论
Distributed Optimal Consensus Control for Coupled Linear Systems Based on Off-Policy Integral reinforcement learning  39
Distributed Optimal Consensus Control for Coupled Linear Sys...
收藏 引用
39th Youth Academic Annual Conference of Chinese-Association-of-Automation (YAC)
作者: Zhao, Wenyan Liu, Zhongchang Yang, Xin Dalian Maritime Univ Coll Marine Elect Engn Dalian Peoples R China
This article studies the consensus problem for coupled linear multi-agent systems with completely unknown system dynamics. These coupled individual agent systems aim to achieve consensus for their system states while ... 详细信息
来源: 评论
adaptive dynamic programming for terminally constrained finite-horizon optimal control problems  53
Adaptive dynamic programming for terminally constrained fini...
收藏 引用
53rd ieee Annual Conference on Decision and Control (CDC)
作者: Andrews, L. Klotz, J. R. Kamalapurkar, R. Dixon, W. E. Univ Florida Dept Mech & Aerosp Engn Gainesville FL USA
adaptive dynamic programming is applied to control-affine nonlinear systems with uncertain drift dynamics to obtain a near-optimal solution to a finite-horizon optimal control problem with hard terminal constraints. A... 详细信息
来源: 评论
Observer-Based adaptive Synchronization Control of Unknown Discrete-Time Nonlinear Heterogeneous Systems
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2022年 第2期33卷 681-693页
作者: Fu, Hao Chen, Xin Wang, Wei Wu, Min China Univ Geosci Sch Automat Wuhan 430074 Peoples R China Hubei Key Lab Adv Control & Intelligent Automat C Wuhan 430074 Peoples R China WISDRI Engn & Res Inc Ltd Wuhan 430223 Peoples R China
This article is concerned with the optimal synchronization problem for discrete-time nonlinear heterogeneous multiagent systems (MASs) with an active leader. To overcome the difficulty in the derivation of the optimal... 详细信息
来源: 评论
Event-trigger-based robust control for nonlinear constrained-input systems using reinforcement learning method
收藏 引用
NEUROCOMPUTING 2019年 340卷 158-170页
作者: Yang, Dongsheng Li, Ting Zhang, Huaguang Xie, Xiangpeng Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Liaoning Peoples R China Nanjing Univ Posts & Telecommun Inst Adv Technol Nanjing 210023 Jiangsu Peoples R China
In this paper, an online integral reinforcement learning strategy is proposed to deal with robust constrained control problems using event-triggered mechanism for nonlinear Continuous-Time (C-T) systems with external ... 详细信息
来源: 评论
Event-Triggered Robust Stabilization of Nonlinear Input-Constrained Systems Using Single Network adaptive Critic Designs
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2020年 第9期50卷 3145-3157页
作者: Yang, Xiong He, Haibo Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
In this paper, we study the event-triggered robust stabilization problem of nonlinear systems subject to mismatched perturbations and input constraints. First, with the introduction of an infinite-horizon cost functio... 详细信息
来源: 评论
Optimal Tracking Control of the Boiler-turbine System Based on adaptive dynamic programming
Optimal Tracking Control of the Boiler-turbine System Based ...
收藏 引用
International Joint Conference on Neural Networks (IJCNN)
作者: Liu, Yujia Gao, Aiguo Wei, Qinglai Chinese Acad Sci Inst Automat Beijing Peoples R China North China Elect Power Res Inst Co Ltd Beijing Peoples R China
To guarantee the efficient performance of the power plant, an adaptive tacking controller for the nonlinear boiler-turbine system based on offline policy iteration adaptive dynamic prorgamming (ADP) method is proposed... 详细信息
来源: 评论
Smooth and Stopping Interval Aware Driving Behavior Prediction at Un-signalized Intersection with Inverse reinforcement learning on Sequential MDPs  32
Smooth and Stopping Interval Aware Driving Behavior Predicti...
收藏 引用
32nd ieee Intelligent Vehicles symposium (IV)
作者: Yang, Shaoyu Yoshitake, Hiroshi Shino, Motoki Shimosaka, Masamichi Tokyo Inst Technol Dept Comp Sci Tokyo Japan Univ Tokyo Dept Human & Engn Environm Studies Chiba Japan
Driving behavior modeling (DBM) is widely used in the intelligent vehicle field to prevent accidents, which predicts actions that vehicles should take to optimize safe driving behaviors. According to some statistics, ... 详细信息
来源: 评论