咨询与建议

限定检索结果

文献类型

  • 745 篇 会议
  • 266 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,015 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 708 篇 工学
    • 521 篇 计算机科学与技术...
    • 377 篇 电气工程
    • 277 篇 控制科学与工程
    • 155 篇 软件工程
    • 79 篇 信息与通信工程
    • 39 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 5 篇 土木工程
    • 4 篇 航空宇航科学与技...
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 安全科学与工程
  • 119 篇 理学
    • 99 篇 数学
    • 33 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 67 篇 管理学
    • 64 篇 管理科学与工程(可...
    • 15 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 教育学
  • 2 篇 医学

主题

  • 309 篇 reinforcement le...
  • 214 篇 dynamic programm...
  • 203 篇 optimal control
  • 105 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 87 篇 neural networks
  • 74 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 40 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 12 篇 northeastern uni...
  • 12 篇 guangdong univ t...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 6 篇 beijing univ tec...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 21 篇 xu xin
  • 21 篇 wang ding
  • 19 篇 jiang zhong-ping
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 lewis frank l.
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 11 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 9 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 989 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1015 条 记 录,以下是181-190 订阅
排序:
Decentralized Optimal Neurocontroller Design for Mismatched Interconnected Systems via Integral Policy Iteration
收藏 引用
ieee TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS 2024年 第2期71卷 687-691页
作者: Wang, Ding Fan, Wenqian Liu, Ao Qiao, Junfei Beijing Univ Technol Fac Informat Technol Beijing 100124 Peoples R China Beijing Key Lab Computat Intelligence & Intelligen Beijing Univ Technol Beijing 100124 Peoples R China Beijing Univ Technol Beijing Lab Smart Environm Protect Beijing 100124 Peoples R China Beijing Univ Technol Beijing Inst Artificial Intelligence Beijing 100124 Peoples R China
In this brief, the decentralized optimal control problem of continuous-time input-affine nonlinear systems with mismatched interconnections is investigated by utilizing data-based integral policy iteration. Initially,... 详细信息
来源: 评论
Convergence of Value Iterations for Total-Cost MDPs and POMDPs with General State and Action Sets
Convergence of Value Iterations for Total-Cost MDPs and POMD...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Feinberg, Eugene A. Kasyanov, Pavlo O. Zgurovsky, Michael Z. SUNY Stony Brook Dept Appl Math & Stat Stony Brook NY 11794 USA Natl Tech Univ Ukraine Kyiv Polytech Inst Inst Appl Syst Anal UA-03056 Kiev Ukraine Natl Tech Univ Ukraine Kyiv Polytech Inst UA-03056 Kiev Ukraine
This paper describes conditions for convergence to optimal values of the dynamic programming algorithm applied to total-cost Markov Decision Processes (MDPSs) with Borel state and action sets and with possibly unbound... 详细信息
来源: 评论
Discrete-Time Self-learning Parallel Control
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2022年 第1期52卷 192-204页
作者: Wei, Qinglai Wang, Lingxiao Lu, Jingwei Wang, Fei-Yue Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China Qingdao Acad Intelligent Ind Parallel Intelligence Innovat Ctr Qingdao 266109 Peoples R China
In this article, a new self-learning parallel control method, which is based on adaptive dynamic programming (ADP) technique, is developed for solving the optimal control problem of discrete- time time-varying nonline... 详细信息
来源: 评论
reinforcement learning-based Optimal Control Considering L Computation Time Delay of Linear Discrete-time Systems
Reinforcement Learning-based Optimal Control Considering <i>...
收藏 引用
ieee symposium on adaptive dynamic programming and reinforcement learning (ADPRL)
作者: Fujita, Taishi Ushio, Toshimitsu
In embedded control systems, the control input is computed based on sensing data of a plant in a processor and there is a delay, called the computation time delay, due to the computation and the data transmission. Whe... 详细信息
来源: 评论
Off-Policy Model-Free learning for Multi-Player Non-Zero-Sum Games With Constrained Inputs
收藏 引用
ieee TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS 2023年 第2期70卷 910-920页
作者: Huo, Yu Wang, Ding Qiao, Junfei Li, Menghua Beijing Univ Technol Beijing Inst Artificial Intelligence Fac Informat Technol Beijing Key Lab Computat Intelligence & Intelligen Beijing 100124 Peoples R China Beijing Univ Technol Beijing Inst Artificial Intelligence Fac Informat Technol Beijing Lab Smart Environm Protect Beijing 100124 Peoples R China
In this paper, multi-player non-zero-sum games with control constraints are studied by utilizing a novel model-free approach based on adaptive dynamic programming framework. First, the model-based policy iteration (PI... 详细信息
来源: 评论
learning Without External Reward
收藏 引用
ieee COMPUTATIONAL INTELLIGENCE MAGAZINE 2018年 第3期13卷 48-54页
作者: He, Haibo Zhong, Xiangnan Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA Univ North Texas Dept Elect Engn Denton TX 76203 USA
In the traditional reinforcement learning paradigm, a reward signal is applied to define the goal of the task. Usually, the reward signal is a "hand-crafted" numerical value or a pre-defined function: it tel... 详细信息
来源: 评论
adaptive Critic learning and Experience Replay for Decentralized Event-Triggered Control of Nonlinear Interconnected Systems
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2020年 第11期50卷 4043-4055页
作者: Yang, Xiong He, Haibo Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
In this paper, we develop a decentralized event-triggered control (ETC) strategy for a class of nonlinear systems with uncertain interconnections. To begin with, we show that the decentralized ETC policy for the whole... 详细信息
来源: 评论
Policy Optimization adaptive dynamic programming for Optimal Control of Input-Affine Discrete-Time Nonlinear Systems
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2023年 第7期53卷 4339-4350页
作者: Lin, Mingduo Zhao, Bo Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Chongqing Univ Posts & Telecommun Key Lab Ind Internet Things & Networked Control Minist Educ Chongqing 400065 Peoples R China
In this article, a policy optimization adaptive dynamic programming (POADP) method is developed for optimal control of discrete-time unknown nonlinear systems, where the iterative control policy is parameterized to op... 详细信息
来源: 评论
learning continuous-action control policies
Learning continuous-action control policies
收藏 引用
2009 ieee symposium on adaptive dynamic programming and reinforcement learning, ADPRL 2009
作者: Pazis, Jason G. Lagoudakis, Michail Department of Electronic and Computer Engineering Technical University of Crete Chania Crete Greece
reinforcement learning for control in stochastic processes has received significant attention in the last few years. Several data-efficient methods, even for continuous state spaces, have been proposed, however most o... 详细信息
来源: 评论
Event-Triggered Decentralized Tracking Control of Modular Reconfigurable Robots Through adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON INDUSTRIAL ELECTRONICS 2020年 第4期67卷 3054-3064页
作者: Zhao, Bo Liu, Derong Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Guangdong Univ Technol Sch Automat Guangzhou 510006 Peoples R China
This paper develops an event-triggered decentralized tracking control (DTC) approach for modular reconfigurable robots (MRRs) by using adaptive dynamic programming. By establishing a decentralized neural network (NN) ... 详细信息
来源: 评论