咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是301-310 订阅
A transfer learning approach based on integrated feature extractor for anti-jamming in wireless networks
A transfer learning approach based on integrated feature ext...
收藏 引用
ieee International symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Siavash Barqi Janiar Ping Wang York University
One of the security issues in a wireless network is jamming attacks, where the jammer causes congestion and significant decrement in the network throughput by obstructing channels and disrupting user signals. Deep rei...
来源: 评论
MEWA: A Benchmark For Meta-learning in Collaborative Working Agents
MEWA: A Benchmark For Meta-Learning in Collaborative Working...
收藏 引用
ieee symposium Series on Computational Intelligence (SSCI)
作者: Radu Stoican Angelo Cangelosi Thomas H. Weisswange Manchester Centre for Robotics and AI University of Manchester Manchester United Kingdom Honda Research Institute Europe GmbH Offenbach Germany
Meta-reinforcement learning aims to overcome important limitations in reinforcement learning, like low sample efficiency and poor generalization, by creating agents that adapt to new tasks. The development of intellig...
来源: 评论
adaptive dynamic programming based on parallel control theory for underwater vehicles  1
Adaptive dynamic programming based on parallel control theor...
收藏 引用
1st ieee International Conference on Digital Twins and Parallel Intelligence, DTPI 2021
作者: Bo, Peng Tu, Xingbin Qu, Fengzhong Wang, Fei-Yue Zhejiang University Key Laboratory of Ocean Observation-Imaging Testbed of Zhejiang Province Zhoushan China Institute of Automation Chinese Academy of Sciences Beijing China
Parallel control theory can provide an effective solution for the control problem of complex system with unknown models and time-varying characteristics. The adaptive dynamic programming (ADP) method, which combines r... 详细信息
来源: 评论
adaptive Critic learning and Experience Replay for Decentralized Event-Triggered Control of Nonlinear Interconnected Systems
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2020年 第11期50卷 4043-4055页
作者: Yang, Xiong He, Haibo Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
In this paper, we develop a decentralized event-triggered control (ETC) strategy for a class of nonlinear systems with uncertain interconnections. To begin with, we show that the decentralized ETC policy for the whole... 详细信息
来源: 评论
Event-Triggered Decentralized Tracking Control of Modular Reconfigurable Robots Through adaptive dynamic programming
收藏 引用
ieee TRANSACTIONS ON INDUSTRIAL ELECTRONICS 2020年 第4期67卷 3054-3064页
作者: Zhao, Bo Liu, Derong Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Guangdong Univ Technol Sch Automat Guangzhou 510006 Peoples R China
This paper develops an event-triggered decentralized tracking control (DTC) approach for modular reconfigurable robots (MRRs) by using adaptive dynamic programming. By establishing a decentralized neural network (NN) ... 详细信息
来源: 评论
A Model-Free Solution for Stackelberg Games Using reinforcement learning and Projection Approaches
A Model-Free Solution for Stackelberg Games Using Reinforcem...
收藏 引用
International Workshop on Robot Sensing (ROSE)
作者: Mohammed Abouheaf Wail Gueaieb Suruz Miah Esam H. Abdelhameed Robotics Engineering Bowling Green State University Bowling Green OH USA School of Electrical Engineering and Computer Science University of Ottawa Ottawa Ontario Canada Department of Electrical and Computer Engineering Bradley University Peoria Illinois USA Faculty of Energy Engineering Aswan University Aswan Egypt
The Stackelberg game is adopted in many robotics applications. It features a dynamic multi-player setup based on a leader-follower structure. The main challenge involves implementing model-free strategies that can eff... 详细信息
来源: 评论
Fault Diagnosis for Underactuated Surface Vessel  40
Fault Diagnosis for Underactuated Surface Vessel
收藏 引用
40th Chinese Control Conference (CCC)
作者: Mao, Ruiqi Cui, Rongin Northwestern Polytech Univ Sch Marine Sci & Technol Xian 710000 Peoples R China
In recent years deep neural networks have achieved state-of-the-art accuracy at classifying the running state of a robot. Yet we propose a composite learning model (CLM) that combines the strength of broad learning an... 详细信息
来源: 评论
Safe adaptive dynamic programming Method for Nonlinear Safety-Critical Systems with Disturbance  6
Safe Adaptive Dynamic Programming Method for Nonlinear Safet...
收藏 引用
6th International Conference on Robotics and Automation Engineering, ICRAE 2021
作者: Wang, Jinguang Zhang, Dehua Zhang, Jishi Zhu, Heyang Hu, Shaolin Qin, Chunbin Henan University School of Artificial Intelligence Kaifeng China Guangdong University of Petrochemical Technology School of Automation Maoming China
In this paper, a safe adaptive dynamic programming (SADP) method based on the barrier function (BF) is proposed for the optimal control problem of nonlinear safety-critical systems with the safety constraints and exte... 详细信息
来源: 评论
***: Power-Aware Traffic Engineering via Deep reinforcement learning  29
***: Power-Aware Traffic Engineering via Deep Reinforcement ...
收藏 引用
29th ieee/ACM International symposium on Quality of Service (IWQOS)
作者: Pan, Tian Peng, Xiaoyu Shi, Qianqian Bian, Zizheng Lin, Xingchen Song, Enge Li, Fuliang Xu, Yang Huang, Tao BUPT State Key Lab Networking & Switching Technol Beijing Peoples R China Sci & Technol Commun Networks Lab Shijiazhuang Hebei Peoples R China Northeastern Univ Shenyang Liaoning Peoples R China Fudan Univ Shanghai Peoples R China
Power-aware traffic engineering via coordinated sleeping is usually formulated into Integer programming problems, which are generally NP-hard with unbounded computation time for large-scale networks. This results in d... 详细信息
来源: 评论
DATE: Disturbance-Aware Traffic Engineering with reinforcement learning in Software-Defined Networks  29
DATE: Disturbance-Aware Traffic Engineering with Reinforceme...
收藏 引用
29th ieee/ACM International symposium on Quality of Service (IWQOS)
作者: Ye, Minghao Zhang, Junjie Guo, Zehua Chao, H. Jonathan NYU Dept Elect & Comp Engn New York NY 11201 USA Fortinet Inc Sunnyvale CA 94086 USA Beijing Inst Technol Beijing 100081 Peoples R China
Traffic Engineering (TE) has been applied to optimize network performance by routing/rerouting flows based on traffic loads and network topologies. To cope with network dynamics from emerging applications, it is essen... 详细信息
来源: 评论