咨询与建议

限定检索结果

文献类型

  • 746 篇 会议
  • 270 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,020 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 711 篇 工学
    • 520 篇 计算机科学与技术...
    • 380 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 312 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 12 篇 derong liu
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 994 篇 英文
  • 20 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1020 条 记 录,以下是311-320 订阅
排序:
Mobile-Aware Online Task Offloading Based on Deep reinforcement learning in Mobile Edge Computing Networks
Mobile-Aware Online Task Offloading Based on Deep Reinforcem...
收藏 引用
ieee International symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)
作者: Yuting Li Yitong Liu Xingcheng Liu Qiang Tu Yi Xie School of Electronics and Information Technology Sun Yat-sen University Guangzhou China School of Computer Science and Engineering Sun Yat-sen University Guangzhou China Jiangsu Viscore Technologies Co. Ltd. Suzhou China
Mobile Edge Computing (MEC) is one of the key enabling technologies for future 6G wireless networks that can provide lower latency service and more efficient resource utilization for future intelligent applications an...
来源: 评论
adaptive dynamic programming for Decentralized Stabilization of Uncertain Nonlinear Large-Scale Systems With Mismatched Interconnections
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2020年 第8期50卷 2870-2882页
作者: Yang, Xiong He, Haibo Tianjin Univ Sch Elect & Informat Engn Tianjin 300072 Peoples R China Univ Rhode Isl Dept Elect Comp & Biomed Engn Kingston RI 02881 USA
This paper presents a novel decentralized control strategy for a class of uncertain nonlinear large-scale systems with mismatched interconnections. First, it is shown that the decentralized controller for the overall ... 详细信息
来源: 评论
reinforcement-learning-Based Risk-Sensitive Optimal Feedback Mechanisms of Biological Motor Control
Reinforcement-Learning-Based Risk-Sensitive Optimal Feedback...
收藏 引用
ieee Conference on Decision and Control
作者: Leilei Cui Bo Pang Zhong-Ping Jiang Department of Electrical and Computer Engineering Control and Networks Lab Tandon School of Engineering New York University Brooklyn NY USA
Risk sensitivity is a fundamental aspect of biological motor control that accounts for both the expectation and variability of movement cost in the face of uncertainty. However, most computational models of biological...
来源: 评论
MARS: An adaptive Multi-Agent DRL-based Scheduler for Multipath QUIC in dynamic Networks
MARS: An Adaptive Multi-Agent DRL-based Scheduler for Multip...
收藏 引用
International Workshop on Quality of Service
作者: Xueqiang Han Biao Han Ruidong Li Xiaolan Ji College of Computer National University of Defense Technology Changsha China Graduate School of Natural Science and Technology Kanazawa University Kanazawa Japan
The multipath extension of the Quick UDP Internet Connection (QUIC) protocol, also called MPQUIC, is currently attracting increasing attention from both industry and academia. The multipath scheduler of MPQUIC determi...
来源: 评论
Enhancing 5G Network Throughput Using reinforcement learning
Enhancing 5G Network Throughput Using Reinforcement Learning
收藏 引用
Communication, Computing and Signal Processing (IICCCS), ieee International Conference on
作者: Myasar Mundher Adnan Muydinov Firuzjon Farkhodjonovich Rohit Shrivastava Anika Bhandari AR Aravind Sandeep Kumar S Department Of Computers Techniques Engineering College Of Technical Engineering The Islamic University Najaf Iraq Department Of Computers Techniques Engineering College Of Technical Engineering The Islamic University Of Al Diwaniyah Al Diwaniyah Iraq Head Of Educational and Methodological Department Fergana Medical Institute Of Public Health Fergana Uzbekistan Department of Electronics & Communication Engineering IES College of Technology Bhopal M.P. India Department of Computer Application Chandigarh Engineering College Chandigarh Group of Colleges Mohali Punjab India Prince Shri Venkateshwara Padmavathy Engineering College Chennai India Department of Computer and Communication Engineering NMAM Institute of Technology (NITTE Deemed to be University) Udupi Karnataka India
In this research, RL is proposed as a solution towards achieving the dynamic optimization of 5G network throughput hindered by the complexity of Intersystems environments. To this extent, the optimization problem is f... 详细信息
来源: 评论
dynamic Resource Management for Cloud-native Bulk Synchronous Parallel Applications
Dynamic Resource Management for Cloud-native Bulk Synchronou...
收藏 引用
International symposium on Object-Oriented Real-Time Distributed Computing
作者: Evan Wang Yogesh Barve Aniruddha Gokhale Hongyang Sun Dept of CS Vanderbilt University Nashville TN USA Dept of EECS University of Kansas Lawrence KS USA
Many traditional high-performance computing applications including those that follow the Bulk Synchronous Parallel (BSP) communication paradigm are increasingly being deployed in cloud-native virtualized and multi-ten...
来源: 评论
Optimization control of UAVs based on self-learning adaptive dynamic programming  35
Optimization control of UAVs based on self-learning adaptive...
收藏 引用
35th Youth Academic Annual Conference of Chinese-Association-of-Automation (YAC)
作者: Ye, Shuai Zhou, Ying-Jiang Jiang, Guo-Ping Lin, Qiong Nanjing Univ Posts & Telecommun Dept Automat & Artificial Intelligence Nanjing 210023 Peoples R China
In UAVs, optimal control has attracted more and more attention. In this paper, a self-learning adaptive dynamic programming (ADP) architecture based reinforcement learning (RL) is proposed to obtain optimal control fo... 详细信息
来源: 评论
Balancing Value Iteration and Policy Iteration for Discrete-Time Control
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2020年 第11期50卷 3948-3958页
作者: Luo, Biao Yang, Yin Wu, Huai-Ning Huang, Tingwen Cent South Univ Sch Automat Changsha 410083 Peoples R China Hamad Bin Khalifa Univ Coll Sci & Engn Doha Qatar Beihang Univ Sci & Technol Aircraft Control Lab Beijing 100191 Peoples R China Texas A&M Univ Qatar Dept Sci Doha Qatar
The optimal control problem of discrete-time nonlinear systems depends on the solution of the Bellman equation. In this paper, an adaptive reinforcement learning (RL) method is developed to solve the complex Bellman e... 详细信息
来源: 评论
A Budget-aware Incentive Mechanism for Vehicle-to-Grid via reinforcement learning
A Budget-aware Incentive Mechanism for Vehicle-to-Grid via R...
收藏 引用
International Workshop on Quality of Service
作者: Tianxiang Zhu Xiaoxi Zhang Jingpu Duan Zhi Zhou Xu Chen Sun Yat-sen University Guangzhou China Southern University of Science and Technology Shenzhen China Pengcheng Laboratory Shenzhen China
With the increasing penetration of renewable energy and electric vehicles (EVs), the behavior of EVs' charging and discharging has shown great impact on the Micro Grid power load, motivating the development of Veh...
来源: 评论
Research on Control Strategy of Hybrid Superconducting Energy Storage Based on adaptive dynamic programming
Research on Control Strategy of Hybrid Superconducting Energ...
收藏 引用
International Conference on Applied Superconductivity and Electromagnetic Devices, ASEMD
作者: Yang Liu Xingfan Han Zuoxia Xing Pengtao Li Hengyu Liu Zhanpeng Jiang School of Electrical Engineering Shenyang University of Technology Shenyang China
Frequent charging and discharging of the battery will seriously shorten the battery life, thus increasing the power fluctuation in the distribution network. In this paper, a microgrid energy storage model combining su...
来源: 评论