咨询与建议

限定检索结果

文献类型

  • 748 篇 会议
  • 271 篇 期刊文献
  • 4 册 图书

馆藏范围

  • 1,023 篇 电子文献
  • 1 种 纸本馆藏

日期分布

学科分类号

  • 712 篇 工学
    • 520 篇 计算机科学与技术...
    • 381 篇 电气工程
    • 278 篇 控制科学与工程
    • 153 篇 软件工程
    • 79 篇 信息与通信工程
    • 40 篇 交通运输工程
    • 23 篇 仪器科学与技术
    • 20 篇 机械工程
    • 9 篇 生物工程
    • 8 篇 电子科学与技术(可...
    • 7 篇 力学(可授工学、理...
    • 7 篇 土木工程
    • 6 篇 动力工程及工程热...
    • 6 篇 石油与天然气工程
    • 4 篇 生物医学工程(可授...
    • 3 篇 材料科学与工程(可...
    • 3 篇 化学工程与技术
    • 3 篇 航空宇航科学与技...
    • 3 篇 安全科学与工程
  • 118 篇 理学
    • 98 篇 数学
    • 32 篇 系统科学
    • 22 篇 统计学(可授理学、...
    • 10 篇 生物学
    • 8 篇 物理学
    • 4 篇 化学
  • 66 篇 管理学
    • 63 篇 管理科学与工程(可...
    • 14 篇 工商管理
    • 5 篇 图书情报与档案管...
  • 5 篇 经济学
    • 4 篇 应用经济学
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 医学
  • 1 篇 教育学

主题

  • 313 篇 reinforcement le...
  • 216 篇 dynamic programm...
  • 206 篇 optimal control
  • 107 篇 adaptive dynamic...
  • 104 篇 adaptive dynamic...
  • 97 篇 learning
  • 88 篇 neural networks
  • 78 篇 heuristic algori...
  • 68 篇 reinforcement le...
  • 58 篇 learning (artifi...
  • 54 篇 nonlinear system...
  • 53 篇 convergence
  • 51 篇 control systems
  • 51 篇 mathematical mod...
  • 48 篇 approximate dyna...
  • 44 篇 approximation al...
  • 43 篇 equations
  • 42 篇 adaptive control
  • 41 篇 artificial neura...
  • 41 篇 cost function

机构

  • 41 篇 chinese acad sci...
  • 27 篇 univ rhode isl d...
  • 17 篇 tianjin univ sch...
  • 16 篇 univ sci & techn...
  • 16 篇 univ illinois de...
  • 15 篇 northeastern uni...
  • 14 篇 beijing normal u...
  • 13 篇 northeastern uni...
  • 13 篇 guangdong univ t...
  • 12 篇 northeastern uni...
  • 9 篇 natl univ def te...
  • 8 篇 ieee
  • 8 篇 univ chinese aca...
  • 7 篇 univ chinese aca...
  • 7 篇 cent south univ ...
  • 7 篇 southern univ sc...
  • 7 篇 beijing univ tec...
  • 6 篇 chinese acad sci...
  • 6 篇 missouri univ sc...
  • 5 篇 nanjing univ pos...

作者

  • 54 篇 liu derong
  • 37 篇 wei qinglai
  • 29 篇 he haibo
  • 22 篇 wang ding
  • 21 篇 xu xin
  • 19 篇 jiang zhong-ping
  • 17 篇 lewis frank l.
  • 17 篇 yang xiong
  • 17 篇 zhang huaguang
  • 17 篇 ni zhen
  • 16 篇 zhao bo
  • 15 篇 gao weinan
  • 14 篇 zhao dongbin
  • 13 篇 derong liu
  • 13 篇 zhong xiangnan
  • 12 篇 si jennie
  • 10 篇 jagannathan s.
  • 10 篇 dongbin zhao
  • 10 篇 song ruizhuo
  • 9 篇 abouheaf mohamme...

语言

  • 992 篇 英文
  • 25 篇 其他
  • 6 篇 中文
检索条件"任意字段=IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning"
1023 条 记 录,以下是381-390 订阅
reinforcement learning for adaptive Caching With dynamic Storage Pricing
收藏 引用
ieee JOURNAL ON SELECTED AREAS IN COMMUNICATIONS 2019年 第10期37卷 2267-2281页
作者: Sadeghi, Alireza Sheikholeslami, Fatemeh Marques, Antonio G. Giannakis, Georgios B. Univ Minnesota Digital Technol Ctr Minneapolis MN 55455 USA Univ Minnesota Dept Elect & Comp Engn Minneapolis MN 55455 USA King Juan Carlos Univ Dept Signal Theory & Commun Madrid 28943 Spain
Small base stations (SBs) of fifth-generation (SG) cellular networks are envisioned to have storage devices to locally serve requests for reusable and popular contents by caching them at the edge of the network, close... 详细信息
来源: 评论
Power Control for Wireless VBR Video Streaming: From Optimization to reinforcement learning
收藏 引用
ieee TRANSACTIONS ON COMMUNICATIONS 2019年 第8期67卷 5629-5644页
作者: Ye, Chuang Gursoy, M. Cenk Velipasalar, Senem Syracuse Univ Dept Elect Engn & Comp Sci Syracuse NY 13244 USA
In this paper, we investigate the problem of power control for streaming variable bit rate (VBR) videos over wireless links. A system model involving a transmitter (e.g., a base station) that sends VBR video data to a... 详细信息
来源: 评论
Output Feedback Q-learning Control for the Discrete-Time Linear Quadratic Regulator Problem
收藏 引用
ieee TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2019年 第5期30卷 1523-1536页
作者: Rizvi, Syed Ali Asad Lin, Zongli Univ Virginia Charles L Brown Dept Elect & Comp Engn Charlottesville VA 22904 USA
Approximate dynamic programming (ADP) and reinforcement learning (RL) have emerged as important tools in the design of optimal and adaptive control systems. Most of the existing RL and ADP methods make use of full-sta... 详细信息
来源: 评论
Novel Scheme for Congestion Control in Cellular Networks Using Deep reinforcement learning and Markov Decision Process Models
Novel Scheme for Congestion Control in Cellular Networks Usi...
收藏 引用
2020 International Conference in Mathematics, Computer Engineering and Computer Science, ICMCECS 2020
作者: Arinze, Uchechukwu Bakpo, Francis Eneh, Agozie Longe, Olumide Department of Computer Science Enugu Nigeria Department of Information Systems Adamawa Yola Nigeria
This research deals with the general issue of quality of service (QoS) provisioning and resource utilization in telecommunication networks. The issue requires that mobile network income be optimized while simultaneous... 详细信息
来源: 评论
UCT-ADP Progressive Bias Algorithm for Solving Gomoku
UCT-ADP Progressive Bias Algorithm for Solving Gomoku
收藏 引用
ieee symposium Series on Computational Intelligence (SSCI)
作者: Cao, Xu Lin, Yanghao Fudan Univ Sch Data Sci Shanghai Peoples R China
We combine adaptive dynamic programming (ADP), a reinforcement learning method and UCB applied to trees (UCT) algorithm with a more powerful heuristic function based on Progressive Bias method and two pruning strategi... 详细信息
来源: 评论
Output Feedback H∞ Control for Linear Discrete-Time Multi-Player Systems With Multi-Source Disturbances Using Off-Policy Q-learning
收藏 引用
ieee ACCESS 2020年 8卷 208938-208951页
作者: Xiao, Zhenfei Li, Jinna Li, Ping Liaoning Shihua Univ Sch Informat & Control Engn Fushun 113001 Liaoning Peoples R China
In this paper, a data-driven optimal control method based on adaptive dynamic programming and game theory is presented for solving the output feedback solutions of the H-infinity control problem for linear discrete-ti... 详细信息
来源: 评论
Approximate Nash Solutions for Multiplayer Mixed-Zero-Sum Game With reinforcement learning
收藏 引用
ieee TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2019年 第12期49卷 2739-2750页
作者: Lv, Yongfeng Ren, Xuemei Beijing Inst Technol Sch Automat Beijing 100081 Peoples R China
Inspired by Nash game theory, a multiplayer mixed-zero-sum (MZS) nonlinear game considering both two situations [zero-sum and nonzero-sum (NZS) Nash games] is proposed in this paper. A synchronous reinforcement learni... 详细信息
来源: 评论
Fault Tolerant Tracking Control Through Particle Swarm Optimization Based Policy Iteration  35
Fault Tolerant Tracking Control Through Particle Swarm Optim...
收藏 引用
35th Youth Academic Annual Conference of Chinese-Association-of-Automation (YAC)
作者: Liu, Xi Liu, Derong Zhao, Bo Guangdong Univ Technol Sch Automat Guangzhou Peoples R China Beijing Normal Univ Sch Syst Sci Beijing Peoples R China
This paper focuses on fault tolerant tracking control (FTTC) problems for nonlinear systems with actuator failure. For fault-free system, the tracking control input is derived by the policy iteration. To deal with the... 详细信息
来源: 评论
Model-Free adaptive Control Approach Using Integral reinforcement learning  13
Model-Free Adaptive Control Approach Using Integral Reinforc...
收藏 引用
13th ieee International symposium on Robotic and Sensors Environments (ROSE)
作者: Abouheaf, Mohammed Gueaieb, Wail Univ Ottawa Sch Elect Engn & Comp Sci Ottawa ON Canada Aswan Univ Coll Energy Engn Aswan Egypt
Integral reinforcement learning control approaches with derivative weighting performance indices require full knowledge of dynamic models of the considered systems. These approaches do not provide straightforward solu... 详细信息
来源: 评论
An Online reinforcement learning Wing-Tracking Mechanism for Flexible Wing Aircraft  13
An Online Reinforcement Learning Wing-Tracking Mechanism for...
收藏 引用
13th ieee International symposium on Robotic and Sensors Environments (ROSE)
作者: Abouheaf, Mohammed Mailhot, Nathaniel Gueaieb, Wail Univ Ottawa Sch Elect Engn & Comp Sci Ottawa ON Canada Aswan Univ Coll Energy Engn Aswan Egypt Univ Ottawa Dept Mech Engn Ottawa ON Canada
Flexible wing aircraft are gaining an increasing interest due to their salient features, such as inexpensive market price, low-cost operation, in-flight robustness, multi-purpose use, and their ability to operate with... 详细信息
来源: 评论