咨询与建议

限定检索结果

文献类型

  • 53 篇 期刊文献
  • 24 篇 会议
  • 1 篇 学位论文

馆藏范围

  • 78 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 73 篇 工学
    • 41 篇 计算机科学与技术...
    • 31 篇 电气工程
    • 22 篇 控制科学与工程
    • 18 篇 信息与通信工程
    • 7 篇 交通运输工程
    • 5 篇 软件工程
    • 3 篇 机械工程
    • 3 篇 仪器科学与技术
    • 2 篇 测绘科学与技术
    • 1 篇 土木工程
    • 1 篇 化学工程与技术
  • 18 篇 管理学
    • 14 篇 管理科学与工程(可...
    • 2 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 12 篇 理学
    • 10 篇 数学
    • 6 篇 系统科学
    • 1 篇 地球物理学
    • 1 篇 统计学(可授理学、...
  • 3 篇 医学
    • 3 篇 临床医学
    • 1 篇 基础医学(可授医学...
  • 1 篇 经济学
    • 1 篇 理论经济学
    • 1 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 教育学

主题

  • 78 篇 actor-critic alg...
  • 31 篇 reinforcement le...
  • 18 篇 deep reinforceme...
  • 5 篇 deep learning
  • 5 篇 reinforcement le...
  • 4 篇 input constraint...
  • 3 篇 dynamic path pla...
  • 3 篇 task analysis
  • 3 篇 transfer learnin...
  • 3 篇 differential gam...
  • 3 篇 trajectory
  • 3 篇 active hypothesi...
  • 3 篇 sequential sensi...
  • 3 篇 multi-agent rein...
  • 2 篇 vehicle dynamics
  • 2 篇 quickest state e...
  • 2 篇 sample efficienc...
  • 2 篇 nonzero-sum stoc...
  • 2 篇 transformer
  • 2 篇 industry 4.0

机构

  • 4 篇 syracuse univ de...
  • 4 篇 indian inst sci ...
  • 3 篇 menoufia univ fa...
  • 3 篇 school of contro...
  • 2 篇 lebanese amer un...
  • 2 篇 concordia univ m...
  • 2 篇 harokopio univ a...
  • 2 篇 huazhong univ sc...
  • 2 篇 lakehead univ th...
  • 2 篇 nile univ sesc r...
  • 2 篇 jilin univ key l...
  • 2 篇 univ elect sci &...
  • 2 篇 shenzhen univ co...
  • 2 篇 texas a&m univ d...
  • 2 篇 jilin univ coll ...
  • 1 篇 univ calif berke...
  • 1 篇 zhongguancun lab...
  • 1 篇 univ texas austi...
  • 1 篇 hanoi univ sci &...
  • 1 篇 texas a&m univ c...

作者

  • 3 篇 joseph geethu
  • 3 篇 chronis christos
  • 3 篇 shalaby raafat
  • 3 篇 bhatnagar shalab...
  • 3 篇 varlamis iraklis
  • 3 篇 varshney pramod ...
  • 3 篇 politi elena
  • 3 篇 gursoy m. cenk
  • 3 篇 mahmoud tarek a.
  • 3 篇 abo-zalam belal
  • 3 篇 dimitrakopoulos ...
  • 3 篇 el-hossainy moha...
  • 2 篇 wang bing-chang
  • 2 篇 assi chadi
  • 2 篇 wang yanzhi
  • 2 篇 zhang zhicai
  • 2 篇 lu shuai
  • 2 篇 qiu qinru
  • 2 篇 qu hong
  • 2 篇 parizs richard d...

语言

  • 78 篇 英文
检索条件"主题词=Actor-Critic algorithm"
78 条 记 录,以下是71-80 订阅
排序:
Adaptive fault-tolerant control for affine non-linear systems based on approximate dynamic programming
收藏 引用
IET CONTROL THEORY AND APPLICATIONS 2016年 第6期10卷 655-663页
作者: Fan, Quan-Yong Yang, Guang-Hong Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Liaoning Peoples R China Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Liaoning Peoples R China
This study investigates the fault-tolerant control problem for affine nonlinear systems with time-varying actuator gain and bias faults. In order to handle the actuator faults and guarantee the approximate optimal per... 详细信息
来源: 评论
actor-critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015年 第1期26卷 140-151页
作者: Kiumarsi, Bahare Lewis, Frank L. Univ Texas Arlington UTA Res Inst Ft Worth TX 76118 USA
This paper presents a partially model-free adaptive optimal control solution to the deterministic nonlinear discrete-time (DT) tracking control problem in the presence of input constraints. The tracking error dynamics... 详细信息
来源: 评论
An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
收藏 引用
SYSTEMS & CONTROL LETTERS 2010年 第12期59卷 760-766页
作者: Bhatnagar, Shalabh Indian Inst Sci Dept Comp Sci & Automat Bangalore 560012 Karnataka India
We develop in this article the first actor-critic reinforcement learning algorithm with function approximation for a problem of control under multiple inequality constraints. We consider the infinite horizon discounte... 详细信息
来源: 评论
Transfer Reinforcement Learning Framework for Energy Saving in Next Generation Wireless Networks
Transfer Reinforcement Learning Framework for Energy Saving ...
收藏 引用
作者: Shreyata Sharma Indraprastha Institute of Information Technology Delhi
学位级别:硕士
Recent upsurge in data intensive applications over wireless communication networks is stimu- lating rapid expansion of such networks and thus presenting new research challenges pertaining to their efficient deployment... 详细信息
来源: 评论
A Transfer Learning Framework for Energy Efficient Wi-Fi Networks and Performance Analysis Using Real Data
A Transfer Learning Framework for Energy Efficient Wi-Fi Net...
收藏 引用
IEEE International Conference on Advanced Networks and Telecommuncations Systems
作者: Shreyata Sharma S. J. Darak Anand Srivastava Honggang Zhang Department of Electronics and Communication Engineering IIIT Delhi College of Information Science & Electronic Engineering (ISEE) Zhejiang University
In the recent past, there has been an exponential increase in data intensive services over the communication networks. This trend would sustain in future communication networks as well, especially in the Wi-Fi network... 详细信息
来源: 评论
TACT: A Transfer actor-critic Learning Framework for Energy Saving in Cellular Radio Access Networks
收藏 引用
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS 2014年 第4期13卷 2000-2011页
作者: Li, Rongpeng Zhao, Zhifeng Chen, Xianfu Palicot, Jacques Zhang, Honggang Zhejiang Univ Dept Informat Sci & Elect Engn Hangzhou 310027 Zhejiang Peoples R China Univ Europeenne Bretagne Rennes France Supelec F-35576 Cesson Sevigne France VTT Tech Res Ctr Finland FI-90571 Oulu Finland
Recent works have validated the possibility of improving energy efficiency in radio access networks (RANs), achieved by dynamically turning on/off some base stations (BSs). In this paper, we extend the research over B... 详细信息
来源: 评论
A Neuro-fuzzy Learning System for Adaptive Swarm Behaviors Dealing with Continuous State Space
收藏 引用
4th International Conference on Intelligent Computing
作者: Kuremoto, Takashi Obayashi, Masanao Kobayashi, Kunikazu Adachi, Hirotaka Yoneda, Kentaro Yamaguchi Univ Grad Sch Sci & Engn Tokiwadai 2-16-1 Yamaguchi 7558611 Japan Fac Sci & Engn Yamaguchi Japan
Swarm intelligence has brought a new paradise for function optimization, structural optimization, multi-agent systems and other study fields. In our previous work, we proposed a neuro-fuzzy system using reinforcement ... 详细信息
来源: 评论
The actor-critic algorithm as multi-time-scale stochastic approximation
收藏 引用
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES 1997年 第4期22卷 525-543页
作者: Borkar, VS Konda, VR Indian Inst Sci Dept Comp Sci & Automat Bangalore 560012 Karnataka India
The actor-critic algorithm of Barto and others for simulation-based optimization of Markov decision processes is cast as a two time Scale stochastic approximation. Convergence analysis, approximation issues and an exa... 详细信息
来源: 评论