咨询与建议

限定检索结果

文献类型

  • 52 篇 期刊文献
  • 24 篇 会议
  • 1 篇 学位论文

馆藏范围

  • 77 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 72 篇 工学
    • 41 篇 计算机科学与技术...
    • 30 篇 电气工程
    • 22 篇 控制科学与工程
    • 18 篇 信息与通信工程
    • 6 篇 交通运输工程
    • 5 篇 软件工程
    • 3 篇 机械工程
    • 3 篇 仪器科学与技术
    • 2 篇 测绘科学与技术
    • 1 篇 化学工程与技术
  • 18 篇 管理学
    • 14 篇 管理科学与工程(可...
    • 2 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 12 篇 理学
    • 10 篇 数学
    • 6 篇 系统科学
    • 1 篇 地球物理学
    • 1 篇 统计学(可授理学、...
  • 3 篇 医学
    • 3 篇 临床医学
    • 1 篇 基础医学(可授医学...
  • 1 篇 经济学
    • 1 篇 理论经济学
    • 1 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 教育学

主题

  • 77 篇 actor-critic alg...
  • 31 篇 reinforcement le...
  • 17 篇 deep reinforceme...
  • 5 篇 deep learning
  • 5 篇 reinforcement le...
  • 4 篇 input constraint...
  • 3 篇 dynamic path pla...
  • 3 篇 task analysis
  • 3 篇 transfer learnin...
  • 3 篇 differential gam...
  • 3 篇 reinforcement le...
  • 3 篇 trajectory
  • 3 篇 active hypothesi...
  • 3 篇 sequential sensi...
  • 3 篇 multi-agent rein...
  • 2 篇 vehicle dynamics
  • 2 篇 quickest state e...
  • 2 篇 sample efficienc...
  • 2 篇 nonzero-sum stoc...
  • 2 篇 industry 4.0

机构

  • 4 篇 syracuse univ de...
  • 4 篇 indian inst sci ...
  • 3 篇 menoufia univ fa...
  • 3 篇 school of contro...
  • 2 篇 lebanese amer un...
  • 2 篇 concordia univ m...
  • 2 篇 harokopio univ a...
  • 2 篇 huazhong univ sc...
  • 2 篇 lakehead univ th...
  • 2 篇 nile univ sesc r...
  • 2 篇 jilin univ key l...
  • 2 篇 univ elect sci &...
  • 2 篇 shenzhen univ co...
  • 2 篇 texas a&m univ d...
  • 2 篇 jilin univ coll ...
  • 1 篇 univ calif berke...
  • 1 篇 zhongguancun lab...
  • 1 篇 univ texas austi...
  • 1 篇 hanoi univ sci &...
  • 1 篇 texas a&m univ c...

作者

  • 3 篇 joseph geethu
  • 3 篇 chronis christos
  • 3 篇 shalaby raafat
  • 3 篇 bhatnagar shalab...
  • 3 篇 varlamis iraklis
  • 3 篇 varshney pramod ...
  • 3 篇 politi elena
  • 3 篇 gursoy m. cenk
  • 3 篇 mahmoud tarek a.
  • 3 篇 abo-zalam belal
  • 3 篇 dimitrakopoulos ...
  • 3 篇 el-hossainy moha...
  • 2 篇 wang bing-chang
  • 2 篇 assi chadi
  • 2 篇 wang yanzhi
  • 2 篇 zhang zhicai
  • 2 篇 lu shuai
  • 2 篇 qiu qinru
  • 2 篇 qu hong
  • 2 篇 parizs richard d...

语言

  • 77 篇 英文
检索条件"主题词=Actor-critic algorithm"
77 条 记 录,以下是21-30 订阅
排序:
Proactive Content Caching Based on actor-critic Reinforcement Learning for Mobile Edge Networks
收藏 引用
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING 2022年 第2期8卷 1239-1252页
作者: Jiang, Wei Feng, Daquan Sun, Yao Feng, Gang Wang, Zhenzhong Xia, Xiang-Gen Shenzhen Univ Guangdong Prov Engn Lab Digital Creat Technol Shenzhen Key Lab Digital Creat Technol Coll Elect & Informat EngnGuangdong Key Lab Inte Shenzhen 518060 Peoples R China Univ Glasgow James Watt Sch Engn Glasgow G12 8QQ Lanark Scotland Univ Elect Sci & Technol China Yangtze Delta Reg Inst Huzhou Huzhou 313001 Scotland Univ Elect Sci & Technol China Natl Key Lab Sci & Technol Commun Chengdu 611731 Peoples R China Tech Management Ctr China Media Grp Beijing 100020 Peoples R China Univ Delaware Dept Elect & Comp Engn Newark DE 19716 USA
Mobile edge caching/computing (MEC) has emerged as a promising approach for addressing the drastic increasing mobile data traffic by bringing high caching and computing capabilities to the edge of networks. Under MEC ... 详细信息
来源: 评论
Online actor-critic Reinforcement Learning Control for Uncertain Surface Vessel Systems with External Disturbances
收藏 引用
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS 2022年 第3期20卷 1029-1040页
作者: Vu, Van Tu Tran, Quang Huy Pham, Thanh Loc Dao, Phuong Nam Hai Phong Univ Hai Phong Vietnam Natl Cheng Kung Uninvers NCKU Dept Mech Engn Tainan Taiwan Hanoi Univ Sci & Univ Sch Elect Engn 01 Dai Co Viet Hanoi Vietnam
This article addresses a trajectory tracking control approach for an uncertain surface vessel using the new cascade structure of adaptive reinforcement learning (ARL) algorithm and kinematic controller, feed-forward t... 详细信息
来源: 评论
Improving Exploration in actor-critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024年 第7期35卷 8783-8796页
作者: Li, Fan Fu, Mingsheng Chen, Wenyu Zhang, Fan Zhang, Haixian Qu, Hong Yi, Zhang Univ Elect Sci & Technol China Sch Comp Sci & Engn Chengdu 611731 Peoples R China Sichuan Univ Sch Comp Sci Chengdu 610065 Peoples R China
Deep off-policy actor-critic algorithms have been successfully applied to challenging tasks in continuous control. However, these methods typically suffer from the poor sample efficiency problem, limiting their widesp... 详细信息
来源: 评论
An actor-critic reinforcement learning-based resource management in mobile edge computing systems
收藏 引用
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS 2020年 第8期11卷 1875-1889页
作者: Fu, Fang Zhang, Zhicai Yu, Fei Richard Yan, Qiao Shanxi Univ Sch Phys & Elect Taiyuan Peoples R China Carleton Univ Coll Syst & Comp Engn Ottawa ON Canada Shenzhen Univ Coll Comp Sci & Software Engn Shenzhen Peoples R China
Reinforcement learning (RL) as an effective tool has attracted great attention in wireless communication field nowadays. In this paper, we investigate the offloading decision and resource allocation problem in mobile ... 详细信息
来源: 评论
QoS Aware Transcoding for Live Streaming in Edge-Clouds Aided HetNets: An Enhanced actor-critic Approach
收藏 引用
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY 2019年 第11期68卷 11295-11308页
作者: Zhang, Zhicai Wang, Ru Yu, F. Richard Fu, Fang Yan, Qiao Shanxi Univ Sch Phys & Elect Engn Taiyuan 030006 Shanxi Peoples R China Carleton Univ Dept Syst & Comp Engn Ottawa ON K1S 5B6 Canada Shenzhen Univ Coll Comp Sci & Software Engn Shenzhen 518060 Guangdong Peoples R China
With the advances in hand-held devices (smartphones and tablets, etc.) and high speed wireless networks, users have an explosive growth demand for live streaming service. Due to the diversity of user equipments (UEs),... 详细信息
来源: 评论
Fault Diagnosis for Gas Turbine Rotor Using actor-critic Network
Fault Diagnosis for Gas Turbine Rotor Using Actor-Critic Net...
收藏 引用
International Conference of The Efficiency and Performance Engineering Network (TEPEN)
作者: Cui, Yingjie Wang, Hongjun Beijing Informat Sci & Technol Univ Sch Mech & Elect Engn Beijing 100192 Peoples R China Beijing Int Sci Cooperat Base High End Equipment Beijing 100192 Peoples R China Minist Educ Key Lab Modern Measurement & Control Technol Beijing 100192 Peoples R China
As a key component of gas turbine, gas turbine rotor often operates under high speed and variable working conditions, which is extremely prone to failure. Aiming at the problem of low fault diagnosis accuracy of gas t... 详细信息
来源: 评论
Manipulator Motion Planning based on actor-critic Reinforcement Learning  40
Manipulator Motion Planning based on Actor-Critic Reinforcem...
收藏 引用
40th Chinese Control Conference (CCC)
作者: Li, Qiang Nie, Jun Wang, Haixia Lu, Xiao Song, Shibin Shandong Univ Sci & Technol Coll Elect Engn & Automat Qingdao 266590 Peoples R China
The manipulator control model has the characteristics of high-order, nonlinear, multivariable and strong coupling, which makes it difficult for the manipulator to have good adaptability and autonomy. Aiming at the pro... 详细信息
来源: 评论
A Fast Decentralized Scheduling Method of Cooperative Localization Based on actor-critic Deep Reinforcement  3
A Fast Decentralized Scheduling Method of Cooperative Locali...
收藏 引用
3rd International Conference on Information Communication and Software Engineering (ICICSE)
作者: Di, Xinyue Guan, Yalin Yu, Weijia Lin, Heyun Commun Univ China Beijing Peoples R China Guangxi Power Grid Dispatching Control Ctr Nanning Peoples R China
With the emergence of more and more automated vehicles, localization of vehicles has attracted a lot of attention. Among multiple localization methods, cooperative localization is very attractive due to its high cover... 详细信息
来源: 评论
actor-critic Based Graphical Games for Discrete-time Linear Systems with Input Constraints  39
Actor-critic Based Graphical Games for Discrete-time Linear ...
收藏 引用
39th Chinese Control Conference (CCC)
作者: Wang, Tian-Xiang Liang, Yong Wang, Bing-Chang Shandong Univ Sch Control Sci & Engn Jinan Shandong Peoples R China
In dynamic graphical games, in order to obtain the optimal strategy for each agent, the traditional method is to solve a set of coupled HJB equations. It is very difficult to solve such problems by traditional methods... 详细信息
来源: 评论
Temporal Detection of Anomalies via actor-critic Based Controlled Sensing
Temporal Detection of Anomalies via Actor-Critic Based Contr...
收藏 引用
IEEE Global Communications Conference (GLOBECOM)
作者: Joseph, Geethu Gursoy, M. Cenk Varshney, Pramod K. Syracuse Univ Dept Elect Engn & Comp Sci Syracuse NY 13244 USA
We address the problem of monitoring a set of binary stochastic processes and generating an alert when the number of anomalies among them exceeds a threshold. For this, the decision-maker selects and probes a subset o... 详细信息
来源: 评论