咨询与建议

限定检索结果

文献类型

  • 52 篇 期刊文献
  • 24 篇 会议
  • 1 篇 学位论文

馆藏范围

  • 77 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 72 篇 工学
    • 41 篇 计算机科学与技术...
    • 30 篇 电气工程
    • 22 篇 控制科学与工程
    • 18 篇 信息与通信工程
    • 6 篇 交通运输工程
    • 5 篇 软件工程
    • 3 篇 机械工程
    • 3 篇 仪器科学与技术
    • 2 篇 测绘科学与技术
    • 1 篇 化学工程与技术
  • 18 篇 管理学
    • 14 篇 管理科学与工程(可...
    • 2 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 12 篇 理学
    • 10 篇 数学
    • 6 篇 系统科学
    • 1 篇 地球物理学
    • 1 篇 统计学(可授理学、...
  • 3 篇 医学
    • 3 篇 临床医学
    • 1 篇 基础医学(可授医学...
  • 1 篇 经济学
    • 1 篇 理论经济学
    • 1 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 教育学

主题

  • 77 篇 actor-critic alg...
  • 31 篇 reinforcement le...
  • 17 篇 deep reinforceme...
  • 5 篇 deep learning
  • 5 篇 reinforcement le...
  • 4 篇 input constraint...
  • 3 篇 dynamic path pla...
  • 3 篇 task analysis
  • 3 篇 transfer learnin...
  • 3 篇 differential gam...
  • 3 篇 reinforcement le...
  • 3 篇 trajectory
  • 3 篇 active hypothesi...
  • 3 篇 sequential sensi...
  • 3 篇 multi-agent rein...
  • 2 篇 vehicle dynamics
  • 2 篇 quickest state e...
  • 2 篇 sample efficienc...
  • 2 篇 nonzero-sum stoc...
  • 2 篇 industry 4.0

机构

  • 4 篇 syracuse univ de...
  • 4 篇 indian inst sci ...
  • 3 篇 menoufia univ fa...
  • 3 篇 school of contro...
  • 2 篇 lebanese amer un...
  • 2 篇 concordia univ m...
  • 2 篇 harokopio univ a...
  • 2 篇 huazhong univ sc...
  • 2 篇 lakehead univ th...
  • 2 篇 nile univ sesc r...
  • 2 篇 jilin univ key l...
  • 2 篇 univ elect sci &...
  • 2 篇 shenzhen univ co...
  • 2 篇 texas a&m univ d...
  • 2 篇 jilin univ coll ...
  • 1 篇 univ calif berke...
  • 1 篇 zhongguancun lab...
  • 1 篇 univ texas austi...
  • 1 篇 hanoi univ sci &...
  • 1 篇 texas a&m univ c...

作者

  • 3 篇 joseph geethu
  • 3 篇 chronis christos
  • 3 篇 shalaby raafat
  • 3 篇 bhatnagar shalab...
  • 3 篇 varlamis iraklis
  • 3 篇 varshney pramod ...
  • 3 篇 politi elena
  • 3 篇 gursoy m. cenk
  • 3 篇 mahmoud tarek a.
  • 3 篇 abo-zalam belal
  • 3 篇 dimitrakopoulos ...
  • 3 篇 el-hossainy moha...
  • 2 篇 wang bing-chang
  • 2 篇 assi chadi
  • 2 篇 wang yanzhi
  • 2 篇 zhang zhicai
  • 2 篇 lu shuai
  • 2 篇 qiu qinru
  • 2 篇 qu hong
  • 2 篇 parizs richard d...

语言

  • 77 篇 英文
检索条件"主题词=Actor-critic Algorithm"
77 条 记 录,以下是11-20 订阅
排序:
Episodic Memory-Double actor-critic Twin Delayed Deep Deterministic Policy Gradient
收藏 引用
NEURAL NETWORKS 2025年 187卷 107286页
作者: Shu, Man Lu, Shuai Gong, Xiaoyu An, Daolong Li, Songlin Jilin Univ Key Lab Symbol Computat & Knowledge Engn Minist Educ Changchun 130012 Peoples R China Chinese Acad Sci Changchun Inst Opt Fine Mech & Phys Changchun 130033 Peoples R China Jilin Univ Coll Comp Sci & Technol Changchun 130012 Peoples R China Jilin Univ Coll Software Changchun 130012 Peoples R China
Existing deep reinforcement learning (DRL) algorithms suffer from the problem of low sample efficiency. Episodic memory allows DRL algorithms to remember and use past experiences with high return, thereby improving sa... 详细信息
来源: 评论
Intermittent Dynamic Event-Triggered Optimal Control for Networked Control Systems With Input Saturation
收藏 引用
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 2025年 第6期35卷 1935-1949页
作者: Zhang, Cong Zhang, Xiaodan Xiao, Feng Wei, Bo North China Elect Power Univ State Key Lab Alternate Elect Power Syst Renewable Beijing Peoples R China North China Elect Power Univ Sch Control & Comp Engn Beijing Peoples R China
In this article, we explore an event-triggered optimal control problem for nonlinear networked control systems (NCSs) with input saturation and aperiodic intermittent control. First, a non-quadratic cost function with... 详细信息
来源: 评论
Event-triggered fractional-order fuzzy sliding mode control using online reinforcement learning for uncertain nonlinear systems: Practical validation
收藏 引用
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 2025年 151卷
作者: Mahmoud, Tarek A. El-Hossainy, Mohammad Abo-Zalam, Belal Shalaby, Raafat Menoufia Univ Fac Elect Engn Dept Ind Elect & Control Engn Menoufia 32952 Egypt Nile Univ SESC Res Ctr Sch Engn & Appl Sci MECT Program Giza 12588 Egypt
In this paper, a novel event-triggered control strategy is proposed for uncertain nonlinear systems by developing a fractional-order fuzzy sliding mode controller based on a fractional-order actor-critic network. The ... 详细信息
来源: 评论
A constrained optimization perspective on actor-critic algorithms and application to network routing
收藏 引用
SYSTEMS & CONTROL LETTERS 2016年 92卷 46-51页
作者: Prashanth, L. A. Prasad, H. L. Bhatnagar, Shalabh Chandra, Prakash Univ Maryland Syst Res Inst College Pk MD 20742 USA Astrome Technol Pvt Ltd Bangalore Karnataka India Indian Inst Sci Dept Comp Sci & Automat Bangalore 560012 Karnataka India Indian Inst Sci Syst Sci & Automat Bangalore 560012 Karnataka India
We propose a novel actor-critic algorithm with guaranteed convergence to an optimal policy for a discounted reward Markov decision process. The actor incorporates a descent direction that is motivated by the solution ... 详细信息
来源: 评论
A Novel Model of Generative Automatic Text Summarization Based on BART
IAENG International Journal of Computer Science
收藏 引用
IAENG International Journal of Computer Science 2025年 第2期52卷 507-514页
作者: Wang, Yahui Chang, Qingxia Meng, Xuelei Foreign Languages School Lanzhou Jiaotong University Gansu Lanzhou730070 China Traffic and Transportation School Lanzhou Jiaotong University Gansu Lanzhou730070 China
To obtain useful information accurately and quickly from the massive text information is the most urgent need for people nowadays. The text automatic summarization technology summarizes and condenses the given source ... 详细信息
来源: 评论
Optimizing Non-Terrestrial Hybrid RF/FSO Links With Reinforcement Learning: Navigating Through Clouds
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY
收藏 引用
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY 2025年 6卷 793-806页
作者: Almohamad, Abdullateef Ibrahim, Mostafa Ekin, Sabit Hasna, Mazen Althunibat, Saud Qaraqe, Khalid Texas A&M Univ Coll Stn Dept Elect & Comp Engn College Stn TX 77843 USA Texas A&M Univ Coll Stn Dept Engn Technol & Ind Distribut College Stn TX 77843 USA Qatar Univ Elect Engn Dept Doha Qatar Al Hussein Bin Talal Univ Dept Commun Engn Maan Jordan Hamad Bin Khalifa Univ Coll Sci & Engn Doha Qatar
In the pursuit of ubiquitous broadband connectivity, there has been a significant shift towards the vertical expansion of communication networks into space, particularly through the exploitation of low Earth orbit (LE... 详细信息
来源: 评论
Graph attention, learning 2-opt algorithm for the traveling salesman problem
收藏 引用
COMPLEX & INTELLIGENT SYSTEMS 2025年 第1期11卷 1-21页
作者: Luo, Jia Heng, Herui Wu, Geng Ningbo Univ Technol Sch Econ & Management Ningbo 315211 Peoples R China Shanghai Maritime Univ Inst Logist Sci & Engn Shanghai 201306 Peoples R China
In recent years, deep graph neural networks (GNNs) have been used as solvers or helper functions for the traveling salesman problem (TSP), but they are usually used as encoders to generate static node representations ... 详细信息
来源: 评论
Energy-Efficient Content Fetching Strategies in Cache-Enabled D2D Networks via an actor-critic Reinforcement Learning Structure
收藏 引用
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY 2024年 第11期73卷 17485-17495页
作者: Yan, Ming Luo, Meiqi Chan, Chien Aun Gygax, Andre F. Li, Chunguo Chih-Lin, I Commun Univ China Sch Informat & Commun Engn Beijing 100024 Peoples R China Commun Univ China Key Lab Acoust Visual Technol & Intelligent Contro Beijing 100024 Peoples R China Univ Melbourne Dept Elect & Elect Engn Melbourne Vic 3010 Australia Univ Melbourne Fac Business & Econ Melbourne Vic 3010 Australia Southeast Univ Sch Informat Sci & Engn Nanjing 210096 Peoples R China China Mobile Res Inst Beijing 100053 Peoples R China
As one of the important complementary technologies of the fifth-generation (5G) wireless communication and beyond, mobile device-to-device (D2D) edge caching and computing can effectively reduce the pressure on backbo... 详细信息
来源: 评论
actor-critic-Based Optimal Tracking for Partially Unknown Nonlinear Discrete-Time Systems
收藏 引用
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015年 第1期26卷 140-151页
作者: Kiumarsi, Bahare Lewis, Frank L. Univ Texas Arlington UTA Res Inst Ft Worth TX 76118 USA
This paper presents a partially model-free adaptive optimal control solution to the deterministic nonlinear discrete-time (DT) tracking control problem in the presence of input constraints. The tracking error dynamics... 详细信息
来源: 评论
TACT: A Transfer actor-critic Learning Framework for Energy Saving in Cellular Radio Access Networks
收藏 引用
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS 2014年 第4期13卷 2000-2011页
作者: Li, Rongpeng Zhao, Zhifeng Chen, Xianfu Palicot, Jacques Zhang, Honggang Zhejiang Univ Dept Informat Sci & Elect Engn Hangzhou 310027 Zhejiang Peoples R China Univ Europeenne Bretagne Rennes France Supelec F-35576 Cesson Sevigne France VTT Tech Res Ctr Finland FI-90571 Oulu Finland
Recent works have validated the possibility of improving energy efficiency in radio access networks (RANs), achieved by dynamically turning on/off some base stations (BSs). In this paper, we extend the research over B... 详细信息
来源: 评论