咨询与建议

限定检索结果

文献类型

  • 52 篇 期刊文献
  • 24 篇 会议
  • 1 篇 学位论文

馆藏范围

  • 77 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 72 篇 工学
    • 41 篇 计算机科学与技术...
    • 30 篇 电气工程
    • 22 篇 控制科学与工程
    • 18 篇 信息与通信工程
    • 6 篇 交通运输工程
    • 5 篇 软件工程
    • 3 篇 机械工程
    • 3 篇 仪器科学与技术
    • 2 篇 测绘科学与技术
    • 1 篇 化学工程与技术
  • 18 篇 管理学
    • 14 篇 管理科学与工程(可...
    • 2 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 12 篇 理学
    • 10 篇 数学
    • 6 篇 系统科学
    • 1 篇 地球物理学
    • 1 篇 统计学(可授理学、...
  • 3 篇 医学
    • 3 篇 临床医学
    • 1 篇 基础医学(可授医学...
  • 1 篇 经济学
    • 1 篇 理论经济学
    • 1 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 教育学

主题

  • 77 篇 actor-critic alg...
  • 31 篇 reinforcement le...
  • 17 篇 deep reinforceme...
  • 5 篇 deep learning
  • 5 篇 reinforcement le...
  • 4 篇 input constraint...
  • 3 篇 dynamic path pla...
  • 3 篇 task analysis
  • 3 篇 transfer learnin...
  • 3 篇 differential gam...
  • 3 篇 reinforcement le...
  • 3 篇 trajectory
  • 3 篇 active hypothesi...
  • 3 篇 sequential sensi...
  • 3 篇 multi-agent rein...
  • 2 篇 vehicle dynamics
  • 2 篇 quickest state e...
  • 2 篇 sample efficienc...
  • 2 篇 nonzero-sum stoc...
  • 2 篇 industry 4.0

机构

  • 4 篇 syracuse univ de...
  • 4 篇 indian inst sci ...
  • 3 篇 menoufia univ fa...
  • 3 篇 school of contro...
  • 2 篇 lebanese amer un...
  • 2 篇 concordia univ m...
  • 2 篇 harokopio univ a...
  • 2 篇 huazhong univ sc...
  • 2 篇 lakehead univ th...
  • 2 篇 nile univ sesc r...
  • 2 篇 jilin univ key l...
  • 2 篇 univ elect sci &...
  • 2 篇 shenzhen univ co...
  • 2 篇 texas a&m univ d...
  • 2 篇 jilin univ coll ...
  • 1 篇 univ calif berke...
  • 1 篇 zhongguancun lab...
  • 1 篇 univ texas austi...
  • 1 篇 hanoi univ sci &...
  • 1 篇 texas a&m univ c...

作者

  • 3 篇 joseph geethu
  • 3 篇 chronis christos
  • 3 篇 shalaby raafat
  • 3 篇 bhatnagar shalab...
  • 3 篇 varlamis iraklis
  • 3 篇 varshney pramod ...
  • 3 篇 politi elena
  • 3 篇 gursoy m. cenk
  • 3 篇 mahmoud tarek a.
  • 3 篇 abo-zalam belal
  • 3 篇 dimitrakopoulos ...
  • 3 篇 el-hossainy moha...
  • 2 篇 wang bing-chang
  • 2 篇 assi chadi
  • 2 篇 wang yanzhi
  • 2 篇 zhang zhicai
  • 2 篇 lu shuai
  • 2 篇 qiu qinru
  • 2 篇 qu hong
  • 2 篇 parizs richard d...

语言

  • 77 篇 英文
检索条件"主题词=Actor-critic algorithm"
77 条 记 录,以下是31-40 订阅
排序:
An Online Q-Learning Method for Linear-Quadratic Nonzero-Sum Stochastic Differential Games with Completely Unknown Dynamics
收藏 引用
Journal of Systems Science & Complexity 2024年 第5期37卷 1907-1922页
作者: ZHANG Bao-Qiang WANG Bing-Chang CAO Ying School of Control Science and Engineering Shandong UniversityJinan 250000China
In this paper,the authors design a reinforcement learning algorithm to solve the adaptive linear-quadratic stochastic n-players non-zero sum differential game with completely unknown *** each player,a critic network i... 详细信息
来源: 评论
Adaptive fault-tolerant control for affine non-linear systems based on approximate dynamic programming
收藏 引用
IET CONTROL THEORY AND APPLICATIONS 2016年 第6期10卷 655-663页
作者: Fan, Quan-Yong Yang, Guang-Hong Northeastern Univ Coll Informat Sci & Engn Shenyang 110819 Liaoning Peoples R China Northeastern Univ State Key Lab Synthet Automat Proc Ind Shenyang 110819 Liaoning Peoples R China
This study investigates the fault-tolerant control problem for affine nonlinear systems with time-varying actuator gain and bias faults. In order to handle the actuator faults and guarantee the approximate optimal per... 详细信息
来源: 评论
Adaptive TTL-Based Caching for Content Delivery
收藏 引用
IEEE-ACM TRANSACTIONS ON NETWORKING 2018年 第3期26卷 1063-1077页
作者: Basu, Soumya Sundarrajan, Aditya Ghaderi, Javad Shakkottai, Sanjay Sitaraman, Ramesh Univ Texas Austin Dept Elect & Comp Engn Austin TX 78712 USA Univ Massachusetts Coll Informat & Comp Sci Amherst MA 01003 USA CUNY Dept Elect Engn New York NY 10027 USA
Content delivery networks (CDNs) cache and serve a majority of the user-requested content on the Internet. Designing caching algorithms that automatically adapt to the heterogeneity, burstiness, and non-stationary nat... 详细信息
来源: 评论
Adaptive Deep Reinforcement Learning for Efficient 3D Navigation of Autonomous Underwater Vehicles
收藏 引用
IEEE ACCESS 2024年 12卷 178209-178221页
作者: Politi, Elena Stefanidou, Artemis Chronis, Christos Dimitrakopoulos, George Varlamis, Iraklis Harokopio Univ Athens Dept Informat & Telemat Athens 17779 Greece
The exploration of the underwater environments has recently accelerated with the development of the Autonomous Underwater Vehicle (AUV). One of the key elements for enhancing the autonomy of AUVs navigation across var... 详细信息
来源: 评论
A Maximum Divergence Approach to Optimal Policy in Deep Reinforcement Learning
收藏 引用
IEEE TRANSACTIONS ON CYBERNETICS 2023年 第3期53卷 1499-1510页
作者: Yang, Zhiyou Qu, Hong Fu, Mingsheng Hu, Wang Zhao, Yongze Univ Elect Sci & Technol China Sch Comp Sci & Engn Chengdu 610054 Peoples R China
Model-free reinforcement learning algorithms based on entropy regularized have achieved good performance in control tasks. Those algorithms consider using the entropy-regularized term for the policy to learn a stochas... 详细信息
来源: 评论
Dynamic Navigation in Unconstrained Environments Using Reinforcement Learning algorithms
收藏 引用
IEEE ACCESS 2023年 11卷 117984-118001页
作者: Chronis, Christos Anagnostopoulos, Georgios Politi, Elena Dimitrakopoulos, George Varlamis, Iraklis Harokopio Univ Athens Dept Informat & Telemat Athens 17779 Greece
The potential for the use of drones in logistics and transportation is continuously growing, with multiple applications both in urban and rural environments. The safe navigation of drones in such environments is a maj... 详细信息
来源: 评论
An experimental study on the application of reinforcement learning in injection molding in the spirit of Industry 4.0
收藏 引用
APPLIED SOFT COMPUTING 2024年 167卷
作者: Parizs, Richard Dominik Torok, Daniel Budapest Univ Technol & Econ Dept Polymer Engn Fac Mech Engn Muegyet Rkp 3 H-1111 Budapest Hungary MTA BME Lendulet Lightweight Polymer Composites Re Muegyet Rkp 3 H-1111 Budapest Hungary
The use of reinforcement learning in the injection molding process is a little-researched area in the era of Industry 4.0. The use of a smart decision-making algorithm is necessary for such a complex production method... 详细信息
来源: 评论
Simultaneous locomotion and manipulation control of quadruped robots using reinforcement learning-based adaptive fractional-order sliding-mode control
收藏 引用
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL 2023年 第13期45卷 2459-2476页
作者: Farid, Yousef Tarbiat Modares Univ Sch Elect & Comp Engn POB 14115-111 Tehran Iran
This paper investigates a model-free reinforcement learning-based approach that enables the quadruped robot to manipulate objects while maintaining its balance and dynamic stability during walking. At first, the dynam... 详细信息
来源: 评论
Multi-agent graphical games with input constraints:an online learning solution
收藏 引用
Control Theory and Technology 2020年 第2期18卷 148-159页
作者: Tianxiang WANG Bingchang WANG Yong LIANG School of Control Science and Engineering Shandong UniversityJinan Shandong 250061China
This paper studies an online iterative algorithm for solving discrete-time multi-agent dynamic graphical games with input *** order to obtain the optimal strategy of each agent,it is necessary to solve a set of couple... 详细信息
来源: 评论
An approximate dynamic programming method for the optimal control of Alkai-Surfactant-Polymer flooding
收藏 引用
JOURNAL OF PROCESS CONTROL 2018年 64卷 15-26页
作者: Ge, Yulei Li, Shurong Chan, Peng China Univ Petr East China Coll Informat & Control Engn Qingdao 266580 Peoples R China Beijing Univ Posts & Telecommun Automat Sch Beijing 100876 Peoples R China Jiangsu Automat Res Inst Lianyungang 222006 Peoples R China
Since the complexity, coupling, distributed parameter, etc. of alkali-surfactant-polymer (ASP) flooding, common optimization methods cannot acquire the optimal solutions well. This paper brings an optimal control meth... 详细信息
来源: 评论