咨询与建议

限定检索结果

文献类型

  • 53 篇 期刊文献
  • 24 篇 会议
  • 1 篇 学位论文

馆藏范围

  • 78 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 73 篇 工学
    • 41 篇 计算机科学与技术...
    • 31 篇 电气工程
    • 22 篇 控制科学与工程
    • 18 篇 信息与通信工程
    • 7 篇 交通运输工程
    • 5 篇 软件工程
    • 3 篇 机械工程
    • 3 篇 仪器科学与技术
    • 2 篇 测绘科学与技术
    • 1 篇 土木工程
    • 1 篇 化学工程与技术
  • 18 篇 管理学
    • 14 篇 管理科学与工程(可...
    • 2 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 12 篇 理学
    • 10 篇 数学
    • 6 篇 系统科学
    • 1 篇 地球物理学
    • 1 篇 统计学(可授理学、...
  • 3 篇 医学
    • 3 篇 临床医学
    • 1 篇 基础医学(可授医学...
  • 1 篇 经济学
    • 1 篇 理论经济学
    • 1 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 教育学

主题

  • 78 篇 actor-critic alg...
  • 31 篇 reinforcement le...
  • 18 篇 deep reinforceme...
  • 5 篇 deep learning
  • 5 篇 reinforcement le...
  • 4 篇 input constraint...
  • 3 篇 dynamic path pla...
  • 3 篇 task analysis
  • 3 篇 transfer learnin...
  • 3 篇 differential gam...
  • 3 篇 trajectory
  • 3 篇 active hypothesi...
  • 3 篇 sequential sensi...
  • 3 篇 multi-agent rein...
  • 2 篇 vehicle dynamics
  • 2 篇 quickest state e...
  • 2 篇 sample efficienc...
  • 2 篇 nonzero-sum stoc...
  • 2 篇 transformer
  • 2 篇 industry 4.0

机构

  • 4 篇 syracuse univ de...
  • 4 篇 indian inst sci ...
  • 3 篇 menoufia univ fa...
  • 3 篇 school of contro...
  • 2 篇 lebanese amer un...
  • 2 篇 concordia univ m...
  • 2 篇 harokopio univ a...
  • 2 篇 huazhong univ sc...
  • 2 篇 lakehead univ th...
  • 2 篇 nile univ sesc r...
  • 2 篇 jilin univ key l...
  • 2 篇 univ elect sci &...
  • 2 篇 shenzhen univ co...
  • 2 篇 texas a&m univ d...
  • 2 篇 jilin univ coll ...
  • 1 篇 univ calif berke...
  • 1 篇 zhongguancun lab...
  • 1 篇 univ texas austi...
  • 1 篇 hanoi univ sci &...
  • 1 篇 texas a&m univ c...

作者

  • 3 篇 joseph geethu
  • 3 篇 chronis christos
  • 3 篇 shalaby raafat
  • 3 篇 bhatnagar shalab...
  • 3 篇 varlamis iraklis
  • 3 篇 varshney pramod ...
  • 3 篇 politi elena
  • 3 篇 gursoy m. cenk
  • 3 篇 mahmoud tarek a.
  • 3 篇 abo-zalam belal
  • 3 篇 dimitrakopoulos ...
  • 3 篇 el-hossainy moha...
  • 2 篇 wang bing-chang
  • 2 篇 assi chadi
  • 2 篇 wang yanzhi
  • 2 篇 zhang zhicai
  • 2 篇 lu shuai
  • 2 篇 qiu qinru
  • 2 篇 qu hong
  • 2 篇 parizs richard d...

语言

  • 78 篇 英文
检索条件"主题词=Actor-Critic algorithm"
78 条 记 录,以下是41-50 订阅
排序:
Resilient Multi-agent Reinforcement Learning Using Medoid and Soft-medoid Based Aggregation
Resilient Multi-agent Reinforcement Learning Using Medoid an...
收藏 引用
IEEE International Conference on Assured Autonomy (ICAA)
作者: Bhowmick, Chandreyee Shabbir, Mudassir Abbas, Waseem Koutsoukos, Xenofon Vanderbilt Univ Inst Software Integrated Syst 221 Kirkland Hall Nashville TN 37235 USA Univ Texas Dallas Dept Syst Engn Richardson TX USA
A network of reinforcement learning (RL) agents that cooperate with each other by sharing information can improve learning performance of control and coordination tasks when compared to non-cooperative agents. However... 详细信息
来源: 评论
An online Q-learning design for stochastic differential LQ game with completely unknown dynamics  41
An online Q-learning design for stochastic differential LQ g...
收藏 引用
第41届中国控制会议
作者: Baoqiang Zhang Bingchang Wang School of Control Science and Engineering Shandong University Shandong University
In this paper,we design a reinforcement learning algorithm to solve the adaptive optimal control problem of linear quadratic stochastic non-zero sum differential game with n-players and completely unknown *** is diffi... 详细信息
来源: 评论
Reinforcement Learning Based Dynamic Resource Allocation for Massive MTC in Sliced Mobile Networks  14
Reinforcement Learning Based Dynamic Resource Allocation for...
收藏 引用
14th IEEE International Conference on Advanced Infocomm Technology (ICAIT)
作者: Yang, Bei Xu, Yiqian She, Xiaoming Zhu, Jianchi Wei, Fengsheng Cheri, Peng Wang, Jianxiu China Telecom Res Inst Beijing 102209 Peoples R China Univ Elect Sci & Technol China Chengdu 611731 Peoples R China
With the rapid development of the Internet of Things (IoT) systems, the low latency requirement of massive Machine Type Communication ( mMTC) in the IoT is an urgent problem to be solved for future mobile communicatio... 详细信息
来源: 评论
Leveraging UAVs for Coverage in Cell-Free Vehicular Networks: A Deep Reinforcement Learning Approach
收藏 引用
IEEE TRANSACTIONS ON MOBILE COMPUTING 2021年 第9期20卷 2835-2847页
作者: Samir, Moataz Ebrahimi, Dariush Assi, Chadi Sharafeddine, Sanaa Ghrayeb, Ali Concordia Univ Montreal PQ H3G 1M8 Canada Lakehead Univ Thunder Bay ON P7B 5E1 Canada Lebanese Amer Univ Beirut 11022801 Lebanon Texas A&M Univ Doha 23874 Qatar
The success in transitioning towards smart cities relies on the availability of information and communication technologies that meet the demands of this transformation. The terrestrial infrastructure presents itself a... 详细信息
来源: 评论
TASAC: A twin-actor reinforcement learning framework with a stochastic with an to batch control
收藏 引用
CONTROL ENGINEERING PRACTICE 2023年 第1期134卷
作者: Joshi, Tanuja Kodamana, Hariprasad Kandath, Harikumar Kaisare, Niket Indian Inst Technol Delhi Dept Chem Engn New Delhi 110016 India Indian Inst Technol Delhi Yardi Sch Artificial Intelligence New Delhi 110016 India Int Inst Informat Technol Hyderabad Hyderabad 500032 India Indian Inst Technol Madras Dept Chem Engn Chennai 600036 India
Due to their complex nonlinear dynamics and batch-to-batch variability, batch processes pose a challenge for process control. Due to the absence of accurate models and resulting plant-model mismatch, these problems be... 详细信息
来源: 评论
actor-critic Based Graphical Games for Discrete-time Linear Systems with Input Constraints  39
Actor-critic Based Graphical Games for Discrete-time Linear ...
收藏 引用
39th Chinese Control Conference (CCC)
作者: Wang, Tian-Xiang Liang, Yong Wang, Bing-Chang Shandong Univ Sch Control Sci & Engn Jinan Shandong Peoples R China
In dynamic graphical games, in order to obtain the optimal strategy for each agent, the traditional method is to solve a set of coupled HJB equations. It is very difficult to solve such problems by traditional methods... 详细信息
来源: 评论
How to use prior knowledge for injection molding in industry 4.0
收藏 引用
RESULTS IN ENGINEERING 2024年 23卷
作者: Parizs, Richard Dominik Torok, Daniel Budapest Univ Technol & Econ Fac Mech Engn Dept Polymer Engn Muegyetem Rkp 3 H-1111 Budapest Hungary MTA BME Lendulet Lightweight Polymer Composites Re Muegyetem Rkp 3 H-1111 Budapest Hungary
Searching for the optimal injection molding settings for a new product usually requires much time and money. This article proposes a new method that uses reinforcement learning with prior knowledge for the optimizatio... 详细信息
来源: 评论
Drone Elevation Control Based on Python-Unity Integrated Framework for Reinforcement Learning Applications
收藏 引用
DRONES 2023年 第4期7卷 225-225页
作者: Abbass, Mahmoud Abdelkader Bashery Kang, Hyun-Soo Chungbuk Natl Univ Sch Elect & Comp Engn Dept Informat & Commun Engn Cheongju 28644 South Korea Helwan Univ Dept Mech Power Engn Cairo 11772 Egypt
Reinforcement learning (RL) applications require a huge effort to become established in real-world environments, due to the injury and break down risks during interactions between the RL agent and the environment, in ... 详细信息
来源: 评论
Manipulator Motion Planning based on actor-critic Reinforcement Learning
Manipulator Motion Planning based on Actor-Critic Reinforcem...
收藏 引用
第40届中国控制会议
作者: Qiang Li Jun Nie Haixia Wang Xiao Lu Shibin Song College of Electrical Engineering and Automation Shandong University of Science and Technology
The manipulator control model has the characteristics of high-order,nonlinear,multivariable and strong coupling,which makes it difficult for the manipulator to have good adaptability and *** at the problem of poor reu... 详细信息
来源: 评论
A Scalable algorithm for Anomaly Detection via Learning-Based Controlled Sensing
A Scalable Algorithm for Anomaly Detection via Learning-Base...
收藏 引用
IEEE International Conference on Communications (ICC)
作者: Joseph, Geethu Gursoy, M. Cenk Varshney, Pramod K. Syracuse Univ Dept Elect Engn & Comp Sci Syracuse NY 13244 USA
We address the problem of sequentially selecting and observing processes from a given set to find the anomalies among them. The decision maker observes one process at a time and obtains a noisy binary indicator of whe... 详细信息
来源: 评论