咨询与建议

限定检索结果

文献类型

  • 2 篇 期刊文献
  • 1 篇 会议

馆藏范围

  • 3 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 2 篇 工学
    • 2 篇 计算机科学与技术...
    • 1 篇 电气工程
    • 1 篇 信息与通信工程
  • 1 篇 经济学
    • 1 篇 理论经济学
    • 1 篇 应用经济学
  • 1 篇 理学
    • 1 篇 数学
  • 1 篇 管理学
    • 1 篇 工商管理

主题

  • 3 篇 multi-agent acto...
  • 1 篇 reinforcement ta...
  • 1 篇 distributed rein...
  • 1 篇 differential gam...
  • 1 篇 gradient tempora...
  • 1 篇 market microstru...
  • 1 篇 learning
  • 1 篇 market making
  • 1 篇 dynamic spectrum...
  • 1 篇 decentralized le...
  • 1 篇 crns
  • 1 篇 off policy
  • 1 篇 federated learni...
  • 1 篇 deep reinforceme...
  • 1 篇 nash equilibrium
  • 1 篇 intensity contro...

机构

  • 1 篇 wenzhou univ cha...
  • 1 篇 purple mt labs p...
  • 1 篇 univ oxford math...
  • 1 篇 shandong univ sh...
  • 1 篇 southeast univ s...
  • 1 篇 wenzhou univ cha...

作者

  • 1 篇 sun jian
  • 1 篇 zhang wensheng
  • 1 篇 bo yulian
  • 1 篇 ren jineng
  • 1 篇 xiong wei
  • 1 篇 wang cheng-xiang
  • 1 篇 yang tongtong
  • 1 篇 cont rama

语言

  • 3 篇 英文
检索条件"主题词=Multi-agent actor-critic algorithm"
3 条 记 录,以下是1-10 订阅
排序:
multi-agent Gradient-Based Off-Policy actor-critic algorithm for Distributed Reinforcement Learning
收藏 引用
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS 2024年 第1期17卷 1-18页
作者: Ren, Jineng Wenzhou Univ Chashan Univ Town Sch Comp Sci & Artificial Intelligence Wenzhou 325035 Zhejiang Peoples R China Wenzhou Univ Chashan Univ Town Artificial Intelligence & Adv Mfg Inst Yongjia Wenzhou 325035 Zhejiang Peoples R China
This paper proposes a gradient-based multi-agent actor-critic algorithm for off-policy reinforcement learning using importance sampling. Our algorithm is incremental with full gradients, and its complexity per iterati... 详细信息
来源: 评论
Dynamic Spectrum Sharing Based on Federated Learning and multi-agent actor-critic Reinforcement Learning  19
Dynamic Spectrum Sharing Based on Federated Learning and Mul...
收藏 引用
19th IEEE International Wireless Communications and Mobile Computing (IEEE IWCMC)
作者: Yang, Tongtong Zhang, Wensheng Bo, Yulian Sun, Jian Wang, Cheng-Xiang Shandong Univ Shandong Prov Key Lab Wireless Commun Sch Informat Sci & Engn Qingdao 266237 Peoples R China Southeast Univ Sch Informat Sci & Engn Natl Mobile Commun Res Lab Nanjing 210096 Peoples R China Purple Mt Labs Nanjing 211111 Peoples R China
In order to improve spectrum efficiency in emergency communications, a dynamic spectrum sharing (DSS) scheme based on federated learning (FL) and deep reinforcement learning (DRL) is proposed. The operation model foll... 详细信息
来源: 评论
Dynamics of market making algorithms in dealer markets: Learning and tacit collusion
收藏 引用
MATHEMATICAL FINANCE 2024年 第2期34卷 467-521页
作者: Cont, Rama Xiong, Wei Univ Oxford Math Inst Oxford England
The widespread use of market-making algorithms in electronic over-the-counter markets may give rise to unexpected effects resulting from the autonomous learning dynamics of these algorithms. In particular the possibil... 详细信息
来源: 评论