咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >CHEAP TALK DISCOVERY AND UTILI... 收藏
arXiv

CHEAP TALK DISCOVERY AND UTILIZATION IN MULTI-AGENT REINFORCEMENT LEARNING

作     者:Lo, Yat Long de Witt, Christian Schroeder Sokota, Samuel Foerster, Jakob Whiteson, Shimon 

作者机构:University of Oxford Dyson Robot Learning Lab United Kingdom FLAIR University of Oxford United Kingdom Carnegie Mellon University United States University of Oxford United Kingdom 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2023年

核心收录:

主  题:Reinforcement learning 

摘      要:By enabling agents to communicate, recent cooperative multi-agent reinforcement learning (MARL) methods have demonstrated better task performance and more coordinated behavior. Most existing approaches facilitate inter-agent communication by allowing agents to send messages to each other through free communication channels, i.e., cheap talk channels. Current methods require these channels to be constantly accessible and known to the agents a priori. In this work, we lift these requirements such that the agents must discover the cheap talk channels and learn how to use them. Hence, the problem has two main parts: cheap talk discovery (CTD) and cheap talk utilization (CTU). We introduce a novel conceptual framework for both parts and develop a new algorithm based on mutual information maximization that outperforms existing algorithms in CTD/CTU settings. We also release a novel benchmark suite to stimulate future research in CTD/CTU. © 2023, CC BY.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分