文献详情 >Cooperative multi-agent actor-... 收藏

Cooperative multi-agent actor-critic control of traffic network flow based on edge computing

交通网络流动的合作多代理人 actorcritic 控制基于边计算

作者：Zhang, Yongnan Zhou, Yonghua Lu, Huapu Fujita, Hamido

作者机构：Beijing Jiaotong Univ Sch Elect & Informat Engn Beijing Peoples R China Tsinghua Univ Inst Transportat Engn Beijing Peoples R China Iwate Prefectural Univ Fac Software & Informat Sci Takizawa Iwate Japan Univ Granada Andalusian Res Inst Data Sci & Computat Intellige Granada Spain

出版物：《FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE》 (下代计算机系统)

年卷期：2021年第123卷

页面：128-141页

核心收录：

学科分类：08[工学] 0812[工学-计算机科学与技术（可授工学、理学学位）]

基　　金：Beijing Natural Science Foundation, China [L191017] National Natural Science Foundation of China Science and Technology Research Program of Beijing, China [Z161100001116093]

主　　题：Distributed deep reinforcement learning Edge computing Traffic network flow control Cooperative multi-agent actor-critic framework

摘要：Most of the existing traffic signal control strategies are hard to satisfy the real-time requirements of traffic big data analysis, knowledge reasoning and decision making for sophisticated traffic dynamics and heterogeneous intersection structures in the context of Internet of Vehicles (IoV). In this paper, we attempt to propose a cooperative multi-agent actor-critic (CMAC) deep reinforcement learning (DRL) approach with value decomposition based on edge computing architecture. The intuition behind CMAC is to decompose the global actor-critic learning tasks into several local actor-critic sub-problems with respect to each intersection. Each agent searches the local optimal decision by actor-critic network that takes the discrete state encoding about several consecutive frames of image-like traffic states as the inputs of the network. Among them, the green ratio output strategy considering multiple constraints is formulated in the output layer of the actor network, so that the continuous control of traffic signals using multi-agent DRL (MADRL) can be realized. Furthermore, a cooperative mechanism that considers contribution weight distributions of local agents to the global traffic pattern is proposed to coordinate multiple local agents to evolve toward global optimization. Especially, some parallel training tasks of CMAC with a large number of computing loads are deployed on the cloud side in the edge computing architecture to accelerate learning and reconstructing knowledge. The well-trained multi-agent model is downloaded from the cloud side into the edge side for real-time decision making of traffic network flow adaptive control. Simulation results with regard to a realistic traffic network demonstrate that the proposed CMAC approach under edge computing architecture outperforms the value-decomposition based multi-agent actor-critic (VMAC), independent multi-agent actor-critic (IMAC), and the fixed timing control (FTC) in terms of alleviating traffic congestion.

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Cooperative multi-agent actor-critic control of traffic network flow based on edge computing

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Cooperative multi-agent actor-critic control of traffic network flow based on edge computing

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：