检索结果-内蒙古大学图书馆

Proceedings of the 23rd International Conference on Autonomous Agents and multiagent Systems

作者： Yaoxin Wu Mingfeng Fan Zhiguang Cao Ruobin Gao Yaqing Hou Guillaume Sartoretti Eindhoven University of Technology Eindhoven Netherlands National University of Singapore Singapore Singapore Singapore Management University Singapore Singapore Nanyang Technological University Singapore Singapore Dalian University of Technology Dalian China

ISBN: (纸本)9798400704864

Existing deep reinforcement learning (DRL) methods for multi-objective vehicle routing problems (MOVRPs) typically decompose an MOVRP into subproblems with respective preferences and then train policies to solve corresponding subproblems. However, such a paradigm is still less effective in tackling the intricate interactions among subproblems, thus holding back the quality of the Pareto solutions. To counteract this limitation, we introduce a collaborative deep reinforcement learning method. We first propose a preference-based attention network (PAN) that allows the DRL agents to reason out solutions to subproblems in parallel, where a shared encoder learns the instance embedding and a decoder is tailored for each agent by preference intervention to construct respective solutions. Then, we design a collaborative active search (CAS) to further improve the solution quality, which updates only a part of the decoder parameters per instance during inference. In the CAS process, we also explicitly foster the interactions of neighboring DRL agents by imitation learning, empowering them to exchange insights of elite solutions to similar subproblems. Extensive results on random and benchmark instances verified the efficacy of PAN and CAS, which is particularly pronounced on the configurations (i.e., problem sizes or node distributions) beyond the training ones. Our code is available at https://***/marmotlab/PAN-CAS.

关键词： attention network collaborative active search deep reinforcement learning multi-objective vehicle routing problems

来源：评论

学校读者我要写书评

暂无评论

Stochastic partially optimized cyclic shift crossover for multi-objective genetic algorithms for the vehicle routing problem with time-windows

引用

APPLIED SOFT COMPUTING 2017年 52卷 863-876页

作者： Pierre, Djamalladine Mahamat Zakaria, Nordin Univ Teknol PETRONAS High Performance Comp Ctr Tronoh 32610 Perak Malaysia

This paper presents a stochastic partially optimized cyclic shift crossover operator for the optimization of the multi-objective vehicle routing problem with time windows using genetic algorithms. The aim of the paper is to show how the combination of simple stochastic rules and sequential appendage policies addresses a common limitation of the traditional genetic algorithm when optimizing complex combinatorial problems. The limitation, in question, is the inability of the traditional genetic algorithm to perform local optimization. A series of tests based on the Solomon benchmark instances show the level of competitiveness of the newly introduced crossover operator. (C) 2016 Elsevier B.V. All rights reserved

关键词： multi-objective vehicle routing problems multi-objective genetic algorithm Crossover

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：