Graph Q-learning Assisted Ant Colony Optimization for Vehicle Routing Problems with Time Windows

Proceedings of the Companion Conference on Genetic and Evolutionary Computation Pub Date : 2023-07-15 DOI:10.1145/3583133.3596423

Peng Yue, Shiqing Liu, Yaochu Jin

{"title":"Graph Q-learning Assisted Ant Colony Optimization for Vehicle Routing Problems with Time Windows","authors":"Peng Yue, Shiqing Liu, Yaochu Jin","doi":"10.1145/3583133.3596423","DOIUrl":null,"url":null,"abstract":"Vehicle routing problem with time windows (VRPTW) is a typical class of constrained path planning problems in the field of combinatorial optimization. VRPTW considers a delivery task for a given set of customers with time windows, and the target is to find optimal routes for a group of vehicles that can minimize the total transportation cost. The traditional heuristics suffer from several limitations when solving VRPTW, such as poor scalability, sensitivity to hyperparameters and difficulty in handling complex constraints. Recent advance in machine learning makes it possible to enhance heuristic approaches via learned knowledge. In this paper, we propose a graph Q-learning assisted ant colony optimization algorithm named GQL-ACO to solve VRPTW. Compared to vanilla ant colony optimization (ACO), our proposed method first employs the learned heuristic values by using graph Q learning, instead of handcrafted ones, to define the hyperparameters of ACO. Second, we design a collaborative search strategy by combining ACO and Q-learning effectively, which can adaptively adjust the hyperparameters of ACO based on the search experiences.","PeriodicalId":422029,"journal":{"name":"Proceedings of the Companion Conference on Genetic and Evolutionary Computation","volume":"56 2","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Companion Conference on Genetic and Evolutionary Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3583133.3596423","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Vehicle routing problem with time windows (VRPTW) is a typical class of constrained path planning problems in the field of combinatorial optimization. VRPTW considers a delivery task for a given set of customers with time windows, and the target is to find optimal routes for a group of vehicles that can minimize the total transportation cost. The traditional heuristics suffer from several limitations when solving VRPTW, such as poor scalability, sensitivity to hyperparameters and difficulty in handling complex constraints. Recent advance in machine learning makes it possible to enhance heuristic approaches via learned knowledge. In this paper, we propose a graph Q-learning assisted ant colony optimization algorithm named GQL-ACO to solve VRPTW. Compared to vanilla ant colony optimization (ACO), our proposed method first employs the learned heuristic values by using graph Q learning, instead of handcrafted ones, to define the hyperparameters of ACO. Second, we design a collaborative search strategy by combining ACO and Q-learning effectively, which can adaptively adjust the hyperparameters of ACO based on the search experiences.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

带时间窗车辆路径问题的图q学习辅助蚁群优化

带时间窗的车辆路径问题是组合优化领域中一类典型的约束路径规划问题。VRPTW考虑给定一组有时间窗口的客户的交付任务，目标是为一组车辆找到能够使总运输成本最小化的最佳路线。传统的启发式算法在求解VRPTW时存在可扩展性差、对超参数敏感、处理复杂约束困难等局限性。机器学习的最新进展使得通过学习知识来增强启发式方法成为可能。本文提出了一种图q学习辅助蚁群优化算法GQL-ACO来解决VRPTW问题。与普通蚁群算法相比，本文提出的方法首先利用图Q学习的启发式值来定义蚁群算法的超参数，而不是手工制作的启发式值。其次，将蚁群算法与q -学习有效结合，设计了一种基于搜索经验自适应调整蚁群算法超参数的协同搜索策略;

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Companion Conference on Genetic and Evolutionary Computation

自引率

0.00%

发文量

期刊最新文献

Graph Q-learning Assisted Ant Colony Optimization for Vehicle Routing Problems with Time Windows Iterative Structure-Based Genetic Programming for Neural Architecture Search Bayesian Optimization For Choice Data Exploring Adaptive Components of SOMA Evaluation of the impact of various modifications to CMA-ES that facilitate its theoretical analysis