具有冲突和时间窗的二维矢量包装问题的基于上下文强盗学习的分支-价格-切割算法

IF 8.3 1区工程技术 Q1 ECONOMICS Transportation Research Part E-Logistics and Transportation Review Pub Date : 2024-11-29 DOI:10.1016/j.tre.2024.103866

Yanru Chen , Mujin Gao , Zongcheng Zhang , Junheng Li , M.I.M. Wahab , Yangsheng Jiang

{"title":"具有冲突和时间窗的二维矢量包装问题的基于上下文强盗学习的分支-价格-切割算法","authors":"Yanru Chen , Mujin Gao , Zongcheng Zhang , Junheng Li , M.I.M. Wahab , Yangsheng Jiang","doi":"10.1016/j.tre.2024.103866","DOIUrl":null,"url":null,"abstract":"<div><div>A two-dimensional vector packing problem with conflicts and time windows (2DVPPCTW) is investigated in this study. It consists of packing items into the minimum number of bins, and items are characterized by different weights, volumes, and time windows. Items also have conflicts and cannot be packed in the same bin. We formulate the 2DVPPCTW as an integer programming model and reformulate it to the master problem and the subproblem based on the Danzig–Wolfe decomposition. An exact algorithm, contextual bandits learning-based branch-and-price-and-cut algorithm (CBL-BPC), is proposed for the 2DVPPCTW with reinforcement learning technique. In particular, we provide a CBL framework for the subproblem, which usually poses considerable computational challenges. Five heuristic algorithms, namely, adaptive large neighborhood search (ALNS), ant colony optimization heuristic (ACO), heuristic dynamic programming (DP), a combination of ALNS and heuristic DP, and a combination of ACO and heuristic DP, are developed as bandit arms in the CBL framework. The CBL framework adaptively chooses one of five heuristics algorithms to solve the subproblem by learning from previous experiences. An exact dynamic programming algorithm is invoked to guarantee optimality once the CBL fails to find a better solution to the subproblem. Rounded capacity inequalities and accelerating strategies are introduced to accelerate the solution. An extensive computational study shows that the CBL-BPC can solve all 800 instances optimally within a reasonable time frame and is highly competitive with state-of-the-art exact and heuristics methods.</div></div>","PeriodicalId":49418,"journal":{"name":"Transportation Research Part E-Logistics and Transportation Review","volume":"193 ","pages":"Article 103866"},"PeriodicalIF":8.3000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Contextual bandits learning-based branch-and-price-and-cut algorithm for the two-dimensional vector packing problem with conflicts and time windows\",\"authors\":\"Yanru Chen , Mujin Gao , Zongcheng Zhang , Junheng Li , M.I.M. Wahab , Yangsheng Jiang\",\"doi\":\"10.1016/j.tre.2024.103866\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>A two-dimensional vector packing problem with conflicts and time windows (2DVPPCTW) is investigated in this study. It consists of packing items into the minimum number of bins, and items are characterized by different weights, volumes, and time windows. Items also have conflicts and cannot be packed in the same bin. We formulate the 2DVPPCTW as an integer programming model and reformulate it to the master problem and the subproblem based on the Danzig–Wolfe decomposition. An exact algorithm, contextual bandits learning-based branch-and-price-and-cut algorithm (CBL-BPC), is proposed for the 2DVPPCTW with reinforcement learning technique. In particular, we provide a CBL framework for the subproblem, which usually poses considerable computational challenges. Five heuristic algorithms, namely, adaptive large neighborhood search (ALNS), ant colony optimization heuristic (ACO), heuristic dynamic programming (DP), a combination of ALNS and heuristic DP, and a combination of ACO and heuristic DP, are developed as bandit arms in the CBL framework. The CBL framework adaptively chooses one of five heuristics algorithms to solve the subproblem by learning from previous experiences. An exact dynamic programming algorithm is invoked to guarantee optimality once the CBL fails to find a better solution to the subproblem. Rounded capacity inequalities and accelerating strategies are introduced to accelerate the solution. An extensive computational study shows that the CBL-BPC can solve all 800 instances optimally within a reasonable time frame and is highly competitive with state-of-the-art exact and heuristics methods.</div></div>\",\"PeriodicalId\":49418,\"journal\":{\"name\":\"Transportation Research Part E-Logistics and Transportation Review\",\"volume\":\"193 \",\"pages\":\"Article 103866\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2024-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Research Part E-Logistics and Transportation Review\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1366554524004575\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Research Part E-Logistics and Transportation Review","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1366554524004575","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

研究了一个带冲突和时间窗的二维向量填充问题（2DVPPCTW）。它包括将物品打包到最少数量的箱子中，物品的特征是不同的重量、体积和时间窗口。物品也有冲突，不能放在同一个箱子里。我们将2DVPPCTW作为一个整数规划模型，并基于danzigg - wolfe分解将其重新表述为主问题和子问题。采用强化学习技术，提出了一种精确的2DVPPCTW算法——基于上下文强盗学习的分支-价格-削减算法（CBL-BPC）。特别是，我们为子问题提供了一个CBL框架，这通常会带来相当大的计算挑战。在CBL框架下，开发了自适应大邻域搜索（ALNS）、蚁群优化启发式算法（ACO）、启发式动态规划算法（DP）、自适应大邻域搜索与启发式动态规划相结合算法、蚁群优化与启发式动态规划相结合算法等5种启发式算法。CBL框架自适应地从五种启发式算法中选择一种，通过学习前人的经验来解决子问题。当CBL无法找到子问题的较优解时，调用精确的动态规划算法来保证最优性。引入了四舍五入容量不等式和加速策略来加速求解。一项广泛的计算研究表明，CBL-BPC可以在合理的时间框架内最优地解决所有800个实例，并且与最先进的精确和启发式方法具有很强的竞争力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Contextual bandits learning-based branch-and-price-and-cut algorithm for the two-dimensional vector packing problem with conflicts and time windows

A two-dimensional vector packing problem with conflicts and time windows (2DVPPCTW) is investigated in this study. It consists of packing items into the minimum number of bins, and items are characterized by different weights, volumes, and time windows. Items also have conflicts and cannot be packed in the same bin. We formulate the 2DVPPCTW as an integer programming model and reformulate it to the master problem and the subproblem based on the Danzig–Wolfe decomposition. An exact algorithm, contextual bandits learning-based branch-and-price-and-cut algorithm (CBL-BPC), is proposed for the 2DVPPCTW with reinforcement learning technique. In particular, we provide a CBL framework for the subproblem, which usually poses considerable computational challenges. Five heuristic algorithms, namely, adaptive large neighborhood search (ALNS), ant colony optimization heuristic (ACO), heuristic dynamic programming (DP), a combination of ALNS and heuristic DP, and a combination of ACO and heuristic DP, are developed as bandit arms in the CBL framework. The CBL framework adaptively chooses one of five heuristics algorithms to solve the subproblem by learning from previous experiences. An exact dynamic programming algorithm is invoked to guarantee optimality once the CBL fails to find a better solution to the subproblem. Rounded capacity inequalities and accelerating strategies are introduced to accelerate the solution. An extensive computational study shows that the CBL-BPC can solve all 800 instances optimally within a reasonable time frame and is highly competitive with state-of-the-art exact and heuristics methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Transportation Research Part E-Logistics and Transportation Review 工程技术-工程：土木

CiteScore

16.20

自引率

16.00%

发文量

285

审稿时长

62 days

期刊介绍： Transportation Research Part E: Logistics and Transportation Review is a reputable journal that publishes high-quality articles covering a wide range of topics in the field of logistics and transportation research. The journal welcomes submissions on various subjects, including transport economics, transport infrastructure and investment appraisal, evaluation of public policies related to transportation, empirical and analytical studies of logistics management practices and performance, logistics and operations models, and logistics and supply chain management. Part E aims to provide informative and well-researched articles that contribute to the understanding and advancement of the field. The content of the journal is complementary to other prestigious journals in transportation research, such as Transportation Research Part A: Policy and Practice, Part B: Methodological, Part C: Emerging Technologies, Part D: Transport and Environment, and Part F: Traffic Psychology and Behaviour. Together, these journals form a comprehensive and cohesive reference for current research in transportation science.