{"title":"针对有放弃的knapsack问题的强化学习驱动的合作分散搜索","authors":"Juntao Zhao , Mhand Hifi","doi":"10.1016/j.cie.2024.110713","DOIUrl":null,"url":null,"abstract":"<div><div>The knapsack problem with forfeits belongs to the NP-hard combinatorial optimization family and arises in various applications like resource allocation, finance, and logistics, where certain item combinations incur penalties. Efficiently solving such a problem is crucial for optimizing resources while minimizing penalties. This paper proposes a novel reinforcement learning-driven cooperative scatter search algorithm to solve it, combining the robust exploration capabilities of scatter search with the adaptive learning strengths of <span><math><mi>Q</mi></math></span>-learning. The algorithm starts by generating a diverse archive set to ensure broad exploration of the solution space. It then iteratively generates and combines subsets using path-relinking, followed by a two-stage improvement process: <span><math><mi>Q</mi></math></span>-learning for dynamic enhancement and a tabu-based local search for refinement. Experimental evaluations on benchmark instances highlight the proposed method’s competitiveness against state-of-the-art approaches. The method establishes new lower bounds on 22 instances and matches existing bounds on others.</div></div>","PeriodicalId":55220,"journal":{"name":"Computers & Industrial Engineering","volume":"198 ","pages":"Article 110713"},"PeriodicalIF":6.7000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A reinforcement learning-driven cooperative scatter search for the knapsack problem with forfeits\",\"authors\":\"Juntao Zhao , Mhand Hifi\",\"doi\":\"10.1016/j.cie.2024.110713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The knapsack problem with forfeits belongs to the NP-hard combinatorial optimization family and arises in various applications like resource allocation, finance, and logistics, where certain item combinations incur penalties. Efficiently solving such a problem is crucial for optimizing resources while minimizing penalties. This paper proposes a novel reinforcement learning-driven cooperative scatter search algorithm to solve it, combining the robust exploration capabilities of scatter search with the adaptive learning strengths of <span><math><mi>Q</mi></math></span>-learning. The algorithm starts by generating a diverse archive set to ensure broad exploration of the solution space. It then iteratively generates and combines subsets using path-relinking, followed by a two-stage improvement process: <span><math><mi>Q</mi></math></span>-learning for dynamic enhancement and a tabu-based local search for refinement. Experimental evaluations on benchmark instances highlight the proposed method’s competitiveness against state-of-the-art approaches. The method establishes new lower bounds on 22 instances and matches existing bounds on others.</div></div>\",\"PeriodicalId\":55220,\"journal\":{\"name\":\"Computers & Industrial Engineering\",\"volume\":\"198 \",\"pages\":\"Article 110713\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Industrial Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0360835224008350\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Industrial Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360835224008350","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
A reinforcement learning-driven cooperative scatter search for the knapsack problem with forfeits
The knapsack problem with forfeits belongs to the NP-hard combinatorial optimization family and arises in various applications like resource allocation, finance, and logistics, where certain item combinations incur penalties. Efficiently solving such a problem is crucial for optimizing resources while minimizing penalties. This paper proposes a novel reinforcement learning-driven cooperative scatter search algorithm to solve it, combining the robust exploration capabilities of scatter search with the adaptive learning strengths of -learning. The algorithm starts by generating a diverse archive set to ensure broad exploration of the solution space. It then iteratively generates and combines subsets using path-relinking, followed by a two-stage improvement process: -learning for dynamic enhancement and a tabu-based local search for refinement. Experimental evaluations on benchmark instances highlight the proposed method’s competitiveness against state-of-the-art approaches. The method establishes new lower bounds on 22 instances and matches existing bounds on others.
期刊介绍:
Computers & Industrial Engineering (CAIE) is dedicated to researchers, educators, and practitioners in industrial engineering and related fields. Pioneering the integration of computers in research, education, and practice, industrial engineering has evolved to make computers and electronic communication integral to its domain. CAIE publishes original contributions focusing on the development of novel computerized methodologies to address industrial engineering problems. It also highlights the applications of these methodologies to issues within the broader industrial engineering and associated communities. The journal actively encourages submissions that push the boundaries of fundamental theories and concepts in industrial engineering techniques.