{"title":"通过混合进化算法和深度强化学习解决定向问题","authors":"Rui Wang;Wei Liu;Kaiwen Li;Tao Zhang;Ling Wang;Xin Xu","doi":"10.1109/TAI.2024.3409520","DOIUrl":null,"url":null,"abstract":"The orienteering problem (OP) is widely applied in real life. However, as the scale of real-world problem scenarios grows quickly, traditional exact, heuristics, and learning-based methods have difficulty balancing optimization accuracy and efficiency. This study proposes a problem decomposition-based double-layer optimization framework named DEA-DYPN to solve OPs. Using a diversity evolutionary algorithm (DEA) as the external optimizer and a dynamic pointer network (DYPN) as the inner optimizer, we significantly reduce the difficulty of solving large-scale OPs. Several targeted optimization operators are innovatively designed for stronger search ability, including a greedy population initialization heuristic, an elite strategy, a population restart mechanism, and a fitness-sharing selection strategy. Moreover, a dynamic embedding mechanism is introduced to DYPN to improve its characteristic learning ability. Extensive comparative experiments on OP instances with sizes from 20 to 500 are conducted for algorithmic performance validation. More experiments and analyses, including the significance test, stability analysis, complexity analysis, sensitivity analysis, and ablation experiments, are also conducted for comprehensive algorithmic evaluation. Experimental results show that our proposed DEA-DYPN ranks first according to the Friedman test and outperforms the competitor algorithms by 69%.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 11","pages":"5493-5508"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Solving Orienteering Problems by Hybridizing Evolutionary Algorithm and Deep Reinforcement Learning\",\"authors\":\"Rui Wang;Wei Liu;Kaiwen Li;Tao Zhang;Ling Wang;Xin Xu\",\"doi\":\"10.1109/TAI.2024.3409520\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The orienteering problem (OP) is widely applied in real life. However, as the scale of real-world problem scenarios grows quickly, traditional exact, heuristics, and learning-based methods have difficulty balancing optimization accuracy and efficiency. This study proposes a problem decomposition-based double-layer optimization framework named DEA-DYPN to solve OPs. Using a diversity evolutionary algorithm (DEA) as the external optimizer and a dynamic pointer network (DYPN) as the inner optimizer, we significantly reduce the difficulty of solving large-scale OPs. Several targeted optimization operators are innovatively designed for stronger search ability, including a greedy population initialization heuristic, an elite strategy, a population restart mechanism, and a fitness-sharing selection strategy. Moreover, a dynamic embedding mechanism is introduced to DYPN to improve its characteristic learning ability. Extensive comparative experiments on OP instances with sizes from 20 to 500 are conducted for algorithmic performance validation. More experiments and analyses, including the significance test, stability analysis, complexity analysis, sensitivity analysis, and ablation experiments, are also conducted for comprehensive algorithmic evaluation. Experimental results show that our proposed DEA-DYPN ranks first according to the Friedman test and outperforms the competitor algorithms by 69%.\",\"PeriodicalId\":73305,\"journal\":{\"name\":\"IEEE transactions on artificial intelligence\",\"volume\":\"5 11\",\"pages\":\"5493-5508\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on artificial intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10547597/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10547597/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
定向行走问题(OP)在现实生活中应用广泛。然而,随着现实世界问题场景规模的快速增长,传统的精确、启发式和基于学习的方法难以兼顾优化精度和效率。本研究提出了一种基于问题分解的双层优化框架,名为 DEA-DYPN,用于解决 OPs。以多样性进化算法(DEA)为外部优化器,以动态指针网络(DYPN)为内部优化器,大大降低了大规模 OP 的求解难度。为了增强搜索能力,我们创新性地设计了几种有针对性的优化算子,包括贪婪种群初始化启发式、精英策略、种群重启机制和适配性共享选择策略。此外,DYPN 还引入了动态嵌入机制,以提高其特有的学习能力。为了验证算法的性能,我们在 20 到 500 个 OP 实例上进行了广泛的对比实验。此外,还进行了更多的实验和分析,包括显著性检验、稳定性分析、复杂性分析、灵敏度分析和消融实验,以对算法进行综合评估。实验结果表明,根据弗里德曼测试,我们提出的 DEA-DYPN 排在第一位,比竞争算法高出 69%。
Solving Orienteering Problems by Hybridizing Evolutionary Algorithm and Deep Reinforcement Learning
The orienteering problem (OP) is widely applied in real life. However, as the scale of real-world problem scenarios grows quickly, traditional exact, heuristics, and learning-based methods have difficulty balancing optimization accuracy and efficiency. This study proposes a problem decomposition-based double-layer optimization framework named DEA-DYPN to solve OPs. Using a diversity evolutionary algorithm (DEA) as the external optimizer and a dynamic pointer network (DYPN) as the inner optimizer, we significantly reduce the difficulty of solving large-scale OPs. Several targeted optimization operators are innovatively designed for stronger search ability, including a greedy population initialization heuristic, an elite strategy, a population restart mechanism, and a fitness-sharing selection strategy. Moreover, a dynamic embedding mechanism is introduced to DYPN to improve its characteristic learning ability. Extensive comparative experiments on OP instances with sizes from 20 to 500 are conducted for algorithmic performance validation. More experiments and analyses, including the significance test, stability analysis, complexity analysis, sensitivity analysis, and ablation experiments, are also conducted for comprehensive algorithmic evaluation. Experimental results show that our proposed DEA-DYPN ranks first according to the Friedman test and outperforms the competitor algorithms by 69%.