Thomy Phan, Benran Zhang, Shao-Hung Chan, Sven Koenig
{"title":"基于自适应延迟启发式的随时多代理路径搜索","authors":"Thomy Phan, Benran Zhang, Shao-Hung Chan, Sven Koenig","doi":"arxiv-2408.02960","DOIUrl":null,"url":null,"abstract":"Anytime multi-agent path finding (MAPF) is a promising approach to scalable\npath optimization in multi-agent systems. MAPF-LNS, based on Large Neighborhood\nSearch (LNS), is the current state-of-the-art approach where a fast initial\nsolution is iteratively optimized by destroying and repairing selected paths of\nthe solution. Current MAPF-LNS variants commonly use an adaptive selection\nmechanism to choose among multiple destroy heuristics. However, to determine\npromising destroy heuristics, MAPF-LNS requires a considerable amount of\nexploration time. As common destroy heuristics are non-adaptive, any\nperformance bottleneck caused by these heuristics cannot be overcome via\nadaptive heuristic selection alone, thus limiting the overall effectiveness of\nMAPF-LNS in terms of solution cost. In this paper, we propose Adaptive\nDelay-based Destroy-and-Repair Enhanced with Success-based Self-Learning\n(ADDRESS) as a single-destroy-heuristic variant of MAPF-LNS. ADDRESS applies\nrestricted Thompson Sampling to the top-K set of the most delayed agents to\nselect a seed agent for adaptive LNS neighborhood generation. We evaluate\nADDRESS in multiple maps from the MAPF benchmark set and demonstrate cost\nimprovements by at least 50% in large-scale scenarios with up to a thousand\nagents, compared with the original MAPF-LNS and other state-of-the-art methods.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic\",\"authors\":\"Thomy Phan, Benran Zhang, Shao-Hung Chan, Sven Koenig\",\"doi\":\"arxiv-2408.02960\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Anytime multi-agent path finding (MAPF) is a promising approach to scalable\\npath optimization in multi-agent systems. MAPF-LNS, based on Large Neighborhood\\nSearch (LNS), is the current state-of-the-art approach where a fast initial\\nsolution is iteratively optimized by destroying and repairing selected paths of\\nthe solution. Current MAPF-LNS variants commonly use an adaptive selection\\nmechanism to choose among multiple destroy heuristics. However, to determine\\npromising destroy heuristics, MAPF-LNS requires a considerable amount of\\nexploration time. As common destroy heuristics are non-adaptive, any\\nperformance bottleneck caused by these heuristics cannot be overcome via\\nadaptive heuristic selection alone, thus limiting the overall effectiveness of\\nMAPF-LNS in terms of solution cost. In this paper, we propose Adaptive\\nDelay-based Destroy-and-Repair Enhanced with Success-based Self-Learning\\n(ADDRESS) as a single-destroy-heuristic variant of MAPF-LNS. ADDRESS applies\\nrestricted Thompson Sampling to the top-K set of the most delayed agents to\\nselect a seed agent for adaptive LNS neighborhood generation. We evaluate\\nADDRESS in multiple maps from the MAPF benchmark set and demonstrate cost\\nimprovements by at least 50% in large-scale scenarios with up to a thousand\\nagents, compared with the original MAPF-LNS and other state-of-the-art methods.\",\"PeriodicalId\":501479,\"journal\":{\"name\":\"arXiv - CS - Artificial Intelligence\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.02960\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02960","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic
Anytime multi-agent path finding (MAPF) is a promising approach to scalable
path optimization in multi-agent systems. MAPF-LNS, based on Large Neighborhood
Search (LNS), is the current state-of-the-art approach where a fast initial
solution is iteratively optimized by destroying and repairing selected paths of
the solution. Current MAPF-LNS variants commonly use an adaptive selection
mechanism to choose among multiple destroy heuristics. However, to determine
promising destroy heuristics, MAPF-LNS requires a considerable amount of
exploration time. As common destroy heuristics are non-adaptive, any
performance bottleneck caused by these heuristics cannot be overcome via
adaptive heuristic selection alone, thus limiting the overall effectiveness of
MAPF-LNS in terms of solution cost. In this paper, we propose Adaptive
Delay-based Destroy-and-Repair Enhanced with Success-based Self-Learning
(ADDRESS) as a single-destroy-heuristic variant of MAPF-LNS. ADDRESS applies
restricted Thompson Sampling to the top-K set of the most delayed agents to
select a seed agent for adaptive LNS neighborhood generation. We evaluate
ADDRESS in multiple maps from the MAPF benchmark set and demonstrate cost
improvements by at least 50% in large-scale scenarios with up to a thousand
agents, compared with the original MAPF-LNS and other state-of-the-art methods.