Thomy Phan, Benran Zhang, Shao-Hung Chan, Sven Koenig
{"title":"Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic","authors":"Thomy Phan, Benran Zhang, Shao-Hung Chan, Sven Koenig","doi":"arxiv-2408.02960","DOIUrl":null,"url":null,"abstract":"Anytime multi-agent path finding (MAPF) is a promising approach to scalable\npath optimization in multi-agent systems. MAPF-LNS, based on Large Neighborhood\nSearch (LNS), is the current state-of-the-art approach where a fast initial\nsolution is iteratively optimized by destroying and repairing selected paths of\nthe solution. Current MAPF-LNS variants commonly use an adaptive selection\nmechanism to choose among multiple destroy heuristics. However, to determine\npromising destroy heuristics, MAPF-LNS requires a considerable amount of\nexploration time. As common destroy heuristics are non-adaptive, any\nperformance bottleneck caused by these heuristics cannot be overcome via\nadaptive heuristic selection alone, thus limiting the overall effectiveness of\nMAPF-LNS in terms of solution cost. In this paper, we propose Adaptive\nDelay-based Destroy-and-Repair Enhanced with Success-based Self-Learning\n(ADDRESS) as a single-destroy-heuristic variant of MAPF-LNS. ADDRESS applies\nrestricted Thompson Sampling to the top-K set of the most delayed agents to\nselect a seed agent for adaptive LNS neighborhood generation. We evaluate\nADDRESS in multiple maps from the MAPF benchmark set and demonstrate cost\nimprovements by at least 50% in large-scale scenarios with up to a thousand\nagents, compared with the original MAPF-LNS and other state-of-the-art methods.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.02960","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Anytime multi-agent path finding (MAPF) is a promising approach to scalable
path optimization in multi-agent systems. MAPF-LNS, based on Large Neighborhood
Search (LNS), is the current state-of-the-art approach where a fast initial
solution is iteratively optimized by destroying and repairing selected paths of
the solution. Current MAPF-LNS variants commonly use an adaptive selection
mechanism to choose among multiple destroy heuristics. However, to determine
promising destroy heuristics, MAPF-LNS requires a considerable amount of
exploration time. As common destroy heuristics are non-adaptive, any
performance bottleneck caused by these heuristics cannot be overcome via
adaptive heuristic selection alone, thus limiting the overall effectiveness of
MAPF-LNS in terms of solution cost. In this paper, we propose Adaptive
Delay-based Destroy-and-Repair Enhanced with Success-based Self-Learning
(ADDRESS) as a single-destroy-heuristic variant of MAPF-LNS. ADDRESS applies
restricted Thompson Sampling to the top-K set of the most delayed agents to
select a seed agent for adaptive LNS neighborhood generation. We evaluate
ADDRESS in multiple maps from the MAPF benchmark set and demonstrate cost
improvements by at least 50% in large-scale scenarios with up to a thousand
agents, compared with the original MAPF-LNS and other state-of-the-art methods.