{"title":"A graph reinforcement learning framework for real-time distributed multi-robot task allocation","authors":"Dian Zhang, Peng Dong, Pai Peng, Yubo Dong","doi":"10.1007/s42401-024-00334-w","DOIUrl":null,"url":null,"abstract":"<div><p>Dynamic multi-robot task allocation (MRTA) requires real-time responsiveness and adaptability to rapidly changing conditions. Existing methods, primarily based on static data and centralized architectures, often fail in dynamic environments that require decentralized, context-aware decisions. To address these challenges, this paper proposes a novel graph reinforcement learning (GRL) architecture, named Spatial-Temporal Fusing Reinforcement Learning (STFRL), to address real-time distributed target allocation problems in search and rescue scenarios. The proposed policy network includes an encoder, which employs a Temporal-Spatial Fusing Encoder (TSFE) to extract input features and a decoder uses multi-head attention (MHA) to perform distributed allocation based on the encoder’s output and context. The policy network is trained with the REINFORCE algorithm. Experimental comparisons with state-of-the-art baselines demonstrate that STFRL achieves superior performance in path cost, inference speed, and scalability, highlighting its robustness and efficiency in complex, dynamic environments.</p></div>","PeriodicalId":36309,"journal":{"name":"Aerospace Systems","volume":"8 1","pages":"105 - 116"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s42401-024-00334-w.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aerospace Systems","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s42401-024-00334-w","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0
Abstract
Dynamic multi-robot task allocation (MRTA) requires real-time responsiveness and adaptability to rapidly changing conditions. Existing methods, primarily based on static data and centralized architectures, often fail in dynamic environments that require decentralized, context-aware decisions. To address these challenges, this paper proposes a novel graph reinforcement learning (GRL) architecture, named Spatial-Temporal Fusing Reinforcement Learning (STFRL), to address real-time distributed target allocation problems in search and rescue scenarios. The proposed policy network includes an encoder, which employs a Temporal-Spatial Fusing Encoder (TSFE) to extract input features and a decoder uses multi-head attention (MHA) to perform distributed allocation based on the encoder’s output and context. The policy network is trained with the REINFORCE algorithm. Experimental comparisons with state-of-the-art baselines demonstrate that STFRL achieves superior performance in path cost, inference speed, and scalability, highlighting its robustness and efficiency in complex, dynamic environments.
期刊介绍:
Aerospace Systems provides an international, peer-reviewed forum which focuses on system-level research and development regarding aeronautics and astronautics. The journal emphasizes the unique role and increasing importance of informatics on aerospace. It fills a gap in current publishing coverage from outer space vehicles to atmospheric vehicles by highlighting interdisciplinary science, technology and engineering.
Potential topics include, but are not limited to:
Trans-space vehicle systems design and integration
Air vehicle systems
Space vehicle systems
Near-space vehicle systems
Aerospace robotics and unmanned system
Communication, navigation and surveillance
Aerodynamics and aircraft design
Dynamics and control
Aerospace propulsion
Avionics system
Opto-electronic system
Air traffic management
Earth observation
Deep space exploration
Bionic micro-aircraft/spacecraft
Intelligent sensing and Information fusion