{"title":"学习多无人机的连续控制,进行洪水区域覆盖的定向探索","authors":"","doi":"10.1016/j.robot.2024.104774","DOIUrl":null,"url":null,"abstract":"<div><p>Real time on-ground information is of critical value during any natural disaster such as floods. The disaster response teams require the latest ground information of the flooded areas to effectively plan and execute rescue operations. Unmanned Aerial Vehicles (UAVs) are increasingly becoming a tool to perform quick surveys of larger areas such as flood disasters. In this paper, we propose a method to perform critical area coverage of flood-struck regions using multiple autonomous UAVs. A Deep Reinforcement Learning algorithm is proposed to learn continuous multi-UAV controls, incorporating a directed exploration strategy for the DDPG’s target actor, which relies on the D-infinity (DINF) algorithm. The DINF water flow estimation technique utilizes surface elevation data to understand and predict the directed discharge of floodwater. Further, we introduce a Path scatter strategy for the multi-UAV system that inhibits the clustered formation of the UAVs over low-elevated regions. The performance of the proposed D3S (DDPG+DINF+Path Scatter) algorithm is evaluated using various performance metrics, such as average cumulative rewards, number of collisions, and UAVs’ spread observed over the environment. In comparison to the baseline algorithms and other prevalent approaches in the literature, the proposed method is found to be better placed as the results highlight a significantly improved performance by D3S across different metrics.</p></div>","PeriodicalId":49592,"journal":{"name":"Robotics and Autonomous Systems","volume":null,"pages":null},"PeriodicalIF":4.3000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning continuous multi-UAV controls with directed explorations for flood area coverage\",\"authors\":\"\",\"doi\":\"10.1016/j.robot.2024.104774\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Real time on-ground information is of critical value during any natural disaster such as floods. The disaster response teams require the latest ground information of the flooded areas to effectively plan and execute rescue operations. Unmanned Aerial Vehicles (UAVs) are increasingly becoming a tool to perform quick surveys of larger areas such as flood disasters. In this paper, we propose a method to perform critical area coverage of flood-struck regions using multiple autonomous UAVs. A Deep Reinforcement Learning algorithm is proposed to learn continuous multi-UAV controls, incorporating a directed exploration strategy for the DDPG’s target actor, which relies on the D-infinity (DINF) algorithm. The DINF water flow estimation technique utilizes surface elevation data to understand and predict the directed discharge of floodwater. Further, we introduce a Path scatter strategy for the multi-UAV system that inhibits the clustered formation of the UAVs over low-elevated regions. The performance of the proposed D3S (DDPG+DINF+Path Scatter) algorithm is evaluated using various performance metrics, such as average cumulative rewards, number of collisions, and UAVs’ spread observed over the environment. In comparison to the baseline algorithms and other prevalent approaches in the literature, the proposed method is found to be better placed as the results highlight a significantly improved performance by D3S across different metrics.</p></div>\",\"PeriodicalId\":49592,\"journal\":{\"name\":\"Robotics and Autonomous Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Robotics and Autonomous Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0921889024001581\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Robotics and Autonomous Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0921889024001581","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Learning continuous multi-UAV controls with directed explorations for flood area coverage
Real time on-ground information is of critical value during any natural disaster such as floods. The disaster response teams require the latest ground information of the flooded areas to effectively plan and execute rescue operations. Unmanned Aerial Vehicles (UAVs) are increasingly becoming a tool to perform quick surveys of larger areas such as flood disasters. In this paper, we propose a method to perform critical area coverage of flood-struck regions using multiple autonomous UAVs. A Deep Reinforcement Learning algorithm is proposed to learn continuous multi-UAV controls, incorporating a directed exploration strategy for the DDPG’s target actor, which relies on the D-infinity (DINF) algorithm. The DINF water flow estimation technique utilizes surface elevation data to understand and predict the directed discharge of floodwater. Further, we introduce a Path scatter strategy for the multi-UAV system that inhibits the clustered formation of the UAVs over low-elevated regions. The performance of the proposed D3S (DDPG+DINF+Path Scatter) algorithm is evaluated using various performance metrics, such as average cumulative rewards, number of collisions, and UAVs’ spread observed over the environment. In comparison to the baseline algorithms and other prevalent approaches in the literature, the proposed method is found to be better placed as the results highlight a significantly improved performance by D3S across different metrics.
期刊介绍:
Robotics and Autonomous Systems will carry articles describing fundamental developments in the field of robotics, with special emphasis on autonomous systems. An important goal of this journal is to extend the state of the art in both symbolic and sensory based robot control and learning in the context of autonomous systems.
Robotics and Autonomous Systems will carry articles on the theoretical, computational and experimental aspects of autonomous systems, or modules of such systems.