Coverage Path Planning for Unmanned Aerial Vehicles in Complex 3D Environments with Deep Reinforcement Learning

2022 IEEE International Conference on Robotics and Biomimetics (ROBIO) Pub Date : 2022-12-05 DOI:10.1109/ROBIO55434.2022.10011936

Julian Bialas, M. Döller

{"title":"Coverage Path Planning for Unmanned Aerial Vehicles in Complex 3D Environments with Deep Reinforcement Learning","authors":"Julian Bialas, M. Döller","doi":"10.1109/ROBIO55434.2022.10011936","DOIUrl":null,"url":null,"abstract":"Coverage path planning (CPP) for unmanned aerial vehicles (UAVs) defines a vital role in the automation process of UAV-supported disaster management. While multiple algorithms exist to solve the CPP problem for planar areas, the proposed algorithm is the first to handle complex three-dimensional environments and also account for power constraints and changing environments. By applying proximal policy optimization to an advantage-based actor-critic deep reinforcement learning model, the proposed framework enables an agent to efficiently cover the target area (TA), considering the orientation of the observation sensor, avoiding collisions as well as no-flying zones (NFZ) and reacting to changing environments. Furthermore, a safe landing mechanism, based on the Dijkstra algorithm, expands the framework to guarantee a successful landing in the respective start and landing zone (SLZ) within the power constraints. The model is trained on real data to learn the optimal control policy. Additionally, the framework was tested and validated on hardware in a drone lab to confirm its effectiveness and capability to perform real-time path planning.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"107 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO55434.2022.10011936","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Coverage path planning (CPP) for unmanned aerial vehicles (UAVs) defines a vital role in the automation process of UAV-supported disaster management. While multiple algorithms exist to solve the CPP problem for planar areas, the proposed algorithm is the first to handle complex three-dimensional environments and also account for power constraints and changing environments. By applying proximal policy optimization to an advantage-based actor-critic deep reinforcement learning model, the proposed framework enables an agent to efficiently cover the target area (TA), considering the orientation of the observation sensor, avoiding collisions as well as no-flying zones (NFZ) and reacting to changing environments. Furthermore, a safe landing mechanism, based on the Dijkstra algorithm, expands the framework to guarantee a successful landing in the respective start and landing zone (SLZ) within the power constraints. The model is trained on real data to learn the optimal control policy. Additionally, the framework was tested and validated on hardware in a drone lab to confirm its effectiveness and capability to perform real-time path planning.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度强化学习的复杂三维环境下无人机覆盖路径规划

无人机覆盖路径规划(CPP)在无人机支持的灾害管理自动化过程中起着至关重要的作用。虽然已有多种算法解决平面区域的CPP问题，但本文提出的算法是第一个处理复杂三维环境的算法，并且考虑了功率约束和变化的环境。通过将近端策略优化应用于基于优势的行为者-批评者深度强化学习模型，所提出的框架使智能体能够有效地覆盖目标区域(TA)，考虑观察传感器的方向，避免碰撞和禁飞区(NFZ)，并对不断变化的环境做出反应。此外，基于Dijkstra算法的安全着陆机制扩展了框架，保证在功率约束下在各自的启动区和着陆区(SLZ)成功着陆。利用实际数据对模型进行训练，学习最优控制策略。此外，该框架在无人机实验室的硬件上进行了测试和验证，以确认其有效性和执行实时路径规划的能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)

自引率

0.00%

发文量