Coverage Path Planning for Unmanned Aerial Vehicles in Complex 3D Environments with Deep Reinforcement Learning

Julian Bialas, M. Döller
{"title":"Coverage Path Planning for Unmanned Aerial Vehicles in Complex 3D Environments with Deep Reinforcement Learning","authors":"Julian Bialas, M. Döller","doi":"10.1109/ROBIO55434.2022.10011936","DOIUrl":null,"url":null,"abstract":"Coverage path planning (CPP) for unmanned aerial vehicles (UAVs) defines a vital role in the automation process of UAV-supported disaster management. While multiple algorithms exist to solve the CPP problem for planar areas, the proposed algorithm is the first to handle complex three-dimensional environments and also account for power constraints and changing environments. By applying proximal policy optimization to an advantage-based actor-critic deep reinforcement learning model, the proposed framework enables an agent to efficiently cover the target area (TA), considering the orientation of the observation sensor, avoiding collisions as well as no-flying zones (NFZ) and reacting to changing environments. Furthermore, a safe landing mechanism, based on the Dijkstra algorithm, expands the framework to guarantee a successful landing in the respective start and landing zone (SLZ) within the power constraints. The model is trained on real data to learn the optimal control policy. Additionally, the framework was tested and validated on hardware in a drone lab to confirm its effectiveness and capability to perform real-time path planning.","PeriodicalId":151112,"journal":{"name":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"107 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO55434.2022.10011936","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Coverage path planning (CPP) for unmanned aerial vehicles (UAVs) defines a vital role in the automation process of UAV-supported disaster management. While multiple algorithms exist to solve the CPP problem for planar areas, the proposed algorithm is the first to handle complex three-dimensional environments and also account for power constraints and changing environments. By applying proximal policy optimization to an advantage-based actor-critic deep reinforcement learning model, the proposed framework enables an agent to efficiently cover the target area (TA), considering the orientation of the observation sensor, avoiding collisions as well as no-flying zones (NFZ) and reacting to changing environments. Furthermore, a safe landing mechanism, based on the Dijkstra algorithm, expands the framework to guarantee a successful landing in the respective start and landing zone (SLZ) within the power constraints. The model is trained on real data to learn the optimal control policy. Additionally, the framework was tested and validated on hardware in a drone lab to confirm its effectiveness and capability to perform real-time path planning.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度强化学习的复杂三维环境下无人机覆盖路径规划
无人机覆盖路径规划(CPP)在无人机支持的灾害管理自动化过程中起着至关重要的作用。虽然已有多种算法解决平面区域的CPP问题,但本文提出的算法是第一个处理复杂三维环境的算法,并且考虑了功率约束和变化的环境。通过将近端策略优化应用于基于优势的行为者-批评者深度强化学习模型,所提出的框架使智能体能够有效地覆盖目标区域(TA),考虑观察传感器的方向,避免碰撞和禁飞区(NFZ),并对不断变化的环境做出反应。此外,基于Dijkstra算法的安全着陆机制扩展了框架,保证在功率约束下在各自的启动区和着陆区(SLZ)成功着陆。利用实际数据对模型进行训练,学习最优控制策略。此外,该框架在无人机实验室的硬件上进行了测试和验证,以确认其有效性和执行实时路径规划的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Relative Displacement Measurement Based Affine Formation Tracking Control for Nonholonomic Kinematic Agents Steady Tracker: Tracking a Target Stably Using a Quadrotor Adaptive Super-Twisting sliding mode trajectory tracking control of underactuated unmanned surface vehicles based on prescribed performance* Design and Preliminary Evaluation of a Lightweight, Cable-Driven Hip Exoskeleton for Walking Assistance A PSO-based Resource Allocation and Task Assignment Approach for Real-Time Cloud Computing-based Robotic Systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1