SimplestNet-Drone: An efficient and Accurate Object Detection Algorithm for Drone Aerial Image Analytics

{"title":"SimplestNet-Drone: An efficient and Accurate Object Detection Algorithm for Drone Aerial Image Analytics","authors":"","doi":"10.1109/DICTA56598.2022.10034564","DOIUrl":null,"url":null,"abstract":"Images captured by drones are extremely difficult to detect due to varying camera angles, distances, sizes, and environmental conditions, making it challenging to accurately detect an object from a height. Nonetheless, object detection plays a crucial role in computer vision and has made significant improvements to images captured by drones. We apply the YOLOv5 framework with modified feature extraction and focus detection. The problem with aerial images is object size and viewing angle from a high altitude, so we proposed a single-stage object detection model called “SimplestNet-Drone”. We included a fourth prediction head to improve the object detection on the smallest objects and improve the detection speed. The algorithm's prediction accuracy is improved by adding an attention model mechanism, which detects attention regions in environments and suppresses unnecessary information. The model was trained and tested on the VisDorne dataset and compared with other object detection models. The model shows great improvement, with a mean average precision of 63.72%, and has improved the Yolo architecture. A real-time implementation of our model can be watched in the following YouTube video: https://youtu.be/De8t4tjtb6w","PeriodicalId":159377,"journal":{"name":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Digital Image Computing: Techniques and Applications (DICTA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DICTA56598.2022.10034564","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Images captured by drones are extremely difficult to detect due to varying camera angles, distances, sizes, and environmental conditions, making it challenging to accurately detect an object from a height. Nonetheless, object detection plays a crucial role in computer vision and has made significant improvements to images captured by drones. We apply the YOLOv5 framework with modified feature extraction and focus detection. The problem with aerial images is object size and viewing angle from a high altitude, so we proposed a single-stage object detection model called “SimplestNet-Drone”. We included a fourth prediction head to improve the object detection on the smallest objects and improve the detection speed. The algorithm's prediction accuracy is improved by adding an attention model mechanism, which detects attention regions in environments and suppresses unnecessary information. The model was trained and tested on the VisDorne dataset and compared with other object detection models. The model shows great improvement, with a mean average precision of 63.72%, and has improved the Yolo architecture. A real-time implementation of our model can be watched in the following YouTube video: https://youtu.be/De8t4tjtb6w
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
SimplestNet-Drone:一种高效、准确的无人机航拍图像目标检测算法
由于相机角度、距离、尺寸和环境条件的变化,无人机捕获的图像极难检测,这使得从高处准确检测物体具有挑战性。尽管如此,物体检测在计算机视觉中起着至关重要的作用,并对无人机捕获的图像进行了重大改进。我们将YOLOv5框架应用于改进的特征提取和焦点检测。航空图像的问题是物体大小和高空视角,因此我们提出了一种单级目标检测模型,称为“SimplestNet-Drone”。我们加入了第四个预测头,以提高对最小目标的目标检测,提高检测速度。该算法通过添加注意模型机制来检测环境中的注意区域并抑制不必要的信息,从而提高了算法的预测精度。该模型在VisDorne数据集上进行了训练和测试,并与其他目标检测模型进行了比较。该模型改进了Yolo结构,平均精度达到63.72%。我们的模型的实时实现可以在以下YouTube视频中观看:https://youtu.be/De8t4tjtb6w
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FootSeg: Automatic Anatomical Segmentation of Foot Bones from Weight-Bearing Cone Beam CT Scans KENGIC: KEyword-driven and N-Gram Graph based Image Captioning Swin-ResUNet: A Swin-Topology Module for Road Extraction from Remote Sensing Images A Multi-Granularity Feature Fusion Model for Pedestrian Attribute Recognition Disentangling Convolutional Neural Network towards an explainable Vehicle Classifier
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1