YOLO-HLFE:基于 YOLOv7、具有混合损失和特征增强功能的无人机透视目标探测器

IF 2.6 4区 综合性期刊 Q2 MULTIDISCIPLINARY SCIENCES Arabian Journal for Science and Engineering Pub Date : 2024-07-01 DOI:10.1007/s13369-024-09188-y
Hao Sun, Jianhao Wang, Ziyu Hu, He Yang, Zhenwei Xu
{"title":"YOLO-HLFE:基于 YOLOv7、具有混合损失和特征增强功能的无人机透视目标探测器","authors":"Hao Sun,&nbsp;Jianhao Wang,&nbsp;Ziyu Hu,&nbsp;He Yang,&nbsp;Zhenwei Xu","doi":"10.1007/s13369-024-09188-y","DOIUrl":null,"url":null,"abstract":"<div><p>Target detection from UAV perspective has been a very hot task in recent years. Due to the flying height of the UAV, the detection targets in the photographs are dense and small in scale, resulting in little available information and difficulty in feature extraction. And the prediction bias of small targets can have a large negative impact on the calculation of losses. So for better use of UAV, YOLO-HLFE is designed on the basis of YOLOv7. The coordinate attention mechanism is added to the MP downsampling structure to comprise MPFE downsampling structure, which makes full use of the location information of the target and enhances the feature extraction capability of the network. The complete intersection over union (CIOU) of YOLOv7 is combined with the Normalized Gaussian Wasserstein Distance loss (NWD) to constitute the CIOU-NWD loss to mitigate the prediction bias problem for small targets. In addition, in order to make the anchor point of the model closer to the target scale of the UAV perspective, the clustering method of the model is improved and the anchor point is re-clustered. In experiment using the sliced VisDrone2021-DET dataset and SeaDronesSeeV2 dataset, the mAP50 and mAP of YOLO-HLFE on sliced VisDrone2021-DET dataset reach 52.3% and 30.0%, which are 2.8% and 0.9% higher than the baseline, respectively.</p></div>","PeriodicalId":54354,"journal":{"name":"Arabian Journal for Science and Engineering","volume":"50 2","pages":"1261 - 1278"},"PeriodicalIF":2.6000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"YOLO-HLFE: A UAV Perspective Target Detector With Hybrid Loss and Feature Enhancement Based on YOLOv7\",\"authors\":\"Hao Sun,&nbsp;Jianhao Wang,&nbsp;Ziyu Hu,&nbsp;He Yang,&nbsp;Zhenwei Xu\",\"doi\":\"10.1007/s13369-024-09188-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Target detection from UAV perspective has been a very hot task in recent years. Due to the flying height of the UAV, the detection targets in the photographs are dense and small in scale, resulting in little available information and difficulty in feature extraction. And the prediction bias of small targets can have a large negative impact on the calculation of losses. So for better use of UAV, YOLO-HLFE is designed on the basis of YOLOv7. The coordinate attention mechanism is added to the MP downsampling structure to comprise MPFE downsampling structure, which makes full use of the location information of the target and enhances the feature extraction capability of the network. The complete intersection over union (CIOU) of YOLOv7 is combined with the Normalized Gaussian Wasserstein Distance loss (NWD) to constitute the CIOU-NWD loss to mitigate the prediction bias problem for small targets. In addition, in order to make the anchor point of the model closer to the target scale of the UAV perspective, the clustering method of the model is improved and the anchor point is re-clustered. In experiment using the sliced VisDrone2021-DET dataset and SeaDronesSeeV2 dataset, the mAP50 and mAP of YOLO-HLFE on sliced VisDrone2021-DET dataset reach 52.3% and 30.0%, which are 2.8% and 0.9% higher than the baseline, respectively.</p></div>\",\"PeriodicalId\":54354,\"journal\":{\"name\":\"Arabian Journal for Science and Engineering\",\"volume\":\"50 2\",\"pages\":\"1261 - 1278\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Arabian Journal for Science and Engineering\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s13369-024-09188-y\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arabian Journal for Science and Engineering","FirstCategoryId":"103","ListUrlMain":"https://link.springer.com/article/10.1007/s13369-024-09188-y","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

从无人机角度进行目标检测是近年来非常热门的任务。由于无人机的飞行高度,照片中的探测目标密度大、尺度小,导致可用信息少,特征提取困难。而且小目标的预测偏差会对损失计算产生较大的负面影响。因此,为了更好地利用无人机,在 YOLOv7 的基础上设计了 YOLO-HLFE。在 MP 下采样结构中加入坐标注意机制,组成 MPFE 下采样结构,充分利用目标的位置信息,增强网络的特征提取能力。将 YOLOv7 的完全交集大于联合(CIOU)与归一化高斯瓦瑟斯坦距离损失(NWD)相结合,构成 CIOU-NWD 损失,以减轻对小目标的预测偏差问题。此外,为了使模型的锚点更接近无人机视角下的目标尺度,改进了模型的聚类方法,对锚点进行了重新聚类。在使用切片 VisDrone2021-DET 数据集和 SeaDronesSeeV2 数据集进行的实验中,YOLO-HLFE 在切片 VisDrone2021-DET 数据集上的 mAP50 和 mAP 分别达到 52.3% 和 30.0%,比基线分别高出 2.8% 和 0.9%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
YOLO-HLFE: A UAV Perspective Target Detector With Hybrid Loss and Feature Enhancement Based on YOLOv7

Target detection from UAV perspective has been a very hot task in recent years. Due to the flying height of the UAV, the detection targets in the photographs are dense and small in scale, resulting in little available information and difficulty in feature extraction. And the prediction bias of small targets can have a large negative impact on the calculation of losses. So for better use of UAV, YOLO-HLFE is designed on the basis of YOLOv7. The coordinate attention mechanism is added to the MP downsampling structure to comprise MPFE downsampling structure, which makes full use of the location information of the target and enhances the feature extraction capability of the network. The complete intersection over union (CIOU) of YOLOv7 is combined with the Normalized Gaussian Wasserstein Distance loss (NWD) to constitute the CIOU-NWD loss to mitigate the prediction bias problem for small targets. In addition, in order to make the anchor point of the model closer to the target scale of the UAV perspective, the clustering method of the model is improved and the anchor point is re-clustered. In experiment using the sliced VisDrone2021-DET dataset and SeaDronesSeeV2 dataset, the mAP50 and mAP of YOLO-HLFE on sliced VisDrone2021-DET dataset reach 52.3% and 30.0%, which are 2.8% and 0.9% higher than the baseline, respectively.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Arabian Journal for Science and Engineering
Arabian Journal for Science and Engineering MULTIDISCIPLINARY SCIENCES-
CiteScore
5.70
自引率
3.40%
发文量
993
期刊介绍: King Fahd University of Petroleum & Minerals (KFUPM) partnered with Springer to publish the Arabian Journal for Science and Engineering (AJSE). AJSE, which has been published by KFUPM since 1975, is a recognized national, regional and international journal that provides a great opportunity for the dissemination of research advances from the Kingdom of Saudi Arabia, MENA and the world.
期刊最新文献
Effects of Combined Utilization of Active Cooler/Heater and Blade-Shaped Nanoparticles in Base Fluid for Performance Improvement of Thermoelectric Generator Mounted in Between Vented Cavities A Review of the Shear Design Provisions of ACI Code and Eurocode for Self-Compacting Concrete, Recycled Aggregate Concrete, and Geopolymer Concrete Beams Advancements in Vertical Axis Wind Turbine Technologies: A Comprehensive Review Improved Electrochemical Performance of Co3O4 Incorporated MnO2 Nanowires for Energy Storage Applications Biological CO2 Utilization; Current Status, Challenges, and Future Directions for Photosynthetic and Non-photosynthetic Route
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1