Hao Sun, Jianhao Wang, Ziyu Hu, He Yang, Zhenwei Xu
{"title":"YOLO-HLFE:基于 YOLOv7、具有混合损失和特征增强功能的无人机透视目标探测器","authors":"Hao Sun, Jianhao Wang, Ziyu Hu, He Yang, Zhenwei Xu","doi":"10.1007/s13369-024-09188-y","DOIUrl":null,"url":null,"abstract":"<div><p>Target detection from UAV perspective has been a very hot task in recent years. Due to the flying height of the UAV, the detection targets in the photographs are dense and small in scale, resulting in little available information and difficulty in feature extraction. And the prediction bias of small targets can have a large negative impact on the calculation of losses. So for better use of UAV, YOLO-HLFE is designed on the basis of YOLOv7. The coordinate attention mechanism is added to the MP downsampling structure to comprise MPFE downsampling structure, which makes full use of the location information of the target and enhances the feature extraction capability of the network. The complete intersection over union (CIOU) of YOLOv7 is combined with the Normalized Gaussian Wasserstein Distance loss (NWD) to constitute the CIOU-NWD loss to mitigate the prediction bias problem for small targets. In addition, in order to make the anchor point of the model closer to the target scale of the UAV perspective, the clustering method of the model is improved and the anchor point is re-clustered. In experiment using the sliced VisDrone2021-DET dataset and SeaDronesSeeV2 dataset, the mAP50 and mAP of YOLO-HLFE on sliced VisDrone2021-DET dataset reach 52.3% and 30.0%, which are 2.8% and 0.9% higher than the baseline, respectively.</p></div>","PeriodicalId":54354,"journal":{"name":"Arabian Journal for Science and Engineering","volume":"50 2","pages":"1261 - 1278"},"PeriodicalIF":2.6000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"YOLO-HLFE: A UAV Perspective Target Detector With Hybrid Loss and Feature Enhancement Based on YOLOv7\",\"authors\":\"Hao Sun, Jianhao Wang, Ziyu Hu, He Yang, Zhenwei Xu\",\"doi\":\"10.1007/s13369-024-09188-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Target detection from UAV perspective has been a very hot task in recent years. Due to the flying height of the UAV, the detection targets in the photographs are dense and small in scale, resulting in little available information and difficulty in feature extraction. And the prediction bias of small targets can have a large negative impact on the calculation of losses. So for better use of UAV, YOLO-HLFE is designed on the basis of YOLOv7. The coordinate attention mechanism is added to the MP downsampling structure to comprise MPFE downsampling structure, which makes full use of the location information of the target and enhances the feature extraction capability of the network. The complete intersection over union (CIOU) of YOLOv7 is combined with the Normalized Gaussian Wasserstein Distance loss (NWD) to constitute the CIOU-NWD loss to mitigate the prediction bias problem for small targets. In addition, in order to make the anchor point of the model closer to the target scale of the UAV perspective, the clustering method of the model is improved and the anchor point is re-clustered. In experiment using the sliced VisDrone2021-DET dataset and SeaDronesSeeV2 dataset, the mAP50 and mAP of YOLO-HLFE on sliced VisDrone2021-DET dataset reach 52.3% and 30.0%, which are 2.8% and 0.9% higher than the baseline, respectively.</p></div>\",\"PeriodicalId\":54354,\"journal\":{\"name\":\"Arabian Journal for Science and Engineering\",\"volume\":\"50 2\",\"pages\":\"1261 - 1278\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Arabian Journal for Science and Engineering\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s13369-024-09188-y\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arabian Journal for Science and Engineering","FirstCategoryId":"103","ListUrlMain":"https://link.springer.com/article/10.1007/s13369-024-09188-y","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
YOLO-HLFE: A UAV Perspective Target Detector With Hybrid Loss and Feature Enhancement Based on YOLOv7
Target detection from UAV perspective has been a very hot task in recent years. Due to the flying height of the UAV, the detection targets in the photographs are dense and small in scale, resulting in little available information and difficulty in feature extraction. And the prediction bias of small targets can have a large negative impact on the calculation of losses. So for better use of UAV, YOLO-HLFE is designed on the basis of YOLOv7. The coordinate attention mechanism is added to the MP downsampling structure to comprise MPFE downsampling structure, which makes full use of the location information of the target and enhances the feature extraction capability of the network. The complete intersection over union (CIOU) of YOLOv7 is combined with the Normalized Gaussian Wasserstein Distance loss (NWD) to constitute the CIOU-NWD loss to mitigate the prediction bias problem for small targets. In addition, in order to make the anchor point of the model closer to the target scale of the UAV perspective, the clustering method of the model is improved and the anchor point is re-clustered. In experiment using the sliced VisDrone2021-DET dataset and SeaDronesSeeV2 dataset, the mAP50 and mAP of YOLO-HLFE on sliced VisDrone2021-DET dataset reach 52.3% and 30.0%, which are 2.8% and 0.9% higher than the baseline, respectively.
期刊介绍:
King Fahd University of Petroleum & Minerals (KFUPM) partnered with Springer to publish the Arabian Journal for Science and Engineering (AJSE).
AJSE, which has been published by KFUPM since 1975, is a recognized national, regional and international journal that provides a great opportunity for the dissemination of research advances from the Kingdom of Saudi Arabia, MENA and the world.