{"title":"MFFNet:用于无人机红外物体探测的轻量级多特征融合网络","authors":"Yunlei Chen , Ziyan Liu , Lihui Zhang , Yingyu Wu , Qian Zhang , Xuhui Zheng","doi":"10.1016/j.ejrs.2024.03.001","DOIUrl":null,"url":null,"abstract":"<div><p>In light of issues such as unnoticeable texture features and limited resolution of infrared image objects, a lightweight multi-scale feature fusion method for UAV infrared object recognition is presented to enhance the performance of UAVs carrying intelligent devices to detect infrared objects. By changing the anchorless frame strategy of the YOLOX method, a lightweight Multi-Feature Fusion Network (MFFNet) for UAV IR image object recognition is proposed. First, a lightweight backbone network is built using ShuffleNetv2_block, spatial pyramid pooling, and other modules to reduce the network's number of parameters and inference time while maintaining its capacity to extract features. Second, we develop a multi-feature fusion module to improve the detection capabilities of the model for IR objects by fusing the local features and the overall characteristics of IR objects since the texture features of IR objects are challenging to employ, but the boundary information is evident. The boundary frame regression loss is then optimized using SIoU by comparing the predicted frame to the actual frame in terms of angle, distance, shape, and IoU (Intersection over Union), which forces the model to reach the optimum predicted box more quickly.</p></div>","PeriodicalId":48539,"journal":{"name":"Egyptian Journal of Remote Sensing and Space Sciences","volume":"27 2","pages":"Pages 268-276"},"PeriodicalIF":3.7000,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1110982324000218/pdfft?md5=85d30684c98bfb92e8845e2acca9c06c&pid=1-s2.0-S1110982324000218-main.pdf","citationCount":"0","resultStr":"{\"title\":\"MFFNet: A lightweight multi-feature fusion network for UAV infrared object detection\",\"authors\":\"Yunlei Chen , Ziyan Liu , Lihui Zhang , Yingyu Wu , Qian Zhang , Xuhui Zheng\",\"doi\":\"10.1016/j.ejrs.2024.03.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In light of issues such as unnoticeable texture features and limited resolution of infrared image objects, a lightweight multi-scale feature fusion method for UAV infrared object recognition is presented to enhance the performance of UAVs carrying intelligent devices to detect infrared objects. By changing the anchorless frame strategy of the YOLOX method, a lightweight Multi-Feature Fusion Network (MFFNet) for UAV IR image object recognition is proposed. First, a lightweight backbone network is built using ShuffleNetv2_block, spatial pyramid pooling, and other modules to reduce the network's number of parameters and inference time while maintaining its capacity to extract features. Second, we develop a multi-feature fusion module to improve the detection capabilities of the model for IR objects by fusing the local features and the overall characteristics of IR objects since the texture features of IR objects are challenging to employ, but the boundary information is evident. The boundary frame regression loss is then optimized using SIoU by comparing the predicted frame to the actual frame in terms of angle, distance, shape, and IoU (Intersection over Union), which forces the model to reach the optimum predicted box more quickly.</p></div>\",\"PeriodicalId\":48539,\"journal\":{\"name\":\"Egyptian Journal of Remote Sensing and Space Sciences\",\"volume\":\"27 2\",\"pages\":\"Pages 268-276\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-03-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S1110982324000218/pdfft?md5=85d30684c98bfb92e8845e2acca9c06c&pid=1-s2.0-S1110982324000218-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Egyptian Journal of Remote Sensing and Space Sciences\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1110982324000218\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Egyptian Journal of Remote Sensing and Space Sciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1110982324000218","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
针对红外图像物体纹理特征不明显、分辨率有限等问题,提出了一种用于无人机红外物体识别的轻量级多尺度特征融合方法,以提高搭载智能设备的无人机探测红外物体的性能。通过改变 YOLOX 方法的无锚帧策略,提出了一种用于无人机红外图像物体识别的轻量级多特征融合网络(MFFNet)。首先,利用 ShuffleNetv2_block、空间金字塔池化等模块构建了轻量级骨干网络,在保持特征提取能力的同时减少了网络的参数数量和推理时间。其次,我们开发了一个多特征融合模块,通过融合红外物体的局部特征和整体特征来提高模型对红外物体的检测能力,因为红外物体的纹理特征很难利用,但边界信息却很明显。然后利用 SIoU 对边界框回归损失进行优化,将预测框与实际框在角度、距离、形状和 IoU(Intersection over Union)方面进行比较,从而迫使模型更快地达到最佳预测框。
MFFNet: A lightweight multi-feature fusion network for UAV infrared object detection
In light of issues such as unnoticeable texture features and limited resolution of infrared image objects, a lightweight multi-scale feature fusion method for UAV infrared object recognition is presented to enhance the performance of UAVs carrying intelligent devices to detect infrared objects. By changing the anchorless frame strategy of the YOLOX method, a lightweight Multi-Feature Fusion Network (MFFNet) for UAV IR image object recognition is proposed. First, a lightweight backbone network is built using ShuffleNetv2_block, spatial pyramid pooling, and other modules to reduce the network's number of parameters and inference time while maintaining its capacity to extract features. Second, we develop a multi-feature fusion module to improve the detection capabilities of the model for IR objects by fusing the local features and the overall characteristics of IR objects since the texture features of IR objects are challenging to employ, but the boundary information is evident. The boundary frame regression loss is then optimized using SIoU by comparing the predicted frame to the actual frame in terms of angle, distance, shape, and IoU (Intersection over Union), which forces the model to reach the optimum predicted box more quickly.
期刊介绍:
The Egyptian Journal of Remote Sensing and Space Sciences (EJRS) encompasses a comprehensive range of topics within Remote Sensing, Geographic Information Systems (GIS), planetary geology, and space technology development, including theories, applications, and modeling. EJRS aims to disseminate high-quality, peer-reviewed research focusing on the advancement of remote sensing and GIS technologies and their practical applications for effective planning, sustainable development, and environmental resource conservation. The journal particularly welcomes innovative papers with broad scientific appeal.