Multibranch Mutual-Guiding Learning for Infrared Small Target Detection

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2025-01-07 DOI:10.1109/TGRS.2025.3526754
Qiang Li;Wei Zhang;Wanxuan Lu;Qi Wang
{"title":"Multibranch Mutual-Guiding Learning for Infrared Small Target Detection","authors":"Qiang Li;Wei Zhang;Wanxuan Lu;Qi Wang","doi":"10.1109/TGRS.2025.3526754","DOIUrl":null,"url":null,"abstract":"At present, many infrared target detection approaches focus on designing modules that address the two key characteristics of targets: their weak signals and small size. However, these approaches often fail to fully leverage guided learning for weak and small target content, resulting in suboptimal detection performance, particularly in terms of shape preservation and target positioning. To tackle this challenge, this article proposes a multibranch mutual-guiding learning network (MMLNet) that enhances the accuracy of infrared target detection, even in the absence of clear morphological and textural features in images. The method consists of three branches: edge, positioning, and detection, each of which is designed with a specialized module from a unique perspective. In the detection branch, we introduce a multidimensional lossless encoder optimized through a downsampling strategy and multilevel feature fusion to mitigate feature loss in small targets. In the positioning branch, a target positioning strategy is proposed to explicitly identify candidate targets from the image by means of a learnable multikernel pattern. In the edge branch, a simple architecture is adopted to enhance the ability of the model to preserve the target shape. To effectively utilize the knowledge of different branches, a mutual-guiding fusion module is developed to adjust information within and between branches. The manner adaptively utilizes the specific knowledge from each input branch. The experimental results demonstrate that the proposed method achieves comparable performance, and the visualization results show the advantages of our method in shape preservation and positioning of the targets. Our code is publicly available at <uri>https://github.com/qianngli/MMLNet</uri>.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-10"},"PeriodicalIF":8.6000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10830282/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

At present, many infrared target detection approaches focus on designing modules that address the two key characteristics of targets: their weak signals and small size. However, these approaches often fail to fully leverage guided learning for weak and small target content, resulting in suboptimal detection performance, particularly in terms of shape preservation and target positioning. To tackle this challenge, this article proposes a multibranch mutual-guiding learning network (MMLNet) that enhances the accuracy of infrared target detection, even in the absence of clear morphological and textural features in images. The method consists of three branches: edge, positioning, and detection, each of which is designed with a specialized module from a unique perspective. In the detection branch, we introduce a multidimensional lossless encoder optimized through a downsampling strategy and multilevel feature fusion to mitigate feature loss in small targets. In the positioning branch, a target positioning strategy is proposed to explicitly identify candidate targets from the image by means of a learnable multikernel pattern. In the edge branch, a simple architecture is adopted to enhance the ability of the model to preserve the target shape. To effectively utilize the knowledge of different branches, a mutual-guiding fusion module is developed to adjust information within and between branches. The manner adaptively utilizes the specific knowledge from each input branch. The experimental results demonstrate that the proposed method achieves comparable performance, and the visualization results show the advantages of our method in shape preservation and positioning of the targets. Our code is publicly available at https://github.com/qianngli/MMLNet.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
红外小目标检测的多分支互导学习
目前,许多红外目标检测方法都侧重于设计模块,以解决目标的两个关键特征:信号弱和体积小。然而,这些方法往往不能充分利用引导学习对弱和小的目标内容,导致不理想的检测性能,特别是在形状保存和目标定位方面。为了解决这一挑战,本文提出了一种多分支相互引导学习网络(MMLNet),即使在图像中没有清晰的形态和纹理特征的情况下,也能提高红外目标检测的准确性。该方法包括边缘、定位和检测三个分支,每个分支都从独特的角度设计了专门的模块。在检测分支中,我们引入了一种多维无损编码器,该编码器通过降采样策略和多级特征融合进行优化,以减轻小目标的特征损失。在定位分支中,提出了一种目标定位策略,利用可学习的多核模式从图像中明确识别候选目标。在边缘分支中,采用了一种简单的结构,增强了模型保持目标形状的能力。为了有效地利用不同分支机构的知识,开发了一个相互引导的融合模块来调整分支机构内部和分支机构之间的信息。该方法自适应地利用来自每个输入分支的特定知识。实验结果表明,所提方法取得了相当的性能,可视化结果显示了所提方法在目标形状保持和定位方面的优势。我们的代码可以在https://github.com/qianngli/MMLNet上公开获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
An Improved Mahalanobis Distance Method for Smoke Detection Based on Fine-Grained Background Identification An Automatic Layer Extraction Algorithm for Ice Sounding Radar Data Based on Curvelet Transform (CT) and Minimum Spanning Tree (MST) Estimations of wind direction and CO 2 emissions from power plants with DaQi-1 Satellite Adaptive Modeling for Air Quality: A Continual Learning Framework for PM 2.5 Estimation in Vietnam LKA-GFNet: Language Knowledge-Augmented Graph Fusion for Tri-Source Heterogeneous Remote Sensing Data Classification
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1