Infrared Small Target Detection in Satellite Videos: A New Dataset and a Novel Recurrent Feature Refinement Framework

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2025-02-14 DOI:10.1109/TGRS.2025.3542368
Xinyi Ying;Li Liu;Zaiping Lin;Yangsi Shi;Yingqian Wang;Ruojing Li;Xu Cao;Boyang Li;Shilin Zhou;Wei An
{"title":"Infrared Small Target Detection in Satellite Videos: A New Dataset and a Novel Recurrent Feature Refinement Framework","authors":"Xinyi Ying;Li Liu;Zaiping Lin;Yangsi Shi;Yingqian Wang;Ruojing Li;Xu Cao;Boyang Li;Shilin Zhou;Wei An","doi":"10.1109/TGRS.2025.3542368","DOIUrl":null,"url":null,"abstract":"Multiframe infrared small target (MIRST) detection in satellite videos has been a long-standing, fundamental yet challenging task for decades, and the challenges can be summarized as follows. First, the extremely small target size, highly complex clutter & noise and various satellite motions result in limited feature representation, high false alarms and difficult motion analyses. In addition, existing methods are primarily designed for static or slightly adjusted perspectives captured by short-distance platforms, which cannot generalize well to complex background motion in satellite videos. Second, the lack of a large-scale publicly available MIRST dataset in satellite videos greatly hinders the algorithm development. To address the aforementioned challenges, in this article, we first build a large-scale dataset for MIRST detection in satellite videos (namely IRSatVideo-LEO), and then develop a recurrent feature refinement (RFR) framework as the baseline method for satellite motion estimation and compensation. Specifically, IRSatVideo-LEO is a semi-simulated dataset with synthesized satellite motion, target appearance, trajectory, and intensity, which can provide a standard toolbox for satellite video generation and a reliable evaluation platform to facilitate algorithm development. For the baseline method, RFR is proposed to be equipped with existing powerful CNN-based methods for long-term temporal dependency exploitation and integrated motion compensation and MIRST detection. Specifically, a pyramid deformable alignment (PDA) module is proposed to achieve effective feature alignment, and a temporal-spatial–frequent modulation (TSFM) module is proposed to achieve efficient feature aggregation and enhancement. Extensive experiments have been conducted to demonstrate the effectiveness and superiority of our scheme. The comparative results show that ResUNet equipped with RFR outperforms the state-of-the-art MIRST detection methods. The dataset and code are available at <uri>https://github.com/XinyiYing/RFR</uri>.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-18"},"PeriodicalIF":8.6000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10887334/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Multiframe infrared small target (MIRST) detection in satellite videos has been a long-standing, fundamental yet challenging task for decades, and the challenges can be summarized as follows. First, the extremely small target size, highly complex clutter & noise and various satellite motions result in limited feature representation, high false alarms and difficult motion analyses. In addition, existing methods are primarily designed for static or slightly adjusted perspectives captured by short-distance platforms, which cannot generalize well to complex background motion in satellite videos. Second, the lack of a large-scale publicly available MIRST dataset in satellite videos greatly hinders the algorithm development. To address the aforementioned challenges, in this article, we first build a large-scale dataset for MIRST detection in satellite videos (namely IRSatVideo-LEO), and then develop a recurrent feature refinement (RFR) framework as the baseline method for satellite motion estimation and compensation. Specifically, IRSatVideo-LEO is a semi-simulated dataset with synthesized satellite motion, target appearance, trajectory, and intensity, which can provide a standard toolbox for satellite video generation and a reliable evaluation platform to facilitate algorithm development. For the baseline method, RFR is proposed to be equipped with existing powerful CNN-based methods for long-term temporal dependency exploitation and integrated motion compensation and MIRST detection. Specifically, a pyramid deformable alignment (PDA) module is proposed to achieve effective feature alignment, and a temporal-spatial–frequent modulation (TSFM) module is proposed to achieve efficient feature aggregation and enhancement. Extensive experiments have been conducted to demonstrate the effectiveness and superiority of our scheme. The comparative results show that ResUNet equipped with RFR outperforms the state-of-the-art MIRST detection methods. The dataset and code are available at https://github.com/XinyiYing/RFR.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
卫星视频中的红外小目标检测:一个新的数据集和一种新的递归特征细化框架
几十年来,卫星视频中的多帧红外小目标(MIRST)检测一直是一项长期、基础但具有挑战性的任务,挑战可以概括为以下几点。首先,极小的目标尺寸,高度复杂的杂波和噪声以及各种卫星运动导致特征表示有限,高虚警和运动分析困难。此外,现有方法主要针对短距离平台捕获的静态或略微调整的视角,不能很好地推广到卫星视频中复杂的背景运动。其次,卫星视频中缺乏大规模公开可用的MIRST数据集,极大地阻碍了算法的发展。为了解决上述挑战,在本文中,我们首先构建了一个用于卫星视频中MIRST检测的大规模数据集(即IRSatVideo-LEO),然后开发了一个循环特征细化(RFR)框架作为卫星运动估计和补偿的基线方法。具体来说,IRSatVideo-LEO是一个综合了卫星运动、目标外观、轨迹和强度的半模拟数据集,可以为卫星视频生成提供一个标准工具箱,并为算法开发提供一个可靠的评估平台。对于基线方法,提出将RFR与现有的强大的基于cnn的方法相结合,进行长期时间依赖性开发,并集成运动补偿和MIRST检测。具体来说,提出了金字塔形变对齐(PDA)模块来实现有效的特征对齐,提出了时空频率调制(TSFM)模块来实现有效的特征聚合和增强。大量的实验证明了该方案的有效性和优越性。对比结果表明,配备RFR的ResUNet优于最先进的MIRST检测方法。数据集和代码可在https://github.com/XinyiYing/RFR上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Triplet Contrastive Learning for Multi-Object Tracking in Satellite Videos Satellite-GS: Enhanced 2D Gaussian Splatting for Robust Satellite Reconstruction A Machine Learning Approach for Chlorophyll-a Estimation in Coastal Waters from Top-of-Atmosphere VIIRS Satellite Data A Spatio-Temporal Hierarchical Diffusion Framework for Training-Free Perceptual Remote Sensing Image Compression Small Target Detection in UAV Remote Sensing Images Based on Adaptive Cross-modal Feature Fusion and Attention Mechanism
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1