{"title":"A Pixel Expansion-Based Improvement in Dense Nesting Structures for Infrared Small Target Detection","authors":"Zhichao Zhao;Hao Wang;Haiyan Li;Jundon Yang;Pengfei Yu","doi":"10.1109/LGRS.2025.3547899","DOIUrl":null,"url":null,"abstract":"In infrared detection, the identification of weak targets is often hindered by low pixel count and resolution, leading to a scarcity of semantic details for small targets and significant blurring of boundary information. To solve these problems, we propose a novel approach for the enhanced cross-stage feature matching network (ECFNet) to learn features beyond a single-scale data source by introducing pixel expansion branching in this letter. First, we introduce the feature attention enhancement module (FAEM), which uses rapid edge feature extraction to effectively enhance the boundary information of weak targets after pixel expansion, thereby improving the network’s fine-grained detection capability. Moreover, inspired by the Monte Carlo attention mechanism used in medical image processing, we introduce the stage randomness enhancement module (SREM) to direct the network’s focus toward small target regions rather than background noise during the learning process, allowing the network to adapt to various random situations independent of a fixed structure. Furthermore, we design a cross-feature matching module (CFMM), which effectively aggregates shallow profile information and deeper semantics at the center of the network, facilitating efficient information transfer and precise feature assignment, thereby narrowing the feature gap between the encoding and decoding stages. Our network achieves an intersection-over-union (IoU) ratio of 78.51% on the publicly available NUAA-SIRST dataset. The experimental results of NUDT-SIRDT and infrared small target detection (IRSTD)-1k can be found in <uri>https://github.com/bobo66597/ECF</uri>.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10909503/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In infrared detection, the identification of weak targets is often hindered by low pixel count and resolution, leading to a scarcity of semantic details for small targets and significant blurring of boundary information. To solve these problems, we propose a novel approach for the enhanced cross-stage feature matching network (ECFNet) to learn features beyond a single-scale data source by introducing pixel expansion branching in this letter. First, we introduce the feature attention enhancement module (FAEM), which uses rapid edge feature extraction to effectively enhance the boundary information of weak targets after pixel expansion, thereby improving the network’s fine-grained detection capability. Moreover, inspired by the Monte Carlo attention mechanism used in medical image processing, we introduce the stage randomness enhancement module (SREM) to direct the network’s focus toward small target regions rather than background noise during the learning process, allowing the network to adapt to various random situations independent of a fixed structure. Furthermore, we design a cross-feature matching module (CFMM), which effectively aggregates shallow profile information and deeper semantics at the center of the network, facilitating efficient information transfer and precise feature assignment, thereby narrowing the feature gap between the encoding and decoding stages. Our network achieves an intersection-over-union (IoU) ratio of 78.51% on the publicly available NUAA-SIRST dataset. The experimental results of NUDT-SIRDT and infrared small target detection (IRSTD)-1k can be found in https://github.com/bobo66597/ECF.