{"title":"基于注意力和部分学习卷积的红外海事小目标检测网络","authors":"Enzhong Zhao, Lili Dong, Xinlei Chu, Mengge Wang","doi":"10.1016/j.infrared.2025.105748","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared maritime targets (IMTs) captured by infrared sensors lack distinct detailed features and have a low signal-to-noise ratio, which renders the detection of such targets more challenging than typical target detection tasks. To improve detection accuracy, this manuscript exploits the characteristics of IMTs and proposes a detection network based on attention and partial learning convolution (APLCnet). Firstly, a projected-global self-attention module (PGSAM) is designed based on the projected saliency of IMTs. By encoding long-range features, PGSAM effectively captures the contextual relationships between targets and their backgrounds. Secondly, an attention module based on the local-wide convolutional block attention module (LWCBAM) is devised. By introducing local attention and attention over a larger receptive field, this module helps the network highlight important spatial and channel features after feature fusion, preventing critical weak targets from being overwhelmed during the fusion process. Additionally, inspired by traditional edge detection operators, a partial learning convolutional module (PLCM) is designed and applied to shallow features, which enhances the focus on small IMTs, improving the saliency of weak and small IMTs. Experimental results demonstrate that the proposed network effectively improves the accuracy of IMT detection while maintaining higher robustness.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"146 ","pages":"Article 105748"},"PeriodicalIF":3.1000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Infrared maritime small target detection network based on attention and partial learning convolution\",\"authors\":\"Enzhong Zhao, Lili Dong, Xinlei Chu, Mengge Wang\",\"doi\":\"10.1016/j.infrared.2025.105748\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Infrared maritime targets (IMTs) captured by infrared sensors lack distinct detailed features and have a low signal-to-noise ratio, which renders the detection of such targets more challenging than typical target detection tasks. To improve detection accuracy, this manuscript exploits the characteristics of IMTs and proposes a detection network based on attention and partial learning convolution (APLCnet). Firstly, a projected-global self-attention module (PGSAM) is designed based on the projected saliency of IMTs. By encoding long-range features, PGSAM effectively captures the contextual relationships between targets and their backgrounds. Secondly, an attention module based on the local-wide convolutional block attention module (LWCBAM) is devised. By introducing local attention and attention over a larger receptive field, this module helps the network highlight important spatial and channel features after feature fusion, preventing critical weak targets from being overwhelmed during the fusion process. Additionally, inspired by traditional edge detection operators, a partial learning convolutional module (PLCM) is designed and applied to shallow features, which enhances the focus on small IMTs, improving the saliency of weak and small IMTs. Experimental results demonstrate that the proposed network effectively improves the accuracy of IMT detection while maintaining higher robustness.</div></div>\",\"PeriodicalId\":13549,\"journal\":{\"name\":\"Infrared Physics & Technology\",\"volume\":\"146 \",\"pages\":\"Article 105748\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-02-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Infrared Physics & Technology\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1350449525000416\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"INSTRUMENTS & INSTRUMENTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525000416","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
Infrared maritime small target detection network based on attention and partial learning convolution
Infrared maritime targets (IMTs) captured by infrared sensors lack distinct detailed features and have a low signal-to-noise ratio, which renders the detection of such targets more challenging than typical target detection tasks. To improve detection accuracy, this manuscript exploits the characteristics of IMTs and proposes a detection network based on attention and partial learning convolution (APLCnet). Firstly, a projected-global self-attention module (PGSAM) is designed based on the projected saliency of IMTs. By encoding long-range features, PGSAM effectively captures the contextual relationships between targets and their backgrounds. Secondly, an attention module based on the local-wide convolutional block attention module (LWCBAM) is devised. By introducing local attention and attention over a larger receptive field, this module helps the network highlight important spatial and channel features after feature fusion, preventing critical weak targets from being overwhelmed during the fusion process. Additionally, inspired by traditional edge detection operators, a partial learning convolutional module (PLCM) is designed and applied to shallow features, which enhances the focus on small IMTs, improving the saliency of weak and small IMTs. Experimental results demonstrate that the proposed network effectively improves the accuracy of IMT detection while maintaining higher robustness.
期刊介绍:
The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region.
Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine.
Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.