Zhonghao Chen , Hongmin Gao , Zhengtao Lu , Yiyan Zhang , Yao Ding , Xin Li , Bing Zhang
{"title":"MDA-HTD: Mask-driven dual autoencoders meet hyperspectral target detection","authors":"Zhonghao Chen , Hongmin Gao , Zhengtao Lu , Yiyan Zhang , Yao Ding , Xin Li , Bing Zhang","doi":"10.1016/j.ipm.2025.104106","DOIUrl":null,"url":null,"abstract":"<div><div>Existing background learning-based hyperspectral (HS) target detection (HTD) methods employ an adversarial autoencoder (AAE) to model the background distribution of HS images, which may lead to pattern collapse. To tackle this challenge, we propose the mask-driven dual autoencoder for HTD (MDA-HTD), an adversarial learning-free spectral learning framework capable of effectively and robustly reconstructing background pixels. Our approach involves three steps: background sample generation, background distribution learning, and HS image reconstruction for target detection. First, we design a locally spatially constrained background spectral selection (LSCBS<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>) strategy to generate high-confidence background samples and produce coarse target detection results. Next, we introduce the MDA network, consisting of an autoencoder (AE) and a spectral masking autoencoder (SMAE). Specifically, the AE is driven to learn the background spectral features more robustly by constraining the consistency between the AE and SMAE latent features. To mask spectral bands without losing spectral trend information, we propose neighborhood spectral averaging masking strategy. In the detection stage, the trained AE reconstructs all pixels in the HS image, and detection results are obtained by comparing the reconstructed and original images. These results are fused with coarse detection results using a quadratic exponential nonlinear filter for the final detection outcome. Experimental evaluations on four publicly recognized benchmarks and one synthetic data demonstrate that the proposed MDA-HTD achieves state-of-the-art detection performance, surpassing previous AAE-based methods by an average of 0.79%, 2.1%, 0.32%, 3.1%, and 3.8% in AUC value</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 4","pages":"Article 104106"},"PeriodicalIF":7.4000,"publicationDate":"2025-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325000482","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Existing background learning-based hyperspectral (HS) target detection (HTD) methods employ an adversarial autoencoder (AAE) to model the background distribution of HS images, which may lead to pattern collapse. To tackle this challenge, we propose the mask-driven dual autoencoder for HTD (MDA-HTD), an adversarial learning-free spectral learning framework capable of effectively and robustly reconstructing background pixels. Our approach involves three steps: background sample generation, background distribution learning, and HS image reconstruction for target detection. First, we design a locally spatially constrained background spectral selection (LSCBS) strategy to generate high-confidence background samples and produce coarse target detection results. Next, we introduce the MDA network, consisting of an autoencoder (AE) and a spectral masking autoencoder (SMAE). Specifically, the AE is driven to learn the background spectral features more robustly by constraining the consistency between the AE and SMAE latent features. To mask spectral bands without losing spectral trend information, we propose neighborhood spectral averaging masking strategy. In the detection stage, the trained AE reconstructs all pixels in the HS image, and detection results are obtained by comparing the reconstructed and original images. These results are fused with coarse detection results using a quadratic exponential nonlinear filter for the final detection outcome. Experimental evaluations on four publicly recognized benchmarks and one synthetic data demonstrate that the proposed MDA-HTD achieves state-of-the-art detection performance, surpassing previous AAE-based methods by an average of 0.79%, 2.1%, 0.32%, 3.1%, and 3.8% in AUC value
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.