{"title":"Attention based network for fusion of polarimetric and contextual features for polarimetric synthetic aperture radar image classification","authors":"Maryam Imani","doi":"10.1016/j.engappai.2024.109665","DOIUrl":null,"url":null,"abstract":"<div><div>Polarimetric synthetic aperture radar (PolSAR) images containing polarimetric, scattering and contextual features are useful radar data for ground surface classification. Appropriate feature extraction and fusion by using a small set of available labeled samples is an important and challenging task. Several transformers with self-attention mechanism have recently achieved great success for PolSAR image classification. While almost all methods just exploit the self-attention features from the PolSAR cube, the feature fusion method proposed in this work, which is called attention based scattering and contextual (ASC) network, utilizes the polarimetric self-attention beside two cross-attention blocks. The cross-attention blocks extract the polarimetric-scattering dependencies and polarimetric-contextual interactions, individually. The proposed ASC network uses three inputs: the PolSAR cube, the scattering feature maps obtained by clustering of the entropy-alpha features, and the segmentation maps obtained by a super-pixel generation algorithm. The features extracted by self- and cross-attention blocks are fused together, and the residual learning improves the feature learning. While transformers and attention-based networks usually need large training sets, the proposed ASC network shows high efficiency with relatively low number of training samples in various real and synthetic PolSAR images. For example, in the Flevoland PolSAR image containing 15 classes acquired by AIRSAR in L-band, with using 100 training samples per class (less than 1% of labeled samples), the ASC network achieves the overall accuracy of 99.51, which is statistically preferred than the self-attention-based network according to the McNemars test.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"139 ","pages":"Article 109665"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624018232","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Polarimetric synthetic aperture radar (PolSAR) images containing polarimetric, scattering and contextual features are useful radar data for ground surface classification. Appropriate feature extraction and fusion by using a small set of available labeled samples is an important and challenging task. Several transformers with self-attention mechanism have recently achieved great success for PolSAR image classification. While almost all methods just exploit the self-attention features from the PolSAR cube, the feature fusion method proposed in this work, which is called attention based scattering and contextual (ASC) network, utilizes the polarimetric self-attention beside two cross-attention blocks. The cross-attention blocks extract the polarimetric-scattering dependencies and polarimetric-contextual interactions, individually. The proposed ASC network uses three inputs: the PolSAR cube, the scattering feature maps obtained by clustering of the entropy-alpha features, and the segmentation maps obtained by a super-pixel generation algorithm. The features extracted by self- and cross-attention blocks are fused together, and the residual learning improves the feature learning. While transformers and attention-based networks usually need large training sets, the proposed ASC network shows high efficiency with relatively low number of training samples in various real and synthetic PolSAR images. For example, in the Flevoland PolSAR image containing 15 classes acquired by AIRSAR in L-band, with using 100 training samples per class (less than 1% of labeled samples), the ASC network achieves the overall accuracy of 99.51, which is statistically preferred than the self-attention-based network according to the McNemars test.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.