{"title":"CenterFormer: A Center Spatial–Spectral Attention Transformer Network for Hyperspectral Image Classification","authors":"Chenjing Jia;Xiaohua Zhang;Hongyun Meng;Shuxiang Xia;Licheng Jiao","doi":"10.1109/JSTARS.2025.3529985","DOIUrl":null,"url":null,"abstract":"Feature extraction is crucial for hyperspectral image classification (HSIC), and transformer-based methods have demonstrated significant potential in this field due to their exceptional global modeling capabilities. However, existing transformer-based methods use patches of fixed size and shape as input, which, while leveraging information from neighboring similar pixels to some extent, may also introduce heterogeneous pixels from nonhomogeneous regions, leading to a decrease in classification accuracy. In addition, since the goal of HSIC is to classify the center pixel, the attention calculation in these methods may focus on pixels unrelated to the center pixel, further impacting the accuracy of the classification. To address these issues, a novel transformer framework called CenterFormer is proposed, which enhances the center pixel to fully leverage the rich spatial and spectral information. Specifically, a multigranularity feature extractor is designed to effectively capture the fine-grained and coarse-grained spatial–spectral features of hyperspectral images, mitigating performance degradation caused by heterogeneous pixels. Moreover, a transformer encoder with center spatial–spectral attention is introduced, which enhances the center pixel and models global spatial–spectral information to improve classification performance. Finally, an adaptive classifier balances the classification results from different granularity branches, further enhancing the performance of CenterFormer. Comparative experiments conducted on four challenging datasets validate the model's effectiveness. Experimental results show that our model achieves an improvement in overall accuracy of up to 2.83<inline-formula><tex-math>$\\% $</tex-math></inline-formula> compared to the current state-of-the-art methods.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"5523-5539"},"PeriodicalIF":4.7000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10841983","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10841983/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Feature extraction is crucial for hyperspectral image classification (HSIC), and transformer-based methods have demonstrated significant potential in this field due to their exceptional global modeling capabilities. However, existing transformer-based methods use patches of fixed size and shape as input, which, while leveraging information from neighboring similar pixels to some extent, may also introduce heterogeneous pixels from nonhomogeneous regions, leading to a decrease in classification accuracy. In addition, since the goal of HSIC is to classify the center pixel, the attention calculation in these methods may focus on pixels unrelated to the center pixel, further impacting the accuracy of the classification. To address these issues, a novel transformer framework called CenterFormer is proposed, which enhances the center pixel to fully leverage the rich spatial and spectral information. Specifically, a multigranularity feature extractor is designed to effectively capture the fine-grained and coarse-grained spatial–spectral features of hyperspectral images, mitigating performance degradation caused by heterogeneous pixels. Moreover, a transformer encoder with center spatial–spectral attention is introduced, which enhances the center pixel and models global spatial–spectral information to improve classification performance. Finally, an adaptive classifier balances the classification results from different granularity branches, further enhancing the performance of CenterFormer. Comparative experiments conducted on four challenging datasets validate the model's effectiveness. Experimental results show that our model achieves an improvement in overall accuracy of up to 2.83$\% $ compared to the current state-of-the-art methods.
期刊介绍:
The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.