Guanglian Zhang;Lan Zhang;Zhanxu Zhang;Jiangwei Deng;Lifeng Bian;Chen Yang
{"title":"DECT:用于多源遥感数据分类的扩散增强型 CNN 变换器","authors":"Guanglian Zhang;Lan Zhang;Zhanxu Zhang;Jiangwei Deng;Lifeng Bian;Chen Yang","doi":"10.1109/JSTARS.2024.3479212","DOIUrl":null,"url":null,"abstract":"Methods for joint classification of hyperspectral images (HSIs) with high dimensionality and spectral correlation and other sensor data (e.g., optical, infrared, radar, etc.) are important directions in the field of remote sensing. To better learn the feature representation of diffusion features (HSI), the unsupervised global modeling property of diffusion is utilized to mine the potential features of HSI to obtain diffusion features as input data. In addition, to fuse HSI features, HSI diffusion features, and other data features, a three-input diffusion-enhanced CNN–transformer (DECT) network based on CNN and transformer is proposed for feature extraction and fusion. First, the primary features are extracted by hierarchical CNN after premodal fusion. Second, considering the high dimensionality of HSI, spectral pooling attention interaction is designed for feature extraction and aggregation of information from different attentions. Finally, the inverted bottleneck convolutional transformer is designed to aggregate multisource information to enhance feature reuse and aggregate local and contextual information. It is shown on three publicly available datasets that DECT outperforms current state-of-the-art methods.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"17 ","pages":"19288-19301"},"PeriodicalIF":4.7000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10716525","citationCount":"0","resultStr":"{\"title\":\"DECT: Diffusion-Enhanced CNN–Transformer for Multisource Remote Sensing Data Classification\",\"authors\":\"Guanglian Zhang;Lan Zhang;Zhanxu Zhang;Jiangwei Deng;Lifeng Bian;Chen Yang\",\"doi\":\"10.1109/JSTARS.2024.3479212\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Methods for joint classification of hyperspectral images (HSIs) with high dimensionality and spectral correlation and other sensor data (e.g., optical, infrared, radar, etc.) are important directions in the field of remote sensing. To better learn the feature representation of diffusion features (HSI), the unsupervised global modeling property of diffusion is utilized to mine the potential features of HSI to obtain diffusion features as input data. In addition, to fuse HSI features, HSI diffusion features, and other data features, a three-input diffusion-enhanced CNN–transformer (DECT) network based on CNN and transformer is proposed for feature extraction and fusion. First, the primary features are extracted by hierarchical CNN after premodal fusion. Second, considering the high dimensionality of HSI, spectral pooling attention interaction is designed for feature extraction and aggregation of information from different attentions. Finally, the inverted bottleneck convolutional transformer is designed to aggregate multisource information to enhance feature reuse and aggregate local and contextual information. It is shown on three publicly available datasets that DECT outperforms current state-of-the-art methods.\",\"PeriodicalId\":13116,\"journal\":{\"name\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"volume\":\"17 \",\"pages\":\"19288-19301\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2024-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10716525\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10716525/\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10716525/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
DECT: Diffusion-Enhanced CNN–Transformer for Multisource Remote Sensing Data Classification
Methods for joint classification of hyperspectral images (HSIs) with high dimensionality and spectral correlation and other sensor data (e.g., optical, infrared, radar, etc.) are important directions in the field of remote sensing. To better learn the feature representation of diffusion features (HSI), the unsupervised global modeling property of diffusion is utilized to mine the potential features of HSI to obtain diffusion features as input data. In addition, to fuse HSI features, HSI diffusion features, and other data features, a three-input diffusion-enhanced CNN–transformer (DECT) network based on CNN and transformer is proposed for feature extraction and fusion. First, the primary features are extracted by hierarchical CNN after premodal fusion. Second, considering the high dimensionality of HSI, spectral pooling attention interaction is designed for feature extraction and aggregation of information from different attentions. Finally, the inverted bottleneck convolutional transformer is designed to aggregate multisource information to enhance feature reuse and aggregate local and contextual information. It is shown on three publicly available datasets that DECT outperforms current state-of-the-art methods.
期刊介绍:
The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.