Deep 3D-2D Convolutional Neural Networks Combined With Mobinenetv2 For Hyperspectral Image Classification

Proceedings of the 2023 5th International Conference on Image Processing and Machine Vision Pub Date : 2023-01-13 DOI:10.1145/3582177.3582185

DouglasOmwenga Nyabuga

{"title":"Deep 3D-2D Convolutional Neural Networks Combined With Mobinenetv2 For Hyperspectral Image Classification","authors":"DouglasOmwenga Nyabuga","doi":"10.1145/3582177.3582185","DOIUrl":null,"url":null,"abstract":"Convolutional neural networks (CNNs), one of the most successful models for visual identification, have shown excellent performance outcomes in different visual recognition challenges, attracting much interest in recent years. However, deploying CNN models to hyperspectral imaging (HSI) data continues to be a challenge due to the strongly correlated bands and insufficient training sets. Furthermore, HSI categorization is hugely dependent on spectral-spatial information. Hence, a 2D-CNN is a possible technique to analyze these features. However, because of the volume and spectral dimensions, a 3D CNN can be an option but is more computationally expensive. Furthermore, the models underperform in areas with comparable spectrums due to their inability to extract feature maps of high quality. This work, therefore, proposes a 3D/2D CNN combined with the MobineNetV2 model that uses both spectral-spatial feature maps to achieve competitive performance. First, the HSI data cube is split into small overlapping 3-D batches using the principal component analysis (PCA) to get the desired dimensions. These batches are then processed to build 3-D feature maps over many contiguous bands using a 3D convolutional kernel function, which retains the spectral properties. The performance of our model is validated using three benchmark HSI data sets (i.e., Pavia University, Indian Pines, and Salinas Scene). The results are then compared with different state-of-the-art (SOTA) methods.","PeriodicalId":170327,"journal":{"name":"Proceedings of the 2023 5th International Conference on Image Processing and Machine Vision","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 5th International Conference on Image Processing and Machine Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3582177.3582185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Convolutional neural networks (CNNs), one of the most successful models for visual identification, have shown excellent performance outcomes in different visual recognition challenges, attracting much interest in recent years. However, deploying CNN models to hyperspectral imaging (HSI) data continues to be a challenge due to the strongly correlated bands and insufficient training sets. Furthermore, HSI categorization is hugely dependent on spectral-spatial information. Hence, a 2D-CNN is a possible technique to analyze these features. However, because of the volume and spectral dimensions, a 3D CNN can be an option but is more computationally expensive. Furthermore, the models underperform in areas with comparable spectrums due to their inability to extract feature maps of high quality. This work, therefore, proposes a 3D/2D CNN combined with the MobineNetV2 model that uses both spectral-spatial feature maps to achieve competitive performance. First, the HSI data cube is split into small overlapping 3-D batches using the principal component analysis (PCA) to get the desired dimensions. These batches are then processed to build 3-D feature maps over many contiguous bands using a 3D convolutional kernel function, which retains the spectral properties. The performance of our model is validated using three benchmark HSI data sets (i.e., Pavia University, Indian Pines, and Salinas Scene). The results are then compared with different state-of-the-art (SOTA) methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

结合Mobinenetv2的深度3D-2D卷积神经网络用于高光谱图像分类

卷积神经网络(Convolutional neural networks, cnn)是视觉识别领域最成功的模型之一，近年来在不同的视觉识别挑战中表现出优异的性能，引起了人们的广泛关注。然而，将CNN模型部署到高光谱成像(HSI)数据中仍然是一个挑战，因为波段相关性强，训练集不足。此外，恒指分类很大程度上依赖于光谱空间信息。因此，2D-CNN是分析这些特征的一种可能的技术。然而，由于体积和光谱尺寸，3D CNN可以作为一种选择，但计算成本更高。此外，由于无法提取高质量的特征图，这些模型在具有可比光谱的区域中表现不佳。因此，这项工作提出了一个3D/2D CNN与MobineNetV2模型相结合，该模型使用光谱空间特征映射来获得具有竞争力的性能。首先，使用主成分分析(PCA)将HSI数据立方体分割成小的重叠的3-D批次，以获得所需的维度。然后使用保留光谱特性的3D卷积核函数对这些批次进行处理，在许多连续的波段上构建3D特征图。使用三个基准HSI数据集(即Pavia University, Indian Pines和Salinas Scene)验证了我们模型的性能。然后将结果与不同的最先进(SOTA)方法进行比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 2023 5th International Conference on Image Processing and Machine Vision

自引率

0.00%

发文量