Jianxin Jia , Xiaorou Zheng , Yueming Wang , Yuwei Chen , Mika Karjalainen , Shoubin Dong , Runuo Lu , Jianyu Wang , Juha Hyyppä
{"title":"The effect of artificial intelligence evolving on hyperspectral imagery with different signal-to-noise ratio, spectral and spatial resolutions","authors":"Jianxin Jia , Xiaorou Zheng , Yueming Wang , Yuwei Chen , Mika Karjalainen , Shoubin Dong , Runuo Lu , Jianyu Wang , Juha Hyyppä","doi":"10.1016/j.rse.2024.114291","DOIUrl":null,"url":null,"abstract":"<div><p>Hyperspectral images are increasingly being used in classification and identification. Data users prefer hyperspectral imagery with high spatial resolution, finer spectral resolution, and high signal-to-noise ratio (SNR). However, tradeoffs exist in these core parameters in imagery acquired by different hyperspectral sensor systems. Data users may find it difficult to utilize all the advantages of hyperspectral imagery. How to select hyperspectral data with optimal parameter configuration has been one of the essential issues for data users, which also affects the back-end applications. With advancements in computer science, various artificial intelligence algorithms from conventional machine learning to deep learning have been utilized for hyperspectral images classification and identification. Few researchers study the mechanism between the core parameters of hyperspectral imaging spectrometers and advanced artificial intelligence algorithms, which affects the application efficiency and accuracy. In this paper, we delved into the evolution of machine learning and deep learning models applied to imagery acquired by different hyperspectral sensor systems having different SNR, spectral, and spatial resolutions. Additionally, we also considered the tradeoffs among the core parameters of hyperspectral imagers. We used two conventional machine learning models, including the classification and regression tree (CART) and random forest (RF), two deep learning methods based on convolution neural network architectures—3D convolutional neural network (3D-CNN) and hamida, and two deep learning methods based on vision transformers architectures—transformer models vision transformer (VIT) and robust vision transformer (RVT), to compare the characteristics of different algorithms. In addition, five hyperspectral datasets with different species categories and scene distributions and aggregated datasets with different spatial resolutions, spectral resolutions, and SNRs were used to validate our study. The experimental results indicate that: (1) The overall accuracy (OA) using CART, RF, 3D-CNN, and VIT models decreased with coarser spectral resolution, but almost remained unchanged using the RVT classifier. The number of class and classification species affect the results. (2) The influence of spatial resolution on classification accuracy is related to the scene complexity, target size, and classification purpose. The coarser spatial resolution can achieve higher OA than the original spatial resolution for the uniform scene distribution. For the datasets with small objects and intersections of different species, OA first increased, plateaued, and then decreased with coarser spatial resolution. (3) The SNR has an obvious impact on OA for the CART and RF classifiers, and the impact decreased for deep learning models, especially for the VIT and RVT models, which were almost unaffected by SNR. Additionally, slight variations in experimental results were observed for datasets with different scene distributions and categories. Furthermore, we conducted a detailed analysis of the role of traditional machine learning and deep learning models in the experimental outcomes. The study can provide insights into understanding the relationship between the core parameters of hyperspectral imager and the artificial intelligence algorithms used for hyperspectral classification. It serves to bridge the knowledge gap between the front-end hyperspectral imager, mid-end model, and back-end applications, and further promote the development of hyperspectral imaging technology.</p></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":null,"pages":null},"PeriodicalIF":11.1000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0034425724003092/pdfft?md5=b6b63290497a1f21b4c5543bf078af7e&pid=1-s2.0-S0034425724003092-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0034425724003092","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Hyperspectral images are increasingly being used in classification and identification. Data users prefer hyperspectral imagery with high spatial resolution, finer spectral resolution, and high signal-to-noise ratio (SNR). However, tradeoffs exist in these core parameters in imagery acquired by different hyperspectral sensor systems. Data users may find it difficult to utilize all the advantages of hyperspectral imagery. How to select hyperspectral data with optimal parameter configuration has been one of the essential issues for data users, which also affects the back-end applications. With advancements in computer science, various artificial intelligence algorithms from conventional machine learning to deep learning have been utilized for hyperspectral images classification and identification. Few researchers study the mechanism between the core parameters of hyperspectral imaging spectrometers and advanced artificial intelligence algorithms, which affects the application efficiency and accuracy. In this paper, we delved into the evolution of machine learning and deep learning models applied to imagery acquired by different hyperspectral sensor systems having different SNR, spectral, and spatial resolutions. Additionally, we also considered the tradeoffs among the core parameters of hyperspectral imagers. We used two conventional machine learning models, including the classification and regression tree (CART) and random forest (RF), two deep learning methods based on convolution neural network architectures—3D convolutional neural network (3D-CNN) and hamida, and two deep learning methods based on vision transformers architectures—transformer models vision transformer (VIT) and robust vision transformer (RVT), to compare the characteristics of different algorithms. In addition, five hyperspectral datasets with different species categories and scene distributions and aggregated datasets with different spatial resolutions, spectral resolutions, and SNRs were used to validate our study. The experimental results indicate that: (1) The overall accuracy (OA) using CART, RF, 3D-CNN, and VIT models decreased with coarser spectral resolution, but almost remained unchanged using the RVT classifier. The number of class and classification species affect the results. (2) The influence of spatial resolution on classification accuracy is related to the scene complexity, target size, and classification purpose. The coarser spatial resolution can achieve higher OA than the original spatial resolution for the uniform scene distribution. For the datasets with small objects and intersections of different species, OA first increased, plateaued, and then decreased with coarser spatial resolution. (3) The SNR has an obvious impact on OA for the CART and RF classifiers, and the impact decreased for deep learning models, especially for the VIT and RVT models, which were almost unaffected by SNR. Additionally, slight variations in experimental results were observed for datasets with different scene distributions and categories. Furthermore, we conducted a detailed analysis of the role of traditional machine learning and deep learning models in the experimental outcomes. The study can provide insights into understanding the relationship between the core parameters of hyperspectral imager and the artificial intelligence algorithms used for hyperspectral classification. It serves to bridge the knowledge gap between the front-end hyperspectral imager, mid-end model, and back-end applications, and further promote the development of hyperspectral imaging technology.
期刊介绍:
Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing.
The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques.
RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.