Evaluating epistemic uncertainty estimation strategies in vegetation trait retrieval using hybrid models and imaging spectroscopy data

IF 11.1 1区 地球科学 Q1 ENVIRONMENTAL SCIENCES Remote Sensing of Environment Pub Date : 2024-06-04 DOI:10.1016/j.rse.2024.114228
José Luis García-Soria , Miguel Morata , Katja Berger , Ana Belén Pascual-Venteo , Juan Pablo Rivera-Caicedo , Jochem Verrelst
{"title":"Evaluating epistemic uncertainty estimation strategies in vegetation trait retrieval using hybrid models and imaging spectroscopy data","authors":"José Luis García-Soria ,&nbsp;Miguel Morata ,&nbsp;Katja Berger ,&nbsp;Ana Belén Pascual-Venteo ,&nbsp;Juan Pablo Rivera-Caicedo ,&nbsp;Jochem Verrelst","doi":"10.1016/j.rse.2024.114228","DOIUrl":null,"url":null,"abstract":"<div><p>The new-generation satellite imaging spectrometers provide an unprecedented data stream to be processed into quantifiable vegetation traits. Hybrid models have gained widespread acceptance in recent years due to their versatility in converting spectral data into traits. In hybrid models, the retrieval is obtained through a machine learning regression algorithm (MLRA) trained on a wide range of simulated data. For instance, they are currently under development for trait retrieval in preparation for the upcoming Copernicus Hyperspectral Imaging Mission for the Environment (CHIME), among others targeting routine estimation of canopy nitrogen content (CNC). However, like any retrieval algorithm, the process is not error-free, and most MLRAs inherently lack an uncertainty estimation related to the retrieved traits, which implies a risk of misinterpretation when applying the model to real-world data. Therefore, this study aimed to assess epistemic uncertainty estimation strategies (Bayesian method, drop-out, quantile regression, and bootstrapping) alongside the estimation of CNC using competitive MLRAs. Each of the regression models was evaluated using three data sets: (1) simulated scenes with varying noise using the SCOPE 2.1 radiative transfer model, (2) hyperspectral images from the PRISMA sensor, and (3) field-measured data. Analysis of generated uncertainty intervals led to the following findings: First, Gaussian processes regression (GPR) offers meaningful uncertainties, primarily attributable to spectral data degradation, which provide supplementary insights into the quality of trait mapping. Second, bootstrapping uncertainties can be used as quality indicators of the reliability of the estimates retrieved by hybrid models. Yet, its variability depends on the used MLRA, which impedes trusting its variance as a confidence interval. Third, quantile regression forest (QRF), despite not being top-performing algorithm, exhibit outstanding robustness estimations and uncertainty when the spectral data is degraded, either by Gaussian noise or by striping, often occurring in satellite imagery. Fourth, bootstrapped kernel ridge regression (KRR) demonstrated comparable performance to the benchmark algorithm GPR; the retrievals and uncertainties of these two MLRAs were highly correlated. Fifth, bootstrapped partial least squares regression (PLSR) estimations and uncertainties exhibit poor robustness to noise degradation, with normalized root mean square error (NRMSE) increasing from 19% to 112%. Additionally, a GUI tool was integrated into the ARTMO software package for assessing epistemic uncertainties from the embedded regression algorithms, providing a trait mapping quality indicator for mapping applications, and improving decision-making.</p></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":null,"pages":null},"PeriodicalIF":11.1000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0034425724002463/pdfft?md5=d568747656a99eeda88c8639e7927b04&pid=1-s2.0-S0034425724002463-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0034425724002463","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

The new-generation satellite imaging spectrometers provide an unprecedented data stream to be processed into quantifiable vegetation traits. Hybrid models have gained widespread acceptance in recent years due to their versatility in converting spectral data into traits. In hybrid models, the retrieval is obtained through a machine learning regression algorithm (MLRA) trained on a wide range of simulated data. For instance, they are currently under development for trait retrieval in preparation for the upcoming Copernicus Hyperspectral Imaging Mission for the Environment (CHIME), among others targeting routine estimation of canopy nitrogen content (CNC). However, like any retrieval algorithm, the process is not error-free, and most MLRAs inherently lack an uncertainty estimation related to the retrieved traits, which implies a risk of misinterpretation when applying the model to real-world data. Therefore, this study aimed to assess epistemic uncertainty estimation strategies (Bayesian method, drop-out, quantile regression, and bootstrapping) alongside the estimation of CNC using competitive MLRAs. Each of the regression models was evaluated using three data sets: (1) simulated scenes with varying noise using the SCOPE 2.1 radiative transfer model, (2) hyperspectral images from the PRISMA sensor, and (3) field-measured data. Analysis of generated uncertainty intervals led to the following findings: First, Gaussian processes regression (GPR) offers meaningful uncertainties, primarily attributable to spectral data degradation, which provide supplementary insights into the quality of trait mapping. Second, bootstrapping uncertainties can be used as quality indicators of the reliability of the estimates retrieved by hybrid models. Yet, its variability depends on the used MLRA, which impedes trusting its variance as a confidence interval. Third, quantile regression forest (QRF), despite not being top-performing algorithm, exhibit outstanding robustness estimations and uncertainty when the spectral data is degraded, either by Gaussian noise or by striping, often occurring in satellite imagery. Fourth, bootstrapped kernel ridge regression (KRR) demonstrated comparable performance to the benchmark algorithm GPR; the retrievals and uncertainties of these two MLRAs were highly correlated. Fifth, bootstrapped partial least squares regression (PLSR) estimations and uncertainties exhibit poor robustness to noise degradation, with normalized root mean square error (NRMSE) increasing from 19% to 112%. Additionally, a GUI tool was integrated into the ARTMO software package for assessing epistemic uncertainties from the embedded regression algorithms, providing a trait mapping quality indicator for mapping applications, and improving decision-making.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用混合模型和成像光谱数据评估植被性状检索中的认识不确定性估计策略
新一代卫星成像光谱仪提供了前所未有的数据流,可将其处理为可量化的植被特征。近年来,混合模型因其在将光谱数据转化为特征方面的多功能性而得到广泛认可。在混合模型中,检索是通过在大量模拟数据上训练的机器学习回归算法(MLRA)获得的。例如,目前正在为即将到来的哥白尼环境高光谱成像任务(CHIME)的性状检索进行开发,其中包括针对冠层氮含量(CNC)的常规估算。然而,与任何检索算法一样,这一过程并非毫无差错,而且大多数 MLRA 本身缺乏与检索性状相关的不确定性估计,这意味着将模型应用于实际数据时存在误读风险。因此,本研究在使用竞争性 MLRA 估计 CNC 的同时,还评估了认识不确定性估计策略(贝叶斯法、剔除、量化回归和引导)。使用三个数据集对每个回归模型进行了评估:(1) 使用 SCOPE 2.1 辐射传递模型模拟的具有不同噪声的场景;(2) PRISMA 传感器的高光谱图像;(3) 实地测量数据。对生成的不确定性区间进行分析后得出以下结论:首先,高斯过程回归(GPR)提供了有意义的不确定性,主要归因于光谱数据退化,这为了解性状映射的质量提供了补充。其次,自举不确定性可作为混合模型估计值可靠性的质量指标。然而,其可变性取决于所使用的 MLRA,这就妨碍了将其方差作为置信区间。第三,量子回归森林(QRF)尽管不是性能最好的算法,但在光谱数据因高斯噪声或条纹(经常发生在卫星图像中)而退化时,其估计值和不确定性的鲁棒性表现突出。第四,引导核岭回归(KRR)与基准算法 GPR 的性能相当;这两种 MLRA 的检索和不确定性高度相关。第五,自举偏最小二乘回归(PLSR)估计值和不确定性对噪声衰减的鲁棒性较差,归一化均方根误差(NRMSE)从 19% 增加到 112%。此外,ARTMO 软件包还集成了一个图形用户界面工具,用于评估嵌入式回归算法的认识不确定性,为测绘应用提供了一个性状测绘质量指标,并改进了决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Remote Sensing of Environment
Remote Sensing of Environment 环境科学-成像科学与照相技术
CiteScore
25.10
自引率
8.90%
发文量
455
审稿时长
53 days
期刊介绍: Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing. The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques. RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.
期刊最新文献
River ice breakup classification using dual- (HH&HV) or compact-polarization RADARSAT Constellation Mission data Characterizing fire and fire atmospheric states from space using collocated hyperspectral infrared sounding and narrow-band imagery Comparative validation of recent 10 m-resolution global land cover maps Automating the detection of hydrological barriers and fragmentation in wetlands using deep learning and InSAR A machine learning scheme for estimating fine-resolution grassland aboveground biomass over China with Sentinel-1/2 satellite images
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1