{"title":"用于量化 ChemCam 火星光谱和实验室光谱中氧化钾的集合方法","authors":"Mohit Dubey, Diane Oyen, Patrick Gasda","doi":"10.1016/j.sab.2024.106945","DOIUrl":null,"url":null,"abstract":"<div><p>In this paper we test new approaches for predicting the amount of element oxides in rock samples from the ChemCam instrument suite onboard the NASA Curiosity rover by focusing on <span><math><msub><mi>K</mi><mn>2</mn></msub><mi>O</mi></math></span>. Using the expanded dataset compiled by Gasda et al. (2021) with and without the Earth to Mars (E2M and NoE2M) transformation discussed in Clegg et al. (2017) we trained blended submodels using the “double blending” technique and compared these to ensemble methods (Random Forest, ExtraTrees, and Gradient Boosting Regression). We found that ensemble methods performed similar to blended submodels when looking at RMSE-P on the laboratory spectra and provided significant advantages when looking at spectra coming from Mars. For the full model, blended submodels achieved an RMSE-P of 0.62 and 0.60 (E2M and NoE2M respectively) while Gradient Boosting Regression resulted in a slightly improved RMSE-P of 0.59 and 0.60. More importantly, by employing a local RMSE-P estimation technique where model performance is evaluated based on nearby test samples we found that using ensemble methods can lower the quantification limit for <span><math><msub><mi>K</mi><mn>2</mn></msub><mi>O</mi></math></span> from the current value of ≈0.6 wt% to ≈0.08 wt% using Extra Trees and Random Forest. This would allow for a much larger range of <span><math><msub><mi>K</mi><mn>2</mn></msub><mi>O</mi></math></span> values to be quantified on Mars with greater certainty given that most targets seen on Mars tend to have <1 wt% <span><math><msub><mi>K</mi><mn>2</mn></msub><mi>O</mi></math></span>. Finally, we used both Mean Decrease in Impurity (MDI) and permutation importance techniques to investigate the wavelengths used by the ensemble methods and found that they correspond to known potassium emission lines. This suggests that ensemble methods can provide an easier to train and improved alternative to blended submodels for predicting potassium compositions from Laser Induced Breakdown Spectroscopy (LIBS) data.</p></div>","PeriodicalId":21890,"journal":{"name":"Spectrochimica Acta Part B: Atomic Spectroscopy","volume":"216 ","pages":"Article 106945"},"PeriodicalIF":3.2000,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ensemble methods for quantification of potassium oxide in ChemCam Mars and laboratory spectra\",\"authors\":\"Mohit Dubey, Diane Oyen, Patrick Gasda\",\"doi\":\"10.1016/j.sab.2024.106945\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>In this paper we test new approaches for predicting the amount of element oxides in rock samples from the ChemCam instrument suite onboard the NASA Curiosity rover by focusing on <span><math><msub><mi>K</mi><mn>2</mn></msub><mi>O</mi></math></span>. Using the expanded dataset compiled by Gasda et al. (2021) with and without the Earth to Mars (E2M and NoE2M) transformation discussed in Clegg et al. (2017) we trained blended submodels using the “double blending” technique and compared these to ensemble methods (Random Forest, ExtraTrees, and Gradient Boosting Regression). We found that ensemble methods performed similar to blended submodels when looking at RMSE-P on the laboratory spectra and provided significant advantages when looking at spectra coming from Mars. For the full model, blended submodels achieved an RMSE-P of 0.62 and 0.60 (E2M and NoE2M respectively) while Gradient Boosting Regression resulted in a slightly improved RMSE-P of 0.59 and 0.60. More importantly, by employing a local RMSE-P estimation technique where model performance is evaluated based on nearby test samples we found that using ensemble methods can lower the quantification limit for <span><math><msub><mi>K</mi><mn>2</mn></msub><mi>O</mi></math></span> from the current value of ≈0.6 wt% to ≈0.08 wt% using Extra Trees and Random Forest. This would allow for a much larger range of <span><math><msub><mi>K</mi><mn>2</mn></msub><mi>O</mi></math></span> values to be quantified on Mars with greater certainty given that most targets seen on Mars tend to have <1 wt% <span><math><msub><mi>K</mi><mn>2</mn></msub><mi>O</mi></math></span>. Finally, we used both Mean Decrease in Impurity (MDI) and permutation importance techniques to investigate the wavelengths used by the ensemble methods and found that they correspond to known potassium emission lines. This suggests that ensemble methods can provide an easier to train and improved alternative to blended submodels for predicting potassium compositions from Laser Induced Breakdown Spectroscopy (LIBS) data.</p></div>\",\"PeriodicalId\":21890,\"journal\":{\"name\":\"Spectrochimica Acta Part B: Atomic Spectroscopy\",\"volume\":\"216 \",\"pages\":\"Article 106945\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-05-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spectrochimica Acta Part B: Atomic Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0584854724000892\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part B: Atomic Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0584854724000892","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
Ensemble methods for quantification of potassium oxide in ChemCam Mars and laboratory spectra
In this paper we test new approaches for predicting the amount of element oxides in rock samples from the ChemCam instrument suite onboard the NASA Curiosity rover by focusing on . Using the expanded dataset compiled by Gasda et al. (2021) with and without the Earth to Mars (E2M and NoE2M) transformation discussed in Clegg et al. (2017) we trained blended submodels using the “double blending” technique and compared these to ensemble methods (Random Forest, ExtraTrees, and Gradient Boosting Regression). We found that ensemble methods performed similar to blended submodels when looking at RMSE-P on the laboratory spectra and provided significant advantages when looking at spectra coming from Mars. For the full model, blended submodels achieved an RMSE-P of 0.62 and 0.60 (E2M and NoE2M respectively) while Gradient Boosting Regression resulted in a slightly improved RMSE-P of 0.59 and 0.60. More importantly, by employing a local RMSE-P estimation technique where model performance is evaluated based on nearby test samples we found that using ensemble methods can lower the quantification limit for from the current value of ≈0.6 wt% to ≈0.08 wt% using Extra Trees and Random Forest. This would allow for a much larger range of values to be quantified on Mars with greater certainty given that most targets seen on Mars tend to have <1 wt% . Finally, we used both Mean Decrease in Impurity (MDI) and permutation importance techniques to investigate the wavelengths used by the ensemble methods and found that they correspond to known potassium emission lines. This suggests that ensemble methods can provide an easier to train and improved alternative to blended submodels for predicting potassium compositions from Laser Induced Breakdown Spectroscopy (LIBS) data.
期刊介绍:
Spectrochimica Acta Part B: Atomic Spectroscopy, is intended for the rapid publication of both original work and reviews in the following fields:
Atomic Emission (AES), Atomic Absorption (AAS) and Atomic Fluorescence (AFS) spectroscopy;
Mass Spectrometry (MS) for inorganic analysis covering Spark Source (SS-MS), Inductively Coupled Plasma (ICP-MS), Glow Discharge (GD-MS), and Secondary Ion Mass Spectrometry (SIMS).
Laser induced atomic spectroscopy for inorganic analysis, including non-linear optical laser spectroscopy, covering Laser Enhanced Ionization (LEI), Laser Induced Fluorescence (LIF), Resonance Ionization Spectroscopy (RIS) and Resonance Ionization Mass Spectrometry (RIMS); Laser Induced Breakdown Spectroscopy (LIBS); Cavity Ringdown Spectroscopy (CRDS), Laser Ablation Inductively Coupled Plasma Atomic Emission Spectroscopy (LA-ICP-AES) and Laser Ablation Inductively Coupled Plasma Mass Spectrometry (LA-ICP-MS).
X-ray spectrometry, X-ray Optics and Microanalysis, including X-ray fluorescence spectrometry (XRF) and related techniques, in particular Total-reflection X-ray Fluorescence Spectrometry (TXRF), and Synchrotron Radiation-excited Total reflection XRF (SR-TXRF).
Manuscripts dealing with (i) fundamentals, (ii) methodology development, (iii)instrumentation, and (iv) applications, can be submitted for publication.