{"title":"Comprehensive comparison on different wavelength selection methods using several near-infrared spectral datasets with different dimensionalities","authors":"Tao Wang , Yun Zheng , Lilan Xu , Yong-Huan Yun","doi":"10.1016/j.saa.2025.125767","DOIUrl":null,"url":null,"abstract":"<div><div>NIR spectroscopy is widely used in chemical analysis, agricultural science, food safety, and other fields, but its high dimensionality and data redundancy bring analytical challenges. This study aims to compare the performance of different wavelength selection methods in NIR spectral datasets with different dimensionalities to provide a reference for researchers. The wavelength selection methods in this study were classified into four categories according to their principles, which are partial least squares (PLS) parameter-based methods, intelligent optimization algorithms (IOA)-based methods, model population analysis (MPA)-based methods and wavelength interval selection (WIS) methods. The performance of the models was compared in terms of R<sup>2</sup><sub>C</sub>, R<sup>2</sup><sub>P</sub>, root mean square error of calibration (RMSEC), root mean square error of prediction (RMSEP), the number of selected variables, computational time, and the improvement ratio of RMSEP (iRMSEP). The results showed that the models established by MPA-based and WIS methods were more stable and superior to the other categories of wavelength selection methods in most datasets. During the twenty characteristic wavelength selection methods in this study, bootstrapping soft shrinkage (BOSS) and genetic algorithm interval partial least squares (GA-iPLS) show the best performance at the overall level.</div></div>","PeriodicalId":433,"journal":{"name":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","volume":"331 ","pages":"Article 125767"},"PeriodicalIF":4.3000,"publicationDate":"2025-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386142525000733","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0
Abstract
NIR spectroscopy is widely used in chemical analysis, agricultural science, food safety, and other fields, but its high dimensionality and data redundancy bring analytical challenges. This study aims to compare the performance of different wavelength selection methods in NIR spectral datasets with different dimensionalities to provide a reference for researchers. The wavelength selection methods in this study were classified into four categories according to their principles, which are partial least squares (PLS) parameter-based methods, intelligent optimization algorithms (IOA)-based methods, model population analysis (MPA)-based methods and wavelength interval selection (WIS) methods. The performance of the models was compared in terms of R2C, R2P, root mean square error of calibration (RMSEC), root mean square error of prediction (RMSEP), the number of selected variables, computational time, and the improvement ratio of RMSEP (iRMSEP). The results showed that the models established by MPA-based and WIS methods were more stable and superior to the other categories of wavelength selection methods in most datasets. During the twenty characteristic wavelength selection methods in this study, bootstrapping soft shrinkage (BOSS) and genetic algorithm interval partial least squares (GA-iPLS) show the best performance at the overall level.
期刊介绍:
Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy (SAA) is an interdisciplinary journal which spans from basic to applied aspects of optical spectroscopy in chemistry, medicine, biology, and materials science.
The journal publishes original scientific papers that feature high-quality spectroscopic data and analysis. From the broad range of optical spectroscopies, the emphasis is on electronic, vibrational or rotational spectra of molecules, rather than on spectroscopy based on magnetic moments.
Criteria for publication in SAA are novelty, uniqueness, and outstanding quality. Routine applications of spectroscopic techniques and computational methods are not appropriate.
Topics of particular interest of Spectrochimica Acta Part A include, but are not limited to:
Spectroscopy and dynamics of bioanalytical, biomedical, environmental, and atmospheric sciences,
Novel experimental techniques or instrumentation for molecular spectroscopy,
Novel theoretical and computational methods,
Novel applications in photochemistry and photobiology,
Novel interpretational approaches as well as advances in data analysis based on electronic or vibrational spectroscopy.