Renjie Fang, Xin Han, Xiangxian Li, Jingjing Tong, Minguang Gao, Yang Wang
{"title":"在近红外光谱中选择变量,为生物水溶液中的血红蛋白含量建模","authors":"Renjie Fang, Xin Han, Xiangxian Li, Jingjing Tong, Minguang Gao, Yang Wang","doi":"10.1007/s10812-024-01801-0","DOIUrl":null,"url":null,"abstract":"<p>The background differences in water content of different samples have a very strong influence on the robustness of near-infrared spectroscopy (NIRS). For this reason, this study simulated typical biological water matrix samples with formulated hemoglobin (Hb), glucose (Glc), and distilled water, and attempted to use four different intelligent spectral variable selection algorithms [Competitive Adaptive Reweighted Sampling (CARS), Randomized Frog Hopping Algorithm (RF), Genetic Algorithm (GA), and Variable Projection Importance Algorithm (VIP)] to perform the Hb water interference-resistant feature band preferences, while combining partial least squares (PLS) in parallel to build a robust quantitative model of Hb. In addition, the applicability and validity of the model were validated using three prediction sets <i>P</i><sub>1</sub>, <i>P</i><sub>2</sub>, <i>P</i><sub>3</sub> with different water backgrounds (the formulation method and composition were kept the same, and only the water content increased sequentially). The results showed that RF, GA, and VIP could effectively screen out the characteristic wavelengths of Hb with low sensitivity to water changes and successfully correct the water effect, but due to the large number of characteristic variables they screened out and the existence of a large number of redundant and water interference variables, this ultimately made the model's robustness less than ideal. The CARS algorithm performed the best, and the RMSEP of the three prediction sets were 0.016, 0.017, and 0.038, which is closer to the RMSECV of the calibration set. Therefore, NIRS combined with the variable selection can reduce the effect of water on model robustness and improve the prediction accuracy of the model by the method of selecting effective wave number intervals, and CARS may be one of the ideal algorithms to solve such problems.</p>","PeriodicalId":609,"journal":{"name":"Journal of Applied Spectroscopy","volume":"91 4","pages":"928 - 935"},"PeriodicalIF":0.8000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Variable Selection in Near-Infrared Spectra for Modeling of Hemoglobin Content in Bio-Water Solutions\",\"authors\":\"Renjie Fang, Xin Han, Xiangxian Li, Jingjing Tong, Minguang Gao, Yang Wang\",\"doi\":\"10.1007/s10812-024-01801-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The background differences in water content of different samples have a very strong influence on the robustness of near-infrared spectroscopy (NIRS). For this reason, this study simulated typical biological water matrix samples with formulated hemoglobin (Hb), glucose (Glc), and distilled water, and attempted to use four different intelligent spectral variable selection algorithms [Competitive Adaptive Reweighted Sampling (CARS), Randomized Frog Hopping Algorithm (RF), Genetic Algorithm (GA), and Variable Projection Importance Algorithm (VIP)] to perform the Hb water interference-resistant feature band preferences, while combining partial least squares (PLS) in parallel to build a robust quantitative model of Hb. In addition, the applicability and validity of the model were validated using three prediction sets <i>P</i><sub>1</sub>, <i>P</i><sub>2</sub>, <i>P</i><sub>3</sub> with different water backgrounds (the formulation method and composition were kept the same, and only the water content increased sequentially). The results showed that RF, GA, and VIP could effectively screen out the characteristic wavelengths of Hb with low sensitivity to water changes and successfully correct the water effect, but due to the large number of characteristic variables they screened out and the existence of a large number of redundant and water interference variables, this ultimately made the model's robustness less than ideal. The CARS algorithm performed the best, and the RMSEP of the three prediction sets were 0.016, 0.017, and 0.038, which is closer to the RMSECV of the calibration set. Therefore, NIRS combined with the variable selection can reduce the effect of water on model robustness and improve the prediction accuracy of the model by the method of selecting effective wave number intervals, and CARS may be one of the ideal algorithms to solve such problems.</p>\",\"PeriodicalId\":609,\"journal\":{\"name\":\"Journal of Applied Spectroscopy\",\"volume\":\"91 4\",\"pages\":\"928 - 935\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2024-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Applied Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10812-024-01801-0\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1007/s10812-024-01801-0","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
Variable Selection in Near-Infrared Spectra for Modeling of Hemoglobin Content in Bio-Water Solutions
The background differences in water content of different samples have a very strong influence on the robustness of near-infrared spectroscopy (NIRS). For this reason, this study simulated typical biological water matrix samples with formulated hemoglobin (Hb), glucose (Glc), and distilled water, and attempted to use four different intelligent spectral variable selection algorithms [Competitive Adaptive Reweighted Sampling (CARS), Randomized Frog Hopping Algorithm (RF), Genetic Algorithm (GA), and Variable Projection Importance Algorithm (VIP)] to perform the Hb water interference-resistant feature band preferences, while combining partial least squares (PLS) in parallel to build a robust quantitative model of Hb. In addition, the applicability and validity of the model were validated using three prediction sets P1, P2, P3 with different water backgrounds (the formulation method and composition were kept the same, and only the water content increased sequentially). The results showed that RF, GA, and VIP could effectively screen out the characteristic wavelengths of Hb with low sensitivity to water changes and successfully correct the water effect, but due to the large number of characteristic variables they screened out and the existence of a large number of redundant and water interference variables, this ultimately made the model's robustness less than ideal. The CARS algorithm performed the best, and the RMSEP of the three prediction sets were 0.016, 0.017, and 0.038, which is closer to the RMSECV of the calibration set. Therefore, NIRS combined with the variable selection can reduce the effect of water on model robustness and improve the prediction accuracy of the model by the method of selecting effective wave number intervals, and CARS may be one of the ideal algorithms to solve such problems.
期刊介绍:
Journal of Applied Spectroscopy reports on many key applications of spectroscopy in chemistry, physics, metallurgy, and biology. An increasing number of papers focus on the theory of lasers, as well as the tremendous potential for the practical applications of lasers in numerous fields and industries.