Shengxin Li , Ziyan Zhang , Zhiran Zhang , Sen Zhou , Mengkai Liu , Xichao Li , Zheng Zheng , Jie Sun
{"title":"基于nir的山梨多组分检测光谱变量选择方法的比较分析","authors":"Shengxin Li , Ziyan Zhang , Zhiran Zhang , Sen Zhou , Mengkai Liu , Xichao Li , Zheng Zheng , Jie Sun","doi":"10.1016/j.microc.2025.113128","DOIUrl":null,"url":null,"abstract":"<div><div>The moisture, oil, protein, and nervonic acid contents are crucial criteria for assessing the nutritional value and processing quality of <em>Xanthoceras sorbifolia</em> Bunge seed kernels (XSKs). Near-infrared spectroscopy (NIRS) offers a viable alternative to conventional chemical methods for rapid and non-destructive sample analysis. This study employed a highly adaptive method known as competitive adaptive reweighted sampling (CARS), which facilitates variable selection for different components and streamlines the feature-variable screening process. Using this method, various algorithms were applied to establish NIRS detection models for the four components of XSKs, and accuracy evaluations were conducted using the prediction set determination coefficient (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span>), root-mean-square error (RMSEP), and other parameters. The results indicated that CARS effectively extracted key information on various components in NIRS. Based on the feature variables selected by CARS, the partial least squares regression (PLSR) model exhibited optimal performance regarding moisture (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span> = 0.978, RMSEP = 0.117 %) and nervonic acid content (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span> = 0.988, RMSEP = 0.072 %). Meanwhile, the particle swarm optimization-enhanced support vector regression (PSO-SVR) model and back-propagation neural networks (BPNN) achieved superior performance for protein (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span> = 0.989, RMSEP = 0.243 %) and oil content (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span> = 0.978, RMSEP = 0.467 %), respectively. These models enable rapid and non-destructive detection of multi-component content, thereby facilitating the expeditious identification and selection of XSKs with high nutritional value and superior quality.</div></div>","PeriodicalId":391,"journal":{"name":"Microchemical Journal","volume":"212 ","pages":"Article 113128"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative analysis of spectral variable selection methods for NIR-based multi-component detection of Xanthoceras sorbifolium Bunge seed kernels\",\"authors\":\"Shengxin Li , Ziyan Zhang , Zhiran Zhang , Sen Zhou , Mengkai Liu , Xichao Li , Zheng Zheng , Jie Sun\",\"doi\":\"10.1016/j.microc.2025.113128\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The moisture, oil, protein, and nervonic acid contents are crucial criteria for assessing the nutritional value and processing quality of <em>Xanthoceras sorbifolia</em> Bunge seed kernels (XSKs). Near-infrared spectroscopy (NIRS) offers a viable alternative to conventional chemical methods for rapid and non-destructive sample analysis. This study employed a highly adaptive method known as competitive adaptive reweighted sampling (CARS), which facilitates variable selection for different components and streamlines the feature-variable screening process. Using this method, various algorithms were applied to establish NIRS detection models for the four components of XSKs, and accuracy evaluations were conducted using the prediction set determination coefficient (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span>), root-mean-square error (RMSEP), and other parameters. The results indicated that CARS effectively extracted key information on various components in NIRS. Based on the feature variables selected by CARS, the partial least squares regression (PLSR) model exhibited optimal performance regarding moisture (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span> = 0.978, RMSEP = 0.117 %) and nervonic acid content (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span> = 0.988, RMSEP = 0.072 %). Meanwhile, the particle swarm optimization-enhanced support vector regression (PSO-SVR) model and back-propagation neural networks (BPNN) achieved superior performance for protein (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span> = 0.989, RMSEP = 0.243 %) and oil content (<span><math><msubsup><mtext>R</mtext><mrow><mtext>P</mtext></mrow><mtext>2</mtext></msubsup></math></span> = 0.978, RMSEP = 0.467 %), respectively. These models enable rapid and non-destructive detection of multi-component content, thereby facilitating the expeditious identification and selection of XSKs with high nutritional value and superior quality.</div></div>\",\"PeriodicalId\":391,\"journal\":{\"name\":\"Microchemical Journal\",\"volume\":\"212 \",\"pages\":\"Article 113128\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microchemical Journal\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0026265X25004825\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microchemical Journal","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0026265X25004825","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
Comparative analysis of spectral variable selection methods for NIR-based multi-component detection of Xanthoceras sorbifolium Bunge seed kernels
The moisture, oil, protein, and nervonic acid contents are crucial criteria for assessing the nutritional value and processing quality of Xanthoceras sorbifolia Bunge seed kernels (XSKs). Near-infrared spectroscopy (NIRS) offers a viable alternative to conventional chemical methods for rapid and non-destructive sample analysis. This study employed a highly adaptive method known as competitive adaptive reweighted sampling (CARS), which facilitates variable selection for different components and streamlines the feature-variable screening process. Using this method, various algorithms were applied to establish NIRS detection models for the four components of XSKs, and accuracy evaluations were conducted using the prediction set determination coefficient (), root-mean-square error (RMSEP), and other parameters. The results indicated that CARS effectively extracted key information on various components in NIRS. Based on the feature variables selected by CARS, the partial least squares regression (PLSR) model exhibited optimal performance regarding moisture ( = 0.978, RMSEP = 0.117 %) and nervonic acid content ( = 0.988, RMSEP = 0.072 %). Meanwhile, the particle swarm optimization-enhanced support vector regression (PSO-SVR) model and back-propagation neural networks (BPNN) achieved superior performance for protein ( = 0.989, RMSEP = 0.243 %) and oil content ( = 0.978, RMSEP = 0.467 %), respectively. These models enable rapid and non-destructive detection of multi-component content, thereby facilitating the expeditious identification and selection of XSKs with high nutritional value and superior quality.
期刊介绍:
The Microchemical Journal is a peer reviewed journal devoted to all aspects and phases of analytical chemistry and chemical analysis. The Microchemical Journal publishes articles which are at the forefront of modern analytical chemistry and cover innovations in the techniques to the finest possible limits. This includes fundamental aspects, instrumentation, new developments, innovative and novel methods and applications including environmental and clinical field.
Traditional classical analytical methods such as spectrophotometry and titrimetry as well as established instrumentation methods such as flame and graphite furnace atomic absorption spectrometry, gas chromatography, and modified glassy or carbon electrode electrochemical methods will be considered, provided they show significant improvements and novelty compared to the established methods.