Chen Zhai , Wenxiu Wang , Man Gao , Xiaohui Feng , Shengjie Zhang , Chengjing Qian
{"title":"Rapid classification of rice according to storage duration via near-infrared spectroscopy and machine learning","authors":"Chen Zhai , Wenxiu Wang , Man Gao , Xiaohui Feng , Shengjie Zhang , Chengjing Qian","doi":"10.1016/j.talo.2024.100343","DOIUrl":null,"url":null,"abstract":"<div><p>Rice is the most important staple crop for more than half of the world's population. As rice quality can deteriorate during storage, methods that can effectively classify rice according to its storage duration are essential. However, existing methods of assessing rice storage time are time-consuming, laborious, and incompatible with modern industrial processing technologies. Therefore, we investigated the ability of near-infrared spectroscopy combined with machine learning algorithms to distinguish rice storage duration. A total of 482 rice samples were analyzed, which included 74, 100, and 308 samples produced during 2015–2016, 2017–2018, and 2020–2021, respectively. Five pre-processing methods were initially applied to the spectra to enhance the accuracy of the discrimination model. Subsequently, two-dimensional correlation spectroscopy and competitive adaptive reweighted sampling (CARS) were used to extract the characteristic spectra associated with storage time. Finally, three pattern recognition methods (K-nearest neighbor analysis, linear discriminant analysis, and least squares support vector machine (LS-SVM)) were compared for their effectiveness in constructing classification models. The results indicated that the best model for identifying the storage duration of rice was established after spectral pre-processing with the standard normal variate and first derivative, using the CARS algorithm to select feature wavelengths, and applying the LS-SVM modeling method, which together yielded correct identification rates of 99.72 % and 91.67 % for the calibration and validation sets, respectively. Thus, we propose near-infrared spectroscopy coupled with machine learning algorithms as an effective approach for classifying rice according to storage duration, which can facilitate evaluations of rice freshness in the market.</p></div>","PeriodicalId":436,"journal":{"name":"Talanta Open","volume":"10 ","pages":"Article 100343"},"PeriodicalIF":4.1000,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666831924000572/pdfft?md5=ecf4a28b6aa669c677142b1a2d572865&pid=1-s2.0-S2666831924000572-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Talanta Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666831924000572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Rice is the most important staple crop for more than half of the world's population. As rice quality can deteriorate during storage, methods that can effectively classify rice according to its storage duration are essential. However, existing methods of assessing rice storage time are time-consuming, laborious, and incompatible with modern industrial processing technologies. Therefore, we investigated the ability of near-infrared spectroscopy combined with machine learning algorithms to distinguish rice storage duration. A total of 482 rice samples were analyzed, which included 74, 100, and 308 samples produced during 2015–2016, 2017–2018, and 2020–2021, respectively. Five pre-processing methods were initially applied to the spectra to enhance the accuracy of the discrimination model. Subsequently, two-dimensional correlation spectroscopy and competitive adaptive reweighted sampling (CARS) were used to extract the characteristic spectra associated with storage time. Finally, three pattern recognition methods (K-nearest neighbor analysis, linear discriminant analysis, and least squares support vector machine (LS-SVM)) were compared for their effectiveness in constructing classification models. The results indicated that the best model for identifying the storage duration of rice was established after spectral pre-processing with the standard normal variate and first derivative, using the CARS algorithm to select feature wavelengths, and applying the LS-SVM modeling method, which together yielded correct identification rates of 99.72 % and 91.67 % for the calibration and validation sets, respectively. Thus, we propose near-infrared spectroscopy coupled with machine learning algorithms as an effective approach for classifying rice according to storage duration, which can facilitate evaluations of rice freshness in the market.