通过近红外光谱和机器学习,根据储存时间对大米进行快速分类

IF 4.1 Q1 CHEMISTRY, ANALYTICAL Talanta Open Pub Date : 2024-07-14 DOI:10.1016/j.talo.2024.100343
Chen Zhai , Wenxiu Wang , Man Gao , Xiaohui Feng , Shengjie Zhang , Chengjing Qian
{"title":"通过近红外光谱和机器学习,根据储存时间对大米进行快速分类","authors":"Chen Zhai ,&nbsp;Wenxiu Wang ,&nbsp;Man Gao ,&nbsp;Xiaohui Feng ,&nbsp;Shengjie Zhang ,&nbsp;Chengjing Qian","doi":"10.1016/j.talo.2024.100343","DOIUrl":null,"url":null,"abstract":"<div><p>Rice is the most important staple crop for more than half of the world's population. As rice quality can deteriorate during storage, methods that can effectively classify rice according to its storage duration are essential. However, existing methods of assessing rice storage time are time-consuming, laborious, and incompatible with modern industrial processing technologies. Therefore, we investigated the ability of near-infrared spectroscopy combined with machine learning algorithms to distinguish rice storage duration. A total of 482 rice samples were analyzed, which included 74, 100, and 308 samples produced during 2015–2016, 2017–2018, and 2020–2021, respectively. Five pre-processing methods were initially applied to the spectra to enhance the accuracy of the discrimination model. Subsequently, two-dimensional correlation spectroscopy and competitive adaptive reweighted sampling (CARS) were used to extract the characteristic spectra associated with storage time. Finally, three pattern recognition methods (K-nearest neighbor analysis, linear discriminant analysis, and least squares support vector machine (LS-SVM)) were compared for their effectiveness in constructing classification models. The results indicated that the best model for identifying the storage duration of rice was established after spectral pre-processing with the standard normal variate and first derivative, using the CARS algorithm to select feature wavelengths, and applying the LS-SVM modeling method, which together yielded correct identification rates of 99.72 % and 91.67 % for the calibration and validation sets, respectively. Thus, we propose near-infrared spectroscopy coupled with machine learning algorithms as an effective approach for classifying rice according to storage duration, which can facilitate evaluations of rice freshness in the market.</p></div>","PeriodicalId":436,"journal":{"name":"Talanta Open","volume":"10 ","pages":"Article 100343"},"PeriodicalIF":4.1000,"publicationDate":"2024-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666831924000572/pdfft?md5=ecf4a28b6aa669c677142b1a2d572865&pid=1-s2.0-S2666831924000572-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Rapid classification of rice according to storage duration via near-infrared spectroscopy and machine learning\",\"authors\":\"Chen Zhai ,&nbsp;Wenxiu Wang ,&nbsp;Man Gao ,&nbsp;Xiaohui Feng ,&nbsp;Shengjie Zhang ,&nbsp;Chengjing Qian\",\"doi\":\"10.1016/j.talo.2024.100343\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Rice is the most important staple crop for more than half of the world's population. As rice quality can deteriorate during storage, methods that can effectively classify rice according to its storage duration are essential. However, existing methods of assessing rice storage time are time-consuming, laborious, and incompatible with modern industrial processing technologies. Therefore, we investigated the ability of near-infrared spectroscopy combined with machine learning algorithms to distinguish rice storage duration. A total of 482 rice samples were analyzed, which included 74, 100, and 308 samples produced during 2015–2016, 2017–2018, and 2020–2021, respectively. Five pre-processing methods were initially applied to the spectra to enhance the accuracy of the discrimination model. Subsequently, two-dimensional correlation spectroscopy and competitive adaptive reweighted sampling (CARS) were used to extract the characteristic spectra associated with storage time. Finally, three pattern recognition methods (K-nearest neighbor analysis, linear discriminant analysis, and least squares support vector machine (LS-SVM)) were compared for their effectiveness in constructing classification models. The results indicated that the best model for identifying the storage duration of rice was established after spectral pre-processing with the standard normal variate and first derivative, using the CARS algorithm to select feature wavelengths, and applying the LS-SVM modeling method, which together yielded correct identification rates of 99.72 % and 91.67 % for the calibration and validation sets, respectively. Thus, we propose near-infrared spectroscopy coupled with machine learning algorithms as an effective approach for classifying rice according to storage duration, which can facilitate evaluations of rice freshness in the market.</p></div>\",\"PeriodicalId\":436,\"journal\":{\"name\":\"Talanta Open\",\"volume\":\"10 \",\"pages\":\"Article 100343\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2024-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666831924000572/pdfft?md5=ecf4a28b6aa669c677142b1a2d572865&pid=1-s2.0-S2666831924000572-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Talanta Open\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666831924000572\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Talanta Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666831924000572","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

大米是世界上一半以上人口最重要的主食作物。由于大米在储存过程中质量会下降,因此根据储存时间对大米进行有效分类的方法至关重要。然而,现有的大米储存时间评估方法费时、费力,而且与现代工业加工技术不兼容。因此,我们研究了近红外光谱与机器学习算法相结合来区分大米储藏时间的能力。共分析了 482 份大米样品,其中包括 2015-2016 年、2017-2018 年和 2020-2021 年分别生产的 74 份、100 份和 308 份样品。最初对光谱采用了五种预处理方法,以提高判别模型的准确性。随后,使用二维相关光谱法和竞争性自适应再加权采样法(CARS)提取与存储时间相关的特征光谱。最后,比较了三种模式识别方法(K-近邻分析、线性判别分析和最小二乘支持向量机(LS-SVM))在构建分类模型方面的有效性。结果表明,在使用标准正态变分和一阶导数进行光谱预处理、使用 CARS 算法选择特征波长并应用 LS-SVM 建模方法后,建立了识别水稻储藏期的最佳模型,在校准集和验证集上的正确识别率分别为 99.72 % 和 91.67 %。因此,我们建议将近红外光谱仪与机器学习算法相结合,作为一种根据储存时间对大米进行分类的有效方法,从而促进市场上对大米新鲜度的评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Rapid classification of rice according to storage duration via near-infrared spectroscopy and machine learning

Rice is the most important staple crop for more than half of the world's population. As rice quality can deteriorate during storage, methods that can effectively classify rice according to its storage duration are essential. However, existing methods of assessing rice storage time are time-consuming, laborious, and incompatible with modern industrial processing technologies. Therefore, we investigated the ability of near-infrared spectroscopy combined with machine learning algorithms to distinguish rice storage duration. A total of 482 rice samples were analyzed, which included 74, 100, and 308 samples produced during 2015–2016, 2017–2018, and 2020–2021, respectively. Five pre-processing methods were initially applied to the spectra to enhance the accuracy of the discrimination model. Subsequently, two-dimensional correlation spectroscopy and competitive adaptive reweighted sampling (CARS) were used to extract the characteristic spectra associated with storage time. Finally, three pattern recognition methods (K-nearest neighbor analysis, linear discriminant analysis, and least squares support vector machine (LS-SVM)) were compared for their effectiveness in constructing classification models. The results indicated that the best model for identifying the storage duration of rice was established after spectral pre-processing with the standard normal variate and first derivative, using the CARS algorithm to select feature wavelengths, and applying the LS-SVM modeling method, which together yielded correct identification rates of 99.72 % and 91.67 % for the calibration and validation sets, respectively. Thus, we propose near-infrared spectroscopy coupled with machine learning algorithms as an effective approach for classifying rice according to storage duration, which can facilitate evaluations of rice freshness in the market.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Talanta Open
Talanta Open Chemistry-Analytical Chemistry
CiteScore
5.20
自引率
0.00%
发文量
86
审稿时长
49 days
期刊最新文献
Continuous-flow analysis of nitrogen compounds in environmental water using a copper–zinc reduction coil Development of engineered Zn-MOF/g-C3N4 based photoelectrochemical system for real-time sensors and removal of naproxen in wastewater Development of nanomaterial-supported molecularly imprinted polymer/receptor-like sensor for the detection of rosuvastatin from binary mixtures Trace-level quantification of NDMA in levosulpuride active pharmaceutical ingredient and tablet formulation Using UFLC-MS/MS Enhancing isomer specificity in mass spectrometry by combining silver ion adduction and ion mobility
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1