Measurement of four main catechins content in green tea based on visible and near-infrared spectroscopy using optimized machine learning algorithm

IF 4 2区 农林科学 Q2 CHEMISTRY, APPLIED Journal of Food Composition and Analysis Pub Date : 2024-11-17 DOI:10.1016/j.jfca.2024.106990
Wei Luo , Wenyoujia Li , Shuling Liu , Qicheng Li , Haihua Huang , Hailiang Zhang
{"title":"Measurement of four main catechins content in green tea based on visible and near-infrared spectroscopy using optimized machine learning algorithm","authors":"Wei Luo ,&nbsp;Wenyoujia Li ,&nbsp;Shuling Liu ,&nbsp;Qicheng Li ,&nbsp;Haihua Huang ,&nbsp;Hailiang Zhang","doi":"10.1016/j.jfca.2024.106990","DOIUrl":null,"url":null,"abstract":"<div><div>The catechin content is critical for assessing the nutritional value and commercial worth of green tea, making detection methods valuable for its quality evaluation. This study utilized visible and near-infrared spectroscopy (Vis-NIRS, 400–2500 nm) combined with an optimized machine learning model, integrating the global search whale optimization algorithm (GSWOA) and kernel extreme learning machine (KELM), to predict the content of four main catechins (EC, ECG, EGC, and EGCG) in fresh tea leaves. Multiple preprocessing and effective wavelength selection methods were applied to improve prediction accuracy. The results showed that the distribution of effective wavelengths verified the importance of the visible spectrum. The GSWOA-KELM demonstrated superior performance, optimizing the modeling capabilities of KELM and outperforming other models, including partial least squares regression (PLSR) and random forest (RF), with <span><math><msubsup><mrow><mi>R</mi></mrow><mrow><mi>P</mi></mrow><mrow><mn>2</mn></mrow></msubsup></math></span> values exceeding 0.98 and RMSEP values below 3.3 for all catechins. Notably, even without additional data processing, the model maintained good effects, achieving a minimum <span><math><msubsup><mrow><mi>R</mi></mrow><mrow><mi>p</mi></mrow><mrow><mn>2</mn></mrow></msubsup></math></span> of 0.9751 and a maximum RMSEP of 3.3654. Therefore, the proposed approach proved to be both accurate and rapid for non-destructive, online catechin analysis and had practical value for tea appreciation.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"138 ","pages":"Article 106990"},"PeriodicalIF":4.0000,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S088915752401024X","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

The catechin content is critical for assessing the nutritional value and commercial worth of green tea, making detection methods valuable for its quality evaluation. This study utilized visible and near-infrared spectroscopy (Vis-NIRS, 400–2500 nm) combined with an optimized machine learning model, integrating the global search whale optimization algorithm (GSWOA) and kernel extreme learning machine (KELM), to predict the content of four main catechins (EC, ECG, EGC, and EGCG) in fresh tea leaves. Multiple preprocessing and effective wavelength selection methods were applied to improve prediction accuracy. The results showed that the distribution of effective wavelengths verified the importance of the visible spectrum. The GSWOA-KELM demonstrated superior performance, optimizing the modeling capabilities of KELM and outperforming other models, including partial least squares regression (PLSR) and random forest (RF), with RP2 values exceeding 0.98 and RMSEP values below 3.3 for all catechins. Notably, even without additional data processing, the model maintained good effects, achieving a minimum Rp2 of 0.9751 and a maximum RMSEP of 3.3654. Therefore, the proposed approach proved to be both accurate and rapid for non-destructive, online catechin analysis and had practical value for tea appreciation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用优化的机器学习算法,基于可见光和近红外光谱测量绿茶中的四种主要儿茶素含量
儿茶素含量是评估绿茶营养价值和商业价值的关键,因此检测方法对绿茶质量评估非常重要。本研究利用可见光和近红外光谱仪(Vis-NIRS,400-2500 nm)结合优化的机器学习模型,整合了全局搜索鲸鱼优化算法(GSWOA)和核极端学习机(KELM),预测了新鲜茶叶中四种主要儿茶素(EC、ECG、EGC和EGCG)的含量。采用多种预处理和有效波长选择方法来提高预测精度。结果表明,有效波长的分布验证了可见光谱的重要性。GSWOA-KELM 性能优越,优化了 KELM 的建模能力,并优于其他模型,包括偏最小二乘回归(PLSR)和随机森林(RF),所有儿茶素的 RP2 值均超过 0.98,RMSEP 值均低于 3.3。值得注意的是,即使不进行额外的数据处理,该模型也能保持良好的效果,最小 Rp2 值为 0.9751,最大 RMSEP 值为 3.3654。因此,事实证明所提出的方法既准确又快速,适用于非破坏性的在线儿茶素分析,对茶叶鉴赏具有实用价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Food Composition and Analysis
Journal of Food Composition and Analysis 工程技术-食品科技
CiteScore
6.20
自引率
11.60%
发文量
601
审稿时长
53 days
期刊介绍: The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects. The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.
期刊最新文献
Exploring the effects of barley (Hordeum vulgare L.) germination on chemical composition, phytic acid, and potential malt prebiotic properties A unique cuvette and near-infrared spectral imaging for fast and accurate quantification of milk compositions Research on rapid determination methods for main compositions and sensory quality of pumpkins based on hyperspectral imaging technology S-methyl-L-cysteine sulfoxide and glucosinolate levels in Australian-sourced Brassica vegetables before and after domestic cooking A sensitive UHPLC-electrospray tandem mass spectrometry method for the simultaneous quantification of 8 oxysterols and cholesterol in oysters (Crassostrea gigas) with different thermal cooking procedures
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1