Wei Luo , Wenyoujia Li , Shuling Liu , Qicheng Li , Haihua Huang , Hailiang Zhang
{"title":"Measurement of four main catechins content in green tea based on visible and near-infrared spectroscopy using optimized machine learning algorithm","authors":"Wei Luo , Wenyoujia Li , Shuling Liu , Qicheng Li , Haihua Huang , Hailiang Zhang","doi":"10.1016/j.jfca.2024.106990","DOIUrl":null,"url":null,"abstract":"<div><div>The catechin content is critical for assessing the nutritional value and commercial worth of green tea, making detection methods valuable for its quality evaluation. This study utilized visible and near-infrared spectroscopy (Vis-NIRS, 400–2500 nm) combined with an optimized machine learning model, integrating the global search whale optimization algorithm (GSWOA) and kernel extreme learning machine (KELM), to predict the content of four main catechins (EC, ECG, EGC, and EGCG) in fresh tea leaves. Multiple preprocessing and effective wavelength selection methods were applied to improve prediction accuracy. The results showed that the distribution of effective wavelengths verified the importance of the visible spectrum. The GSWOA-KELM demonstrated superior performance, optimizing the modeling capabilities of KELM and outperforming other models, including partial least squares regression (PLSR) and random forest (RF), with <span><math><msubsup><mrow><mi>R</mi></mrow><mrow><mi>P</mi></mrow><mrow><mn>2</mn></mrow></msubsup></math></span> values exceeding 0.98 and RMSEP values below 3.3 for all catechins. Notably, even without additional data processing, the model maintained good effects, achieving a minimum <span><math><msubsup><mrow><mi>R</mi></mrow><mrow><mi>p</mi></mrow><mrow><mn>2</mn></mrow></msubsup></math></span> of 0.9751 and a maximum RMSEP of 3.3654. Therefore, the proposed approach proved to be both accurate and rapid for non-destructive, online catechin analysis and had practical value for tea appreciation.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"138 ","pages":"Article 106990"},"PeriodicalIF":4.0000,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S088915752401024X","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
The catechin content is critical for assessing the nutritional value and commercial worth of green tea, making detection methods valuable for its quality evaluation. This study utilized visible and near-infrared spectroscopy (Vis-NIRS, 400–2500 nm) combined with an optimized machine learning model, integrating the global search whale optimization algorithm (GSWOA) and kernel extreme learning machine (KELM), to predict the content of four main catechins (EC, ECG, EGC, and EGCG) in fresh tea leaves. Multiple preprocessing and effective wavelength selection methods were applied to improve prediction accuracy. The results showed that the distribution of effective wavelengths verified the importance of the visible spectrum. The GSWOA-KELM demonstrated superior performance, optimizing the modeling capabilities of KELM and outperforming other models, including partial least squares regression (PLSR) and random forest (RF), with values exceeding 0.98 and RMSEP values below 3.3 for all catechins. Notably, even without additional data processing, the model maintained good effects, achieving a minimum of 0.9751 and a maximum RMSEP of 3.3654. Therefore, the proposed approach proved to be both accurate and rapid for non-destructive, online catechin analysis and had practical value for tea appreciation.
期刊介绍:
The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects.
The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.