Eitan Fass , Eldar Shlomi , Carmit Ziv , Oren Glickman , David Helman
{"title":"基于高光谱成像的收获前番茄果实质量监测机器学习模型","authors":"Eitan Fass , Eldar Shlomi , Carmit Ziv , Oren Glickman , David Helman","doi":"10.1016/j.compag.2024.109788","DOIUrl":null,"url":null,"abstract":"<div><div>Traditional methods for assessing tomato quality are time-consuming, expensive, and limited in scope. Here we developed a non-destructive spectral-based model using a handheld hyperspectral camera with 204 bands at the 400–1000 nm range, focusing on data reduction, paving the way for an economically viable device designed to assess seven key tomato quality parameters. We collected 567 fruits from five cultivars of various types and analyzed them for weight, firmness, total soluble solids (TSS), citric acid, ascorbic acid, lycopene, and pH after taking hyperspectral images of the fruits. Five commonly used spectral indices, thousands of normalized difference spectral index (NDSI) combinations, a multivariable regression model (MVR), and three machine learning (ML) algorithms (random forest – RF, extreme gradient boosting – XGBoost, and artificial neural network – ANN) were employed to predict the quality parameters from as few bands as possible. Results show that the ML models with bands selected via a hotspot overlapping method significantly improved quality prediction compared to the common spectral index approaches. Among ML algorithms, RF stood out with the best results with R<sup>2</sup> of 0.94 for weight, 0.89 for firmness, 0.79 for lycopene, 0.72 for TSS, 0.67 for pH, 0.62 for citric acid, and 0.45 for ascorbic acid, with the only exception of ANN, which was slightly better for weight and lycopene (R<sup>2</sup> of 0.95 and 0.85, respectively). Overall, models with only five bands were enough to predict all seven quality parameters with comparable performance to models with a larger number of bands. Our study offers an efficient and cost-effective method for assessing pre-harvest tomato quality, benefiting farmers and the food industry, as well as scientific research on fruit development and nutrition.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"229 ","pages":"Article 109788"},"PeriodicalIF":8.9000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning models based on hyperspectral imaging for pre-harvest tomato fruit quality monitoring\",\"authors\":\"Eitan Fass , Eldar Shlomi , Carmit Ziv , Oren Glickman , David Helman\",\"doi\":\"10.1016/j.compag.2024.109788\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Traditional methods for assessing tomato quality are time-consuming, expensive, and limited in scope. Here we developed a non-destructive spectral-based model using a handheld hyperspectral camera with 204 bands at the 400–1000 nm range, focusing on data reduction, paving the way for an economically viable device designed to assess seven key tomato quality parameters. We collected 567 fruits from five cultivars of various types and analyzed them for weight, firmness, total soluble solids (TSS), citric acid, ascorbic acid, lycopene, and pH after taking hyperspectral images of the fruits. Five commonly used spectral indices, thousands of normalized difference spectral index (NDSI) combinations, a multivariable regression model (MVR), and three machine learning (ML) algorithms (random forest – RF, extreme gradient boosting – XGBoost, and artificial neural network – ANN) were employed to predict the quality parameters from as few bands as possible. Results show that the ML models with bands selected via a hotspot overlapping method significantly improved quality prediction compared to the common spectral index approaches. Among ML algorithms, RF stood out with the best results with R<sup>2</sup> of 0.94 for weight, 0.89 for firmness, 0.79 for lycopene, 0.72 for TSS, 0.67 for pH, 0.62 for citric acid, and 0.45 for ascorbic acid, with the only exception of ANN, which was slightly better for weight and lycopene (R<sup>2</sup> of 0.95 and 0.85, respectively). Overall, models with only five bands were enough to predict all seven quality parameters with comparable performance to models with a larger number of bands. Our study offers an efficient and cost-effective method for assessing pre-harvest tomato quality, benefiting farmers and the food industry, as well as scientific research on fruit development and nutrition.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"229 \",\"pages\":\"Article 109788\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169924011797\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/19 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169924011797","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
Machine learning models based on hyperspectral imaging for pre-harvest tomato fruit quality monitoring
Traditional methods for assessing tomato quality are time-consuming, expensive, and limited in scope. Here we developed a non-destructive spectral-based model using a handheld hyperspectral camera with 204 bands at the 400–1000 nm range, focusing on data reduction, paving the way for an economically viable device designed to assess seven key tomato quality parameters. We collected 567 fruits from five cultivars of various types and analyzed them for weight, firmness, total soluble solids (TSS), citric acid, ascorbic acid, lycopene, and pH after taking hyperspectral images of the fruits. Five commonly used spectral indices, thousands of normalized difference spectral index (NDSI) combinations, a multivariable regression model (MVR), and three machine learning (ML) algorithms (random forest – RF, extreme gradient boosting – XGBoost, and artificial neural network – ANN) were employed to predict the quality parameters from as few bands as possible. Results show that the ML models with bands selected via a hotspot overlapping method significantly improved quality prediction compared to the common spectral index approaches. Among ML algorithms, RF stood out with the best results with R2 of 0.94 for weight, 0.89 for firmness, 0.79 for lycopene, 0.72 for TSS, 0.67 for pH, 0.62 for citric acid, and 0.45 for ascorbic acid, with the only exception of ANN, which was slightly better for weight and lycopene (R2 of 0.95 and 0.85, respectively). Overall, models with only five bands were enough to predict all seven quality parameters with comparable performance to models with a larger number of bands. Our study offers an efficient and cost-effective method for assessing pre-harvest tomato quality, benefiting farmers and the food industry, as well as scientific research on fruit development and nutrition.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.