Investigating the contribution of image time series observations to cauliflower harvest-readiness prediction.

IF 3 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Frontiers in Artificial Intelligence Pub Date : 2024-09-18 eCollection Date: 2024-01-01 DOI:10.3389/frai.2024.1416323

Jana Kierdorf, Timo Tjarden Stomberg, Lukas Drees, Uwe Rascher, Ribana Roscher

{"title":"Investigating the contribution of image time series observations to cauliflower harvest-readiness prediction.","authors":"Jana Kierdorf, Timo Tjarden Stomberg, Lukas Drees, Uwe Rascher, Ribana Roscher","doi":"10.3389/frai.2024.1416323","DOIUrl":null,"url":null,"abstract":"<p><p>Cauliflower cultivation is subject to high-quality control criteria during sales, which underlines the importance of accurate harvest timing. Using time series data for plant phenotyping can provide insights into the dynamic development of cauliflower and allow more accurate predictions of when the crop is ready for harvest than single-time observations. However, data acquisition on a daily or weekly basis is resource-intensive, making selection of acquisition days highly important. We investigate which data acquisition days and development stages positively affect the model accuracy to get insights into prediction-relevant observation days and aid future data acquisition planning. We analyze harvest-readiness using the cauliflower image time series of the GrowliFlower dataset. We use an adjusted ResNet18 classification model, including positional encoding of the data acquisition dates to add implicit information about development. The explainable machine learning approach GroupSHAP analyzes time points' contributions. Time points with the lowest mean absolute contribution are excluded from the time series to determine their effect on model accuracy. Using image time series rather than single time points, we achieve an increase in accuracy of 4%. GroupSHAP allows the selection of time points that positively affect the model accuracy. By using seven selected time points instead of all 11 ones, the accuracy improves by an additional 4%, resulting in an overall accuracy of 89.3%. The selection of time points may therefore lead to a reduction in data collection in the future.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"7 ","pages":"1416323"},"PeriodicalIF":3.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11445755/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frai.2024.1416323","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Cauliflower cultivation is subject to high-quality control criteria during sales, which underlines the importance of accurate harvest timing. Using time series data for plant phenotyping can provide insights into the dynamic development of cauliflower and allow more accurate predictions of when the crop is ready for harvest than single-time observations. However, data acquisition on a daily or weekly basis is resource-intensive, making selection of acquisition days highly important. We investigate which data acquisition days and development stages positively affect the model accuracy to get insights into prediction-relevant observation days and aid future data acquisition planning. We analyze harvest-readiness using the cauliflower image time series of the GrowliFlower dataset. We use an adjusted ResNet18 classification model, including positional encoding of the data acquisition dates to add implicit information about development. The explainable machine learning approach GroupSHAP analyzes time points' contributions. Time points with the lowest mean absolute contribution are excluded from the time series to determine their effect on model accuracy. Using image time series rather than single time points, we achieve an increase in accuracy of 4%. GroupSHAP allows the selection of time points that positively affect the model accuracy. By using seven selected time points instead of all 11 ones, the accuracy improves by an additional 4%, resulting in an overall accuracy of 89.3%. The selection of time points may therefore lead to a reduction in data collection in the future.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

研究图像时间序列观测对花椰菜收获准备预测的贡献。

花椰菜种植在销售过程中需要遵守高质量的控制标准，这凸显了准确收获时间的重要性。利用时间序列数据进行植物表型分析，可以深入了解花椰菜的动态生长过程，并能比单次观测更准确地预测作物的收获时间。然而，每天或每周采集数据需要耗费大量资源，因此采集日的选择非常重要。我们研究了哪些数据采集日和发展阶段会对模型的准确性产生积极影响，以便深入了解与预测相关的观测日，并帮助制定未来的数据采集计划。我们使用 GrowliFlower 数据集的花椰菜图像时间序列分析收获准备情况。我们使用调整后的 ResNet18 分类模型，包括数据采集日期的位置编码，以增加有关发育的隐含信息。可解释的机器学习方法 GroupSHAP 分析了时间点的贡献。从时间序列中剔除平均绝对贡献最小的时间点，以确定其对模型准确性的影响。通过使用图像时间序列而不是单个时间点，我们将准确率提高了 4%。GroupSHAP 可以选择对模型准确性有积极影响的时间点。通过使用 7 个选定的时间点而不是全部 11 个时间点，准确率又提高了 4%，从而使总体准确率达到 89.3%。因此，时间点的选择可能会导致未来数据收集的减少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊