Durum wheat yield forecasting using machine learning

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY Artificial Intelligence in Agriculture Pub Date : 2022-01-01 DOI:10.1016/j.aiia.2022.09.003
Nabila Chergui
{"title":"Durum wheat yield forecasting using machine learning","authors":"Nabila Chergui","doi":"10.1016/j.aiia.2022.09.003","DOIUrl":null,"url":null,"abstract":"<div><p>A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector. Machine learning approaches allow for building such predictive models, but the quality of predictions decreases if data is scarce. In this work, we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria. We first increased the dimension of each data set by adding more features, and then we augmented the size of the data by merging the two data sets. To assess the effectiveness of data-augmentation approaches, we conducted three sets of experiments based on three data sets: the primary data sets, data sets with additional features and the augmented data sets obtained by merging, using five regression models (Support Vector Regression, Random Forest, Extreme Learning Machine, Artificial Neural Network, Deep Neural Network). To evaluate the models, we used cross-validation; the results showed an overall increase in performance with the augmented data. DNN outperformed the other models for the first Province with a Root Mean Square Error (RMSE) of 0.04 q/ha and R_Squared (<em>R</em><sup>2</sup>) of 0.96, whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha. The data-augmentation approach proposed in this study showed encouraging results.</p></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":null,"pages":null},"PeriodicalIF":8.2000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2589721722000137/pdfft?md5=4964a697dabfe27531e6ff34bdc2d2dd&pid=1-s2.0-S2589721722000137-main.pdf","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Agriculture","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589721722000137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 5

Abstract

A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector. Machine learning approaches allow for building such predictive models, but the quality of predictions decreases if data is scarce. In this work, we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria. We first increased the dimension of each data set by adding more features, and then we augmented the size of the data by merging the two data sets. To assess the effectiveness of data-augmentation approaches, we conducted three sets of experiments based on three data sets: the primary data sets, data sets with additional features and the augmented data sets obtained by merging, using five regression models (Support Vector Regression, Random Forest, Extreme Learning Machine, Artificial Neural Network, Deep Neural Network). To evaluate the models, we used cross-validation; the results showed an overall increase in performance with the augmented data. DNN outperformed the other models for the first Province with a Root Mean Square Error (RMSE) of 0.04 q/ha and R_Squared (R2) of 0.96, whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha. The data-augmentation approach proposed in this study showed encouraging results.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用机器学习预测硬粒小麦产量
一个可靠和准确的作物产量预测模型对于每个农业部门的有效决策至关重要。机器学习方法允许建立这样的预测模型,但如果数据稀缺,预测的质量会下降。在这项工作中,我们建议在阿尔及利亚两个不同省份的小数据集存在的情况下,对小麦产量预测进行数据增强。我们首先通过添加更多的特征来增加每个数据集的维度,然后通过合并两个数据集来增加数据的大小。为了评估数据增强方法的有效性,我们使用五种回归模型(支持向量回归、随机森林、极限学习机、人工神经网络、深度神经网络),基于三个数据集进行了三组实验:原始数据集、附加特征数据集和合并后的增强数据集。为了评估模型,我们使用交叉验证;结果显示,随着数据的增强,性能总体上有所提高。DNN在第一个省的表现优于其他模型,RMSE为0.04 q/ha, R_Squared (R2)为0.96,而随机森林在第二个省的表现优于其他模型,RMSE为0.05 q/ha。本研究提出的数据增强方法取得了令人鼓舞的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Artificial Intelligence in Agriculture
Artificial Intelligence in Agriculture Engineering-Engineering (miscellaneous)
CiteScore
21.60
自引率
0.00%
发文量
18
审稿时长
12 weeks
期刊最新文献
Prediction of spatial heterogeneity in nutrient-limited sub-tropical maize yield: Implications for precision management in the eastern Indo-Gangetic Plains UAV-based field watermelon detection and counting using YOLOv8s with image panorama stitching and overlap partitioning Comparing YOLOv8 and Mask R-CNN for instance segmentation in complex orchard environments A comprehensive survey on weed and crop classification using machine learning and deep learning Computer vision in smart agriculture and precision farming: Techniques and applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1