Durum wheat yield forecasting using machine learning

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY Artificial Intelligence in Agriculture Pub Date : 2022-01-01 DOI:10.1016/j.aiia.2022.09.003

Nabila Chergui

{"title":"Durum wheat yield forecasting using machine learning","authors":"Nabila Chergui","doi":"10.1016/j.aiia.2022.09.003","DOIUrl":null,"url":null,"abstract":"<div><p>A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector. Machine learning approaches allow for building such predictive models, but the quality of predictions decreases if data is scarce. In this work, we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria. We first increased the dimension of each data set by adding more features, and then we augmented the size of the data by merging the two data sets. To assess the effectiveness of data-augmentation approaches, we conducted three sets of experiments based on three data sets: the primary data sets, data sets with additional features and the augmented data sets obtained by merging, using five regression models (Support Vector Regression, Random Forest, Extreme Learning Machine, Artificial Neural Network, Deep Neural Network). To evaluate the models, we used cross-validation; the results showed an overall increase in performance with the augmented data. DNN outperformed the other models for the first Province with a Root Mean Square Error (RMSE) of 0.04 q/ha and R_Squared (<em>R</em><sup>2</sup>) of 0.96, whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha. The data-augmentation approach proposed in this study showed encouraging results.</p></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"6 ","pages":"Pages 156-166"},"PeriodicalIF":8.2000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2589721722000137/pdfft?md5=4964a697dabfe27531e6ff34bdc2d2dd&pid=1-s2.0-S2589721722000137-main.pdf","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Agriculture","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589721722000137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 5

Abstract

A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector. Machine learning approaches allow for building such predictive models, but the quality of predictions decreases if data is scarce. In this work, we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria. We first increased the dimension of each data set by adding more features, and then we augmented the size of the data by merging the two data sets. To assess the effectiveness of data-augmentation approaches, we conducted three sets of experiments based on three data sets: the primary data sets, data sets with additional features and the augmented data sets obtained by merging, using five regression models (Support Vector Regression, Random Forest, Extreme Learning Machine, Artificial Neural Network, Deep Neural Network). To evaluate the models, we used cross-validation; the results showed an overall increase in performance with the augmented data. DNN outperformed the other models for the first Province with a Root Mean Square Error (RMSE) of 0.04 q/ha and R_Squared (R²) of 0.96, whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha. The data-augmentation approach proposed in this study showed encouraging results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用机器学习预测硬粒小麦产量

一个可靠和准确的作物产量预测模型对于每个农业部门的有效决策至关重要。机器学习方法允许建立这样的预测模型，但如果数据稀缺，预测的质量会下降。在这项工作中，我们建议在阿尔及利亚两个不同省份的小数据集存在的情况下，对小麦产量预测进行数据增强。我们首先通过添加更多的特征来增加每个数据集的维度，然后通过合并两个数据集来增加数据的大小。为了评估数据增强方法的有效性，我们使用五种回归模型(支持向量回归、随机森林、极限学习机、人工神经网络、深度神经网络)，基于三个数据集进行了三组实验:原始数据集、附加特征数据集和合并后的增强数据集。为了评估模型，我们使用交叉验证;结果显示，随着数据的增强，性能总体上有所提高。DNN在第一个省的表现优于其他模型，RMSE为0.04 q/ha, R_Squared (R2)为0.96，而随机森林在第二个省的表现优于其他模型，RMSE为0.05 q/ha。本研究提出的数据增强方法取得了令人鼓舞的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊