Comparative evaluation of statistical and machine learning models for weather-driven wheat yield forecasting across different districts of Punjab

IF 1.827 Q2 Earth and Planetary Sciences Arabian Journal of Geosciences Pub Date : 2024-09-21 DOI:10.1007/s12517-024-12077-1
Kulwinder Kaur Gill, Kavita Bhatt,  Akansha, Parul Setiya, Sandeep Singh Sandhu, Baljeet Kaur
{"title":"Comparative evaluation of statistical and machine learning models for weather-driven wheat yield forecasting across different districts of Punjab","authors":"Kulwinder Kaur Gill,&nbsp;Kavita Bhatt,&nbsp; Akansha,&nbsp;Parul Setiya,&nbsp;Sandeep Singh Sandhu,&nbsp;Baljeet Kaur","doi":"10.1007/s12517-024-12077-1","DOIUrl":null,"url":null,"abstract":"<div><p>Predicting crop yields before harvest is important for making and carrying out policies about food safety, transportation costs, import-export, storage, and selling of agricultural goods. The weather is a key factor in crop growth and its development. Therefore, models that include meteorological variables can predict reliable forecasts for crop output; however, selecting the appropriate model for use in agricultural production forecasting can be challenging. This study investigates the development of wheat yield prediction models using various multivariate analysis techniques and weather indices derived from meteorological data collected over 22 years in Punjab, India. Five different modeling approaches, including stepwise multiple linear regression (SMLR), LASSO, elastic net (ELNET), artificial neural network (ANN), and ridge regression, were employed and compared for their effectiveness in predicting wheat yield. The models were calibrated using data from 17 years (2000–01 to 2016–17) and validated using data from the subsequent 5 years (2017–18 to 2021–22). Evaluation metrics such as <i>R</i><sup>2</sup>, root mean square error (RMSE), normalized root mean square error (NRMSE), mean biased error (MBE), and modeling efficiency (EF) were utilized to assess model performance. The results indicate varying degrees of performance across districts and modeling techniques. ANN demonstrated the highest performance during both calibration and validation periods, followed closely by LASSO and ELNET. However, certain districts showed discrepancies in model fit, with some models performing better than others depending on the specific district. Overall, ANN emerged as the most reliable approach for wheat yield prediction in Punjab followed by ELNET and LASSO, offering valuable insights for agricultural planning and management. This comprehensive analysis provides valuable contributions to the field of crop yield prediction, enhancing understanding of the complex interactions between weather variables and agricultural outcomes.</p></div>","PeriodicalId":476,"journal":{"name":"Arabian Journal of Geosciences","volume":"17 10","pages":""},"PeriodicalIF":1.8270,"publicationDate":"2024-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arabian Journal of Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s12517-024-12077-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Predicting crop yields before harvest is important for making and carrying out policies about food safety, transportation costs, import-export, storage, and selling of agricultural goods. The weather is a key factor in crop growth and its development. Therefore, models that include meteorological variables can predict reliable forecasts for crop output; however, selecting the appropriate model for use in agricultural production forecasting can be challenging. This study investigates the development of wheat yield prediction models using various multivariate analysis techniques and weather indices derived from meteorological data collected over 22 years in Punjab, India. Five different modeling approaches, including stepwise multiple linear regression (SMLR), LASSO, elastic net (ELNET), artificial neural network (ANN), and ridge regression, were employed and compared for their effectiveness in predicting wheat yield. The models were calibrated using data from 17 years (2000–01 to 2016–17) and validated using data from the subsequent 5 years (2017–18 to 2021–22). Evaluation metrics such as R2, root mean square error (RMSE), normalized root mean square error (NRMSE), mean biased error (MBE), and modeling efficiency (EF) were utilized to assess model performance. The results indicate varying degrees of performance across districts and modeling techniques. ANN demonstrated the highest performance during both calibration and validation periods, followed closely by LASSO and ELNET. However, certain districts showed discrepancies in model fit, with some models performing better than others depending on the specific district. Overall, ANN emerged as the most reliable approach for wheat yield prediction in Punjab followed by ELNET and LASSO, offering valuable insights for agricultural planning and management. This comprehensive analysis provides valuable contributions to the field of crop yield prediction, enhancing understanding of the complex interactions between weather variables and agricultural outcomes.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
旁遮普省不同地区天气驱动的小麦产量预测统计模型和机器学习模型的比较评估
在收获前预测作物产量对于制定和执行有关食品安全、运输成本、进出口、储存和农产品销售的政策非常重要。天气是影响作物生长和发育的关键因素。因此,包含气象变量的模型可以预测作物产量的可靠预报;然而,选择适当的模型用于农业生产预报可能具有挑战性。本研究利用各种多元分析技术和从印度旁遮普省 22 年气象数据中获得的气象指数,对小麦产量预测模型的开发进行了研究。研究采用了五种不同的建模方法,包括逐步多元线性回归(SMLR)、LASSO、弹性网(ELNET)、人工神经网络(ANN)和脊回归,并比较了它们在预测小麦产量方面的有效性。利用 17 年(2000-01 至 2016-17 年)的数据对模型进行了校准,并利用随后 5 年(2017-18 至 2021-22 年)的数据对模型进行了验证。利用 R2、均方根误差 (RMSE)、归一化均方根误差 (NRMSE)、平均偏差 (MBE) 和建模效率 (EF) 等评价指标来评估模型性能。结果表明,不同地区和建模技术的性能各不相同。在校准和验证期间,ANN 的性能最高,LASSO 和 ELNET 紧随其后。然而,某些地区在模型拟合方面存在差异,根据具体地区的不同,某些模型的表现优于其他模型。总体而言,方差分析是预测旁遮普省小麦产量最可靠的方法,其次是 ELNET 和 LASSO,为农业规划和管理提供了有价值的见解。这项综合分析为作物产量预测领域做出了宝贵贡献,加深了人们对天气变量与农业结果之间复杂互动关系的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Arabian Journal of Geosciences
Arabian Journal of Geosciences GEOSCIENCES, MULTIDISCIPLINARY-
自引率
0.00%
发文量
1587
审稿时长
6.7 months
期刊介绍: The Arabian Journal of Geosciences is the official journal of the Saudi Society for Geosciences and publishes peer-reviewed original and review articles on the entire range of Earth Science themes, focused on, but not limited to, those that have regional significance to the Middle East and the Euro-Mediterranean Zone. Key topics therefore include; geology, hydrogeology, earth system science, petroleum sciences, geophysics, seismology and crustal structures, tectonics, sedimentology, palaeontology, metamorphic and igneous petrology, natural hazards, environmental sciences and sustainable development, geoarchaeology, geomorphology, paleo-environment studies, oceanography, atmospheric sciences, GIS and remote sensing, geodesy, mineralogy, volcanology, geochemistry and metallogenesis.
期刊最新文献
Numerical investigation on energy efficiency of horizontal heat pump systems in buildings heating and cooling: case study of Mostaganem (Algeria) Stability analysis of overburden rocks—a new approach An up-to-date perspective on technological accidents triggered by natural events Investigation of radiation shielding parameters of different heavy metallic glass compositions for gamma radiations Microbial dynamics in soil: Impacts on fertility, nutrient cycling, and soil properties for sustainable geosciences—people, planet, and prosperity
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1