Machine Learning predictive model of grapevine yield based on agroclimatic patterns

Manisha S. Sirsat , João Mendes-Moreira , Carlos Ferreira , Mario Cunha
{"title":"Machine Learning predictive model of grapevine yield based on agroclimatic patterns","authors":"Manisha S. Sirsat ,&nbsp;João Mendes-Moreira ,&nbsp;Carlos Ferreira ,&nbsp;Mario Cunha","doi":"10.1016/j.eaef.2019.07.003","DOIUrl":null,"url":null,"abstract":"<div><p>Grapevine yield prediction during phenostage and particularly, before harvest is highly significant as advanced forecasting could be a great value for superior grapevine management. The main contribution of the current study is to develop predictive model for each phenology that predicts yield during growing stages of grapevine and to identify highly relevant predictive variables. Current study uses climatic conditions, grapevine yield, phenological dates, fertilizer information, soil analysis and maturation index data to construct the relational dataset. After words, we use several approaches to pre-process the data to put it into tabular format. For instance, generalization of climatic variables using phenological dates. Random Forest, LASSO and Elasticnet in generalized linear models, and Spikeslab are feature selection embedded methods which are used to overcome dataset dimensionality issue. We used 10-fold cross validation to evaluate predictive model by partitioning the dataset into training set to train the model and test set to evaluate it by calculating Root Mean Squared Error (RMSE) and Relative Root Mean Squared Error (RRMSE). Results of the study show that rf_PF, rf_PC and rf_MH are optimal models for flowering (PF), colouring (PC) and harvest (MH) phenology respectively which estimate 1484.5, 1504.2 and 1459.4 (Kg/ha) low RMSE and 24.6%, 24.9% and 24.2% RRMSE, respectively as compared to other models. These models also identify some derived climatic variables as major variables for grapevine yield prediction. The reliability and early-indication ability of these forecast models justify their use by institutions and economists in decision making, adoption of technical improvements, and fraud detection.</p></div>","PeriodicalId":38965,"journal":{"name":"Engineering in Agriculture, Environment and Food","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.eaef.2019.07.003","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering in Agriculture, Environment and Food","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1881836618302106","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Engineering","Score":null,"Total":0}
引用次数: 20

Abstract

Grapevine yield prediction during phenostage and particularly, before harvest is highly significant as advanced forecasting could be a great value for superior grapevine management. The main contribution of the current study is to develop predictive model for each phenology that predicts yield during growing stages of grapevine and to identify highly relevant predictive variables. Current study uses climatic conditions, grapevine yield, phenological dates, fertilizer information, soil analysis and maturation index data to construct the relational dataset. After words, we use several approaches to pre-process the data to put it into tabular format. For instance, generalization of climatic variables using phenological dates. Random Forest, LASSO and Elasticnet in generalized linear models, and Spikeslab are feature selection embedded methods which are used to overcome dataset dimensionality issue. We used 10-fold cross validation to evaluate predictive model by partitioning the dataset into training set to train the model and test set to evaluate it by calculating Root Mean Squared Error (RMSE) and Relative Root Mean Squared Error (RRMSE). Results of the study show that rf_PF, rf_PC and rf_MH are optimal models for flowering (PF), colouring (PC) and harvest (MH) phenology respectively which estimate 1484.5, 1504.2 and 1459.4 (Kg/ha) low RMSE and 24.6%, 24.9% and 24.2% RRMSE, respectively as compared to other models. These models also identify some derived climatic variables as major variables for grapevine yield prediction. The reliability and early-indication ability of these forecast models justify their use by institutions and economists in decision making, adoption of technical improvements, and fraud detection.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于农业气候模式的葡萄产量机器学习预测模型
葡萄在表型期,特别是收获前的产量预测非常重要,因为先进的预测对葡萄的优质管理有很大的价值。本研究的主要贡献是建立了葡萄各物候期产量预测模型,并确定了高度相关的预测变量。目前研究利用气候条件、葡萄产量、物候日期、肥料信息、土壤分析和成熟指数数据构建相关数据集。在单词之后,我们使用几种方法对数据进行预处理,将其放入表格格式。例如,利用物候日期概括气候变量。随机森林、广义线性模型中的LASSO和Elasticnet以及Spikeslab是用来克服数据集维数问题的特征选择嵌入方法。我们通过将数据集划分为训练集来训练模型,并通过计算均方根误差(RMSE)和相对均方根误差(RRMSE)来评估测试集,使用10倍交叉验证来评估预测模型。结果表明,rf_PF、rf_PC和rf_MH分别是开花(PF)、显色(PC)和收获(MH)物候的最优模型,其RMSE值分别为1484.5、1504.2和1459.4 (Kg/ha), RRMSE值分别为24.6%、24.9%和24.2%。这些模型还确定了一些衍生的气候变量作为葡萄产量预测的主要变量。这些预测模型的可靠性和早期指示能力证明了机构和经济学家在决策、采用技术改进和欺诈检测中使用它们的合理性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Engineering in Agriculture, Environment and Food
Engineering in Agriculture, Environment and Food Engineering-Industrial and Manufacturing Engineering
CiteScore
1.00
自引率
0.00%
发文量
4
期刊介绍: Engineering in Agriculture, Environment and Food (EAEF) is devoted to the advancement and dissemination of scientific and technical knowledge concerning agricultural machinery, tillage, terramechanics, precision farming, agricultural instrumentation, sensors, bio-robotics, systems automation, processing of agricultural products and foods, quality evaluation and food safety, waste treatment and management, environmental control, energy utilization agricultural systems engineering, bio-informatics, computer simulation, computational mechanics, farm work systems and mechanized cropping. It is an international English E-journal published and distributed by the Asian Agricultural and Biological Engineering Association (AABEA). Authors should submit the manuscript file written by MS Word through a web site. The manuscript must be approved by the author''s organization prior to submission if required. Contact the societies which you belong to, if you have any question on manuscript submission or on the Journal EAEF.
期刊最新文献
Life cycle assessment of apple exported from Japan to Taiwan and potential environmental impact abatement Phenotyping system for precise monitoring of potato crops during growth Production and characterization of levan by <i>Bacillus siamensis</i> at flask and bioreactor The minimal exoskeleton, a passive exoskeleton to simplify pruning and fruit collection A vision-based road detection system for the navigation of an agricultural autonomous tractor
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1