Impacts of meteorological variables and machine learning algorithms on rice yield prediction in Korea

IF 3 3区 地球科学 Q2 BIOPHYSICS International Journal of Biometeorology Pub Date : 2023-09-05 DOI:10.1007/s00484-023-02544-x
Subin Ha, Yong-Tak Kim, Eun-Soon Im, Jina Hur, Sera Jo, Yong-Seok Kim, Kyo‑Moon Shim
{"title":"Impacts of meteorological variables and machine learning algorithms on rice yield prediction in Korea","authors":"Subin Ha,&nbsp;Yong-Tak Kim,&nbsp;Eun-Soon Im,&nbsp;Jina Hur,&nbsp;Sera Jo,&nbsp;Yong-Seok Kim,&nbsp;Kyo‑Moon Shim","doi":"10.1007/s00484-023-02544-x","DOIUrl":null,"url":null,"abstract":"<div><p>As crop productivity is greatly influenced by weather conditions, many attempts have been made to estimate crop yields using meteorological data and have achieved great progress with the development of machine learning. However, most yield prediction models are developed based on observational data, and the utilization of climate model output in yield prediction has been addressed in very few studies. In this study, we estimate rice yields in South Korea using the meteorological variables provided by ERA5 reanalysis data (ERA-O) and its dynamically downscaled data (ERA-DS). After ERA-O and ERA-DS are validated against observations (OBS), two different machine learning models, Support Vector Machine (SVM) and Long Short-Term Memory (LSTM), are trained with different combinations of eight meteorological variables (mean temperature, maximum temperature, minimum temperature, precipitation, diurnal temperature range, solar irradiance, mean wind speed, and relative humidity) obtained from OBS, ERA-O, and ERA-DS at weekly and monthly timescales from May to September. Regardless of the model type and the source of the input data, training a model with weekly datasets leads to better yield estimates compared to monthly datasets. LSTM generally outperforms SVM, especially when the model is trained with ERA-DS data at a weekly timescale. The best yield estimates are produced by the LSTM model trained with all eight variables at a weekly timescale. Altogether this study shows the significance of high spatial and temporal resolution of input meteorological data in yield prediction, which can also serve to substantiate the added value of dynamical downscaling.</p></div>","PeriodicalId":588,"journal":{"name":"International Journal of Biometeorology","volume":"67 11","pages":"1825 - 1838"},"PeriodicalIF":3.0000,"publicationDate":"2023-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biometeorology","FirstCategoryId":"89","ListUrlMain":"https://link.springer.com/article/10.1007/s00484-023-02544-x","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

As crop productivity is greatly influenced by weather conditions, many attempts have been made to estimate crop yields using meteorological data and have achieved great progress with the development of machine learning. However, most yield prediction models are developed based on observational data, and the utilization of climate model output in yield prediction has been addressed in very few studies. In this study, we estimate rice yields in South Korea using the meteorological variables provided by ERA5 reanalysis data (ERA-O) and its dynamically downscaled data (ERA-DS). After ERA-O and ERA-DS are validated against observations (OBS), two different machine learning models, Support Vector Machine (SVM) and Long Short-Term Memory (LSTM), are trained with different combinations of eight meteorological variables (mean temperature, maximum temperature, minimum temperature, precipitation, diurnal temperature range, solar irradiance, mean wind speed, and relative humidity) obtained from OBS, ERA-O, and ERA-DS at weekly and monthly timescales from May to September. Regardless of the model type and the source of the input data, training a model with weekly datasets leads to better yield estimates compared to monthly datasets. LSTM generally outperforms SVM, especially when the model is trained with ERA-DS data at a weekly timescale. The best yield estimates are produced by the LSTM model trained with all eight variables at a weekly timescale. Altogether this study shows the significance of high spatial and temporal resolution of input meteorological data in yield prediction, which can also serve to substantiate the added value of dynamical downscaling.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
气象变量和机器学习算法对韩国水稻产量预测的影响。
由于作物生产力在很大程度上受到天气条件的影响,人们已经尝试使用气象数据来估计作物产量,并随着机器学习的发展取得了巨大进展。然而,大多数产量预测模型都是基于观测数据开发的,很少有研究涉及气候模型输出在产量预测中的应用。在本研究中,我们使用ERA5再分析数据(ERA-O)及其动态缩减数据(ERA-DS)提供的气象变量来估计韩国的水稻产量。在ERA-O和ERA-DS根据观察结果(OBS)进行验证后,两种不同的机器学习模型,支持向量机(SVM)和长短期记忆(LSTM),使用从OBS、ERA-O和ERA-DS获得的八个气象变量(平均温度、最高温度、最低温度、降水量、昼夜温度范围、太阳辐照度、平均风速和相对湿度)的不同组合,在5月至9月的每周和每月时间尺度上进行训练。无论模型类型和输入数据的来源如何,与月度数据集相比,用每周数据集训练模型可以获得更好的产量估计。LSTM通常优于SVM,尤其是当模型在每周的时间尺度上使用ERA-DS数据进行训练时。最佳产量估计是由LSTM模型在每周的时间尺度上用所有八个变量进行训练产生的。总之,本研究表明了输入气象数据的高时空分辨率在产量预测中的重要性,这也有助于证实动态降尺度的附加值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.40
自引率
9.40%
发文量
183
审稿时长
1 months
期刊介绍: The Journal publishes original research papers, review articles and short communications on studies examining the interactions between living organisms and factors of the natural and artificial atmospheric environment. Living organisms extend from single cell organisms, to plants and animals, including humans. The atmospheric environment includes climate and weather, electromagnetic radiation, and chemical and biological pollutants. The journal embraces basic and applied research and practical aspects such as living conditions, agriculture, forestry, and health. The journal is published for the International Society of Biometeorology, and most membership categories include a subscription to the Journal.
期刊最新文献
An exploratory study of household conditions and youth personal exposure to extreme heat during a heatwave in urban Nashville, Tennessee, USA. Planning for a cooler metropolitan area: a perspective on the long-term interaction of urban expansion, surface urban heat islands and blue-green spaces' cooling impact. Salivary heat shock protein 70 as a potential non-invasive biomarker of environmental thermal stress in dairy cattle. Climate variability links to changes in Rioja wine (Spain). Influence of hydrotherapy on change in weight: a narrative review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1