一种实用的降雨预测集成学习方法

Soumili Ghosh, Mahendra Kumar Gourisaria, Biswajit Sahoo, Himansu Das
{"title":"一种实用的降雨预测集成学习方法","authors":"Soumili Ghosh, Mahendra Kumar Gourisaria, Biswajit Sahoo, Himansu Das","doi":"10.1007/s43926-023-00044-3","DOIUrl":null,"url":null,"abstract":"Abstract Heavy rainfall and precipitation play a massive role in shaping the socio-agricultural landscape of a country. Being one of the key indicators of climate change, natural disasters, and of the general topology of a region, rainfall prediction is a gift of estimation that can be used for multiple beneficial causes. Machine learning has an impressive repertoire in aiding prediction and estimation of rainfall. This paper aims to find the effect of ensemble learning, a subset of machine learning, on a rainfall prediction dataset, to increase the predictability of the models used. The classification models used in this paper were tested once individually, and then with applied ensemble techniques like bagging and boosting, on a rainfall dataset based in Australia. The objective of this paper is to demonstrate a reduction in bias and variance via ensemble learning techniques while also analyzing the increase or decrease in the aforementioned metrics. The study shows an overall reduction in bias by an average of 6% using boosting, and an average reduction in variance by 13.6%. Model performance was observed to become more generalized by lowering the false negative rate by an average of more than 20%. The techniques explored in this paper can be further utilized to improve model performance even further via hyper-parameter tuning.","PeriodicalId":34751,"journal":{"name":"Discover Internet of Things","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A pragmatic ensemble learning approach for rainfall prediction\",\"authors\":\"Soumili Ghosh, Mahendra Kumar Gourisaria, Biswajit Sahoo, Himansu Das\",\"doi\":\"10.1007/s43926-023-00044-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract Heavy rainfall and precipitation play a massive role in shaping the socio-agricultural landscape of a country. Being one of the key indicators of climate change, natural disasters, and of the general topology of a region, rainfall prediction is a gift of estimation that can be used for multiple beneficial causes. Machine learning has an impressive repertoire in aiding prediction and estimation of rainfall. This paper aims to find the effect of ensemble learning, a subset of machine learning, on a rainfall prediction dataset, to increase the predictability of the models used. The classification models used in this paper were tested once individually, and then with applied ensemble techniques like bagging and boosting, on a rainfall dataset based in Australia. The objective of this paper is to demonstrate a reduction in bias and variance via ensemble learning techniques while also analyzing the increase or decrease in the aforementioned metrics. The study shows an overall reduction in bias by an average of 6% using boosting, and an average reduction in variance by 13.6%. Model performance was observed to become more generalized by lowering the false negative rate by an average of more than 20%. The techniques explored in this paper can be further utilized to improve model performance even further via hyper-parameter tuning.\",\"PeriodicalId\":34751,\"journal\":{\"name\":\"Discover Internet of Things\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Discover Internet of Things\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s43926-023-00044-3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Discover Internet of Things","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s43926-023-00044-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

强降雨和降水在塑造一个国家的社会农业景观中起着巨大的作用。作为气候变化、自然灾害和一个地区总体拓扑结构的关键指标之一,降雨预测是一种可以用于多种有益原因的估计礼物。机器学习在帮助预测和估计降雨量方面有着令人印象深刻的能力。本文旨在找到集成学习(机器学习的一个子集)对降雨预测数据集的影响,以提高所使用模型的可预测性。本文中使用的分类模型分别进行了一次测试,然后在澳大利亚的降雨数据集上应用了套袋和增强等综合技术。本文的目的是通过集成学习技术证明偏差和方差的减少,同时也分析上述指标的增加或减少。研究表明,使用增强技术,总体偏差平均减少了6%,方差平均减少了13.6%。通过将假阴性率平均降低20%以上,观察到模型性能变得更加一般化。本文探讨的技术可以进一步利用,通过超参数调优进一步提高模型性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A pragmatic ensemble learning approach for rainfall prediction
Abstract Heavy rainfall and precipitation play a massive role in shaping the socio-agricultural landscape of a country. Being one of the key indicators of climate change, natural disasters, and of the general topology of a region, rainfall prediction is a gift of estimation that can be used for multiple beneficial causes. Machine learning has an impressive repertoire in aiding prediction and estimation of rainfall. This paper aims to find the effect of ensemble learning, a subset of machine learning, on a rainfall prediction dataset, to increase the predictability of the models used. The classification models used in this paper were tested once individually, and then with applied ensemble techniques like bagging and boosting, on a rainfall dataset based in Australia. The objective of this paper is to demonstrate a reduction in bias and variance via ensemble learning techniques while also analyzing the increase or decrease in the aforementioned metrics. The study shows an overall reduction in bias by an average of 6% using boosting, and an average reduction in variance by 13.6%. Model performance was observed to become more generalized by lowering the false negative rate by an average of more than 20%. The techniques explored in this paper can be further utilized to improve model performance even further via hyper-parameter tuning.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Discover Internet of Things
Discover Internet of Things Internet of Things (IoT)-
CiteScore
7.50
自引率
0.00%
发文量
6
审稿时长
28 days
期刊介绍: Discover Internet of Things is part of the Discover journal series committed to providing a streamlined submission process, rapid review and publication, and a high level of author service at every stage. It is an open access, community-focussed journal publishing research from across all fields relevant to the Internet of Things (IoT), providing cutting-edge and state-of-art research findings to researchers, academicians, students, and engineers. Discover Internet of Things is a broad, open access journal publishing research from across all fields relevant to IoT. Discover Internet of Things covers concepts at the component, hardware, and system level as well as programming, operating systems, software, applications and other technology-oriented research topics. The journal is uniquely interdisciplinary because its scope spans several research communities, ranging from computer systems to communication, optimisation, big data analytics, and application. It is also intended that articles published in Discover Internet of Things may help to support and accelerate Sustainable Development Goal 9: ‘Build resilient infrastructure, promote inclusive and sustainable industrialization and foster innovation’. Discover Internet of Things welcomes all observational, experimental, theoretical, analytical, mathematical modelling, data-driven, and applied approaches that advance the study of all aspects of IoT research.
期刊最新文献
Determining critical nodes in optimal cost attacks on networked infrastructures Comparative analysis of audio classification with MFCC and STFT features using machine learning techniques Innovative image interpolation based reversible data hiding for secure communication Is iterative feature selection technique efficient enough? A comparative performance analysis of RFECV feature selection technique in ransomware classification using SHAP Internet of things (IoT)-based on real-time and remote boiler fuel monitoring system: a case of Raha Beverages Company Limited, Arusha-Tanzania
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1