Rainfall Prediction using XGB Model with the Australian Dataset

EAI Endorsed Transactions on Energy Web Pub Date : 2024-03-12 DOI:10.4108/ew.5386

Surendra Reddy Vinta, Rashika Peeriga

引用次数: 0

Abstract

Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively. With accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用 XGB 模型和澳大利亚数据集进行降雨预测

降雨预测是一个重要的研究领域，具有多种实际用途，包括农业、水资源管理和备灾。在这项工作中，我们利用 Kaggle 提供的澳大利亚降雨观测数据集，检验了几种机器学习模型在降雨预测方面的性能。我们比较了六种模型：随机森林（RF）、逻辑回归（LogReg）、高斯直觉贝叶斯（GNB）、k-近邻（kNN）、支持向量分类器（SVC）和 XGBoost（XGB）。缺失值估算和特征选择用于数据集的预处理。为了分析模型，我们采用了交叉验证和准确率、精确度、召回率和 F1 分数等性能指标。根据我们的研究结果，RF 和 XGB 模型表现最佳，准确率分别为 87% 和 85%。GNB 和 SVC 模型的准确率低于 70%，表现最差。我们的研究结果表明，机器学习算法是预测降雨量的有用工具，但要获得可靠的结果，还需要对模型进行仔细选择和评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

EAI Endorsed Transactions on Energy Web

自引率

0.00%

发文量