{"title":"Rainfall Prediction using XGB Model with the Australian Dataset","authors":"Surendra Reddy Vinta, Rashika Peeriga","doi":"10.4108/ew.5386","DOIUrl":null,"url":null,"abstract":"Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively. \nWith accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.","PeriodicalId":502230,"journal":{"name":"EAI Endorsed Transactions on Energy Web","volume":"26 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EAI Endorsed Transactions on Energy Web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4108/ew.5386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Rainfall prediction is a critical field of study with several practical uses, including agriculture, water management, and disaster preparedness. In this work, we examine the performance of several machine learning models in forecasting rainfall using a dataset of Australian rainfall observations from Kaggle. Six models are compared: random forest (RF), logistic regression (LogReg), Gaussian Naive Bayes (GNB), k-nearest neighbours (kNN), support vector classifier (SVC), and XGBoost (XGB). Missing value imputation and feature selection were used to preprocess the dataset. To analyse the models, we employed cross-validation and performance indicators such as accuracy, precision, recall, and F1-score. According to our findings, the RF and XGB models fared the best, with accuracy ratings of 87% and 85%, respectively.
With accuracy ratings below 70%, the GNB and SVC models performed the poorest. Our findings imply that machine learning algorithms can be useful tools for predicting rainfall, but careful model selection and evaluation are required for reliable results.