Machine Learning for Weather Forecasting: XGBoost vs SVM vs Random Forest in Predicting Temperature for Visakhapatnam

Q3 Computer Science International Journal of Intelligent Systems and Applications in Engineering Pub Date : 2023-10-08 DOI:10.5815/ijisa.2023.05.05

Deep Karan Singh, Nisha Rawat

{"title":"Machine Learning for Weather Forecasting: XGBoost vs SVM vs Random Forest in Predicting Temperature for Visakhapatnam","authors":"Deep Karan Singh, Nisha Rawat","doi":"10.5815/ijisa.2023.05.05","DOIUrl":null,"url":null,"abstract":"Climate change, a significant and lasting alteration in global weather patterns, is profoundly impacting the stability and predictability of global temperature regimes. As the world continues to grapple with the far-reaching effects of climate change, accurate and timely temperature predictions have become pivotal to various sectors, including agriculture, energy, public health and many more. Crucially, precise temperature forecasting assists in developing effective climate change mitigation and adaptation strategies. With the advent of machine learning techniques, we now have powerful tools that can learn from vast climatic datasets and provide improved predictive performance. This study delves into the comparison of three such advanced machine learning models—XGBoost, Support Vector Machine (SVM), and Random Forest—in predicting daily maximum and minimum temperatures using a 45-year dataset of Visakhapatnam airport. Each model was rigorously trained and evaluated based on key performance metrics including training loss, Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), R2 score, Mean Absolute Percentage Error (MAPE), and Explained Variance Score. Although there was no clear dominance of a single model across all metrics, SVM and Random Forest showed slightly superior performance on several measures. These findings not only highlight the potential of machine learning techniques in enhancing the accuracy of temperature forecasting but also stress the importance of selecting an appropriate model and performance metrics aligned with the requirements of the task at hand. This research accomplishes a thorough comparative analysis, conducts a rigorous evaluation of the models, highlights the significance of model selection.","PeriodicalId":14067,"journal":{"name":"International Journal of Intelligent Systems and Applications in Engineering","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Systems and Applications in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5815/ijisa.2023.05.05","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

Climate change, a significant and lasting alteration in global weather patterns, is profoundly impacting the stability and predictability of global temperature regimes. As the world continues to grapple with the far-reaching effects of climate change, accurate and timely temperature predictions have become pivotal to various sectors, including agriculture, energy, public health and many more. Crucially, precise temperature forecasting assists in developing effective climate change mitigation and adaptation strategies. With the advent of machine learning techniques, we now have powerful tools that can learn from vast climatic datasets and provide improved predictive performance. This study delves into the comparison of three such advanced machine learning models—XGBoost, Support Vector Machine (SVM), and Random Forest—in predicting daily maximum and minimum temperatures using a 45-year dataset of Visakhapatnam airport. Each model was rigorously trained and evaluated based on key performance metrics including training loss, Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), R2 score, Mean Absolute Percentage Error (MAPE), and Explained Variance Score. Although there was no clear dominance of a single model across all metrics, SVM and Random Forest showed slightly superior performance on several measures. These findings not only highlight the potential of machine learning techniques in enhancing the accuracy of temperature forecasting but also stress the importance of selecting an appropriate model and performance metrics aligned with the requirements of the task at hand. This research accomplishes a thorough comparative analysis, conducts a rigorous evaluation of the models, highlights the significance of model selection.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

天气预报的机器学习:XGBoost vs SVM vs随机森林预测维萨卡帕特南的温度

气候变化是全球天气模式的一个重大而持久的变化，正在深刻地影响着全球温度制度的稳定性和可预测性。随着世界继续努力应对气候变化的深远影响，准确和及时的温度预测已成为各个部门的关键，包括农业、能源、公共卫生等。至关重要的是，精确的温度预报有助于制定有效的气候变化减缓和适应战略。随着机器学习技术的出现，我们现在有了强大的工具，可以从大量的气候数据集中学习，并提供改进的预测性能。本研究利用维萨卡帕特南机场45年的数据集，深入研究了三种先进的机器学习模型——xgboost、支持向量机(SVM)和随机森林——在预测日最高和最低温度方面的比较。每个模型都经过严格的训练，并根据关键性能指标进行评估，包括训练损失、平均绝对误差(MAE)、均方误差(MSE)、均方根平方误差(RMSE)、R2评分、平均绝对百分比误差(MAPE)和解释方差评分。虽然单一模型在所有指标中没有明显的优势，但SVM和随机森林在几个指标上表现出略微优越的性能。这些发现不仅突出了机器学习技术在提高温度预测准确性方面的潜力，而且还强调了选择合适的模型和符合手头任务要求的性能指标的重要性。本研究完成了深入的比较分析，对模型进行了严格的评价，突出了模型选择的意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal of Intelligent Systems and Applications in Engineering Computer Science-Computer Graphics and Computer-Aided Design

CiteScore

1.30

自引率

0.00%

发文量