Assessing the accuracy of various statistical models for forecasting PM\(_{2.5}\): a case study from diverse regions of Gandhinagar and Ahmedabad

IF 3 4区 环境科学与生态学 Q3 ENVIRONMENTAL SCIENCES Environmental Monitoring and Assessment Pub Date : 2024-12-21 DOI:10.1007/s10661-024-13550-w
Sajeed I. Ghanchi, Dishant M. Pandya, Manan Shah
{"title":"Assessing the accuracy of various statistical models for forecasting PM\\(_{2.5}\\): a case study from diverse regions of Gandhinagar and Ahmedabad","authors":"Sajeed I. Ghanchi,&nbsp;Dishant M. Pandya,&nbsp;Manan Shah","doi":"10.1007/s10661-024-13550-w","DOIUrl":null,"url":null,"abstract":"<div><p>PM<span>\\(_{2.5}\\)</span> is the most hazardous air pollutant due to its smaller size, which allows deeper bodily penetration. Three diverse regions from Gujarat, India, namely Sector 10, Maninagar, and Vatva, which have green space, high population concentration, and industries, respectively, were chosen to forecast PM<span>\\(_{2.5}\\)</span> concentration for the next day. Four statistical models, including Multiple Linear Regression (MLR), Principal Component Regression (PCR), Simple Exponential Smoothing (SES), and Autoregressive Integrated Moving Average (ARIMA), were chosen to forecast PM<span>\\(_{2.5}\\)</span> levels. For this study, data of various pollutants and meteorological parameters were collected from February 2019 to September 2023. Analysis of the seasonal patterns of PM<span>\\(_{2.5}\\)</span> revealed elevated concentrations during post-monsoon and winter, in contrast to reduced levels during summer and monsoon. Statistical analysis revealed that the concentration of PM<span>\\(_{2.5}\\)</span> in Sector 10 is much lower than in the other two regions. The analysis of the test results, utilising various accuracy measures like RMSE, MAE, MAPE, IA, and others, indicated that Sector 10 achieved the highest precision in its results. While assessing the models’ accuracy on the test data, the ARIMA model demonstrated the highest level of precision. The average RMSE, MAE, and MAPE values for the ARIMA model were 12.63, 8.59, and 0.24, respectively. In the comparison of the performance between these statistical models and the neural network-based Multilayer Perceptron (MLP) model, it was observed that the statistical model demonstrated superior performance over the MLP model.</p></div>","PeriodicalId":544,"journal":{"name":"Environmental Monitoring and Assessment","volume":"197 1","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2024-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Monitoring and Assessment","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s10661-024-13550-w","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

PM\(_{2.5}\) is the most hazardous air pollutant due to its smaller size, which allows deeper bodily penetration. Three diverse regions from Gujarat, India, namely Sector 10, Maninagar, and Vatva, which have green space, high population concentration, and industries, respectively, were chosen to forecast PM\(_{2.5}\) concentration for the next day. Four statistical models, including Multiple Linear Regression (MLR), Principal Component Regression (PCR), Simple Exponential Smoothing (SES), and Autoregressive Integrated Moving Average (ARIMA), were chosen to forecast PM\(_{2.5}\) levels. For this study, data of various pollutants and meteorological parameters were collected from February 2019 to September 2023. Analysis of the seasonal patterns of PM\(_{2.5}\) revealed elevated concentrations during post-monsoon and winter, in contrast to reduced levels during summer and monsoon. Statistical analysis revealed that the concentration of PM\(_{2.5}\) in Sector 10 is much lower than in the other two regions. The analysis of the test results, utilising various accuracy measures like RMSE, MAE, MAPE, IA, and others, indicated that Sector 10 achieved the highest precision in its results. While assessing the models’ accuracy on the test data, the ARIMA model demonstrated the highest level of precision. The average RMSE, MAE, and MAPE values for the ARIMA model were 12.63, 8.59, and 0.24, respectively. In the comparison of the performance between these statistical models and the neural network-based Multilayer Perceptron (MLP) model, it was observed that the statistical model demonstrated superior performance over the MLP model.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估预测PM的各种统计模型的准确性\(_{2.5}\):来自甘地纳加尔和艾哈迈达巴德不同地区的案例研究
PM (_{2.5}\)是最危险的空气污染物,因为它的尺寸较小,可以更深地渗透人体。我们选择了印度古吉拉特邦的三个不同地区,即第10区、马尼纳加尔和瓦特瓦,这三个地区分别有绿地、人口密集区和工业区,用来预测第二天的 PM (_{2.5}\)浓度。选择了四种统计模型,包括多元线性回归(MLR)、主成分回归(PCR)、简单指数平滑(SES)和自回归综合移动平均(ARIMA)来预测 PM(_{2.5}\)水平。本研究收集了 2019 年 2 月至 2023 年 9 月期间各种污染物和气象参数的数据。PM (_{2.5}\)的季节模式分析显示,季风后和冬季的浓度较高,而夏季和季风期间的浓度则较低。统计分析显示,第 10 区的 PM(_{2.5}\)浓度远远低于其他两个区域。利用 RMSE、MAE、MAPE、IA 等各种精度指标对测试结果进行的分析表明,第 10 区的结果精度最高。在评估模型对测试数据的精确度时,ARIMA 模型的精确度最高。ARIMA 模型的平均 RMSE、MAE 和 MAPE 值分别为 12.63、8.59 和 0.24。在比较这些统计模型和基于神经网络的多层感知器(MLP)模型的性能时,发现统计模型的性能优于 MLP 模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Environmental Monitoring and Assessment
Environmental Monitoring and Assessment 环境科学-环境科学
CiteScore
4.70
自引率
6.70%
发文量
1000
审稿时长
7.3 months
期刊介绍: Environmental Monitoring and Assessment emphasizes technical developments and data arising from environmental monitoring and assessment, the use of scientific principles in the design of monitoring systems at the local, regional and global scales, and the use of monitoring data in assessing the consequences of natural resource management actions and pollution risks to man and the environment.
期刊最新文献
Evaluation of sanitation-related pathogenic exposure and epidemiology: bacterial contamination in soil from pit latrines and association with childhood diarrhea. Post-harvest pathways of microplastic contamination in commercially valuable fish from processing centers along the Gulf of Mannar, southeast coast of India. Quantifying and simulating above-ground biomass and carbon sequestration under various land cover scenarios in planted forest using machine learning and Earth observation. Statistical characteristics, source apportionment, and health risk assessment of heavy metals in shallow groundwater: a typical agricultural area of China. Wastewater-based surveillance of ESKAPE pathogens in resource-limited countries (RLCs): an essential tool for monitoring antibiotic resistance.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1