{"title":"Improving Air Pollution Forecasting in Smart Cities using Clustering Techniques","authors":"M. Muntean","doi":"10.1109/ECAI58194.2023.10193934","DOIUrl":null,"url":null,"abstract":"Air quality is a main concern for smart cities policies nowadays. This paper presents an approach for predicting Particulate Matter (PM) for air pollution using partitioning clustering techniques. Instead of predicting PM10 values for the entire dataset, better results were obtained when forecasting air pollution values for each discovered cluster. In the clustering stage, the k-means algorithm was applied, and four clusters were discovered. It could be noticed that two clusters corresponded to normal PM10 values, having the centroid values equal to 16, respectively 1, and the other two clusters stored high rates of pollution (with centroid values equal to 53, respectively 34). The forecasting results were more accurate when learning a cluster at a time with a specific classifier. After this step, several forecasting models were applied for each obtained cluster, and a conclusion that K-Nearest Neighbors and Neural Networks models had best performance in predicting the PM10 values is finally made.","PeriodicalId":391483,"journal":{"name":"2023 15th International Conference on Electronics, Computers and Artificial Intelligence (ECAI)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 15th International Conference on Electronics, Computers and Artificial Intelligence (ECAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ECAI58194.2023.10193934","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Air quality is a main concern for smart cities policies nowadays. This paper presents an approach for predicting Particulate Matter (PM) for air pollution using partitioning clustering techniques. Instead of predicting PM10 values for the entire dataset, better results were obtained when forecasting air pollution values for each discovered cluster. In the clustering stage, the k-means algorithm was applied, and four clusters were discovered. It could be noticed that two clusters corresponded to normal PM10 values, having the centroid values equal to 16, respectively 1, and the other two clusters stored high rates of pollution (with centroid values equal to 53, respectively 34). The forecasting results were more accurate when learning a cluster at a time with a specific classifier. After this step, several forecasting models were applied for each obtained cluster, and a conclusion that K-Nearest Neighbors and Neural Networks models had best performance in predicting the PM10 values is finally made.