Mariana Villarreal-Marines, Michael Pérez-Rodríguez, Yasmany Mancilla, Gabriela Ortiz, Alberto Mendoza
{"title":"Field calibration of fine particulate matter low-cost sensors in a highly industrialized semi-arid conurbation","authors":"Mariana Villarreal-Marines, Michael Pérez-Rodríguez, Yasmany Mancilla, Gabriela Ortiz, Alberto Mendoza","doi":"10.1038/s41612-024-00837-5","DOIUrl":null,"url":null,"abstract":"Low-cost sensors (LCS) for suspended particulate matter with an aerodynamic diameter less than or equal to 2.5 microns (PM2.5) have attracted worldwide attention for crowdsourcing air quality data. Here, we analyze one year’s worth of PM2.5 data from light-scattering LCS deployed in Monterrey, Mexico, one of the most polluted conurbations of Latin America. We also tested the Extreme Gradient Boosting (XGBoost) algorithm for classification and field calibration of the PM2.5 data derived from the LCS. Regression model performance increased from a low baseline (compared to other studies) of R2 ≈ 0.3 to R2 ≈ 0.5, with XGBoost outperforming the other machine learning algorithms tested. Differences in local climate and emission conditions emphasize the significance of considering regional distinctions when interpreting and comparing LCS responses and field calibration efforts. When using rank-level confusion matrices, True Positive air quality classification of predicted PM2.5 levels by XGBoost rated between 71% and 88%.","PeriodicalId":19438,"journal":{"name":"npj Climate and Atmospheric Science","volume":" ","pages":"1-12"},"PeriodicalIF":8.5000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41612-024-00837-5.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"npj Climate and Atmospheric Science","FirstCategoryId":"89","ListUrlMain":"https://www.nature.com/articles/s41612-024-00837-5","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Low-cost sensors (LCS) for suspended particulate matter with an aerodynamic diameter less than or equal to 2.5 microns (PM2.5) have attracted worldwide attention for crowdsourcing air quality data. Here, we analyze one year’s worth of PM2.5 data from light-scattering LCS deployed in Monterrey, Mexico, one of the most polluted conurbations of Latin America. We also tested the Extreme Gradient Boosting (XGBoost) algorithm for classification and field calibration of the PM2.5 data derived from the LCS. Regression model performance increased from a low baseline (compared to other studies) of R2 ≈ 0.3 to R2 ≈ 0.5, with XGBoost outperforming the other machine learning algorithms tested. Differences in local climate and emission conditions emphasize the significance of considering regional distinctions when interpreting and comparing LCS responses and field calibration efforts. When using rank-level confusion matrices, True Positive air quality classification of predicted PM2.5 levels by XGBoost rated between 71% and 88%.
期刊介绍:
npj Climate and Atmospheric Science is an open-access journal encompassing the relevant physical, chemical, and biological aspects of atmospheric and climate science. The journal places particular emphasis on regional studies that unveil new insights into specific localities, including examinations of local atmospheric composition, such as aerosols.
The range of topics covered by the journal includes climate dynamics, climate variability, weather and climate prediction, climate change, ocean dynamics, weather extremes, air pollution, atmospheric chemistry (including aerosols), the hydrological cycle, and atmosphere–ocean and atmosphere–land interactions. The journal welcomes studies employing a diverse array of methods, including numerical and statistical modeling, the development and application of in situ observational techniques, remote sensing, and the development or evaluation of new reanalyses.