Shobitha Shetty, Paul D Hamer, Kerstin Stebel, Arve Kylling, Amirhossein Hassani, Terje Koren Berntsen, Philipp Schneider
{"title":"Daily high-resolution surface PM<sub>2.5</sub> estimation over Europe by ML-based downscaling of the CAMS regional forecast.","authors":"Shobitha Shetty, Paul D Hamer, Kerstin Stebel, Arve Kylling, Amirhossein Hassani, Terje Koren Berntsen, Philipp Schneider","doi":"10.1016/j.envres.2024.120363","DOIUrl":null,"url":null,"abstract":"<p><p>Fine particulate matter (PM<sub>2.5</sub>) is a key air quality indicator due to its adverse health impacts. Accurate PM<sub>2.5</sub> assessment requires high-resolution (e.g., atleast 1 km) daily data, yet current methods face challenges in balancing accuracy, coverage, and resolution. Chemical transport models such as those from the Copernicus Atmosphere Monitoring Service (CAMS) offer continuous data but their relatively coarse resolution can introduce uncertainties. Here we present a synergistic Machine Learning (ML)-based approach called S-MESH (Satellite and ML-based Estimation of Surface air quality at High resolution) for estimating daily surface PM<sub>2.5</sub> over Europe at 1 km spatial resolution and demonstrate its performance for the years 2021 and 2022. The approach enhances and downscales the CAMS regional ensemble 24 h PM<sub>2.5</sub> forecast by training a stacked XGBoost model against station observations, effectively integrating satellite-derived data and modeled meteorological variables. Overall, against station observations, S-MESH (mean absolute error (MAE) of 3.54 μg/m<sup>3</sup>) shows higher accuracy than the CAMS forecast (MAE of 4.18 μg/m<sup>3</sup>) and is approaching the accuracy of the CAMS regional interim reanalysis (MAE of 3.21 μg/m<sup>3</sup>), while exhibiting a significantly reduced mean bias (MB of -0.3 μg/m<sup>3</sup> vs. -1.5 μg/m<sup>3</sup> for the reanalysis). At the same time, S-MESH requires substantially less computational resources and processing time. At concentrations >20 μg/m<sup>3</sup>, S-MESH outperforms the reanalysis (MB of -7.3 μg/m<sup>3</sup> and -10.3 μg/m<sup>3</sup> respectively), and reliably captures high pollution events in both space and time. In the eastern study area, where the reanalysis often underestimates, S-MESH better captures high levels of PM<sub>2.5</sub> mostly from residential heating. S-MESH effectively tracks day-to-day variability, with a temporal relative absolute error of 5% (reanalysis 10%). Exhibiting good performance at high pollution events coupled with its high spatial resolution and rapid estimation speed, S-MESH can be highly relevant for air quality assessments where both resolution and timeliness are critical.</p>","PeriodicalId":312,"journal":{"name":"Environmental Research","volume":" ","pages":"120363"},"PeriodicalIF":7.7000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Research","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.envres.2024.120363","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Fine particulate matter (PM2.5) is a key air quality indicator due to its adverse health impacts. Accurate PM2.5 assessment requires high-resolution (e.g., atleast 1 km) daily data, yet current methods face challenges in balancing accuracy, coverage, and resolution. Chemical transport models such as those from the Copernicus Atmosphere Monitoring Service (CAMS) offer continuous data but their relatively coarse resolution can introduce uncertainties. Here we present a synergistic Machine Learning (ML)-based approach called S-MESH (Satellite and ML-based Estimation of Surface air quality at High resolution) for estimating daily surface PM2.5 over Europe at 1 km spatial resolution and demonstrate its performance for the years 2021 and 2022. The approach enhances and downscales the CAMS regional ensemble 24 h PM2.5 forecast by training a stacked XGBoost model against station observations, effectively integrating satellite-derived data and modeled meteorological variables. Overall, against station observations, S-MESH (mean absolute error (MAE) of 3.54 μg/m3) shows higher accuracy than the CAMS forecast (MAE of 4.18 μg/m3) and is approaching the accuracy of the CAMS regional interim reanalysis (MAE of 3.21 μg/m3), while exhibiting a significantly reduced mean bias (MB of -0.3 μg/m3 vs. -1.5 μg/m3 for the reanalysis). At the same time, S-MESH requires substantially less computational resources and processing time. At concentrations >20 μg/m3, S-MESH outperforms the reanalysis (MB of -7.3 μg/m3 and -10.3 μg/m3 respectively), and reliably captures high pollution events in both space and time. In the eastern study area, where the reanalysis often underestimates, S-MESH better captures high levels of PM2.5 mostly from residential heating. S-MESH effectively tracks day-to-day variability, with a temporal relative absolute error of 5% (reanalysis 10%). Exhibiting good performance at high pollution events coupled with its high spatial resolution and rapid estimation speed, S-MESH can be highly relevant for air quality assessments where both resolution and timeliness are critical.
期刊介绍:
The Environmental Research journal presents a broad range of interdisciplinary research, focused on addressing worldwide environmental concerns and featuring innovative findings. Our publication strives to explore relevant anthropogenic issues across various environmental sectors, showcasing practical applications in real-life settings.