Shobitha Shetty, Paul D Hamer, Kerstin Stebel, Arve Kylling, Amirhossein Hassani, Terje Koren Berntsen, Philipp Schneider
{"title":"通过对 CAMS 区域预报进行基于 ML 的降尺度处理,估算出欧洲的每日高分辨率地表 PM2.5。","authors":"Shobitha Shetty, Paul D Hamer, Kerstin Stebel, Arve Kylling, Amirhossein Hassani, Terje Koren Berntsen, Philipp Schneider","doi":"10.1016/j.envres.2024.120363","DOIUrl":null,"url":null,"abstract":"<p><p>Fine particulate matter (PM<sub>2.5</sub>) is a key air quality indicator due to its adverse health impacts. Accurate PM<sub>2.5</sub> assessment requires high-resolution (e.g., atleast 1 km) daily data, yet current methods face challenges in balancing accuracy, coverage, and resolution. Chemical transport models such as those from the Copernicus Atmosphere Monitoring Service (CAMS) offer continuous data but their relatively coarse resolution can introduce uncertainties. Here we present a synergistic Machine Learning (ML)-based approach called S-MESH (Satellite and ML-based Estimation of Surface air quality at High resolution) for estimating daily surface PM<sub>2.5</sub> over Europe at 1 km spatial resolution and demonstrate its performance for the years 2021 and 2022. The approach enhances and downscales the CAMS regional ensemble 24h PM<sub>2.5</sub> forecast by training a stacked XGBoost model against station observations, effectively integrating satellite-derived data and modeled meteorological variables. Overall, against station observations, S-MESH (mean absolute error (MAE) of 3.54 μg/m<sup>3</sup>) shows higher accuracy than the CAMS forecast (MAE of 4.18 μg/m<sup>3</sup>) and is approaching the accuracy of the CAMS regional interim reanalysis (MAE of 3.21 μg/m<sup>3</sup>), while exhibiting a significantly reduced mean bias (MB of -0.3 μg/m<sup>3</sup> vs. -1.5 μg/m<sup>3</sup> for the reanalysis). At the same time, S-MESH requires substantially less computational resources and processing time. At concentrations >20 μg/m<sup>3</sup>, S-MESH outperforms the reanalysis (MB of -7.3 μg/m<sup>3</sup> and -10.3 μg/m<sup>3</sup> respectively), and reliably captures high pollution events in both space and time. In the eastern study area, where the reanalysis often underestimates, S-MESH better captures high levels of PM<sub>2.5</sub> mostly from residential heating. S-MESH effectively tracks day-to-day variability, with a temporal relative absolute error of 5% (reanalysis 10%). Exhibiting good performance at high pollution events coupled with its high spatial resolution and rapid estimation speed, S-MESH can be highly relevant for air quality assessments where both resolution and timeliness are critical.</p>","PeriodicalId":312,"journal":{"name":"Environmental Research","volume":" ","pages":"120363"},"PeriodicalIF":7.7000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Daily high-resolution surface PM<sub>2.5</sub> estimation over Europe by ML-based downscaling of the CAMS regional forecast.\",\"authors\":\"Shobitha Shetty, Paul D Hamer, Kerstin Stebel, Arve Kylling, Amirhossein Hassani, Terje Koren Berntsen, Philipp Schneider\",\"doi\":\"10.1016/j.envres.2024.120363\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Fine particulate matter (PM<sub>2.5</sub>) is a key air quality indicator due to its adverse health impacts. Accurate PM<sub>2.5</sub> assessment requires high-resolution (e.g., atleast 1 km) daily data, yet current methods face challenges in balancing accuracy, coverage, and resolution. Chemical transport models such as those from the Copernicus Atmosphere Monitoring Service (CAMS) offer continuous data but their relatively coarse resolution can introduce uncertainties. Here we present a synergistic Machine Learning (ML)-based approach called S-MESH (Satellite and ML-based Estimation of Surface air quality at High resolution) for estimating daily surface PM<sub>2.5</sub> over Europe at 1 km spatial resolution and demonstrate its performance for the years 2021 and 2022. The approach enhances and downscales the CAMS regional ensemble 24h PM<sub>2.5</sub> forecast by training a stacked XGBoost model against station observations, effectively integrating satellite-derived data and modeled meteorological variables. Overall, against station observations, S-MESH (mean absolute error (MAE) of 3.54 μg/m<sup>3</sup>) shows higher accuracy than the CAMS forecast (MAE of 4.18 μg/m<sup>3</sup>) and is approaching the accuracy of the CAMS regional interim reanalysis (MAE of 3.21 μg/m<sup>3</sup>), while exhibiting a significantly reduced mean bias (MB of -0.3 μg/m<sup>3</sup> vs. -1.5 μg/m<sup>3</sup> for the reanalysis). At the same time, S-MESH requires substantially less computational resources and processing time. At concentrations >20 μg/m<sup>3</sup>, S-MESH outperforms the reanalysis (MB of -7.3 μg/m<sup>3</sup> and -10.3 μg/m<sup>3</sup> respectively), and reliably captures high pollution events in both space and time. In the eastern study area, where the reanalysis often underestimates, S-MESH better captures high levels of PM<sub>2.5</sub> mostly from residential heating. S-MESH effectively tracks day-to-day variability, with a temporal relative absolute error of 5% (reanalysis 10%). Exhibiting good performance at high pollution events coupled with its high spatial resolution and rapid estimation speed, S-MESH can be highly relevant for air quality assessments where both resolution and timeliness are critical.</p>\",\"PeriodicalId\":312,\"journal\":{\"name\":\"Environmental Research\",\"volume\":\" \",\"pages\":\"120363\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2024-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Research\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.envres.2024.120363\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Research","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.envres.2024.120363","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Daily high-resolution surface PM2.5 estimation over Europe by ML-based downscaling of the CAMS regional forecast.
Fine particulate matter (PM2.5) is a key air quality indicator due to its adverse health impacts. Accurate PM2.5 assessment requires high-resolution (e.g., atleast 1 km) daily data, yet current methods face challenges in balancing accuracy, coverage, and resolution. Chemical transport models such as those from the Copernicus Atmosphere Monitoring Service (CAMS) offer continuous data but their relatively coarse resolution can introduce uncertainties. Here we present a synergistic Machine Learning (ML)-based approach called S-MESH (Satellite and ML-based Estimation of Surface air quality at High resolution) for estimating daily surface PM2.5 over Europe at 1 km spatial resolution and demonstrate its performance for the years 2021 and 2022. The approach enhances and downscales the CAMS regional ensemble 24h PM2.5 forecast by training a stacked XGBoost model against station observations, effectively integrating satellite-derived data and modeled meteorological variables. Overall, against station observations, S-MESH (mean absolute error (MAE) of 3.54 μg/m3) shows higher accuracy than the CAMS forecast (MAE of 4.18 μg/m3) and is approaching the accuracy of the CAMS regional interim reanalysis (MAE of 3.21 μg/m3), while exhibiting a significantly reduced mean bias (MB of -0.3 μg/m3 vs. -1.5 μg/m3 for the reanalysis). At the same time, S-MESH requires substantially less computational resources and processing time. At concentrations >20 μg/m3, S-MESH outperforms the reanalysis (MB of -7.3 μg/m3 and -10.3 μg/m3 respectively), and reliably captures high pollution events in both space and time. In the eastern study area, where the reanalysis often underestimates, S-MESH better captures high levels of PM2.5 mostly from residential heating. S-MESH effectively tracks day-to-day variability, with a temporal relative absolute error of 5% (reanalysis 10%). Exhibiting good performance at high pollution events coupled with its high spatial resolution and rapid estimation speed, S-MESH can be highly relevant for air quality assessments where both resolution and timeliness are critical.
期刊介绍:
The Environmental Research journal presents a broad range of interdisciplinary research, focused on addressing worldwide environmental concerns and featuring innovative findings. Our publication strives to explore relevant anthropogenic issues across various environmental sectors, showcasing practical applications in real-life settings.