{"title":"通过多元线性回归、一般加法模型和随机森林方法对 PM2.5 气象归一化进行比较","authors":"Ling Qi , Haotian Zheng , Dian Ding , Shuxiao Wang","doi":"10.1016/j.atmosenv.2024.120854","DOIUrl":null,"url":null,"abstract":"<div><div>PM<sub>2.5</sub> is still one of the major atmospheric pollutants worldwide. Extracting contributions of anthropogenic emission control from the observed PM<sub>2.5</sub> variations (PM<sub>2.5_anth</sub>), which are also strongly affected by meteorological changes, is critical for effective pollution control. Statistical and machine learning methods are usually used for such purpose, but the effectiveness of these methods is hard to evaluate due to the lack of observed anthropogenic contributions. In this study, we use the chemical transport model GEOS-Chem standard simulation to mimic PM<sub>2.5</sub> variability in the real atmosphere, and use the model simulation with fixed meteorological fields as the “true value” for PM<sub>2.5_anth</sub>. We evaluate the effectiveness of three methods in meteorological normalization of PM<sub>2.5</sub> on decadal (2006–2017) and synoptic (one month) scale: multiple linear regression (MLR), general additive model (GAM), and random forest (RF) algorithm. For meteorological normalization of PM<sub>2.5</sub> on decadal scale, 67–72% of the MLR simulations show positive biases and 56–75% of the RF simulations show negative biases. The “true value” of PM<sub>2.5_anth</sub> falls within the range of meteorological normalization results of the three methods in most cases, but consistent positive/negative biases are observed in ∼30% of the cases, when meteorological changes dominate PM<sub>2.5</sub> variability. In addition, the biases are correlated to the contribution of meteorological changes. As such, multiple statistical or machine learning methods are recommended to quantify the uncertainties associated with method choice in cases anthropogenic emission changes dominate PM<sub>2.5</sub> variability. On synoptic scale, RF performs better in reproducing the daily variations of the PM<sub>2.5_anth</sub> differences than MLR (GAM) in all (83% of) the cases, and is recommended for meteorological normalization of PM<sub>2.5</sub> in short-term in eastern China.</div></div>","PeriodicalId":250,"journal":{"name":"Atmospheric Environment","volume":"338 ","pages":"Article 120854"},"PeriodicalIF":4.2000,"publicationDate":"2024-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comparison of meteorological normalization of PM2.5 by multiple linear regression, general additive model, and random forest methods\",\"authors\":\"Ling Qi , Haotian Zheng , Dian Ding , Shuxiao Wang\",\"doi\":\"10.1016/j.atmosenv.2024.120854\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>PM<sub>2.5</sub> is still one of the major atmospheric pollutants worldwide. Extracting contributions of anthropogenic emission control from the observed PM<sub>2.5</sub> variations (PM<sub>2.5_anth</sub>), which are also strongly affected by meteorological changes, is critical for effective pollution control. Statistical and machine learning methods are usually used for such purpose, but the effectiveness of these methods is hard to evaluate due to the lack of observed anthropogenic contributions. In this study, we use the chemical transport model GEOS-Chem standard simulation to mimic PM<sub>2.5</sub> variability in the real atmosphere, and use the model simulation with fixed meteorological fields as the “true value” for PM<sub>2.5_anth</sub>. We evaluate the effectiveness of three methods in meteorological normalization of PM<sub>2.5</sub> on decadal (2006–2017) and synoptic (one month) scale: multiple linear regression (MLR), general additive model (GAM), and random forest (RF) algorithm. For meteorological normalization of PM<sub>2.5</sub> on decadal scale, 67–72% of the MLR simulations show positive biases and 56–75% of the RF simulations show negative biases. The “true value” of PM<sub>2.5_anth</sub> falls within the range of meteorological normalization results of the three methods in most cases, but consistent positive/negative biases are observed in ∼30% of the cases, when meteorological changes dominate PM<sub>2.5</sub> variability. In addition, the biases are correlated to the contribution of meteorological changes. As such, multiple statistical or machine learning methods are recommended to quantify the uncertainties associated with method choice in cases anthropogenic emission changes dominate PM<sub>2.5</sub> variability. On synoptic scale, RF performs better in reproducing the daily variations of the PM<sub>2.5_anth</sub> differences than MLR (GAM) in all (83% of) the cases, and is recommended for meteorological normalization of PM<sub>2.5</sub> in short-term in eastern China.</div></div>\",\"PeriodicalId\":250,\"journal\":{\"name\":\"Atmospheric Environment\",\"volume\":\"338 \",\"pages\":\"Article 120854\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2024-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Atmospheric Environment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1352231024005296\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Atmospheric Environment","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1352231024005296","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
A comparison of meteorological normalization of PM2.5 by multiple linear regression, general additive model, and random forest methods
PM2.5 is still one of the major atmospheric pollutants worldwide. Extracting contributions of anthropogenic emission control from the observed PM2.5 variations (PM2.5_anth), which are also strongly affected by meteorological changes, is critical for effective pollution control. Statistical and machine learning methods are usually used for such purpose, but the effectiveness of these methods is hard to evaluate due to the lack of observed anthropogenic contributions. In this study, we use the chemical transport model GEOS-Chem standard simulation to mimic PM2.5 variability in the real atmosphere, and use the model simulation with fixed meteorological fields as the “true value” for PM2.5_anth. We evaluate the effectiveness of three methods in meteorological normalization of PM2.5 on decadal (2006–2017) and synoptic (one month) scale: multiple linear regression (MLR), general additive model (GAM), and random forest (RF) algorithm. For meteorological normalization of PM2.5 on decadal scale, 67–72% of the MLR simulations show positive biases and 56–75% of the RF simulations show negative biases. The “true value” of PM2.5_anth falls within the range of meteorological normalization results of the three methods in most cases, but consistent positive/negative biases are observed in ∼30% of the cases, when meteorological changes dominate PM2.5 variability. In addition, the biases are correlated to the contribution of meteorological changes. As such, multiple statistical or machine learning methods are recommended to quantify the uncertainties associated with method choice in cases anthropogenic emission changes dominate PM2.5 variability. On synoptic scale, RF performs better in reproducing the daily variations of the PM2.5_anth differences than MLR (GAM) in all (83% of) the cases, and is recommended for meteorological normalization of PM2.5 in short-term in eastern China.
期刊介绍:
Atmospheric Environment has an open access mirror journal Atmospheric Environment: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review.
Atmospheric Environment is the international journal for scientists in different disciplines related to atmospheric composition and its impacts. The journal publishes scientific articles with atmospheric relevance of emissions and depositions of gaseous and particulate compounds, chemical processes and physical effects in the atmosphere, as well as impacts of the changing atmospheric composition on human health, air quality, climate change, and ecosystems.