利用可解释机器学习了解韩国首尔夏季 PM1 构成的驱动因素

Qihua Hu, Jihye Moon and Hwajin Kim*, 
{"title":"利用可解释机器学习了解韩国首尔夏季 PM1 构成的驱动因素","authors":"Qihua Hu,&nbsp;Jihye Moon and Hwajin Kim*,&nbsp;","doi":"10.1021/acsestair.3c0011610.1021/acsestair.3c00116","DOIUrl":null,"url":null,"abstract":"<p >This study leverages explainable machine learning, specifically XGBoost models with Shapley Additive Explanations (SHAP), to explore the chemical properties of atmospheric aerosols in Seoul, Korea, during the summer of 2019. Focusing on non-refractory particulate matter (NR-PM<sub>1</sub>) properties measured by high-resolution time-of-flight aerosol mass spectrometry (HR-ToF-AMS), the research extends to organic aerosol (OA) sources identified via positive matrix factorization of high-resolution MS data. The models achieved good predictive accuracy (<i>R</i><sup>2</sup> &gt; 0.90) for all species concentrations, except for hydrocarbon-like OA (HOA) due to frequent concentration fluctuations. The model outcomes aligned well with those previously achieved using conventional methods (chemical transport model and correlational analysis), confirming that relative humidity is associated with nocturnal nitrate concentration and photochemistry associated with sulfate concentration in the summertime in Seoul. Importantly, the models revealed mostly nonlinear relationships between atmospheric factors, such as temperature and particulate matter (PM) components, thereby deepening the understanding of formation processes. Notably, different potential formation mechanisms were discerned for more oxidized oxygenated OA (MO-OOA) and oxidized primary OA (OPOA). For MO-OOA, SHAP analysis showed a plateau in SHAP values at an O<sub><i>x</i></sub> concentration of 0.085 ppm, which suggested potential fragmentation from further oxidation and agreed with previous chamber experiments. Conversely, the lack of a plateau in the O<sub><i>x</i></sub> values for OPOA implied potential ongoing oxidation, suggesting a higher and longer atmospheric oxidation potential. This approach offers rapid and potential insights into complex atmospheric aerosol formation processes. It is essential to acknowledge that SHAP values do not establish causality, and knowledge of the underlying physical and chemical processes was required to conclude valid and comprehensive interpretations of the ML results.</p><p >This study represents a pioneering effort in applying explainable machine learning techniques to HR-ToF-AMS data, achieving rapid results yet providing potential insights of OA formation mechanisms, such as the oxidation turning point for MO-OOA and OPOA.</p>","PeriodicalId":100014,"journal":{"name":"ACS ES&T Air","volume":"1 9","pages":"960–972 960–972"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acsestair.3c00116","citationCount":"0","resultStr":"{\"title\":\"Understanding the Driving Forces of Summer PM1 Composition in Seoul, Korea, with Explainable Machine Learning\",\"authors\":\"Qihua Hu,&nbsp;Jihye Moon and Hwajin Kim*,&nbsp;\",\"doi\":\"10.1021/acsestair.3c0011610.1021/acsestair.3c00116\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >This study leverages explainable machine learning, specifically XGBoost models with Shapley Additive Explanations (SHAP), to explore the chemical properties of atmospheric aerosols in Seoul, Korea, during the summer of 2019. Focusing on non-refractory particulate matter (NR-PM<sub>1</sub>) properties measured by high-resolution time-of-flight aerosol mass spectrometry (HR-ToF-AMS), the research extends to organic aerosol (OA) sources identified via positive matrix factorization of high-resolution MS data. The models achieved good predictive accuracy (<i>R</i><sup>2</sup> &gt; 0.90) for all species concentrations, except for hydrocarbon-like OA (HOA) due to frequent concentration fluctuations. The model outcomes aligned well with those previously achieved using conventional methods (chemical transport model and correlational analysis), confirming that relative humidity is associated with nocturnal nitrate concentration and photochemistry associated with sulfate concentration in the summertime in Seoul. Importantly, the models revealed mostly nonlinear relationships between atmospheric factors, such as temperature and particulate matter (PM) components, thereby deepening the understanding of formation processes. Notably, different potential formation mechanisms were discerned for more oxidized oxygenated OA (MO-OOA) and oxidized primary OA (OPOA). For MO-OOA, SHAP analysis showed a plateau in SHAP values at an O<sub><i>x</i></sub> concentration of 0.085 ppm, which suggested potential fragmentation from further oxidation and agreed with previous chamber experiments. Conversely, the lack of a plateau in the O<sub><i>x</i></sub> values for OPOA implied potential ongoing oxidation, suggesting a higher and longer atmospheric oxidation potential. This approach offers rapid and potential insights into complex atmospheric aerosol formation processes. It is essential to acknowledge that SHAP values do not establish causality, and knowledge of the underlying physical and chemical processes was required to conclude valid and comprehensive interpretations of the ML results.</p><p >This study represents a pioneering effort in applying explainable machine learning techniques to HR-ToF-AMS data, achieving rapid results yet providing potential insights of OA formation mechanisms, such as the oxidation turning point for MO-OOA and OPOA.</p>\",\"PeriodicalId\":100014,\"journal\":{\"name\":\"ACS ES&T Air\",\"volume\":\"1 9\",\"pages\":\"960–972 960–972\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/epdf/10.1021/acsestair.3c00116\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS ES&T Air\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acsestair.3c00116\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS ES&T Air","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsestair.3c00116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

本研究利用可解释的机器学习,特别是带有夏普利相加解释(SHAP)的 XGBoost 模型,探索 2019 年夏季韩国首尔大气气溶胶的化学特性。研究重点是通过高分辨率飞行时间气溶胶质谱(HR-ToF-AMS)测量的非难降解颗粒物(NR-PM1)特性,并扩展到通过高分辨率质谱数据的正矩阵因式分解确定的有机气溶胶(OA)来源。模型对所有物种的浓度都达到了良好的预测精度(R2 > 0.90),但碳氢化合物类有机气溶胶(HOA)除外,因为其浓度波动频繁。模型结果与之前使用传统方法(化学传输模型和相关分析)得出的结果非常吻合,证实相对湿度与首尔夏季的夜间硝酸盐浓度相关,光化学与硫酸盐浓度相关。重要的是,模型揭示了温度和颗粒物(PM)成分等大气因素之间的大部分非线性关系,从而加深了对形成过程的理解。值得注意的是,对于更氧化的含氧 OA(MO-OOA)和氧化的原生 OA(OPOA),发现了不同的潜在形成机制。对于 MO-OOA,SHAP 分析表明,当氧化物浓度为 0.085 ppm 时,SHAP 值会达到一个高点,这表明进一步氧化可能会导致碎裂,这与之前的室实验结果一致。相反,OPOA 的 Ox 值没有出现高原,这意味着可能正在发生氧化,表明大气中的氧化潜能更高、时间更长。这种方法可以快速、潜在地了解复杂的大气气溶胶形成过程。必须承认的是,SHAP 值并不能确定因果关系,要对 ML 结果做出有效而全面的解释,还需要了解基本的物理和化学过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Understanding the Driving Forces of Summer PM1 Composition in Seoul, Korea, with Explainable Machine Learning

This study leverages explainable machine learning, specifically XGBoost models with Shapley Additive Explanations (SHAP), to explore the chemical properties of atmospheric aerosols in Seoul, Korea, during the summer of 2019. Focusing on non-refractory particulate matter (NR-PM1) properties measured by high-resolution time-of-flight aerosol mass spectrometry (HR-ToF-AMS), the research extends to organic aerosol (OA) sources identified via positive matrix factorization of high-resolution MS data. The models achieved good predictive accuracy (R2 > 0.90) for all species concentrations, except for hydrocarbon-like OA (HOA) due to frequent concentration fluctuations. The model outcomes aligned well with those previously achieved using conventional methods (chemical transport model and correlational analysis), confirming that relative humidity is associated with nocturnal nitrate concentration and photochemistry associated with sulfate concentration in the summertime in Seoul. Importantly, the models revealed mostly nonlinear relationships between atmospheric factors, such as temperature and particulate matter (PM) components, thereby deepening the understanding of formation processes. Notably, different potential formation mechanisms were discerned for more oxidized oxygenated OA (MO-OOA) and oxidized primary OA (OPOA). For MO-OOA, SHAP analysis showed a plateau in SHAP values at an Ox concentration of 0.085 ppm, which suggested potential fragmentation from further oxidation and agreed with previous chamber experiments. Conversely, the lack of a plateau in the Ox values for OPOA implied potential ongoing oxidation, suggesting a higher and longer atmospheric oxidation potential. This approach offers rapid and potential insights into complex atmospheric aerosol formation processes. It is essential to acknowledge that SHAP values do not establish causality, and knowledge of the underlying physical and chemical processes was required to conclude valid and comprehensive interpretations of the ML results.

This study represents a pioneering effort in applying explainable machine learning techniques to HR-ToF-AMS data, achieving rapid results yet providing potential insights of OA formation mechanisms, such as the oxidation turning point for MO-OOA and OPOA.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Issue Editorial Masthead Issue Publication Information Atmospheric Radical Chemistry Evolution: A Chemical Scan of the Atmosphere Wildfire Seasons, Prenatal PM2.5 Exposure, and Respiratory Infections by Age 1 Year: A Population-Based Case-Control Analysis of Critical Developmental Windows Chemical Characterization of Organic Aerosol Tracers Derived from Burning Biomass Indigenous to Sub-Saharan Africa: Fresh Emissions versus Photochemical Aging
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1