LGHAP: a Long-term Gap-free High-resolution Air Pollutants concentration dataset derived via tensor flow based multimodal data fusion

K. Bai, Ke Li, Mingliang Ma, Kaitao Li, Zhengqiang Li, Jianping Guo, N. Chang, Zhuo Tan, Di Han
{"title":"LGHAP: a Long-term Gap-free High-resolution Air Pollutants concentration dataset derived via tensor flow based multimodal data fusion","authors":"K. Bai, Ke Li, Mingliang Ma, Kaitao Li, Zhengqiang Li, Jianping Guo, N. Chang, Zhuo Tan, Di Han","doi":"10.5194/essd-2021-404","DOIUrl":null,"url":null,"abstract":"Abstract. Developing a big data analytics framework for generating a Long-term Gap-free High-resolution Air Pollutants concentration dataset (abbreviated as LGHAP) is of great significance for environmental management and earth system science analysis. By synergistically integrating multimodal aerosol data acquired from diverse sources via a tensor flow based data fusion method, a gap-free aerosol optical depth (AOD) dataset with daily 1-km resolution covering the period of 2000–2020 in China was generated. Specifically, data gaps in daily AOD imageries from MODIS aboard Terra were reconstructed based on a set of AOD data tensors acquired from satellites, numerical analysis, and in situ air quality data via integrative efforts of spatial pattern recognition for high dimensional gridded image analysis and knowledge transfer in statistical data mining. To our knowledge, this is the first long-term gap-free high resolution AOD dataset in China, from which spatially contiguous PM2.5 and PM10 concentrations were estimated using an ensemble learning approach. Ground validation results indicate that the LGHAP AOD data are in a good agreement with in situ AOD observations from AERONET, with R of 0.91 and RMSE equaling to 0.21. Meanwhile, PM2.5 and PM10 estimations also agreed well with ground measurements, with R of 0.95 and 0.94 and RMSE of 12.03 and 19.56 μg m−3, respectively. Overall, the LGHAP provides a suite of long-term gap free gridded maps with high-resolution to better examine aerosol changes in China over the past two decades, from which three distinct variation periods of haze pollution were revealed in China. Additionally, the proportion of population exposed to unhealthy PM2.5 was increased from 50.60 % in 2000 to 63.81 % in 2014 across China, which was then drastically reduced to 34.03 % in 2020. Overall, the generated LGHAP aerosol dataset has a great potential to trigger multidisciplinary applications in earth observations, climate change, public health, ecosystem assessment, and environmental management. The daily resolution AOD, PM2.5, and PM10 datasets can be publicly accessed at https://doi.org/10.5281/zenodo.5652257 (Bai et al., 2021a), https://doi.org/10.5281/zenodo.5652265 (Bai et al., 2021b), and https://doi.org/10.5281/zenodo.5652263 (Bai et al., 2021c), respectively. Meanwhile, monthly and annual mean datasets can be found at https://doi.org/10.5281/zenodo.5655797 (Bai et al., 2021d) and https://doi.org/10.5281/zenodo.5655807 (Bai et al., 2021e), respectively. Python, Matlab, R, and IDL codes were also provided to help users read and visualize these data.\n","PeriodicalId":326085,"journal":{"name":"Earth System Science Data Discussions","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth System Science Data Discussions","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/essd-2021-404","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Abstract. Developing a big data analytics framework for generating a Long-term Gap-free High-resolution Air Pollutants concentration dataset (abbreviated as LGHAP) is of great significance for environmental management and earth system science analysis. By synergistically integrating multimodal aerosol data acquired from diverse sources via a tensor flow based data fusion method, a gap-free aerosol optical depth (AOD) dataset with daily 1-km resolution covering the period of 2000–2020 in China was generated. Specifically, data gaps in daily AOD imageries from MODIS aboard Terra were reconstructed based on a set of AOD data tensors acquired from satellites, numerical analysis, and in situ air quality data via integrative efforts of spatial pattern recognition for high dimensional gridded image analysis and knowledge transfer in statistical data mining. To our knowledge, this is the first long-term gap-free high resolution AOD dataset in China, from which spatially contiguous PM2.5 and PM10 concentrations were estimated using an ensemble learning approach. Ground validation results indicate that the LGHAP AOD data are in a good agreement with in situ AOD observations from AERONET, with R of 0.91 and RMSE equaling to 0.21. Meanwhile, PM2.5 and PM10 estimations also agreed well with ground measurements, with R of 0.95 and 0.94 and RMSE of 12.03 and 19.56 μg m−3, respectively. Overall, the LGHAP provides a suite of long-term gap free gridded maps with high-resolution to better examine aerosol changes in China over the past two decades, from which three distinct variation periods of haze pollution were revealed in China. Additionally, the proportion of population exposed to unhealthy PM2.5 was increased from 50.60 % in 2000 to 63.81 % in 2014 across China, which was then drastically reduced to 34.03 % in 2020. Overall, the generated LGHAP aerosol dataset has a great potential to trigger multidisciplinary applications in earth observations, climate change, public health, ecosystem assessment, and environmental management. The daily resolution AOD, PM2.5, and PM10 datasets can be publicly accessed at https://doi.org/10.5281/zenodo.5652257 (Bai et al., 2021a), https://doi.org/10.5281/zenodo.5652265 (Bai et al., 2021b), and https://doi.org/10.5281/zenodo.5652263 (Bai et al., 2021c), respectively. Meanwhile, monthly and annual mean datasets can be found at https://doi.org/10.5281/zenodo.5655797 (Bai et al., 2021d) and https://doi.org/10.5281/zenodo.5655807 (Bai et al., 2021e), respectively. Python, Matlab, R, and IDL codes were also provided to help users read and visualize these data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
lghhap:基于张量流的多模态数据融合获得的长期无间隙高分辨率空气污染物浓度数据集
摘要构建长期无间隙高分辨率大气污染物浓度数据集(简称LGHAP)的大数据分析框架,对环境管理和地球系统科学分析具有重要意义。通过基于张量流的数据融合方法,对不同来源的多模态气溶胶数据进行协同整合,生成了2000-2020年中国无间隙气溶胶光学深度(AOD)日分辨率1 km数据集。具体而言,基于从卫星获取的AOD数据张量、数值分析和现场空气质量数据,通过高维网格图像分析的空间模式识别和统计数据挖掘的知识转移的综合努力,重构Terra上MODIS每日AOD图像的数据缺口。据我们所知,这是中国第一个长期无间隙高分辨率AOD数据集,使用集成学习方法估算了空间连续的PM2.5和PM10浓度。地面验证结果表明,lglhap AOD数据与AERONET现场AOD观测值吻合较好,R为0.91,RMSE为0.21。PM2.5和PM10的R值分别为0.95和0.94,RMSE分别为12.03和19.56 μ m−3。总体而言,LGHAP提供了一套长期无间隙高分辨率网格图,以更好地研究过去20年中国的气溶胶变化,从中揭示了中国雾霾污染的三个不同变化期。此外,全国暴露于不健康PM2.5的人口比例从2000年的50.60%上升到2014年的63.81%,然后大幅下降到2020年的34.03%。总的来说,生成的LGHAP气溶胶数据集在地球观测、气候变化、公共卫生、生态系统评估和环境管理等领域具有很大的应用潜力。日分辨率AOD、PM2.5和PM10数据集可分别在https://doi.org/10.5281/zenodo.5652257 (Bai et al., 2021a)、https://doi.org/10.5281/zenodo.5652265 (Bai et al., 2021b)和https://doi.org/10.5281/zenodo.5652263 (Bai et al., 2021c)公开访问。同时,月均数据集可在https://doi.org/10.5281/zenodo.5655797 (Bai et al., 2021d)和https://doi.org/10.5281/zenodo.5655807 (Bai et al., 2021e)上找到。还提供了Python、Matlab、R和IDL代码来帮助用户读取和可视化这些数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
New SMOS SSS maps in the framework of the Earth Observation data For Science and Innovation in the Black Sea LGHAP: a Long-term Gap-free High-resolution Air Pollutants concentration dataset derived via tensor flow based multimodal data fusion Pre- and post-production processes along supply chains increasingly dominate GHG emissions from agri-food systems globally and in most countries Last Interglacial sea-level data points from Northwest Europe A machine learning approach to address air quality changes during the COVID-19 lockdown in Buenos Aires, Argentina
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1