Pub Date : 2024-05-30DOI: 10.5194/essd-16-2525-2024
Francesca Lappin, Gijs de Boer, Petra Klein, Jonathan Hamilton, Michelle Spencer, Radiance Calmer, Antonio R. Segales, Michael Rhodes, Tyler M. Bell, Justin Buchli, Kelsey Britt, Elizabeth Asher, Isaac Medina, Brian Butterworth, Leia Otterstatter, Madison Ritsch, Bryony Puxley, Angelina Miller, Arianna Jordan, Ceu Gomez-Faulk, Elizabeth Smith, Steven Borenstein, Troy Thornberry, Brian Argrow, Elizabeth Pillar-Little
Abstract. The main goal of the TRacking Aerosol Convection interactions ExpeRiment (TRACER) project was to further understand the role that regional circulations and aerosol loading play in the convective cloud life cycle across the greater Houston, Texas, area. To accomplish this goal, the United States Department of Energy and research partners collaborated to deploy atmospheric observing systems across the region. Cloud and precipitation radars, radiosondes, and air quality sensors captured atmospheric and cloud characteristics. A dense lower-atmospheric dataset was developed using ground-based remote sensors, a tethersonde, and uncrewed aerial systems (UASs). TRACER-UAS is a subproject that deployed two UAS platforms to gather high-resolution observations in the lower atmosphere between 1 June and 30 September 2022. The University of Oklahoma CopterSonde and the University of Colorado Boulder RAAVEN (Robust Autonomous Aerial Vehicle – Endurant Nimble) were flown at two coastal locations between the Gulf of Mexico and Houston. The University of Colorado Boulder RAAVEN gathered measurements of atmospheric thermodynamic state, winds and turbulence, and aerosol size distribution. Meanwhile, the University of Oklahoma CopterSonde system operated on a regular basis to resolve the vertical structure of the thermodynamic and kinematic state. Together, a complementary dataset of over 200 flight hours across 61 d was generated, and data from each platform proved to be in strong agreement. In this paper, the platforms and respective data collection and processing are described. The dataset described herein provides information on boundary layer evolution, the sea breeze circulation, conditions prior to and nearby deep convection, and the vertical structure and evolution of aerosols. The quality-controlled TRACER-UAS observations from the CopterSonde and RAAVEN can be found at https://doi.org/10.5439/1969004 (Lappin, 2023) and https://doi.org/10.5439/1985470 (de Boer, 2023), respectively.
{"title":"Data collected using small uncrewed aircraft systems during the TRacking Aerosol Convection interactions ExpeRiment (TRACER)","authors":"Francesca Lappin, Gijs de Boer, Petra Klein, Jonathan Hamilton, Michelle Spencer, Radiance Calmer, Antonio R. Segales, Michael Rhodes, Tyler M. Bell, Justin Buchli, Kelsey Britt, Elizabeth Asher, Isaac Medina, Brian Butterworth, Leia Otterstatter, Madison Ritsch, Bryony Puxley, Angelina Miller, Arianna Jordan, Ceu Gomez-Faulk, Elizabeth Smith, Steven Borenstein, Troy Thornberry, Brian Argrow, Elizabeth Pillar-Little","doi":"10.5194/essd-16-2525-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2525-2024","url":null,"abstract":"Abstract. The main goal of the TRacking Aerosol Convection interactions ExpeRiment (TRACER) project was to further understand the role that regional circulations and aerosol loading play in the convective cloud life cycle across the greater Houston, Texas, area. To accomplish this goal, the United States Department of Energy and research partners collaborated to deploy atmospheric observing systems across the region. Cloud and precipitation radars, radiosondes, and air quality sensors captured atmospheric and cloud characteristics. A dense lower-atmospheric dataset was developed using ground-based remote sensors, a tethersonde, and uncrewed aerial systems (UASs). TRACER-UAS is a subproject that deployed two UAS platforms to gather high-resolution observations in the lower atmosphere between 1 June and 30 September 2022. The University of Oklahoma CopterSonde and the University of Colorado Boulder RAAVEN (Robust Autonomous Aerial Vehicle – Endurant Nimble) were flown at two coastal locations between the Gulf of Mexico and Houston. The University of Colorado Boulder RAAVEN gathered measurements of atmospheric thermodynamic state, winds and turbulence, and aerosol size distribution. Meanwhile, the University of Oklahoma CopterSonde system operated on a regular basis to resolve the vertical structure of the thermodynamic and kinematic state. Together, a complementary dataset of over 200 flight hours across 61 d was generated, and data from each platform proved to be in strong agreement. In this paper, the platforms and respective data collection and processing are described. The dataset described herein provides information on boundary layer evolution, the sea breeze circulation, conditions prior to and nearby deep convection, and the vertical structure and evolution of aerosols. The quality-controlled TRACER-UAS observations from the CopterSonde and RAAVEN can be found at https://doi.org/10.5439/1969004 (Lappin, 2023) and https://doi.org/10.5439/1985470 (de Boer, 2023), respectively.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141177230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abstract. This paper presents an observational dataset on submesoscale eddies obtained from high–resolution chlorophyll–a data captured by GOCI I. Our methodology involves a combination of digital image processing, filtering, and object detection techniques, along with specific chlorophyll–a image enhancement procedure to extract essential information about submesoscale eddies. This information includes their time, polarity, geographical coordinates of the eddy center, eddy radius, coordinates of the upper left and lower right corners of the prediction box, area of the eddy's inner ellipse, and confidence score. The dataset spans eight time intervals, ranging from 00:00 to 08:00 (UTC) daily, covering the period from April 1, 2011, to March 31, 2021. A total of 19,136 anticyclonic eddies and 93,897 cyclonic eddies were identified with a confidence minimum of 0.2. The mean radius of anticyclonic eddies is 24.44 km (range 2.5 km to 44.25 km), while that of cyclonic eddies is 12.34 km (range 1.75 km to 44 km). This unprecedented hourly resolution dataset on submesoscale eddies offers valuable insights into their distribution, morphology, and energy dissipation. It significantly contributes to our understanding of marine environments, ecosystems and the improvement of climate model predictions. The dataset is available at https://doi.org/10.5281/zenodo.7694115 (Wang and Yang, 2023).
{"title":"A Submesoscale Eddy Identification Dataset in the Northwest Pacific Ocean Derived from GOCI I Chlorophyll–a Data based on Deep Learning","authors":"Yan Wang, Jie Yang, Ge Chen","doi":"10.5194/essd-2024-188","DOIUrl":"https://doi.org/10.5194/essd-2024-188","url":null,"abstract":"<strong>Abstract.</strong> This paper presents an observational dataset on submesoscale eddies obtained from high–resolution chlorophyll–a data captured by GOCI I. Our methodology involves a combination of digital image processing, filtering, and object detection techniques, along with specific chlorophyll–a image enhancement procedure to extract essential information about submesoscale eddies. This information includes their time, polarity, geographical coordinates of the eddy center, eddy radius, coordinates of the upper left and lower right corners of the prediction box, area of the eddy's inner ellipse, and confidence score. The dataset spans eight time intervals, ranging from 00:00 to 08:00 (UTC) daily, covering the period from April 1, 2011, to March 31, 2021. A total of 19,136 anticyclonic eddies and 93,897 cyclonic eddies were identified with a confidence minimum of 0.2. The mean radius of anticyclonic eddies is 24.44 km (range 2.5 km to 44.25 km), while that of cyclonic eddies is 12.34 km (range 1.75 km to 44 km). This unprecedented hourly resolution dataset on submesoscale eddies offers valuable insights into their distribution, morphology, and energy dissipation. It significantly contributes to our understanding of marine environments, ecosystems and the improvement of climate model predictions. The dataset is available at https://doi.org/10.5281/zenodo.7694115 (Wang and Yang, 2023).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141177309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abstract. Accurate long-term daily cloud-gap-filled fractional snow cover products are essential for climate change and snow hydrological studies in the Asian Water Tower (AWT) region, but existing Moderate Resolution Imaging Spectroradiometer (MODIS) snow cover products are not sufficient. In this study, the multiple-endmember spectral mixture analysis algorithm based on automatic endmember extraction (MESMA-AGE) and the multistep spatiotemporal interpolation algorithm (MSTI) are used to produce the MODIS daily cloud-gap-filled fractional snow cover product over the AWT region (AWT MODIS FSC). The AWT MODIS FSC products have a spatial resolution of 0.005° and span from 2000 to 2022. The 2745 scenes of Landsat-8 images are used for the areal-scale accuracy assessment. The fractional snow cover accuracy metrics, including the coefficient of determination (R2), root mean squared error (RMSE) and mean absolute error (MAE), are 0.80, 0.16 and 0.10, respectively. The binarized identification accuracy metrics, including overall accuracy (OA), producer's accuracy (PA) and user's accuracy (UA), are 95.17 %, 97.34 % and 97.59 %, respectively. Snow depth data observed at 175 meteorological stations are used to evaluate accuracy at the point scale, yielding the following accuracy metrics: an OA of 93.26 %, a PA of 84.41 %, a UA of 82.14 % and a Cohen kappa (CK) value of 0.79. Snow depth observations from meteorological stations are also used to assess the fractional snow cover resulting from different weather conditions, with an OA of 95.36 % (88.96 %), a PA of 87.75 % (82.26 %), a UA of 86.86 % (78.86 %) and a CK of 0.84 (0.72) under the MODIS clear-sky observations (spatiotemporal reconstruction based on the MSTI algorithm). The AWT MODIS FSC product can provide quantitative spatial distribution information on snowpacks for mountain hydrological models, land surface models and numerical weather prediction in the Asian Water Tower region. This dataset is freely available from the National Tibetan Plateau Data Center at https://doi.org/10.11888/Cryos.tpdc.272503 (Jiang et al., 2022) or from the Zenodo platform at https://doi.org/10.5281/zenodo.10005826 (Jiang et al., 2023a).
{"title":"MODIS daily cloud-gap-filled fractional snow cover dataset of the Asian Water Tower region (2000–2022)","authors":"Fangbo Pan, Lingmei Jiang, Gongxue Wang, Jinmei Pan, Jinyu Huang, Cheng Zhang, Huizhen Cui, Jianwei Yang, Zhaojun Zheng, Shengli Wu, Jiancheng Shi","doi":"10.5194/essd-16-2501-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2501-2024","url":null,"abstract":"Abstract. Accurate long-term daily cloud-gap-filled fractional snow cover products are essential for climate change and snow hydrological studies in the Asian Water Tower (AWT) region, but existing Moderate Resolution Imaging Spectroradiometer (MODIS) snow cover products are not sufficient. In this study, the multiple-endmember spectral mixture analysis algorithm based on automatic endmember extraction (MESMA-AGE) and the multistep spatiotemporal interpolation algorithm (MSTI) are used to produce the MODIS daily cloud-gap-filled fractional snow cover product over the AWT region (AWT MODIS FSC). The AWT MODIS FSC products have a spatial resolution of 0.005° and span from 2000 to 2022. The 2745 scenes of Landsat-8 images are used for the areal-scale accuracy assessment. The fractional snow cover accuracy metrics, including the coefficient of determination (R2), root mean squared error (RMSE) and mean absolute error (MAE), are 0.80, 0.16 and 0.10, respectively. The binarized identification accuracy metrics, including overall accuracy (OA), producer's accuracy (PA) and user's accuracy (UA), are 95.17 %, 97.34 % and 97.59 %, respectively. Snow depth data observed at 175 meteorological stations are used to evaluate accuracy at the point scale, yielding the following accuracy metrics: an OA of 93.26 %, a PA of 84.41 %, a UA of 82.14 % and a Cohen kappa (CK) value of 0.79. Snow depth observations from meteorological stations are also used to assess the fractional snow cover resulting from different weather conditions, with an OA of 95.36 % (88.96 %), a PA of 87.75 % (82.26 %), a UA of 86.86 % (78.86 %) and a CK of 0.84 (0.72) under the MODIS clear-sky observations (spatiotemporal reconstruction based on the MSTI algorithm). The AWT MODIS FSC product can provide quantitative spatial distribution information on snowpacks for mountain hydrological models, land surface models and numerical weather prediction in the Asian Water Tower region. This dataset is freely available from the National Tibetan Plateau Data Center at https://doi.org/10.11888/Cryos.tpdc.272503 (Jiang et al., 2022) or from the Zenodo platform at https://doi.org/10.5281/zenodo.10005826 (Jiang et al., 2023a).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141177400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-24DOI: 10.5194/essd-16-2483-2024
Hui Shen, Robert N. Spengler, Xinying Zhou, Alison Betts, Peter Weiming Jia, Keliang Zhao, Xiaoqiang Li
Abstract. Due largely to demographic growth, agricultural populations during the Holocene became increasingly more impactful ecosystem engineers. Multidisciplinary research has revealed a deep history of human–environmental dynamics; however, these pre-modern anthropogenic ecosystem transformations and cultural adaptions are still poorly understood. Here, we synthesis anthracological data to explore the complex array of human–environmental interactions in the regions of the prehistoric Silk Road. Our results suggest that these ancient humans were not passively impacted by environmental change; rather, they culturally adapted to, and in turn altered, arid ecosystems. Underpinned by the establishment of complex agricultural systems on the western Loess Plateau, people may have started to manage chestnut trees, likely through conservation of economically significant species, as early as 4600 BP. Since ca. 3500 BP, with the appearance of high-yielding wheat and barley farming in Xinjiang and the Hexi Corridor, people appear to have been cultivating Prunus and Morus trees. We also argue that people were transporting preferred coniferous woods over long distances to meet the need for fuel and timber. After 2500 BP, people in our study area were making conscious selections between wood types for craft production and were also clearly cultivating a wide range of long-generation perennials, showing a remarkable traditional knowledge tied into the arid environment. At the same time, the data suggest that there was significant deforestation throughout the chronology of occupation, including a rapid decline of slow-growing spruce forests and riparian woodlands across northwestern China. The wood charcoal dataset is publicly available at https://doi.org/10.5281/zenodo.8158277 (Shen et al., 2023).
{"title":"Seeing the wood for the trees: active human–environmental interactions in arid northwestern China","authors":"Hui Shen, Robert N. Spengler, Xinying Zhou, Alison Betts, Peter Weiming Jia, Keliang Zhao, Xiaoqiang Li","doi":"10.5194/essd-16-2483-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2483-2024","url":null,"abstract":"Abstract. Due largely to demographic growth, agricultural populations during the Holocene became increasingly more impactful ecosystem engineers. Multidisciplinary research has revealed a deep history of human–environmental dynamics; however, these pre-modern anthropogenic ecosystem transformations and cultural adaptions are still poorly understood. Here, we synthesis anthracological data to explore the complex array of human–environmental interactions in the regions of the prehistoric Silk Road. Our results suggest that these ancient humans were not passively impacted by environmental change; rather, they culturally adapted to, and in turn altered, arid ecosystems. Underpinned by the establishment of complex agricultural systems on the western Loess Plateau, people may have started to manage chestnut trees, likely through conservation of economically significant species, as early as 4600 BP. Since ca. 3500 BP, with the appearance of high-yielding wheat and barley farming in Xinjiang and the Hexi Corridor, people appear to have been cultivating Prunus and Morus trees. We also argue that people were transporting preferred coniferous woods over long distances to meet the need for fuel and timber. After 2500 BP, people in our study area were making conscious selections between wood types for craft production and were also clearly cultivating a wide range of long-generation perennials, showing a remarkable traditional knowledge tied into the arid environment. At the same time, the data suggest that there was significant deforestation throughout the chronology of occupation, including a rapid decline of slow-growing spruce forests and riparian woodlands across northwestern China. The wood charcoal dataset is publicly available at https://doi.org/10.5281/zenodo.8158277 (Shen et al., 2023).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141092069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-24DOI: 10.5194/essd-16-2465-2024
Daju Wang, Peiyang Ren, Xiaosheng Xia, Lei Fan, Zhangcai Qin, Xiuzhi Chen, Wenping Yuan
Abstract. Forest harvesting is one of the anthropogenic activities that most significantly affect the carbon budget of forests. However, the absence of explicit spatial information on harvested carbon poses a huge challenge in assessing forest-harvesting impacts, as well as the forest carbon budget. This study utilized provincial-level statistical data on wood harvest, the tree cover loss (TCL) dataset, and a satellite-based vegetation index to develop a Long-term harvEst and Allocation of Forest Biomass (LEAF) dataset. The aim was to provide the spatial location of forest harvesting with a spatial resolution of 30 m and to quantify the post-harvest carbon dynamics. The validations against the surveyed forest harvesting in 133 cities and counties indicated a good performance of the LEAF dataset in capturing the spatial variation of harvested carbon, with a coefficient of determination (R2) of 0.83 between the identified and surveyed harvested carbon. The linear regression slope was up to 0.99. Averaged from 2003 to 2018, forest harvesting removed 68.3 ± 9.3 Mt C yr−1, of which more than 80 % was from selective logging. Of the harvested carbon, 19.6 ± 4.0 %, 2.1 ± 1.1 %, 35.5 ± 12.6 % 6.2 ± 0.3 %, 17.5 ± 0.9 %, and 19.1 ± 9.8 % entered the fuelwood, paper and paperboard, wood-based panels, solid wooden furniture, structural constructions, and residue pools, respectively. Direct combustion of fuelwood was the primary source of carbon emissions after wood harvest. However, carbon can be stored in wood products for a long time, and by 2100, almost 40 % of the carbon harvested during the study period will still be retained. This dataset is expected to provide a foundation and reference for estimating the forestry and national carbon budgets. The 30 m × 30 m harvested-carbon dataset from forests in China can be downloaded at https://doi.org/10.6084/m9.figshare.23641164.v2 (Wang et al., 2023).
{"title":"National forest carbon harvesting and allocation dataset for the period 2003 to 2018","authors":"Daju Wang, Peiyang Ren, Xiaosheng Xia, Lei Fan, Zhangcai Qin, Xiuzhi Chen, Wenping Yuan","doi":"10.5194/essd-16-2465-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2465-2024","url":null,"abstract":"Abstract. Forest harvesting is one of the anthropogenic activities that most significantly affect the carbon budget of forests. However, the absence of explicit spatial information on harvested carbon poses a huge challenge in assessing forest-harvesting impacts, as well as the forest carbon budget. This study utilized provincial-level statistical data on wood harvest, the tree cover loss (TCL) dataset, and a satellite-based vegetation index to develop a Long-term harvEst and Allocation of Forest Biomass (LEAF) dataset. The aim was to provide the spatial location of forest harvesting with a spatial resolution of 30 m and to quantify the post-harvest carbon dynamics. The validations against the surveyed forest harvesting in 133 cities and counties indicated a good performance of the LEAF dataset in capturing the spatial variation of harvested carbon, with a coefficient of determination (R2) of 0.83 between the identified and surveyed harvested carbon. The linear regression slope was up to 0.99. Averaged from 2003 to 2018, forest harvesting removed 68.3 ± 9.3 Mt C yr−1, of which more than 80 % was from selective logging. Of the harvested carbon, 19.6 ± 4.0 %, 2.1 ± 1.1 %, 35.5 ± 12.6 % 6.2 ± 0.3 %, 17.5 ± 0.9 %, and 19.1 ± 9.8 % entered the fuelwood, paper and paperboard, wood-based panels, solid wooden furniture, structural constructions, and residue pools, respectively. Direct combustion of fuelwood was the primary source of carbon emissions after wood harvest. However, carbon can be stored in wood products for a long time, and by 2100, almost 40 % of the carbon harvested during the study period will still be retained. This dataset is expected to provide a foundation and reference for estimating the forestry and national carbon budgets. The 30 m × 30 m harvested-carbon dataset from forests in China can be downloaded at https://doi.org/10.6084/m9.figshare.23641164.v2 (Wang et al., 2023).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141092029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abstract. High-quality gridded data on industrial water use are vital for research and water resource management. However, such data in China usually have low accuracy. In this study, we developed a gridded dataset of monthly industrial water withdrawal (IWW) for China, which is called the China Industrial Water Withdrawal (CIWW) dataset; this dataset spans a 56-year period from 1965 to 2020 at spatial resolutions of 0.1 and 0.25°. We utilized > 400 000 records of industrial enterprises, monthly industrial product output data, and continuous statistical IWW records from 1965 to 2020 to facilitate spatial scaling, seasonal allocation, and long-term temporal coverage in developing the dataset. Our CIWW dataset is a significant improvement in comparison to previous data for the characterization of the spatial and seasonal patterns of the IWW dynamics in China and achieves better consistency with statistical records at the local scale. The CIWW dataset, together with its methodology and auxiliary data, will be useful for water resource management and hydrological models. This new dataset is now available at https://doi.org/10.6084/m9.figshare.21901074 (Hou and Li, 2023).
{"title":"High-resolution mapping of monthly industrial water withdrawal in China from 1965 to 2020","authors":"Chengcheng Hou, Yan Li, Shan Sang, Xu Zhao, Yanxu Liu, Yinglu Liu, Fang Zhao","doi":"10.5194/essd-16-2449-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2449-2024","url":null,"abstract":"Abstract. High-quality gridded data on industrial water use are vital for research and water resource management. However, such data in China usually have low accuracy. In this study, we developed a gridded dataset of monthly industrial water withdrawal (IWW) for China, which is called the China Industrial Water Withdrawal (CIWW) dataset; this dataset spans a 56-year period from 1965 to 2020 at spatial resolutions of 0.1 and 0.25°. We utilized > 400 000 records of industrial enterprises, monthly industrial product output data, and continuous statistical IWW records from 1965 to 2020 to facilitate spatial scaling, seasonal allocation, and long-term temporal coverage in developing the dataset. Our CIWW dataset is a significant improvement in comparison to previous data for the characterization of the spatial and seasonal patterns of the IWW dynamics in China and achieves better consistency with statistical records at the local scale. The CIWW dataset, together with its methodology and auxiliary data, will be useful for water resource management and hydrological models. This new dataset is now available at https://doi.org/10.6084/m9.figshare.21901074 (Hou and Li, 2023).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141079312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abstract. Climate change has precipitated recurrent extreme events and emerged as an imposing global challenge, exerting profound and far-reaching impacts on both the environment and human existence. The Universal Thermal Climate Index (UTCI), serving as an important approach to human comfort assessment, plays a pivotal role in gauging how humans adapt to meteorological conditions and copes with thermal and cold stress. However, the existing UTCI datasets still grapple with limitations in terms of data availability, hindering their effective application across diverse domains. We have produced GloUTCI-M, a monthly UTCI dataset boasting global coverage and an extensive time series spanning March 2000 to October 2022, with a high spatial resolution of 1 km. This dataset is the product of a comprehensive approach leveraging multiple data sources and advanced machine learning models. Our findings underscored the superior predictive capabilities of CatBoost in forecasting the UTCI (mean absolute error, MAE = 0.747 °C; root mean square error, RMSE = 0.943 °C; and coefficient of determination, R2=0.994) when compared to machine learning models such as XGBoost and LightGBM. Utilizing GloUTCI-M, the geographical boundaries of cold stress and thermal stress areas at global scale were effectively delineated. Spanning 2001–2021, the mean annual global UTCI was recorded at 17.24 °C, with a pronounced upward trend. Countries like Russia and Brazil emerged as key contributors to the mean annual global UTCI increasing, while countries like China and India exerted a more inhibitory influence on this trend. Furthermore, in contrast to existing UTCI datasets, GloUTCI-M excelled at portraying UTCI distribution at finer spatial resolutions, augmenting data accuracy. This dataset can enhance our capacity to evaluate thermal stress experienced by humans, offering substantial prospects across a wide array of applications. GloUTCI-M is publicly available at https://doi.org/10.5281/zenodo.8310513 (Yang et al., 2023).
{"title":"GloUTCI-M: a global monthly 1 km Universal Thermal Climate Index dataset from 2000 to 2022","authors":"Zhiwei Yang, Jian Peng, Yanxu Liu, Song Jiang, Xueyan Cheng, Xuebang Liu, Jianquan Dong, Tiantian Hua, Xiaoyu Yu","doi":"10.5194/essd-16-2407-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2407-2024","url":null,"abstract":"Abstract. Climate change has precipitated recurrent extreme events and emerged as an imposing global challenge, exerting profound and far-reaching impacts on both the environment and human existence. The Universal Thermal Climate Index (UTCI), serving as an important approach to human comfort assessment, plays a pivotal role in gauging how humans adapt to meteorological conditions and copes with thermal and cold stress. However, the existing UTCI datasets still grapple with limitations in terms of data availability, hindering their effective application across diverse domains. We have produced GloUTCI-M, a monthly UTCI dataset boasting global coverage and an extensive time series spanning March 2000 to October 2022, with a high spatial resolution of 1 km. This dataset is the product of a comprehensive approach leveraging multiple data sources and advanced machine learning models. Our findings underscored the superior predictive capabilities of CatBoost in forecasting the UTCI (mean absolute error, MAE = 0.747 °C; root mean square error, RMSE = 0.943 °C; and coefficient of determination, R2=0.994) when compared to machine learning models such as XGBoost and LightGBM. Utilizing GloUTCI-M, the geographical boundaries of cold stress and thermal stress areas at global scale were effectively delineated. Spanning 2001–2021, the mean annual global UTCI was recorded at 17.24 °C, with a pronounced upward trend. Countries like Russia and Brazil emerged as key contributors to the mean annual global UTCI increasing, while countries like China and India exerted a more inhibitory influence on this trend. Furthermore, in contrast to existing UTCI datasets, GloUTCI-M excelled at portraying UTCI distribution at finer spatial resolutions, augmenting data accuracy. This dataset can enhance our capacity to evaluate thermal stress experienced by humans, offering substantial prospects across a wide array of applications. GloUTCI-M is publicly available at https://doi.org/10.5281/zenodo.8310513 (Yang et al., 2023).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141079124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abstract. The Long-term Gap-free High-resolution Air Pollutants (LGHAP) concentration dataset generated in our previous study has provided spatially contiguous daily aerosol optical depth (AOD) and fine particulate matter (PM2.5) concentrations at a 1 km grid resolution in China since 2000. This advancement empowered unprecedented assessments of regional aerosol variations and their influence on the environment, health, and climate over the past 20 years. However, there is a need to enhance such a high-quality AOD and PM2.5 concentration dataset with new robust features and extended spatial coverage. In this study, we present version 2 of a global-scale LGHAP dataset (LGHAP v2), which was generated using improved big Earth data analytics via a seamless integration of versatile data science, pattern recognition, and machine learning methods. Specifically, multimodal AODs and air quality measurements acquired from relevant satellites, ground monitoring stations, and numerical models were harmonized by harnessing the capability of random-forest-based data-driven models. Subsequently, an improved tensor-flow-based AOD reconstruction algorithm was developed to weave the harmonized multisource AOD products together for filling data gaps in Multi-Angle Implementation of Atmospheric Correction (MAIAC) AOD retrievals from Terra. The results of the ablation experiments demonstrated better performance of the improved tensor-flow-based gap-filling method in terms of both convergence speed and data accuracy. Ground-based validation results indicated good data accuracy of this global gap-free AOD dataset, with a correlation coefficient (R) of 0.85 and a root mean square error (RMSE) of 0.14 compared to the worldwide AOD observations from the AErosol RObotic NETwork (AERONET), outperforming the purely reconstructed AODs (R = 0.83, RMSE = 0.15), but they were slightly worse than raw MAIAC AOD retrievals (R = 0.88, RMSE = 0.11). For PM2.5 concentration mapping, a novel deep-learning approach, termed the SCene-Aware ensemble learning Graph ATtention network (SCAGAT), was hereby applied. While accounting for the scene representativeness of data-driven models across regions, the SCAGAT algorithm performed better during spatial extrapolation, largely reducing modeling biases over regions with limited and/or even absent in situ PM2.5 concentration measurements. The validation results indicated that the gap-free PM2.5 concentration estimates exhibit higher prediction accuracies, with an R of 0.95 and an RMSE of 5.7 µg m−3, compared to PM2.5 concentration measurements obtained from former holdout sites worldwide. Overall, while leveraging state-of-the-art methods in data science and artificial intelligence, a quality-enhanced LGHAP v2 dataset was generated through big Earth data analytics by cohesively weaving together multimodal AODs and air quality measurements from diverse sources. The gap-free, high-resolution, and global coverage merits render the LGHAP v2 dataset
{"title":"LGHAP v2: a global gap-free aerosol optical depth and PM2.5 concentration dataset since 2000 derived via big Earth data analytics","authors":"Kaixu Bai, Ke Li, Liuqing Shao, Xinran Li, Chaoshun Liu, Zhengqiang Li, Mingliang Ma, Di Han, Yibing Sun, Zhe Zheng, Ruijie Li, Ni-Bin Chang, Jianping Guo","doi":"10.5194/essd-16-2425-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2425-2024","url":null,"abstract":"Abstract. The Long-term Gap-free High-resolution Air Pollutants (LGHAP) concentration dataset generated in our previous study has provided spatially contiguous daily aerosol optical depth (AOD) and fine particulate matter (PM2.5) concentrations at a 1 km grid resolution in China since 2000. This advancement empowered unprecedented assessments of regional aerosol variations and their influence on the environment, health, and climate over the past 20 years. However, there is a need to enhance such a high-quality AOD and PM2.5 concentration dataset with new robust features and extended spatial coverage. In this study, we present version 2 of a global-scale LGHAP dataset (LGHAP v2), which was generated using improved big Earth data analytics via a seamless integration of versatile data science, pattern recognition, and machine learning methods. Specifically, multimodal AODs and air quality measurements acquired from relevant satellites, ground monitoring stations, and numerical models were harmonized by harnessing the capability of random-forest-based data-driven models. Subsequently, an improved tensor-flow-based AOD reconstruction algorithm was developed to weave the harmonized multisource AOD products together for filling data gaps in Multi-Angle Implementation of Atmospheric Correction (MAIAC) AOD retrievals from Terra. The results of the ablation experiments demonstrated better performance of the improved tensor-flow-based gap-filling method in terms of both convergence speed and data accuracy. Ground-based validation results indicated good data accuracy of this global gap-free AOD dataset, with a correlation coefficient (R) of 0.85 and a root mean square error (RMSE) of 0.14 compared to the worldwide AOD observations from the AErosol RObotic NETwork (AERONET), outperforming the purely reconstructed AODs (R = 0.83, RMSE = 0.15), but they were slightly worse than raw MAIAC AOD retrievals (R = 0.88, RMSE = 0.11). For PM2.5 concentration mapping, a novel deep-learning approach, termed the SCene-Aware ensemble learning Graph ATtention network (SCAGAT), was hereby applied. While accounting for the scene representativeness of data-driven models across regions, the SCAGAT algorithm performed better during spatial extrapolation, largely reducing modeling biases over regions with limited and/or even absent in situ PM2.5 concentration measurements. The validation results indicated that the gap-free PM2.5 concentration estimates exhibit higher prediction accuracies, with an R of 0.95 and an RMSE of 5.7 µg m−3, compared to PM2.5 concentration measurements obtained from former holdout sites worldwide. Overall, while leveraging state-of-the-art methods in data science and artificial intelligence, a quality-enhanced LGHAP v2 dataset was generated through big Earth data analytics by cohesively weaving together multimodal AODs and air quality measurements from diverse sources. The gap-free, high-resolution, and global coverage merits render the LGHAP v2 dataset ","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141079199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Can Li, Nickolay A. Krotkov, Joanna Joiner, Vitali Fioletov, Chris McLinden, Debora Griffin, Peter J. T. Leonard, Simon Carn, Colin Seftor, Alexander Vasilkov
Abstract. For nearly two decades, the Ozone Monitoring Instrument (OMI) aboard the NASA Aura spacecraft (launched in 2004) and the Ozone Mapping and Profiler Suite (OMPS) aboard the NASA/NOAA Suomi National Polar-orbiting Partnership (SNPP) satellite (launched in 2011) have been providing global monitoring of SO2 column densities from both anthropogenic and volcanic activities. Here, we describe the version 1 NOAA-20 (N20)/OMPS SO2 product, aimed at extending the long-term climate data record. To achieve this goal, we apply a principal component analysis (PCA) retrieval technique, also used for the OMI and SNPP/OMPS SO2 products, to N20/OMPS. For volcanic SO2 retrievals, the algorithm is identical between N20 and SNPP/OMPS and produces consistent retrievals for eruptions such as the 2018 Kilauea and 2019 Raikoke. For anthropogenic SO2 retrievals, the algorithm has been customized for N20/OMPS, considering its greater spatial resolution and reduced signal-to-noise ratio as compared with SNPP/OMPS. Over background areas, N20/OMPS SO2 slant column densities (SCD) show relatively small biases, comparable retrieval noise with SNPP/OMPS (after aggregation to the same spatial resolution), and remarkable stability with essentially no drift during 2018–2023. Over major anthropogenic source areas, the two OMPS retrievals are generally well-correlated but N20/OMPS SO2 is biased low especially for India and the Middle East, where the differences reach ~20 % on average. The reasons for these differences are not fully understood but are partly due to algorithmic differences. Better agreement (typical differences of ~10–15 %) is found over degassing volcanoes. SO2 emissions from large point sources, inferred from N20/OMPS retrievals, agree well with those based on OMI, SNPP/OMPS, and TROPOspheric Monitoring Instrument (TROPOMI), with correlation coefficients > 0.98 and overall differences < 10 %. The ratios between the estimated emissions and their uncertainties offer insights into the ability of different satellite instruments to detect and quantify SO2 sources. While TROPOMI has the highest ratios among all four sensors, ratios from N20/OMPS are slightly greater than OMI and substantially greater than SNPP/OMPS. Overall, our results suggest that the version 1 N20/OMPS SO2 product will successfully continue the long-term OMI and SNPP/OMPS SO2 data records. Efforts currently underway will further enhance the consistency of retrievals between different instruments, facilitating the development of multi-decade, coherent global SO2 datasets across multiple satellites.
{"title":"Version 1 NOAA-20/OMPS Nadir Mapper Total Column SO2 Product: Continuation of NASA Long-term Global Data Record","authors":"Can Li, Nickolay A. Krotkov, Joanna Joiner, Vitali Fioletov, Chris McLinden, Debora Griffin, Peter J. T. Leonard, Simon Carn, Colin Seftor, Alexander Vasilkov","doi":"10.5194/essd-2024-168","DOIUrl":"https://doi.org/10.5194/essd-2024-168","url":null,"abstract":"<strong>Abstract.</strong> For nearly two decades, the Ozone Monitoring Instrument (OMI) aboard the NASA Aura spacecraft (launched in 2004) and the Ozone Mapping and Profiler Suite (OMPS) aboard the NASA/NOAA Suomi National Polar-orbiting Partnership (SNPP) satellite (launched in 2011) have been providing global monitoring of SO<sub>2</sub> column densities from both anthropogenic and volcanic activities. Here, we describe the version 1 NOAA-20 (N20)/OMPS SO<sub>2</sub> product, aimed at extending the long-term climate data record. To achieve this goal, we apply a principal component analysis (PCA) retrieval technique, also used for the OMI and SNPP/OMPS SO<sub>2</sub> products, to N20/OMPS. For volcanic SO<sub>2</sub> retrievals, the algorithm is identical between N20 and SNPP/OMPS and produces consistent retrievals for eruptions such as the 2018 Kilauea and 2019 Raikoke. For anthropogenic SO<sub>2</sub> retrievals, the algorithm has been customized for N20/OMPS, considering its greater spatial resolution and reduced signal-to-noise ratio as compared with SNPP/OMPS. Over background areas, N20/OMPS SO<sub>2</sub> slant column densities (SCD) show relatively small biases, comparable retrieval noise with SNPP/OMPS (after aggregation to the same spatial resolution), and remarkable stability with essentially no drift during 2018–2023. Over major anthropogenic source areas, the two OMPS retrievals are generally well-correlated but N20/OMPS SO<sub>2</sub> is biased low especially for India and the Middle East, where the differences reach ~20 % on average. The reasons for these differences are not fully understood but are partly due to algorithmic differences. Better agreement (typical differences of ~10–15 %) is found over degassing volcanoes. SO<sub>2</sub> emissions from large point sources, inferred from N20/OMPS retrievals, agree well with those based on OMI, SNPP/OMPS, and TROPOspheric Monitoring Instrument (TROPOMI), with correlation coefficients > 0.98 and overall differences < 10 %. The ratios between the estimated emissions and their uncertainties offer insights into the ability of different satellite instruments to detect and quantify SO<sub>2</sub> sources. While TROPOMI has the highest ratios among all four sensors, ratios from N20/OMPS are slightly greater than OMI and substantially greater than SNPP/OMPS. Overall, our results suggest that the version 1 N20/OMPS SO<sub>2</sub> product will successfully continue the long-term OMI and SNPP/OMPS SO<sub>2</sub> data records. Efforts currently underway will further enhance the consistency of retrievals between different instruments, facilitating the development of multi-decade, coherent global SO<sub>2</sub> datasets across multiple satellites.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141073897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Christopher J. Cox, Janet M. Intrieri, Brian Butterworth, Gijs de Boer, Michael R. Gallagher, Jonathan Hamilton, Erik Hulm, Tilden Meyers, Sara M. Morris, Jackson Osborn, P. Ola G. Persson, Benjamin Schmatz, Matthew D. Shupe, James M. Wilczak
Abstract. From autumn 2021 through summer 2023, scientists from the National Oceanic and Atmospheric Administration (NOAA) and partners conducted the Study of Precipitation, the Lower Atmosphere, and Surface for Hydrometeorology (SPLASH) campaign in the East River Watershed of Colorado. One objective of SPLASH was to observe the transfer of energy between the atmosphere and the surface, which was done at several locations. Two remote sites were chosen that did not have access to power utilities. These were along the valley floor near the East River in the vicinity of the unincorporated town of Gothic, Colorado. Energy balance measurements were made at these locations using autonomous, single-level flux towers referred to as Atmospheric Surface Flux Stations (ASFS). The ASFS were deployed on 28 September 2021 at the “Kettle Ponds Annex” site and on 12 October 2021 at the “Avery Picnic” site and operated until 19 July and 21 June 2023, respectively. Measurements included basic meteorology; upward and downward longwave and shortwave radiative fluxes, and subsurface conductive flux, each at 1-minute resolution; 3-d winds from a sonic anemometer and H2O/CO2 from an open-path gas analyser, both at 20 Hz from which sensible, latent heat, and CO2 fluxes were derived; and profiles of soil properties in the upper 0.5 m (both sites) and temperature profiles through the snow (at Avery Picnic), each reported between 10 min and 6 hours. For most measurements, uptime was 96 % (Kettle Ponds) and 89 % (Avery Picnic), and collectively 1,184 days of data were obtained between the stations. The purpose of this manuscript is to document the ASFS deployment at SPLASH, the data acquisition and post-processing of measurements, and to serve as a guide for interested users of the data sets, which are archived under the Creative Commons 4.0 Public Domain licensing at Zenodo.
{"title":"Observations of surface energy fluxes and meteorology in the seasonally snow-covered high-elevation East River Watershed during SPLASH, 2021–2023","authors":"Christopher J. Cox, Janet M. Intrieri, Brian Butterworth, Gijs de Boer, Michael R. Gallagher, Jonathan Hamilton, Erik Hulm, Tilden Meyers, Sara M. Morris, Jackson Osborn, P. Ola G. Persson, Benjamin Schmatz, Matthew D. Shupe, James M. Wilczak","doi":"10.5194/essd-2024-158","DOIUrl":"https://doi.org/10.5194/essd-2024-158","url":null,"abstract":"<strong>Abstract.</strong> From autumn 2021 through summer 2023, scientists from the National Oceanic and Atmospheric Administration (NOAA) and partners conducted the Study of Precipitation, the Lower Atmosphere, and Surface for Hydrometeorology (SPLASH) campaign in the East River Watershed of Colorado. One objective of SPLASH was to observe the transfer of energy between the atmosphere and the surface, which was done at several locations. Two remote sites were chosen that did not have access to power utilities. These were along the valley floor near the East River in the vicinity of the unincorporated town of Gothic, Colorado. Energy balance measurements were made at these locations using autonomous, single-level flux towers referred to as Atmospheric Surface Flux Stations (ASFS). The ASFS were deployed on 28 September 2021 at the “Kettle Ponds Annex” site and on 12 October 2021 at the “Avery Picnic” site and operated until 19 July and 21 June 2023, respectively. Measurements included basic meteorology; upward and downward longwave and shortwave radiative fluxes, and subsurface conductive flux, each at 1-minute resolution; 3-d winds from a sonic anemometer and H<sub>2</sub>O/CO<sub>2</sub> from an open-path gas analyser, both at 20 Hz from which sensible, latent heat, and CO<sub>2</sub> fluxes were derived; and profiles of soil properties in the upper 0.5 m (both sites) and temperature profiles through the snow (at Avery Picnic), each reported between 10 min and 6 hours. For most measurements, uptime was 96 % (Kettle Ponds) and 89 % (Avery Picnic), and collectively 1,184 days of data were obtained between the stations. The purpose of this manuscript is to document the ASFS deployment at SPLASH, the data acquisition and post-processing of measurements, and to serve as a guide for interested users of the data sets, which are archived under the Creative Commons 4.0 Public Domain licensing at Zenodo.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141074049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}