首页 > 最新文献

Earth System Science Data最新文献

英文 中文
ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation Models ChatEarthNet:支持视觉语言地理基础模型的全球图像-文本数据集
IF 11.4 1区 地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-06-27 DOI: 10.5194/essd-2024-140
Zhenghang Yuan, Zhitong Xiong, Lichao Mou, Xiao Xiang Zhu
Abstract. The rapid development of remote sensing technology has led to an exponential growth in satellite images, yet their inherent complexity often makes them difficult for non-expert users to understand. Natural language, as a carrier of human knowledge, can bridge common users and complicated satellite imagery. Additionally, when paired with visual data, natural language can be utilized to train large vision-language foundation models, significantly improving performance in various tasks. Despite these advancements, the remote sensing community still faces a challenge due to the lack of large- scale, high-quality vision-language datasets for satellite images. To address this challenge, we introduce a new image-text dataset, providing high-quality natural language descriptions for global-scale satellite data. Specifically, we utilize Sentinel-2 data for its global coverage as the foundational image source, employing semantic segmentation labels from the European Space Agency’s WorldCover project to enrich the descriptions of land covers. By conducting in-depth semantic analysis, we formulate detailed prompts to elicit rich descriptions from ChatGPT. We then include a manual verification process to enhance the dataset’s quality further. This step involves manual inspection and correction to refine the dataset. Finally, we offer the community ChatEarthNet, a large-scale image-text dataset characterized by global coverage, high quality, wide-ranging diversity, and detailed descriptions. ChatEarthNet consists of 163,488 image-text pairs with captions generated by ChatGPT3.5 and an additional 10,000 image-text pairs with captions generated by ChatGPT-4V(ision). This dataset has significant potential for both training and evaluating vision-language geo-foundation models for remote sensing. The code is publicly available at https://doi.org/10.5281/zenodo.11004358 (Yuan et al., 2024b), and the ChatEarthNet dataset is at https://doi.org/10.5281/zenodo.11003436 (Yuan et al., 2024c).
摘要遥感技术的飞速发展使卫星图像呈指数级增长,但其固有的复杂性往往使非专业用户难以理解。自然语言作为人类知识的载体,可以在普通用户和复杂的卫星图像之间架起一座桥梁。此外,在与视觉数据配对时,自然语言可用于训练大型视觉语言基础模型,从而显著提高各种任务的性能。尽管取得了这些进步,遥感界仍然面临着一个挑战,那就是缺乏大规模、高质量的卫星图像视觉语言数据集。为了应对这一挑战,我们引入了一个新的图像-文本数据集,为全球范围的卫星数据提供高质量的自然语言描述。具体来说,我们利用 Sentinel-2 数据的全球覆盖范围作为基础图像源,采用欧洲航天局 WorldCover 项目的语义分割标签来丰富土地覆盖的描述。通过深入的语义分析,我们制定了详细的提示,以便从 ChatGPT 中获得丰富的描述。然后,我们加入了人工验证流程,以进一步提高数据集的质量。这一步骤包括人工检查和修正,以完善数据集。最后,我们为社区提供了大型图像-文本数据集 ChatEarthNet,该数据集具有全球覆盖、高质量、广泛多样性和详细描述等特点。ChatEarthNet 包含由 ChatGPT3.5 生成标题的 163,488 对图像-文本,以及由 ChatGPT-4V(ision) 生成标题的另外 10,000 对图像-文本。该数据集在训练和评估遥感视觉语言地理基础模型方面具有巨大潜力。代码可在 https://doi.org/10.5281/zenodo.11004358(Yuan et al.,2024b)上公开获取,ChatEarthNet 数据集可在 https://doi.org/10.5281/zenodo.11003436(Yuan et al.,2024c)上公开获取。
{"title":"ChatEarthNet: A Global-Scale Image-Text Dataset Empowering Vision-Language Geo-Foundation Models","authors":"Zhenghang Yuan, Zhitong Xiong, Lichao Mou, Xiao Xiang Zhu","doi":"10.5194/essd-2024-140","DOIUrl":"https://doi.org/10.5194/essd-2024-140","url":null,"abstract":"<strong>Abstract.</strong> The rapid development of remote sensing technology has led to an exponential growth in satellite images, yet their inherent complexity often makes them difficult for non-expert users to understand. Natural language, as a carrier of human knowledge, can bridge common users and complicated satellite imagery. Additionally, when paired with visual data, natural language can be utilized to train large vision-language foundation models, significantly improving performance in various tasks. Despite these advancements, the remote sensing community still faces a challenge due to the lack of large- scale, high-quality vision-language datasets for satellite images. To address this challenge, we introduce a new image-text dataset, providing high-quality natural language descriptions for global-scale satellite data. Specifically, we utilize Sentinel-2 data for its global coverage as the foundational image source, employing semantic segmentation labels from the European Space Agency’s WorldCover project to enrich the descriptions of land covers. By conducting in-depth semantic analysis, we formulate detailed prompts to elicit rich descriptions from ChatGPT. We then include a manual verification process to enhance the dataset’s quality further. This step involves manual inspection and correction to refine the dataset. Finally, we offer the community ChatEarthNet, a large-scale image-text dataset characterized by global coverage, high quality, wide-ranging diversity, and detailed descriptions. ChatEarthNet consists of 163,488 image-text pairs with captions generated by ChatGPT3.5 and an additional 10,000 image-text pairs with captions generated by ChatGPT-4V(ision). This dataset has significant potential for both training and evaluating vision-language geo-foundation models for remote sensing. The code is publicly available at https://doi.org/10.5281/zenodo.11004358 (Yuan et al., 2024b), and the ChatEarthNet dataset is at https://doi.org/10.5281/zenodo.11003436 (Yuan et al., 2024c).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141461965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Atmospheric Radiation Measurement (ARM) airborne field campaign data products between 2013 and 2018 2013 年至 2018 年大气辐射测量(ARM)机载实地活动数据产品
IF 11.4 1区 地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-06-27 DOI: 10.5194/essd-2024-97
Fan Mei, Jennifer M. Comstock, Mikhail S. Pekour, Jerome D. Fast, Beat Schmid, Krista L. Gaustad, Shuaiqi Tang, Damao Zhang, John E. Shilling, Jason Tomlinson, Adam C. Varble, Jian Wang, L. Ruby Leung, Lawrence Kleinman, Scot Martin, Sebastien C. Biraud, Brian D. Ermold, Kenneth W. Burk
Abstract. Airborne measurements are pivotal for providing detailed, spatiotemporally resolved information about atmospheric parameters, and aerosol and cloud properties, thereby enhancing our understanding of dynamic atmospheric processes. For 30 years, the U.S. Department of Energy (DOE) Office of Science supported an instrumented Gulfstream-1 (G-1) aircraft for atmospheric field campaigns. Data from the final decade of G-1 operations were archived by the Atmospheric Radiation Measurement (ARM) user facility Data Center and made publicly available at no cost to all registered users. To ensure a consistent data format and to improve the accessibility of the ARM airborne data, an integrated dataset was recently developed covering the final six years of G-1 operations (2013 to 2018). The integrated dataset includes data collected from 236 flights (766.4 hours), which covered the Arctic, the U.S. Southern Great Plains (SGP), the U.S. West Coast, the Eastern North Atlantic (ENA), the Amazon Basin in Brazil, and the Sierras de Córdoba range in Argentina. These comprehensive data streams provide much-needed insight into spatiotemporal variability of thermodynamic quantities, aerosol and cloud states and properties for addressing essential science questions in Earth system process studies. This manuscript describes the DOE ARM merged G-1 datasets, including information on the acquisition, collection, and quality control processes. It further illustrates the usage of this merged dataset to evaluate the Energy Exascale Earth System Model (E3SM) with the Earth System Model Aerosol-Cloud Diagnostics (ESMAC Diags) package.
摘要机载测量对于提供有关大气参数、气溶胶和云特性的详细时空分辨信息至关重要,从而增强了我们对动态大气过程的了解。30 年来,美国能源部(DOE)科学办公室一直支持一架配备仪器的湾流-1(G-1)飞机进行大气实地活动。G-1 最后十年的运行数据由大气辐射测量(ARM)用户设施数据中心存档,并向所有注册用户免费公开。为确保数据格式的一致性并提高 ARM 机载数据的可访问性,最近开发了一个综合数据集,涵盖 G-1 行动的最后六年(2013 年至 2018 年)。综合数据集包括 236 次飞行(766.4 小时)收集的数据,覆盖北极、美国南部大平原 (SGP)、美国西海岸、北大西洋东部 (ENA)、巴西亚马逊盆地和阿根廷科尔多瓦山脉。这些综合数据流为解决地球系统过程研究中的基本科学问题提供了急需的热力学量、气溶胶和云状态及特性的时空变化洞察力。本手稿介绍了 DOE ARM 合并 G-1 数据集,包括有关获取、收集和质量控制过程的信息。它进一步说明了如何使用该合并数据集来评估能源超大规模地球系统模型(ESM)与地球系统模型气溶胶-云诊断(ESMAC Diags)软件包。
{"title":"Atmospheric Radiation Measurement (ARM) airborne field campaign data products between 2013 and 2018","authors":"Fan Mei, Jennifer M. Comstock, Mikhail S. Pekour, Jerome D. Fast, Beat Schmid, Krista L. Gaustad, Shuaiqi Tang, Damao Zhang, John E. Shilling, Jason Tomlinson, Adam C. Varble, Jian Wang, L. Ruby Leung, Lawrence Kleinman, Scot Martin, Sebastien C. Biraud, Brian D. Ermold, Kenneth W. Burk","doi":"10.5194/essd-2024-97","DOIUrl":"https://doi.org/10.5194/essd-2024-97","url":null,"abstract":"<strong>Abstract.</strong> Airborne measurements are pivotal for providing detailed, spatiotemporally resolved information about atmospheric parameters, and aerosol and cloud properties, thereby enhancing our understanding of dynamic atmospheric processes. For 30 years, the U.S. Department of Energy (DOE) Office of Science supported an instrumented Gulfstream-1 (G-1) aircraft for atmospheric field campaigns. Data from the final decade of G-1 operations were archived by the Atmospheric Radiation Measurement (ARM) user facility Data Center and made publicly available at no cost to all registered users. To ensure a consistent data format and to improve the accessibility of the ARM airborne data, an integrated dataset was recently developed covering the final six years of G-1 operations (2013 to 2018). The integrated dataset includes data collected from 236 flights (766.4 hours), which covered the Arctic, the U.S. Southern Great Plains (SGP), the U.S. West Coast, the Eastern North Atlantic (ENA), the Amazon Basin in Brazil, and the Sierras de Córdoba range in Argentina. These comprehensive data streams provide much-needed insight into spatiotemporal variability of thermodynamic quantities, aerosol and cloud states and properties for addressing essential science questions in Earth system process studies. This manuscript describes the DOE ARM merged G-1 datasets, including information on the acquisition, collection, and quality control processes. It further illustrates the usage of this merged dataset to evaluate the Energy Exascale Earth System Model (E3SM) with the Earth System Model Aerosol-Cloud Diagnostics (ESMAC Diags) package.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141462563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterizing uncertainty in shear wave velocity profiles from the Italian seismic microzonation database 表征意大利地震微区数据库剪切波速度剖面的不确定性
IF 11.4 1区 地球科学 Q1 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-06-27 DOI: 10.5194/essd-2024-104
Federico Mori, Giuseppe Naso, Amerigo Mendicelli, Giancarlo Ciotoli, Chiara Varone, Massimiliano Moscatelli
Abstract. This research uses a large dataset from the Italian Seismic Microzonation Database, containing nearly 15,000 measured shear wave velocity (Vs) profiles across Italy, to investigate the uncertainties in seismic risk assessment. This extensive collection allows a detailed study of the seismic properties of soil with unparalleled precision. Our focus is on evaluating Vs variations with depth within uniformly clustered areas, known as seismic microzones. These zones are carefully identified based on their spatial correlation and homogeneity in geological, geophysical, and geotechnical characteristics, which are critical for accurate prediction of seismic response. We contrast these results with clusters formed purely based on geographic survey density (here defined geographic clusters), thereby assessing the depth of our understanding of the subsurface geological and geophysical context. These results were further compared with those reported in the seismic code and literature. This study of depth-dependent Vs variations helps to refine our models of subsurface seismic behaviour. Our main discoveries show that: 1) uncertainties associated with seismic microzones (geological and geophysical clusters) are consistently lower than those identified in geographic clusters, particularly in the first 30 m of depth; 2) Vs profile variations show negligible increases in uncertainty within a certain range of correlation distances (up to about 4,500 m); 3) uncertainties for seismic microzones are lower than those previously reported in seismic codes and in the literature, indicating the effectiveness and precision of our methodological approach. The results of this study significantly improve local seismic response analysis and highlight the critical role of depth and spatial correlation in understanding seismic hazard. The dataset is available at https://doi.org/10.5281/zenodo.10885590 (Mori et al., 2024).
摘要本研究利用意大利地震微区数据库中的大型数据集(包含意大利各地近 15,000 个测量剪切波速度 (Vs) 剖面)来研究地震风险评估中的不确定性。通过这一庞大的数据集,可以对土壤的地震特性进行详细研究,其精确度无与伦比。我们的重点是评估被称为地震微区的均匀聚集区域内 Vs 随深度的变化。这些区域是根据其空间相关性以及地质、地球物理和岩土特性的同质性精心确定的,这些特性对于准确预测地震反应至关重要。我们将这些结果与纯粹根据地理勘测密度形成的群集(此处定义为地理群集)进行对比,从而评估我们对地下地质和地球物理背景的理解深度。这些结果与地震规范和文献中报告的结果进行了进一步比较。对随深度变化的 Vs 变化的研究有助于完善我们的地下地震行为模型。我们的主要发现表明1)与地震微区(地质和地球物理群组)相关的不确定性始终低于地理群组中确定的不确定性,特别是在深度的前 30 米;2)Vs 剖面变化在一定的相关距离范围内(最多约 4,500 米)显示出可忽略的不确定性增加;3)地震微区的不确定性低于之前地震规范和文献中报告的不确定性,表明我们的方法的有效性和精确性。这项研究的结果极大地改进了当地地震反应分析,并强调了深度和空间相关性在理解地震灾害中的关键作用。该数据集可在 https://doi.org/10.5281/zenodo.10885590 上查阅(Mori et al.)
{"title":"Characterizing uncertainty in shear wave velocity profiles from the Italian seismic microzonation database","authors":"Federico Mori, Giuseppe Naso, Amerigo Mendicelli, Giancarlo Ciotoli, Chiara Varone, Massimiliano Moscatelli","doi":"10.5194/essd-2024-104","DOIUrl":"https://doi.org/10.5194/essd-2024-104","url":null,"abstract":"<strong>Abstract.</strong> This research uses a large dataset from the Italian Seismic Microzonation Database, containing nearly 15,000 measured shear wave velocity (Vs) profiles across Italy, to investigate the uncertainties in seismic risk assessment. This extensive collection allows a detailed study of the seismic properties of soil with unparalleled precision. Our focus is on evaluating Vs variations with depth within uniformly clustered areas, known as seismic microzones. These zones are carefully identified based on their spatial correlation and homogeneity in geological, geophysical, and geotechnical characteristics, which are critical for accurate prediction of seismic response. We contrast these results with clusters formed purely based on geographic survey density (here defined geographic clusters), thereby assessing the depth of our understanding of the subsurface geological and geophysical context. These results were further compared with those reported in the seismic code and literature. This study of depth-dependent Vs variations helps to refine our models of subsurface seismic behaviour. Our main discoveries show that: 1) uncertainties associated with seismic microzones (geological and geophysical clusters) are consistently lower than those identified in geographic clusters, particularly in the first 30 m of depth; 2) Vs profile variations show negligible increases in uncertainty within a certain range of correlation distances (up to about 4,500 m); 3) uncertainties for seismic microzones are lower than those previously reported in seismic codes and in the literature, indicating the effectiveness and precision of our methodological approach. The results of this study significantly improve local seismic response analysis and highlight the critical role of depth and spatial correlation in understanding seismic hazard. The dataset is available at https://doi.org/10.5281/zenodo.10885590 (Mori et al., 2024).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141461814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Coastal Atmosphere & Sea Time Series (CoASTS) and Bio-Optical mapping of Marine optical Properties (BiOMaP): the CoASTS-BiOMaP dataset 沿海大气与海洋时间序列(CoASTS)和海洋光学特性生物光学绘图(BiOMaP):CoASTS-BiOMaP 数据集
IF 11.4 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2024-06-26 DOI: 10.5194/essd-2024-240
Giuseppe Zibordi, Jean-François Berthon
Abstract. The Coastal Atmosphere & Sea Time Series (CoASTS) and the Bio-Optical mapping of Marine optical Properties (BiOMaP) programs produced bio-optical data supporting satellite ocean color applications for almost two decades. Specifically, relying on the Acqua Alta Oceanographic Tower (AAOT) in the northern Adriatic Sea, from 1995 till 2016 CoASTS delivered time series of marine water apparent and inherent optical properties, in addition to the concentration of major optically significant water constituents. Almost concurrently, from 2000 till 2022 BiOMaP produced equivalent spatially distributed measurements across major European Seas. Both, CoASTS and BiOMaP applied equal standardized instruments, measurement methods, quality control schemes and processing codes to ensure temporal and spatial consistency to data products. This work presents the CoASTS and BiOMaP near surface data products, named CoASTS-BiOMaP, of relevance for ocean color bio-optical modelling and validation activities.
摘要近二十年来,沿海大气与amp; 海洋时间序列(CoASTS)和海洋光学特性生物光学绘图(BiOMaP)计划提供了支持卫星海洋颜色应用的生物光学数据。具体而言,从 1995 年到 2016 年,CoASTS 依靠亚得里亚海北部的 Acqua Alta 海洋学塔(AAOT),提供了海水表观和固有光学特性的时间序列,以及主要光学重要水成分的浓度。几乎与此同时,从 2000 年到 2022 年,BiOMaP 对欧洲主要海域进行了等效的空间分布测量。CoASTS 和 BiOMaP 都采用了相同的标准化仪器、测量方法、质量控制方案和处理代码,以确保数据产品在时间和空间上的一致性。这项工作介绍了 CoASTS 和 BiOMaP 的近表面数据产品,命名为 CoASTS-BiOMaP,与海洋颜色生物光学建模和验证活动有关。
{"title":"Coastal Atmosphere & Sea Time Series (CoASTS) and Bio-Optical mapping of Marine optical Properties (BiOMaP): the CoASTS-BiOMaP dataset","authors":"Giuseppe Zibordi, Jean-François Berthon","doi":"10.5194/essd-2024-240","DOIUrl":"https://doi.org/10.5194/essd-2024-240","url":null,"abstract":"<strong>Abstract.</strong> The <em>Coastal Atmosphere &amp; Sea Time Series</em> (CoASTS) and the <em>Bio-Optical mapping of Marine optical Properties</em> (BiOMaP) programs produced bio-optical data supporting satellite ocean color applications for almost two decades. Specifically, relying on the Acqua Alta Oceanographic Tower (AAOT) in the northern Adriatic Sea, from 1995 till 2016 CoASTS delivered time series of marine water apparent and inherent optical properties, in addition to the concentration of major optically significant water constituents. Almost concurrently, from 2000 till 2022 BiOMaP produced equivalent spatially distributed measurements across major European Seas. Both, CoASTS and BiOMaP applied equal standardized instruments, measurement methods, quality control schemes and processing codes to ensure temporal and spatial consistency to data products. This work presents the CoASTS and BiOMaP near surface data products, named CoASTS-BiOMaP, of relevance for ocean color bio-optical modelling and validation activities.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141452913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
First release of the Pelagic Size Structure database: global datasets of marine size spectra obtained from plankton imaging devices 首次发布远洋尺寸结构数据库:从浮游生物成像设备获得的全球海洋尺寸光谱数据集
IF 11.4 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2024-06-26 DOI: 10.5194/essd-16-2971-2024
Mathilde Dugenne, Marco Corrales-Ugalde, Jessica Y. Luo, Rainer Kiko, Todd D. O'Brien, Jean-Olivier Irisson, Fabien Lombard, Lars Stemmann, Charles Stock, Clarissa R. Anderson, Marcel Babin, Nagib Bhairy, Sophie Bonnet, Francois Carlotti, Astrid Cornils, E. Taylor Crockford, Patrick Daniel, Corinne Desnos, Laetitia Drago, Amanda Elineau, Alexis Fischer, Nina Grandrémy, Pierre-Luc Grondin, Lionel Guidi, Cecile Guieu, Helena Hauss, Kendra Hayashi, Jenny A. Huggett, Laetitia Jalabert, Lee Karp-Boss, Kasia M. Kenitz, Raphael M. Kudela, Magali Lescot, Claudie Marec, Andrew McDonnell, Zoe Mériguet, Barbara Niehoff, Margaux Noyon, Thelma Panaïotis, Emily Peacock, Marc Picheral, Emilie Riquier, Collin Roesler, Jean-Baptiste Romagnan, Heidi M. Sosik, Gretchen Spencer, Jan Taucher, Chloé Tilliette, Marion Vilain
Abstract. In marine ecosystems, most physiological, ecological, or physical processes are size dependent. These include metabolic rates, the uptake of carbon and other nutrients, swimming and sinking velocities, and trophic interactions, which eventually determine the stocks of commercial species, as well as biogeochemical cycles and carbon sequestration. As such, broad-scale observations of plankton size distribution are important indicators of the general functioning and state of pelagic ecosystems under anthropogenic pressures. Here, we present the first global datasets of the Pelagic Size Structure database (PSSdb), generated from plankton imaging devices. This release includes the bulk particle normalized biovolume size spectrum (NBSS) and the bulk particle size distribution (PSD), along with their related parameters (slope, intercept, and R2) measured within the epipelagic layer (0–200 m) by three imaging sensors: the Imaging FlowCytobot (IFCB), the Underwater Vision Profiler (UVP), and benchtop scanners. Collectively, these instruments effectively image organisms and detrital material in the 7–10 000 µm size range. A total of 92 472 IFCB samples, 3068 UVP profiles, and 2411 scans passed our quality control and were standardized to produce consistent instrument-specific size spectra averaged to 1° × 1° latitude and longitude and by year and month. Our instrument-specific datasets span most major ocean basins, except for the IFCB datasets we have ingested, which were exclusively collected in northern latitudes, and cover decadal time periods (2013–2022 for IFCB, 2008–2021 for UVP, and 1996–2022 for scanners), allowing for a further assessment of the pelagic size spectrum in space and time. The datasets that constitute PSSdb's first release are available at https://doi.org/10.5281/zenodo.11050013 (Dugenne et al., 2024b). In addition, future updates to these data products can be accessed at https://doi.org/10.5281/zenodo.7998799.
摘要在海洋生态系统中,大多数生理、生态或物理过程都与大小有关。这些过程包括新陈代谢率、对碳和其他营养物质的吸收、游动和下沉速度、营养相互作用(最终决定了商业物种的存量)以及生物地球化学循环和碳封存。因此,对浮游生物大小分布的大尺度观测是人类活动压力下浮游生态系统总体功能和状态的重要指标。在此,我们展示了浮游生物尺寸结构数据库(PSSdb)的首批全球数据集,该数据库由浮游生物成像设备生成。此次发布的数据集包括由三种成像传感器(成像流式细胞仪(IFCB)、水下视觉剖面仪(UVP)和台式扫描仪)在上浮游层(0-200 米)内测量的大颗粒归一化生物体积粒度谱(NBSS)和大颗粒粒度分布(PSD)及其相关参数(斜率、截距和 R2)。总体而言,这些仪器能够有效地对 7-10 000 µm 尺寸范围内的生物和碎屑物质进行成像。共有 92 472 份 IFCB 样品、3068 份 UVP 剖面图和 2411 次扫描通过了我们的质量控制,并进行了标准化处理,以产生一致的仪器特定尺寸光谱,按 1° × 1° 的经纬度和年月进行平均。我们的特定仪器数据集跨越了大多数主要大洋盆地,但我们所摄取的 IFCB 数据集除外,这些数据集仅在北纬采集,并覆盖了十年的时间段(IFCB 为 2013-2022 年,UVP 为 2008-2021 年,扫描仪为 1996-2022 年),从而可以在空间和时间上对远洋大小谱进行进一步评估。构成 PSSdb 第一版的数据集可在 https://doi.org/10.5281/zenodo.11050013 上查阅(Dugenne 等,2024b)。此外,这些数据产品的未来更新可在 https://doi.org/10.5281/zenodo.7998799 上查阅。
{"title":"First release of the Pelagic Size Structure database: global datasets of marine size spectra obtained from plankton imaging devices","authors":"Mathilde Dugenne, Marco Corrales-Ugalde, Jessica Y. Luo, Rainer Kiko, Todd D. O'Brien, Jean-Olivier Irisson, Fabien Lombard, Lars Stemmann, Charles Stock, Clarissa R. Anderson, Marcel Babin, Nagib Bhairy, Sophie Bonnet, Francois Carlotti, Astrid Cornils, E. Taylor Crockford, Patrick Daniel, Corinne Desnos, Laetitia Drago, Amanda Elineau, Alexis Fischer, Nina Grandrémy, Pierre-Luc Grondin, Lionel Guidi, Cecile Guieu, Helena Hauss, Kendra Hayashi, Jenny A. Huggett, Laetitia Jalabert, Lee Karp-Boss, Kasia M. Kenitz, Raphael M. Kudela, Magali Lescot, Claudie Marec, Andrew McDonnell, Zoe Mériguet, Barbara Niehoff, Margaux Noyon, Thelma Panaïotis, Emily Peacock, Marc Picheral, Emilie Riquier, Collin Roesler, Jean-Baptiste Romagnan, Heidi M. Sosik, Gretchen Spencer, Jan Taucher, Chloé Tilliette, Marion Vilain","doi":"10.5194/essd-16-2971-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2971-2024","url":null,"abstract":"Abstract. In marine ecosystems, most physiological, ecological, or physical processes are size dependent. These include metabolic rates, the uptake of carbon and other nutrients, swimming and sinking velocities, and trophic interactions, which eventually determine the stocks of commercial species, as well as biogeochemical cycles and carbon sequestration. As such, broad-scale observations of plankton size distribution are important indicators of the general functioning and state of pelagic ecosystems under anthropogenic pressures. Here, we present the first global datasets of the Pelagic Size Structure database (PSSdb), generated from plankton imaging devices. This release includes the bulk particle normalized biovolume size spectrum (NBSS) and the bulk particle size distribution (PSD), along with their related parameters (slope, intercept, and R2) measured within the epipelagic layer (0–200 m) by three imaging sensors: the Imaging FlowCytobot (IFCB), the Underwater Vision Profiler (UVP), and benchtop scanners. Collectively, these instruments effectively image organisms and detrital material in the 7–10 000 µm size range. A total of 92 472 IFCB samples, 3068 UVP profiles, and 2411 scans passed our quality control and were standardized to produce consistent instrument-specific size spectra averaged to 1° × 1° latitude and longitude and by year and month. Our instrument-specific datasets span most major ocean basins, except for the IFCB datasets we have ingested, which were exclusively collected in northern latitudes, and cover decadal time periods (2013–2022 for IFCB, 2008–2021 for UVP, and 1996–2022 for scanners), allowing for a further assessment of the pelagic size spectrum in space and time. The datasets that constitute PSSdb's first release are available at https://doi.org/10.5281/zenodo.11050013 (Dugenne et al., 2024b). In addition, future updates to these data products can be accessed at https://doi.org/10.5281/zenodo.7998799.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141452835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BIS-4D: mapping soil properties and their uncertainties at 25 m resolution in the Netherlands BIS-4D:以 25 米分辨率绘制荷兰土壤特性及其不确定性图
IF 11.4 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2024-06-25 DOI: 10.5194/essd-16-2941-2024
Anatol Helfenstein, Vera L. Mulder, Mirjam J. D. Hack-ten Broeke, Maarten van Doorn, Kees Teuling, Dennis J. J. Walvoort, Gerard B. M. Heuvelink
Abstract. In response to the growing societal awareness of the critical role of healthy soils, there has been an increasing demand for accurate and high-resolution soil information to inform national policies and support sustainable land management decisions. Despite advancements in digital soil mapping and initiatives like GlobalSoilMap, quantifying soil variability and its uncertainty across space, depth and time remains a challenge. Therefore, maps of key soil properties are often still missing on a national scale, which is also the case in the Netherlands. To meet this challenge and fill this data gap, we introduce BIS-4D, a high-resolution soil modeling and mapping platform for the Netherlands. BIS-4D delivers maps of soil texture (clay, silt and sand content), bulk density, pH, total nitrogen, oxalate-extractable phosphorus, cation exchange capacity and their uncertainties at 25 m resolution between 0 and 2 m depth in 3D space. Additionally, it provides maps of soil organic matter and its uncertainty in 3D space and time between 1953 and 2023 at the same resolution and depth range. The statistical model uses machine learning informed by soil observations amounting to between 3815 and 855 950, depending on the soil property, and 366 environmental covariates. We assess the accuracy of mean and median predictions using design-based statistical inference of a probability sample and location-grouped 10-fold cross validation (CV) and prediction uncertainty using the prediction interval coverage probability. We found that the accuracy of clay, sand and pH maps was the highest, with the model efficiency coefficient (MEC) ranging between 0.6 and 0.92 depending on depth. Silt, bulk density, soil organic matter, total nitrogen and cation exchange capacity (MEC of 0.27 to 0.78), and especially oxalate-extractable phosphorus (MEC of −0.11 to 0.38) were more difficult to predict. One of the main limitations of BIS-4D is that prediction maps cannot be used to quantify the uncertainty in spatial aggregates. We provide an example of good practice to help users decide whether BIS-4D is suitable for their intended purpose. An overview of all maps and their uncertainties can be found in the Supplement. Openly available code and input data enhance reproducibility and help with future updates. BIS-4D prediction maps can be readily downloaded at https://doi.org/10.4121/0c934ac6-2e95-4422-8360-d3a802766c71 (Helfenstein et al., 2024a). BIS-4D fills the previous data gap of the national-scale GlobalSoilMap product in the Netherlands and will hopefully facilitate the inclusion of soil spatial variability as a routine and integral part of decision support systems.
摘要随着社会对健康土壤的关键作用的认识不断提高,人们对准确的高分辨率土壤信息的需求也日益增加,以便为国家政策提供信息并支持可持续的土地管理决策。尽管数字土壤制图和 GlobalSoilMap 等计划取得了进展,但量化土壤的变异性及其在空间、深度和时间上的不确定性仍然是一项挑战。因此,全国范围内的关键土壤特性地图往往仍然缺失,荷兰的情况也是如此。为了应对这一挑战并填补数据空白,我们推出了荷兰高分辨率土壤建模和绘图平台 BIS-4D。BIS-4D 可提供三维空间 0 至 2 米深度 25 米分辨率的土壤质地(粘土、粉土和沙含量)、容重、pH 值、全氮、草酸盐提取磷、阳离子交换容量及其不确定性图。此外,它还以相同的分辨率和深度范围提供了 1953 年至 2023 年期间三维空间和时间的土壤有机质及其不确定性地图。统计模型采用机器学习方法,根据不同的土壤特性和 366 个环境协变量,对 3815 至 855 950 个土壤进行观测。我们使用基于设计的概率样本统计推断和位置分组 10 倍交叉验证(CV)来评估平均值和中位数预测的准确性,并使用预测区间覆盖概率来评估预测的不确定性。我们发现,粘土图、砂土图和 pH 值图的准确度最高,模型效率系数(MEC)介于 0.6 和 0.92 之间,具体取决于深度。而淤泥、容重、土壤有机质、全氮和阳离子交换容量(模型效率系数为 0.27 至 0.78),尤其是草酸盐提取磷(模型效率系数为 -0.11 至 0.38)则较难预测。BIS-4D 的主要局限之一是预测图不能用于量化空间总量的不确定性。我们提供了一个良好实践范例,以帮助用户决定 BIS-4D 是否适合其预期目的。所有地图及其不确定性的概述可在补编中找到。公开代码和输入数据可提高可重复性,并有助于未来的更新。BIS-4D 预测图可在 https://doi.org/10.4121/0c934ac6-2e95-4422-8360-d3a802766c71(Helfenstein 等,2024a)上下载。BIS-4D 填补了荷兰国家级 GlobalSoilMap 产品之前的数据空白,有望促进将土壤空间变异性作为决策支持系统的常规和组成部分。
{"title":"BIS-4D: mapping soil properties and their uncertainties at 25 m resolution in the Netherlands","authors":"Anatol Helfenstein, Vera L. Mulder, Mirjam J. D. Hack-ten Broeke, Maarten van Doorn, Kees Teuling, Dennis J. J. Walvoort, Gerard B. M. Heuvelink","doi":"10.5194/essd-16-2941-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2941-2024","url":null,"abstract":"Abstract. In response to the growing societal awareness of the critical role of healthy soils, there has been an increasing demand for accurate and high-resolution soil information to inform national policies and support sustainable land management decisions. Despite advancements in digital soil mapping and initiatives like GlobalSoilMap, quantifying soil variability and its uncertainty across space, depth and time remains a challenge. Therefore, maps of key soil properties are often still missing on a national scale, which is also the case in the Netherlands. To meet this challenge and fill this data gap, we introduce BIS-4D, a high-resolution soil modeling and mapping platform for the Netherlands. BIS-4D delivers maps of soil texture (clay, silt and sand content), bulk density, pH, total nitrogen, oxalate-extractable phosphorus, cation exchange capacity and their uncertainties at 25 m resolution between 0 and 2 m depth in 3D space. Additionally, it provides maps of soil organic matter and its uncertainty in 3D space and time between 1953 and 2023 at the same resolution and depth range. The statistical model uses machine learning informed by soil observations amounting to between 3815 and 855 950, depending on the soil property, and 366 environmental covariates. We assess the accuracy of mean and median predictions using design-based statistical inference of a probability sample and location-grouped 10-fold cross validation (CV) and prediction uncertainty using the prediction interval coverage probability. We found that the accuracy of clay, sand and pH maps was the highest, with the model efficiency coefficient (MEC) ranging between 0.6 and 0.92 depending on depth. Silt, bulk density, soil organic matter, total nitrogen and cation exchange capacity (MEC of 0.27 to 0.78), and especially oxalate-extractable phosphorus (MEC of −0.11 to 0.38) were more difficult to predict. One of the main limitations of BIS-4D is that prediction maps cannot be used to quantify the uncertainty in spatial aggregates. We provide an example of good practice to help users decide whether BIS-4D is suitable for their intended purpose. An overview of all maps and their uncertainties can be found in the Supplement. Openly available code and input data enhance reproducibility and help with future updates. BIS-4D prediction maps can be readily downloaded at https://doi.org/10.4121/0c934ac6-2e95-4422-8360-d3a802766c71 (Helfenstein et al., 2024a). BIS-4D fills the previous data gap of the national-scale GlobalSoilMap product in the Netherlands and will hopefully facilitate the inclusion of soil spatial variability as a routine and integral part of decision support systems.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141452927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D-GloBFP: the first global three-dimensional building footprint dataset 3D-GloBFP:首个全球三维建筑足迹数据集
IF 11.4 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2024-06-24 DOI: 10.5194/essd-2024-217
Yangzi Che, Xuecao Li, Xiaoping Liu, Yuhao Wang, Weilin Liao, Xianwei Zheng, Xucai Zhang, Xiaocong Xu, Qian Shi, Jiajun Zhu, Hua Yuan, Yongjiu Dai
Abstract. Understanding urban vertical structures, particularly building heights, is essential for examining the intricate interaction between humans and their environment. Such datasets are indispensable for a variety of applications, including climate modeling, energy consumption analysis, and socioeconomic activities. Despite the importance of this information, previous studies have primarily focused on estimating building heights regionally on a grid scale, often resulting in datasets with limited coverage or spatial resolution. This limitation hampers comprehensive global analyses and the ability to generate actionable insights on finer scales. In this study, we developed a global building height map (3D-GloBFP) at a building footprint scale by leveraging Earth Observation (EO) datasets and advanced machine learning techniques. Our approach integrated multisource remote sensing features and building morphology features to develop height estimation models using the eXtreme Gradient Boosting (XGBoost) regression method across diverse global regions. This methodology allowed us to estimate the heights of individual buildings worldwide, culminating in the creation of the first global three-dimensional (3-D) building footprints (3D-GloBFP). Our evaluation results show that the height estimation models perform exceptionally well on a worldwide scale, with R2 ranging from 0.66 to 0.96 and root mean square errors (RMSEs) ranging from 1.9 m to 14.6 m across 33 subregions. Comparisons with other datasets demonstrate that our 3D-GloBFP closely matches the distribution and spatial pattern of reference heights. Our derived 3-D global building footprint map shows a distinct spatial pattern of building heights across regions, countries, and cities, with building heights gradually decreasing from the city center to the surrounding rural areas. Furthermore, our findings indicate the disparities in built-up infrastructure (i.e., building volume) across different countries and cities. China is the country with the most intensive total built-up infrastructure (5.28×1011 m3, accounting for 23.9 % of the global total), followed by the United States (3.90×1011 m3, accounting for 17.6 % of the global total). Shanghai has the largest volume of built-up infrastructure (2.1×1010 m3) of all representative cities. The derived building-footprint scale height map (3D-GloBFP) reveals the significant heterogeneity of urban built-up environments, providing valuable insights for studies in urban socioeconomic dynamics and climatology. The 3D-GloBFP dataset is available at https://doi.org/10.5281/zenodo.11319913 (Building height of the Americas, Africa, and Oceania in 3D-GloBFP) (Che et al., 2024a), https://doi.org/10.5281/zenodo.11397015 (Building height of Asia in 3D-GloBFP) (Che et al., 2024b), and https://doi.org/10.5281/zenodo.11391077 (Building height of Europe in 3D-Gl
摘要了解城市垂直结构,尤其是建筑高度,对于研究人类与其环境之间错综复杂的互动关系至关重要。此类数据集对于气候建模、能耗分析和社会经济活动等各种应用都是不可或缺的。尽管这些信息非常重要,但以往的研究主要集中在以网格为尺度估算区域内的建筑高度,因此数据集的覆盖范围或空间分辨率往往有限。这种局限性阻碍了全球综合分析以及在更细的尺度上产生可行见解的能力。在这项研究中,我们利用地球观测(EO)数据集和先进的机器学习技术,在建筑物足迹尺度上绘制了全球建筑物高度图(3D-GloBFP)。我们的方法整合了多源遥感特征和建筑形态特征,在全球不同地区使用极梯度提升(XGBoost)回归方法开发了高度估算模型。通过这种方法,我们估算出了全球各个建筑物的高度,并最终创建了首个全球三维(3-D)建筑物足迹(3D-GloBFP)。我们的评估结果表明,高度估算模型在全球范围内表现优异,33 个分区域的 R2 值从 0.66 到 0.96 不等,均方根误差 (RMSE) 从 1.9 米到 14.6 米不等。与其他数据集的比较表明,我们的 3D-GloBFP 与参考高度的分布和空间模式非常吻合。我们得出的三维全球建筑足迹图显示了不同地区、国家和城市之间建筑高度的明显空间模式,建筑高度从城市中心向周边农村地区逐渐降低。此外,我们的研究结果表明,不同国家和城市的已建基础设施(即建筑体量)存在差异。中国是已建基础设施总量最密集的国家(5.28×1011 立方米,占全球总量的 23.9%),其次是美国(3.90×1011 立方米,占全球总量的 17.6%)。在所有具有代表性的城市中,上海的基础设施建设量最大(2.1×1010 立方米)。得出的建筑足迹比例高度图(3D-GloBFP)揭示了城市建成环境的显著异质性,为城市社会经济动态和气候学研究提供了宝贵的见解。3D-GloBFP数据集可在以下网站获取:https://doi.org/10.5281/zenodo.11319913(3D-GloBFP中的美洲、非洲和大洋洲建筑高度)(Che等人,2024a)、https://doi.org/10.5281/zenodo.11397015(3D-GloBFP中的亚洲建筑高度)(Che等人,2024b)和https://doi.org/10.5281/zenodo.11391077(3D-GloBFP中的欧洲建筑高度)(Che等人,2024c)。
{"title":"3D-GloBFP: the first global three-dimensional building footprint dataset","authors":"Yangzi Che, Xuecao Li, Xiaoping Liu, Yuhao Wang, Weilin Liao, Xianwei Zheng, Xucai Zhang, Xiaocong Xu, Qian Shi, Jiajun Zhu, Hua Yuan, Yongjiu Dai","doi":"10.5194/essd-2024-217","DOIUrl":"https://doi.org/10.5194/essd-2024-217","url":null,"abstract":"<strong>Abstract.</strong> Understanding urban vertical structures, particularly building heights, is essential for examining the intricate interaction between humans and their environment. Such datasets are indispensable for a variety of applications, including climate modeling, energy consumption analysis, and socioeconomic activities. Despite the importance of this information, previous studies have primarily focused on estimating building heights regionally on a grid scale, often resulting in datasets with limited coverage or spatial resolution. This limitation hampers comprehensive global analyses and the ability to generate actionable insights on finer scales. In this study, we developed a global building height map (3D-GloBFP) at a building footprint scale by leveraging Earth Observation (EO) datasets and advanced machine learning techniques. Our approach integrated multisource remote sensing features and building morphology features to develop height estimation models using the eXtreme Gradient Boosting (XGBoost) regression method across diverse global regions. This methodology allowed us to estimate the heights of individual buildings worldwide, culminating in the creation of the first global three-dimensional (3-D) building footprints (3D-GloBFP). Our evaluation results show that the height estimation models perform exceptionally well on a worldwide scale, with <em>R</em><sup>2</sup> ranging from 0.66 to 0.96 and root mean square errors (RMSEs) ranging from 1.9 m to 14.6 m across 33 subregions. Comparisons with other datasets demonstrate that our 3D-GloBFP closely matches the distribution and spatial pattern of reference heights. Our derived 3-D global building footprint map shows a distinct spatial pattern of building heights across regions, countries, and cities, with building heights gradually decreasing from the city center to the surrounding rural areas. Furthermore, our findings indicate the disparities in built-up infrastructure (i.e., building volume) across different countries and cities. China is the country with the most intensive total built-up infrastructure (5.28×10<sup>11</sup> m<sup>3</sup>, accounting for 23.9 % of the global total), followed by the United States (3.90×10<sup>11</sup> m<sup>3</sup>, accounting for 17.6 % of the global total). Shanghai has the largest volume of built-up infrastructure (2.1×10<sup>10</sup> m<sup>3</sup>) of all representative cities. The derived building-footprint scale height map (3D-GloBFP) reveals the significant heterogeneity of urban built-up environments, providing valuable insights for studies in urban socioeconomic dynamics and climatology. The 3D-GloBFP dataset is available at https://doi.org/10.5281/zenodo.11319913 (Building height of the Americas, Africa, and Oceania in 3D-GloBFP) (Che et al., 2024a), https://doi.org/10.5281/zenodo.11397015 (Building height of Asia in 3D-GloBFP) (Che et al., 2024b), and https://doi.org/10.5281/zenodo.11391077 (Building height of Europe in 3D-Gl","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141444752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MDG625: A daily high-resolution meteorological dataset derived by geopotential-guided attention network in Asia (1940–2023) MDG625:由亚洲位势引导关注网络推导出的高分辨率每日气象数据集(1940-2023 年)
IF 11.4 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2024-06-24 DOI: 10.5194/essd-2024-137
Zijiang Song, Zhixiang Cheng, Yuying Li, Shanshan Yu, Xiaowen Zhang, Lina Yuan, Min Liu
Abstract. The long-term and reliable meteorological reanalysis dataset with high spatial-temporal resolution is crucial for various hydrological and meteorological applications, especially in regions or periods with scarce in situ observations and with limited open-access data. Based on the ERA5 (produced by the European Centre for Medium-Range Weather Forecasts, 0.25°×0.25°, since 1940) and CLDAS (China Meteorological Administration Land Data Assimilation System, 0.0625°×0.0625°, since 2008), we proposed a novel downscaling method Geopotential-guide Attention Network (GeoAN) leveraging the high spatial resolution of CLDAS and the extended historical coverage of ERA5 and produced the daily multi-variable (2 m temperature, surface pressure, and 10 m wind speed) meteorological dataset MDG625 (Song et al., 2024). MDG625 (0.0625° Meteorological Dataset derived by GeoAN) covers most of Asia from 0.125° S to 64.875° N and 60.125° E to 160.125° E since 1940. Compared with other downscaling methods, GeoAN shows better performance with the R2 (2 m temperature, surface pressure, and 10 m wind speed reached 0.990, 0.998, and 0.781, respectively). MDG625 demonstrates superior continuity and consistency from both spatial and temporal perspectives. We anticipate that this GeoAN method and this dataset MDG625 will aid in climate studies of Asia and will contribute to improving the accuracy of reanalysis products from the 1940s. The dataset (Song et al., 2024) is presented in https://doi.org/10.57760/sciencedb.17408 and the code can be found in https://github.com/songzijiang/GeoAN.
摘要具有高时空分辨率的长期可靠的气象再分析数据集对于各种水文和气象应用至关重要,尤其是在原地观测资料匮乏和开放数据有限的地区或时期。基于 ERA5(由欧洲中期天气预报中心制作,0.25°×0.25°,1940 年开始)和 CLDAS(中国气象局陆地数据同化系统,0.0625°×0.利用 CLDAS 的高空间分辨率和 ERA5 的扩展历史覆盖范围,我们提出了一种新的降尺度方法--位势引导注意网络(GeoAN),并生成了日多变量(2 米气温、地面气压和 10 米风速)气象数据集 MDG625(Song et al、2024).MDG625(由 GeoAN 导出的 0.0625°气象数据集)涵盖了自 1940 年以来南纬 0.125°至北纬 64.875°、东经 60.125°至东经 160.125°的亚洲大部分地区。与其他降尺度方法相比,GeoAN 的 R2(2 米气温、地面气压和 10 米风速分别达到 0.990、0.998 和 0.781)表现更佳。从空间和时间角度来看,MDG625 都表现出了卓越的连续性和一致性。我们预计这种 GeoAN 方法和 MDG625 数据集将有助于亚洲的气候研究,并有助于提高 20 世纪 40 年代再分析产品的精度。数据集(Song 等,2024 年)见 https://doi.org/10.57760/sciencedb.17408,代码见 https://github.com/songzijiang/GeoAN。
{"title":"MDG625: A daily high-resolution meteorological dataset derived by geopotential-guided attention network in Asia (1940–2023)","authors":"Zijiang Song, Zhixiang Cheng, Yuying Li, Shanshan Yu, Xiaowen Zhang, Lina Yuan, Min Liu","doi":"10.5194/essd-2024-137","DOIUrl":"https://doi.org/10.5194/essd-2024-137","url":null,"abstract":"<strong>Abstract.</strong> The long-term and reliable meteorological reanalysis dataset with high spatial-temporal resolution is crucial for various hydrological and meteorological applications, especially in regions or periods with scarce in situ observations and with limited open-access data. Based on the ERA5 (produced by the European Centre for Medium-Range Weather Forecasts, 0.25°×0.25°, since 1940) and CLDAS (China Meteorological Administration Land Data Assimilation System, 0.0625°×0.0625°, since 2008), we proposed a novel downscaling method Geopotential-guide Attention Network (GeoAN) leveraging the high spatial resolution of CLDAS and the extended historical coverage of ERA5 and produced the daily multi-variable (2 m temperature, surface pressure, and 10 m wind speed) meteorological dataset MDG625 (Song et al., 2024). MDG625 (0.0625° Meteorological Dataset derived by GeoAN) covers most of Asia from 0.125° S to 64.875° N and 60.125° E to 160.125° E since 1940. Compared with other downscaling methods, GeoAN shows better performance with the R<sup>2</sup> (2 m temperature, surface pressure, and 10 m wind speed reached 0.990, 0.998, and 0.781, respectively). MDG625 demonstrates superior continuity and consistency from both spatial and temporal perspectives. We anticipate that this GeoAN method and this dataset MDG625 will aid in climate studies of Asia and will contribute to improving the accuracy of reanalysis products from the 1940s. The dataset (Song et al., 2024) is presented in https://doi.org/10.57760/sciencedb.17408 and the code can be found in https://github.com/songzijiang/GeoAN.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141444765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A derecho climatology (2004–2021) in the United States based on machine learning identification of bow echoes 基于弓形回波机器学习识别的美国厄尔尼诺气候学(2004-2021 年
IF 11.4 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2024-06-24 DOI: 10.5194/essd-2024-112
Jianfeng Li, Andrew Geiss, Zhe Feng, L. Ruby Leung, Yun Qian, Wenjun Cui
Abstract. Due to their persistent widespread severe winds, derechos pose significant threats to human safety and property, and they are as hazardous and fatal as many tornadoes and hurricanes. Yet, automated detection of derechos remains challenging due to the absence of spatiotemporally continuous observations and the complex criteria employed to define the phenomenon. This study proposes a physically based definition of derechos that contains the key features of derechos described in the literature and allows their automated objective identification using either observations or model simulations. The automated detection is composed of three algorithms: the Flexible Object Tracker algorithm to track mesoscale convective systems (MCSs), a semantic segmentation convolutional neural network to identify bow echoes, and a comprehensive algorithm to classify MCSs as derechos or non-derecho events. Using the new approach, we develop a novel high-resolution (4 km and hourly) observational dataset of derechos over the United States east of the Rocky Mountains from 2004 to 2021. The dataset is analyzed to document the derecho climatology in the United States. Many more derechos (increased by ~50–400 %) are identified in the dataset (~31 events per year) than in previous estimations (~6–21 events per year), but the spatial distribution and seasonal variation patterns resemble earlier studies with a peak occurrence in the Great Plains and Midwest during the warm season. In addition, around 20 % of damaging gust (≥ 25.93 m s-1) reports are produced by derechos during the dataset period over the United States east of the Rocky Mountains. The dataset is available at https://doi.org/10.5281/zenodo.10884046 (Li et al., 2024).
摘要。由于持续大范围的强风,龙卷风对人类安全和财产构成重大威胁,其危害性和致命性不亚于许多龙卷风和飓风。然而,由于缺乏连续的时空观测数据,以及定义这种现象的标准非常复杂,因此自动探测龙卷风仍然具有挑战性。本研究提出了一个基于物理的 "德雷乔斯 "定义,该定义包含了文献中描述的 "德雷乔斯 "的主要特征,并允许使用观测数据或模型模拟自动客观地识别 "德雷乔斯"。自动检测由三种算法组成:用于跟踪中尺度对流系统(MCSs)的灵活目标跟踪算法、用于识别弓形回波的语义分割卷积神经网络,以及用于将中尺度对流系统分类为德雷赫斯或非德雷赫斯事件的综合算法。利用新方法,我们开发了一个新的高分辨率(4 公里和每小时)观测数据集,用于观测 2004 年至 2021 年洛基山脉以东美国上空的低回声。通过分析该数据集,我们记录了美国的厄尔尼诺气候。与之前的估计(每年约 6-21 次)相比,数据集中发现的厄尔尼诺现象要多得多(增加了约 50-400%)(每年约 31 次),但其空间分布和季节变化规律与之前的研究相似,暖季时大平原和中西部地区出现厄尔尼诺现象的高峰。此外,在数据集期间,美国落基山以东地区约有 20% 的破坏性阵风(≥ 25.93 m s-1)报告是由德雷乔斯造成的。该数据集可查阅 https://doi.org/10.5281/zenodo.10884046(Li 等人,2024 年)。
{"title":"A derecho climatology (2004–2021) in the United States based on machine learning identification of bow echoes","authors":"Jianfeng Li, Andrew Geiss, Zhe Feng, L. Ruby Leung, Yun Qian, Wenjun Cui","doi":"10.5194/essd-2024-112","DOIUrl":"https://doi.org/10.5194/essd-2024-112","url":null,"abstract":"<strong>Abstract.</strong> Due to their persistent widespread severe winds, derechos pose significant threats to human safety and property, and they are as hazardous and fatal as many tornadoes and hurricanes. Yet, automated detection of derechos remains challenging due to the absence of spatiotemporally continuous observations and the complex criteria employed to define the phenomenon. This study proposes a physically based definition of derechos that contains the key features of derechos described in the literature and allows their automated objective identification using either observations or model simulations. The automated detection is composed of three algorithms: the Flexible Object Tracker algorithm to track mesoscale convective systems (MCSs), a semantic segmentation convolutional neural network to identify bow echoes, and a comprehensive algorithm to classify MCSs as derechos or non-derecho events. Using the new approach, we develop a novel high-resolution (4 km and hourly) observational dataset of derechos over the United States east of the Rocky Mountains from 2004 to 2021. The dataset is analyzed to document the derecho climatology in the United States. Many more derechos (increased by ~50–400 %) are identified in the dataset (~31 events per year) than in previous estimations (~6–21 events per year), but the spatial distribution and seasonal variation patterns resemble earlier studies with a peak occurrence in the Great Plains and Midwest during the warm season. In addition, around 20 % of damaging gust (≥ 25.93 m s<sup>-1</sup>) reports are produced by derechos during the dataset period over the United States east of the Rocky Mountains. The dataset is available at https://doi.org/10.5281/zenodo.10884046 (Li et al., 2024).","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141444758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of a high-resolution integrated emission inventory of air pollutants for China 编制中国大气污染物高分辨率综合排放清单
IF 11.4 1区 地球科学 Q1 Earth and Planetary Sciences Pub Date : 2024-06-21 DOI: 10.5194/essd-16-2893-2024
Nana Wu, Guannan Geng, Ruochong Xu, Shigan Liu, Xiaodong Liu, Qinren Shi, Ying Zhou, Yu Zhao, Huan Liu, Yu Song, Junyu Zheng, Qiang Zhang, Kebin He
Abstract. Constructing a highly resolved comprehensive emission dataset for China is challenging due to limited availability of refined information for parameters in a unified bottom-up framework. Here, by developing an integrated modeling framework, we harmonized multi-source heterogeneous data, including several up-to-date emission inventories at national and regional scales and for key species and sources in China to generate a 0.1° resolution inventory for 2017. By source mapping, species mapping, temporal disaggregation, spatial allocation, and spatial–temporal coupling, different emission inventories are normalized in terms of source categories, chemical species, and spatiotemporal resolutions. This achieves the coupling of multi-scale, high-resolution emission inventories with the Multi-resolution Emission Inventory for China (MEIC), forming the high-resolution INTegrated emission inventory of Air pollutants for China (INTAC). We find that INTAC provides more accurate representations for emission magnitudes and spatiotemporal patterns. In 2017, China's emissions of sulfur dioxide (SO2), nitrous oxides (NOx), carbon monoxide (CO), non-methane volatile organic compounds (NMVOCs), ammonia (NH3), PM10 and PM2.5 (particulate matter), black carbon (BC), and organic carbon (OC) were 12.3, 24.5, 141.0, 27.9, 9.2, 11.1, 8.4, 1.3, and 2.2 Tg, respectively. The proportion of point source emissions for SO2, PM10, NOx, and PM2.5 increases from 7 %–19 % in MEIC to 48 %–66 % in INTAC, resulting in improved spatial accuracy, especially mitigating overestimations in densely populated areas. Compared with MEIC, INTAC reduces mean biases in simulated concentrations of major air pollutants by 2–14 µg m−3 across 74 cities, compared against ground observations. The enhanced model performance by INTAC is particularly evident at finer-grid resolutions. Our new dataset is accessible at http://meicmodel.org.cn/intac (last access: 15 April 2024) and https://doi.org/10.5281/zenodo.10459198 (Wu et al., 2024), and it will provide a solid data foundation for fine-scale atmospheric research and air-quality improvement.
摘要由于在统一的自下而上框架中,参数的细化信息有限,为中国构建一个高分辨率的综合排放数据集具有挑战性。在此,通过开发一个综合建模框架,我们协调了多源异构数据,包括多个国家和区域尺度上的最新排放清单以及中国的关键物种和排放源,生成了一个 0.1° 分辨率的 2017 年清单。通过来源映射、物种映射、时间分解、空间分配和时空耦合,不同排放清单在来源类别、化学物种和时空分辨率方面实现了标准化。这就实现了多尺度、高分辨率排放清单与中国多分辨率排放清单(MEIC)的耦合,形成了高分辨率的中国大气污染物综合排放清单(INTAC)。我们发现,INTAC 能更准确地反映排放规模和时空模式。2017年,中国二氧化硫(SO2)、氧化亚氮(NOx)、一氧化碳(CO)、非甲烷挥发性有机物(NMVOCs)、氨(NH3)、可吸入颗粒物(PM10和PM2.5)、黑碳(BC)和有机碳(OC)的排放量分别为12.3、24.5、141.0、27.9、9.2、11.1、8.4、1.3和2.2 Tg。SO2、PM10、NOx和PM2.5的点源排放比例从MEIC的7%-19%增加到INTAC的48%-66%,从而提高了空间精度,特别是减少了人口稠密地区的高估。与 MEIC 相比,INTAC 可将 74 个城市主要空气污染物模拟浓度的平均偏差减少 2-14 µg m-3(与地面观测数据相比)。在更精细的网格分辨率下,INTAC模型性能的提升尤为明显。我们的新数据集可在 http://meicmodel.org.cn/intac(最后访问日期:2024 年 4 月 15 日)和 https://doi.org/10.5281/zenodo.10459198(Wu 等人,2024 年)上访问,它将为精细尺度大气研究和空气质量改善提供坚实的数据基础。
{"title":"Development of a high-resolution integrated emission inventory of air pollutants for China","authors":"Nana Wu, Guannan Geng, Ruochong Xu, Shigan Liu, Xiaodong Liu, Qinren Shi, Ying Zhou, Yu Zhao, Huan Liu, Yu Song, Junyu Zheng, Qiang Zhang, Kebin He","doi":"10.5194/essd-16-2893-2024","DOIUrl":"https://doi.org/10.5194/essd-16-2893-2024","url":null,"abstract":"Abstract. Constructing a highly resolved comprehensive emission dataset for China is challenging due to limited availability of refined information for parameters in a unified bottom-up framework. Here, by developing an integrated modeling framework, we harmonized multi-source heterogeneous data, including several up-to-date emission inventories at national and regional scales and for key species and sources in China to generate a 0.1° resolution inventory for 2017. By source mapping, species mapping, temporal disaggregation, spatial allocation, and spatial–temporal coupling, different emission inventories are normalized in terms of source categories, chemical species, and spatiotemporal resolutions. This achieves the coupling of multi-scale, high-resolution emission inventories with the Multi-resolution Emission Inventory for China (MEIC), forming the high-resolution INTegrated emission inventory of Air pollutants for China (INTAC). We find that INTAC provides more accurate representations for emission magnitudes and spatiotemporal patterns. In 2017, China's emissions of sulfur dioxide (SO2), nitrous oxides (NOx), carbon monoxide (CO), non-methane volatile organic compounds (NMVOCs), ammonia (NH3), PM10 and PM2.5 (particulate matter), black carbon (BC), and organic carbon (OC) were 12.3, 24.5, 141.0, 27.9, 9.2, 11.1, 8.4, 1.3, and 2.2 Tg, respectively. The proportion of point source emissions for SO2, PM10, NOx, and PM2.5 increases from 7 %–19 % in MEIC to 48 %–66 % in INTAC, resulting in improved spatial accuracy, especially mitigating overestimations in densely populated areas. Compared with MEIC, INTAC reduces mean biases in simulated concentrations of major air pollutants by 2–14 µg m−3 across 74 cities, compared against ground observations. The enhanced model performance by INTAC is particularly evident at finer-grid resolutions. Our new dataset is accessible at http://meicmodel.org.cn/intac (last access: 15 April 2024) and https://doi.org/10.5281/zenodo.10459198 (Wu et al., 2024), and it will provide a solid data foundation for fine-scale atmospheric research and air-quality improvement.","PeriodicalId":48747,"journal":{"name":"Earth System Science Data","volume":null,"pages":null},"PeriodicalIF":11.4,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141435782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Earth System Science Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1