首页 > 最新文献

Computers & Geosciences最新文献

英文 中文
Super-resolution of 3D Micro-CT images using generative adversarial Networks: Enhancing resolution and segmentation accuracy 使用生成对抗网络的三维微ct图像的超分辨率:提高分辨率和分割精度
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-30 DOI: 10.1016/j.cageo.2025.106018
Evgeny Ugolkov , Xupeng He , Hyung Kwak , Hussein Hoteit
We develop a procedure for substantially improving the quality of segmented 3D micro-Computed Tomography (micro-CT) images of rocks with a Machine Learning (ML) Generative Model. The proposed model enhances the resolution eightfold (8x) and addresses segmentation inaccuracies due to the overlapping X-ray attenuation in micro-CT measurement for different rock minerals and phases. The proposed generative model is a 3D Deep Convolutional Wasserstein Generative Adversarial Network with Gradient Penalty (3D DC WGAN-GP). The algorithm is trained on segmented 3D low-resolution micro-CT images and segmented unpaired complementary 2D high-resolution Laser Scanning Microscope (LSM) images. The algorithm was demonstrated on multiple samples of Berea sandstones. We achieved high-quality super-resolved 3D images with a resolution of 0.44 μm/voxel and accurate segmentation for constituting minerals and pore space. The proposed procedure can significantly expand the modern capabilities of digital rock physics.
我们开发了一种程序,用于通过机器学习(ML)生成模型大幅提高岩石分段三维微计算机断层扫描(micro-CT)图像的质量。该模型将分辨率提高了8倍(8倍),并解决了在不同岩石矿物和相的微ct测量中由于x射线衰减重叠而导致的分割不准确。提出的生成模型是三维深度卷积Wasserstein梯度惩罚生成对抗网络(3D DC WGAN-GP)。该算法分别对三维低分辨率微ct图像和二维高分辨率激光扫描显微镜(LSM)图像进行分割训练。该算法在Berea砂岩的多个样本上进行了验证。我们获得了分辨率为0.44 μm/体素的高质量超分辨率3D图像,并对构成矿物和孔隙空间进行了精确分割。所提出的程序可以显著扩展数字岩石物理的现代能力。
{"title":"Super-resolution of 3D Micro-CT images using generative adversarial Networks: Enhancing resolution and segmentation accuracy","authors":"Evgeny Ugolkov ,&nbsp;Xupeng He ,&nbsp;Hyung Kwak ,&nbsp;Hussein Hoteit","doi":"10.1016/j.cageo.2025.106018","DOIUrl":"10.1016/j.cageo.2025.106018","url":null,"abstract":"<div><div>We develop a procedure for substantially improving the quality of segmented 3D micro-Computed Tomography (micro-CT) images of rocks with a Machine Learning (ML) Generative Model. The proposed model enhances the resolution eightfold (8x) and addresses segmentation inaccuracies due to the overlapping X-ray attenuation in micro-CT measurement for different rock minerals and phases. The proposed generative model is a 3D Deep Convolutional Wasserstein Generative Adversarial Network with Gradient Penalty (3D DC WGAN-GP). The algorithm is trained on segmented 3D low-resolution micro-CT images and segmented unpaired complementary 2D high-resolution Laser Scanning Microscope (LSM) images. The algorithm was demonstrated on multiple samples of Berea sandstones. We achieved high-quality super-resolved 3D images with a resolution of 0.44 μm/voxel and accurate segmentation for constituting minerals and pore space. The proposed procedure can significantly expand the modern capabilities of digital rock physics.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 106018"},"PeriodicalIF":4.4,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144763906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An improved method for pore size distribution measurement of porous geomaterials based on microscopic images 基于显微图像的多孔岩土材料孔径分布测量方法的改进
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-28 DOI: 10.1016/j.cageo.2025.106017
Shijia Ma, Jiangfeng Liu, Zhipeng Wang, Ruinian Sun, Xinyue Zhang, Hongyang Ni
Pore size distribution (PSD) is vital for characterizing microscopic information and fluid transport in geomaterials, but traditional methods struggle with irregular pore shapes and digital imaging errors, often leading to inaccurate results. This study presents an improved morphological transformation-based algorithm that iteratively fills voids with maximal circles or spheres and introduces an optimized scheme for small-pore representation, significantly reducing measurement errors. Validation on eight 2D scanning electron microscope and six 3D computer tomography images shows the proposed method achieves up to 67 % lower relative error for small pore sizes and produces permeability predictions with a mean deviation within 3 % of experimental values, outperforming established techniques. Statistical analysis confirms that, for most samples, predicted permeability values fall within or approaching the 95 % confidence interval of measured data, demonstrating robust consistency across imaging sources and magnifications. Furthermore, the quantitative evaluation of pore geometry and PSD curves using different methods reveals that complex and randomly distributed pore geometries strongly influence PSD curve morphology, underscoring the importance of geometric characterization. These advancements enable more reliable and repeatable pore structure quantification, offering practical value for geoscience and engineering applications.
孔隙尺寸分布(PSD)对于表征岩土材料的微观信息和流体运移至关重要,但传统的方法难以处理不规则的孔隙形状和数字成像误差,往往导致结果不准确。本研究提出了一种改进的基于形态变换的算法,该算法迭代地用最大的圆或球填充空隙,并引入了一种优化的小孔隙表示方案,显著降低了测量误差。在8张二维扫描电镜和6张三维计算机断层扫描图像上的验证表明,该方法在小孔隙尺寸下的相对误差降低了67%,渗透率预测的平均偏差在实验值的3%以内,优于现有技术。统计分析证实,对于大多数样品,预测渗透率值落在或接近测量数据的95%置信区间内,显示出不同成像源和放大倍数的强大一致性。此外,利用不同的方法对孔隙几何形状和PSD曲线进行定量评价,揭示了复杂和随机分布的孔隙几何形状对PSD曲线形态的强烈影响,强调了几何表征的重要性。这些进步使孔隙结构量化更加可靠和可重复,为地球科学和工程应用提供了实用价值。
{"title":"An improved method for pore size distribution measurement of porous geomaterials based on microscopic images","authors":"Shijia Ma,&nbsp;Jiangfeng Liu,&nbsp;Zhipeng Wang,&nbsp;Ruinian Sun,&nbsp;Xinyue Zhang,&nbsp;Hongyang Ni","doi":"10.1016/j.cageo.2025.106017","DOIUrl":"10.1016/j.cageo.2025.106017","url":null,"abstract":"<div><div>Pore size distribution (PSD) is vital for characterizing microscopic information and fluid transport in geomaterials, but traditional methods struggle with irregular pore shapes and digital imaging errors, often leading to inaccurate results. This study presents an improved morphological transformation-based algorithm that iteratively fills voids with maximal circles or spheres and introduces an optimized scheme for small-pore representation, significantly reducing measurement errors. Validation on eight 2D scanning electron microscope and six 3D computer tomography images shows the proposed method achieves up to 67 % lower relative error for small pore sizes and produces permeability predictions with a mean deviation within 3 % of experimental values, outperforming established techniques. Statistical analysis confirms that, for most samples, predicted permeability values fall within or approaching the 95 % confidence interval of measured data, demonstrating robust consistency across imaging sources and magnifications. Furthermore, the quantitative evaluation of pore geometry and PSD curves using different methods reveals that complex and randomly distributed pore geometries strongly influence PSD curve morphology, underscoring the importance of geometric characterization. These advancements enable more reliable and repeatable pore structure quantification, offering practical value for geoscience and engineering applications.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 106017"},"PeriodicalIF":4.4,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144748729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DisperPy: A machine learning based tool to automatically pick group velocity dispersion curves from earthquakes 色散:一个基于机器学习的工具,可以自动从地震中选择群速度色散曲线
IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-16 DOI: 10.1016/j.cageo.2025.106015
André V.S. Nascimento , Carlos A.M. Chaves , Susanne T.R. Maciel , George S. França , Giuliano S. Marotta
Seismology has made significant progress in high-resolution Earth imaging, largely driven by the increasing volume of freely available data. As a result, automated tools and machine learning algorithms are becoming essential for processing this vast amount of information. We present DisperPy, an open-source Python library developed to automatically extract group velocity dispersion curves from earthquake data. The analysis framework of DisperPy is structured around two primary tasks: (1) assessing the quality of waveforms to determine if dispersion extraction is feasible, and (2) measuring the group velocity dispersion curve for suitable waveforms. To address the first task, DisperPy uses a convolutional neural network trained on dispersion spectrograms to classify waveform quality. The model, based on the ResNet-34 architecture, is initialized with ImageNet-pretrained weights and fine-tuned using the fastai deep learning library. In the test set, the network achieves an accuracy of 92 % in distinguishing between high- and low-quality dispersion images. For the second task, DisperPy employs unsupervised learning techniques, starting with a Gaussian mixture model to separate dispersion energy from background noise, followed by k-means to separate the dispersion energy into clusters, making it easier to track amplitude maxima and then construct initial dispersion curves. Finally, a refinement of the initial dispersion is achieved using both the density-based spatial clustering of applications with noise algorithm and data quality criteria to remove possible outliers. To further test DisperPy, we conduct a surface wave tomography experiment across the contiguous United States using freely available vertical-component broadband waveforms. After processing the data with DisperPy and removing low-quality waveforms, the final dataset consisted of 194,325 unique dispersion curves. Consistent with previous studies, our maps reveal a prominent velocity dichotomy, with low velocities in the tectonically active western US and high velocities in the stable central and eastern US.
地震学在高分辨率地球成像方面取得了重大进展,这主要是由于免费数据量的增加。因此,自动化工具和机器学习算法对于处理如此大量的信息变得至关重要。我们提出了一个开源的Python库DisperPy,用于从地震数据中自动提取群速度色散曲线。色散分析框架围绕两个主要任务构建:(1)评估波形质量以确定色散提取是否可行;(2)测量合适波形的群速度色散曲线。为了解决第一个任务,DisperPy使用在色散谱图上训练的卷积神经网络对波形质量进行分类。该模型基于ResNet-34架构,使用imagenet预训练的权值进行初始化,并使用fastai深度学习库进行微调。在测试集中,该网络在区分高质量和低质量色散图像方面达到了92%的准确率。对于第二个任务,DisperPy采用无监督学习技术,从高斯混合模型开始将色散能量从背景噪声中分离出来,然后使用k-means将色散能量分离成簇,从而更容易跟踪振幅最大值,然后构建初始色散曲线。最后,使用基于密度的空间聚类应用和噪声算法和数据质量标准来去除可能的异常值,从而实现初始离散度的细化。为了进一步测试色散,我们使用免费提供的垂直分量宽带波形在美国邻近地区进行了表面波层析成像实验。在对数据进行色散处理并去除低质量波形后,最终数据集由194,325条独特的色散曲线组成。与之前的研究一致,我们的地图显示了一个明显的速度二分法,在构造活跃的美国西部,速度较低,而在稳定的美国中部和东部,速度较高。
{"title":"DisperPy: A machine learning based tool to automatically pick group velocity dispersion curves from earthquakes","authors":"André V.S. Nascimento ,&nbsp;Carlos A.M. Chaves ,&nbsp;Susanne T.R. Maciel ,&nbsp;George S. França ,&nbsp;Giuliano S. Marotta","doi":"10.1016/j.cageo.2025.106015","DOIUrl":"10.1016/j.cageo.2025.106015","url":null,"abstract":"<div><div>Seismology has made significant progress in high-resolution Earth imaging, largely driven by the increasing volume of freely available data. As a result, automated tools and machine learning algorithms are becoming essential for processing this vast amount of information. We present <em>DisperPy</em>, an open-source Python library developed to automatically extract group velocity dispersion curves from earthquake data. The analysis framework of <em>DisperPy</em> is structured around two primary tasks: (1) assessing the quality of waveforms to determine if dispersion extraction is feasible, and (2) measuring the group velocity dispersion curve for suitable waveforms. To address the first task, <em>DisperPy</em> uses a convolutional neural network trained on dispersion spectrograms to classify waveform quality. The model, based on the ResNet-34 architecture, is initialized with ImageNet-pretrained weights and fine-tuned using the fastai deep learning library. In the test set, the network achieves an accuracy of 92 % in distinguishing between high- and low-quality dispersion images. For the second task, <em>DisperPy</em> employs unsupervised learning techniques, starting with a Gaussian mixture model to separate dispersion energy from background noise, followed by <em>k-means</em> to separate the dispersion energy into clusters, making it easier to track amplitude maxima and then construct initial dispersion curves. Finally, a refinement of the initial dispersion is achieved using both the density-based spatial clustering of applications with noise algorithm and data quality criteria to remove possible outliers. To further test <em>DisperPy</em>, we conduct a surface wave tomography experiment across the contiguous United States using freely available vertical-component broadband waveforms. After processing the data with <em>DisperPy</em> and removing low-quality waveforms, the final dataset consisted of 194,325 unique dispersion curves. Consistent with previous studies, our maps reveal a prominent velocity dichotomy, with low velocities in the tectonically active western US and high velocities in the stable central and eastern US.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 106015"},"PeriodicalIF":4.2,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144662576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Adaptive Preconditioned Conjugate Gradient Regularization (APCGR) algorithm with Sigmoid Function (SF) constraint for efficient three-dimensional (3D) gravity focusing inversion 基于Sigmoid函数(SF)约束的自适应预条件共轭梯度正则化(APCGR)算法用于三维重力聚焦反演
IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-16 DOI: 10.1016/j.cageo.2025.106014
Wenjin Chen, Xiaolong Tan
We introduce a novel focused gravity inversion algorithm and develop corresponding software, highlighting three key innovations. First, we propose the Adaptive Preconditioned Conjugate Gradient Regularization algorithm, which efficiently and adaptively determines the regularization parameter. Second, we incorporate the Sigmoid Function to stabilize the inversion process, significantly accelerating iterative convergence. Third, we have developed a user-friendly software with a graphical user interface for this new method, utilizing the popular high-level and interactive programming language MATLAB. To promote knowledge sharing and resource accessibility, we have made the software open-source. To validate our approach, we tested the algorithm on both synthetic and real gravity data, demonstrating its exceptional capability to accurately reconstruct the 3D density distribution of complex subsurface structures. Furthermore, we conducted a comparative analysis between the new algorithm, the conjugate gradient method constrained by SF, and the standard conjugate gradient method. The results indicate that the new method requires fewer iterations and exhibits higher computational efficiency.
本文介绍了一种新的重力聚焦反演算法,并开发了相应的软件,重点介绍了三个关键创新点。首先,提出了自适应预条件共轭梯度正则化算法,该算法能有效地自适应确定正则化参数。其次,我们引入了Sigmoid函数来稳定反演过程,显著加快了迭代收敛。第三,我们利用流行的高级交互式编程语言MATLAB,为这种新方法开发了具有图形用户界面的用户友好软件。为了促进知识共享和资源可及性,我们对软件进行了开源。为了验证我们的方法,我们在合成和真实重力数据上测试了该算法,证明了其精确重建复杂地下结构三维密度分布的卓越能力。在此基础上,对新算法、受SF约束的共轭梯度法和标准共轭梯度法进行了对比分析。结果表明,该方法迭代次数少,计算效率高。
{"title":"An Adaptive Preconditioned Conjugate Gradient Regularization (APCGR) algorithm with Sigmoid Function (SF) constraint for efficient three-dimensional (3D) gravity focusing inversion","authors":"Wenjin Chen,&nbsp;Xiaolong Tan","doi":"10.1016/j.cageo.2025.106014","DOIUrl":"10.1016/j.cageo.2025.106014","url":null,"abstract":"<div><div>We introduce a novel focused gravity inversion algorithm and develop corresponding software, highlighting three key innovations. First, we propose the Adaptive Preconditioned Conjugate Gradient Regularization algorithm, which efficiently and adaptively determines the regularization parameter. Second, we incorporate the Sigmoid Function to stabilize the inversion process, significantly accelerating iterative convergence. Third, we have developed a user-friendly software with a graphical user interface for this new method, utilizing the popular high-level and interactive programming language MATLAB. To promote knowledge sharing and resource accessibility, we have made the software open-source. To validate our approach, we tested the algorithm on both synthetic and real gravity data, demonstrating its exceptional capability to accurately reconstruct the 3D density distribution of complex subsurface structures. Furthermore, we conducted a comparative analysis between the new algorithm, the conjugate gradient method constrained by SF, and the standard conjugate gradient method. The results indicate that the new method requires fewer iterations and exhibits higher computational efficiency.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 106014"},"PeriodicalIF":4.2,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144654450","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel class-imbalance learning framework for fluid recognition: Application to Qingshimao-Gaoshawo tight-sand gas reservoirs in the Ordos Basin, China 一种新的类不平衡学习框架在流体识别中的应用——以鄂尔多斯盆地青石茂—高沙窝致密砂岩气藏为例
IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-14 DOI: 10.1016/j.cageo.2025.105993
Jun Yi , ZhongLi Qi , XiangChengZhen Li , Fuqiang Lai , Wei Zhou
The mathematical model-based methods used for conventional oil and gas resources often perform poorly in fluid recognition of tight-sand reservoir, due to the mutual interference of various factors such as reservoir lithology and pore structure. Booming artificial intelligence technologies and accumulating logging data provide a solid foundation for the application of machine learning methods as new tools for fluid identification. However, there is often a serious class imbalance, which can easily lead to the inability to achieve ideal classification results, in the proportion of categories of the collected well logging data. Consequently, this issue has become a huge challenge for the academic and industrial communities. To address this, a novel class-imbalance learning framework for fluid recognition (CILF) is proposed to tight-sand gas reservoirs of Qingshimao-Gaoshawo area of Ordos Basin, in China. Specifically, an improved label propagation algorithm based on semi-supervised learning (SS-LPA) is designed at the data level, which can reduce the imbalance rate of raw data to some extent after assigning high-confidence labels to unlabeled samples. At the model level, Q-network, as an effective reinforcement learning approach, is introduced into ensemble learning framework (QNEL), which can enhance the multi-classification accuracy of fluid identification by training multiple baseline models that are given different weights for feedback on imbalanced data. The experimental results from 35 tight-sand wells in Qingshimao-Gaoshawo area of Ordos Basin validate the effectiveness of the proposed framework. Specifically, the performance of CILF is the best on all three typical evaluation metrics, and it outperforms others in 12 out of a total of 18 categories. In terms of the average scores for six categories, the precision, recall rate, and F1 score of the proposed framework reach 0.988, 0.984, and 0.985, respectively.
常规油气资源基于数学模型的方法由于储层岩性、孔隙结构等多种因素的相互干扰,在致密砂岩储层流体识别中往往表现不佳。人工智能技术的蓬勃发展和测井数据的积累为机器学习方法作为流体识别新工具的应用提供了坚实的基础。然而,所采集的测井资料的类别比例往往存在严重的类别不平衡,容易导致无法获得理想的分类结果。因此,这一问题已成为学术界和工业界面临的巨大挑战。针对这一问题,提出了一种新的流体识别类不平衡学习框架(CILF),并应用于鄂尔多斯盆地青石茂—高沙窝致密砂岩气藏。具体而言,在数据层面设计了一种改进的基于半监督学习的标签传播算法(SS-LPA),通过对未标记的样本分配高置信度标签,可以在一定程度上降低原始数据的不平衡率。在模型层面,将Q-network作为一种有效的强化学习方法引入到集成学习框架(QNEL)中,通过训练多个基线模型,赋予不同的权值对不平衡数据进行反馈,提高流体识别的多分类精度。鄂尔多斯盆地青石茂—高沙窝地区35口致密砂岩井的实验结果验证了该框架的有效性。具体来说,在所有三个典型的评估指标上,CILF的表现都是最好的,并且在总共18个类别中的12个类别中表现优于其他类别。从6个类别的平均得分来看,所提框架的准确率、召回率和F1得分分别达到0.988、0.984和0.985。
{"title":"A novel class-imbalance learning framework for fluid recognition: Application to Qingshimao-Gaoshawo tight-sand gas reservoirs in the Ordos Basin, China","authors":"Jun Yi ,&nbsp;ZhongLi Qi ,&nbsp;XiangChengZhen Li ,&nbsp;Fuqiang Lai ,&nbsp;Wei Zhou","doi":"10.1016/j.cageo.2025.105993","DOIUrl":"10.1016/j.cageo.2025.105993","url":null,"abstract":"<div><div>The mathematical model-based methods used for conventional oil and gas resources often perform poorly in fluid recognition of tight-sand reservoir, due to the mutual interference of various factors such as reservoir lithology and pore structure. Booming artificial intelligence technologies and accumulating logging data provide a solid foundation for the application of machine learning methods as new tools for fluid identification. However, there is often a serious class imbalance, which can easily lead to the inability to achieve ideal classification results, in the proportion of categories of the collected well logging data. Consequently, this issue has become a huge challenge for the academic and industrial communities. To address this, a novel class-imbalance learning framework for fluid recognition (CILF) is proposed to tight-sand gas reservoirs of Qingshimao-Gaoshawo area of Ordos Basin, in China. Specifically, an improved label propagation algorithm based on semi-supervised learning (SS-LPA) is designed at the data level, which can reduce the imbalance rate of raw data to some extent after assigning high-confidence labels to unlabeled samples. At the model level, <span><math><mi>Q</mi></math></span>-network, as an effective reinforcement learning approach, is introduced into ensemble learning framework (QNEL), which can enhance the multi-classification accuracy of fluid identification by training multiple baseline models that are given different weights for feedback on imbalanced data. The experimental results from 35 tight-sand wells in Qingshimao-Gaoshawo area of Ordos Basin validate the effectiveness of the proposed framework. Specifically, the performance of CILF is the best on all three typical evaluation metrics, and it outperforms others in 12 out of a total of 18 categories. In terms of the average scores for six categories, the precision, recall rate, and F1 score of the proposed framework reach 0.988, 0.984, and 0.985, respectively.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 105993"},"PeriodicalIF":4.2,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144654451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Seismicity-constrained fault detection and characterization with a multitask machine learning model 基于多任务机器学习模型的地震约束故障检测与表征
IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-14 DOI: 10.1016/j.cageo.2025.105999
Kai Gao, Ting Chen
Geological fault detection and characterization are crucial for understanding subsurface dynamics across scales. While methods for fault delineation based on either seismicity location analysis or seismic image reflector discontinuity are well-established, a systematic approach that integrates both data types remains absent. We develop a novel machine learning model that unifies seismic reflector images and seismicity location information to automatically identify geological faults and characterize their geometrical properties. The model encodes a seismic image and a seismicity location image separately, and fuses the encoded features with a spatial-channel attention fusion module to improve the learning of important features in both inputs. We design an automated strategy to generate high-quality synthetic training data and labels. To improve the realism of the seismicity location image, we include random seismicity noise and missing seismicity location associated with some of the faults. We validate the model’s efficacy and accuracy using synthetic data examples and two field data examples. Moreover, we show that fine-tuning the trained model with a small, domain-specific dataset enhances its fidelity for field data applications. The results demonstrate that integrating seismicity location and seismic images into a unified framework allows the end-to-end neural network to achieve higher fidelity and accuracy in delineating subsurface faults and their geometrical properties compared with image-only fault detection methods. Our approach offers an adaptive data-driven tool for geological fault characterization and seismic hazard mitigation, bridging the gap between seismicity location and image-based fault detection methods.
地质断层检测和表征对于理解跨尺度的地下动力学至关重要。虽然基于地震活动定位分析或地震图像反射面不连续的断层描绘方法已经建立,但集成这两种数据类型的系统方法仍然缺乏。我们开发了一种新的机器学习模型,将地震反射图像和地震活动定位信息结合起来,自动识别地质断层并表征其几何性质。该模型分别对地震图像和地震活动定位图像进行编码,并利用空间通道注意力融合模块对编码后的特征进行融合,以提高对两个输入中重要特征的学习能力。我们设计了一个自动化的策略来生成高质量的合成训练数据和标签。为了提高地震活动性定位图像的真实感,我们加入了随机地震活动性噪声和与一些断层相关的地震活动性定位缺失。通过综合数据算例和两个现场数据算例验证了模型的有效性和准确性。此外,我们还表明,使用小的、特定领域的数据集对训练模型进行微调可以增强其对现场数据应用的保真度。结果表明,将地震活动定位和地震图像整合到一个统一的框架中,可以使端到端神经网络在圈定地下断层及其几何特征方面获得比仅图像检测方法更高的保真度和精度。我们的方法为地质断层表征和地震灾害减轻提供了自适应数据驱动工具,弥合了地震活动定位和基于图像的断层检测方法之间的差距。
{"title":"Seismicity-constrained fault detection and characterization with a multitask machine learning model","authors":"Kai Gao,&nbsp;Ting Chen","doi":"10.1016/j.cageo.2025.105999","DOIUrl":"10.1016/j.cageo.2025.105999","url":null,"abstract":"<div><div>Geological fault detection and characterization are crucial for understanding subsurface dynamics across scales. While methods for fault delineation based on either seismicity location analysis or seismic image reflector discontinuity are well-established, a systematic approach that integrates both data types remains absent. We develop a novel machine learning model that unifies seismic reflector images and seismicity location information to automatically identify geological faults and characterize their geometrical properties. The model encodes a seismic image and a seismicity location image separately, and fuses the encoded features with a spatial-channel attention fusion module to improve the learning of important features in both inputs. We design an automated strategy to generate high-quality synthetic training data and labels. To improve the realism of the seismicity location image, we include random seismicity noise and missing seismicity location associated with some of the faults. We validate the model’s efficacy and accuracy using synthetic data examples and two field data examples. Moreover, we show that fine-tuning the trained model with a small, domain-specific dataset enhances its fidelity for field data applications. The results demonstrate that integrating seismicity location and seismic images into a unified framework allows the end-to-end neural network to achieve higher fidelity and accuracy in delineating subsurface faults and their geometrical properties compared with image-only fault detection methods. Our approach offers an adaptive data-driven tool for geological fault characterization and seismic hazard mitigation, bridging the gap between seismicity location and image-based fault detection methods.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 105999"},"PeriodicalIF":4.2,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144672224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-code framework for IoT data warehousing and visualization 物联网数据仓库和可视化的低代码框架
IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-11 DOI: 10.1016/j.cageo.2025.105998
Victor Lamas, Alejandro Cortiñas, Miguel R. Luaces

Background:

The Internet of Things has revolutionized data collection in geosciences through extensive sensor networks. However, developing web-based data warehousing systems for IoT data remains costly and complex. While studies address sensor variability and data ingestion architectures, they often overlook the critical data warehouse component needed to manage IoT data volume and variability. Additionally, Model-Driven Engineering techniques have been used to create dashboards for urban activities but lack advanced map-based visualizations, which are essential for geospatial data.

Objectives:

This study aims to address the challenges of creating IoT data warehouses for geosciences, encouraging scientists to share sensor data analysis results using a simple, user-friendly, and cost-effective approach.

Methods:

The proposed framework integrates (i) a Domain-Specific Language metamodel to define sensors, dimensions, and measurement parameters, (ii) a Software Product Line for IoT data warehouse creation, and (iii) a low-code platform with command-line and web interfaces. The approach was validated through four case studies: meteorological, traffic and air quality, coastal, and oceanic monitoring systems.

Results:

The framework enables efficient IoT data warehouse creation with customized spatial, temporal, and attribute aggregation. Case studies demonstrate adaptability across domains, supporting real-time data ingestion, sensor mobility, and advanced visualization.

Conclusion:

The study presents a scalable, user-friendly framework for IoT data warehousing in geosciences using SPL and DSL technologies, addressing domain-specific challenges and empowering non-expert users. Future work includes usability assessments and expansion to other domains.
背景:物联网通过广泛的传感器网络彻底改变了地球科学的数据收集。然而,为物联网数据开发基于web的数据仓库系统仍然成本高昂且复杂。虽然研究涉及传感器可变性和数据摄取架构,但它们往往忽略了管理物联网数据量和可变性所需的关键数据仓库组件。此外,模型驱动工程技术已用于为城市活动创建仪表板,但缺乏高级的基于地图的可视化,这对于地理空间数据至关重要。目的:本研究旨在解决为地球科学创建物联网数据仓库的挑战,鼓励科学家使用简单,用户友好且具有成本效益的方法共享传感器数据分析结果。方法:提出的框架集成了(i)一个领域特定语言元模型来定义传感器、尺寸和测量参数,(ii)一个用于物联网数据仓库创建的软件产品线,以及(iii)一个具有命令行和web界面的低代码平台。该方法通过气象、交通和空气质量、海岸和海洋监测系统四个案例研究得到了验证。结果:该框架通过自定义的空间、时间和属性聚合实现了高效的物联网数据仓库创建。案例研究展示了跨领域的适应性,支持实时数据摄取、传感器移动性和高级可视化。结论:该研究提出了一个可扩展的、用户友好的框架,用于使用SPL和DSL技术的地球科学物联网数据仓库,解决了特定领域的挑战,并为非专业用户提供了支持。未来的工作包括可用性评估和扩展到其他领域。
{"title":"Low-code framework for IoT data warehousing and visualization","authors":"Victor Lamas,&nbsp;Alejandro Cortiñas,&nbsp;Miguel R. Luaces","doi":"10.1016/j.cageo.2025.105998","DOIUrl":"10.1016/j.cageo.2025.105998","url":null,"abstract":"<div><h3>Background:</h3><div>The Internet of Things has revolutionized data collection in geosciences through extensive sensor networks. However, developing web-based data warehousing systems for IoT data remains costly and complex. While studies address sensor variability and data ingestion architectures, they often overlook the critical data warehouse component needed to manage IoT data volume and variability. Additionally, Model-Driven Engineering techniques have been used to create dashboards for urban activities but lack advanced map-based visualizations, which are essential for geospatial data.</div></div><div><h3>Objectives:</h3><div>This study aims to address the challenges of creating IoT data warehouses for geosciences, encouraging scientists to share sensor data analysis results using a simple, user-friendly, and cost-effective approach.</div></div><div><h3>Methods:</h3><div>The proposed framework integrates (i) a Domain-Specific Language metamodel to define sensors, dimensions, and measurement parameters, (ii) a Software Product Line for IoT data warehouse creation, and (iii) a low-code platform with command-line and web interfaces. The approach was validated through four case studies: meteorological, traffic and air quality, coastal, and oceanic monitoring systems.</div></div><div><h3>Results:</h3><div>The framework enables efficient IoT data warehouse creation with customized spatial, temporal, and attribute aggregation. Case studies demonstrate adaptability across domains, supporting real-time data ingestion, sensor mobility, and advanced visualization.</div></div><div><h3>Conclusion:</h3><div>The study presents a scalable, user-friendly framework for IoT data warehousing in geosciences using SPL and DSL technologies, addressing domain-specific challenges and empowering non-expert users. Future work includes usability assessments and expansion to other domains.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 105998"},"PeriodicalIF":4.2,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144632362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards an open soil-plant digital twin based on STEMMUS-SCOPE model following open science 基于STEMMUS-SCOPE模型的开放土壤-植物数字孪生模型的研究
IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-11 DOI: 10.1016/j.cageo.2025.106013
Yijian Zeng , Fakhereh Alidoost , Bart Schilperoort , Yang Liu , Stefan Verhoeven , Meiert Willem Grootes , Yunfei Wang , Zengjing Song , Danyang Yu , Enting Tang , Qianqian Han , Lianyu Yu , Mostafa Gomaa Daoud , Prajwal Khanal , Yunfei Chen , Christiaan van der Tol , Raúl Zurita-Milla , Serkan Girgin , Bas Retsios , Niels Drost , Zhongbo Su
Droughts and heatwaves jeopardize terrestrial ecosystem services. The development of an open digital twin of the soil-plant system can help monitor and predict the impact of these extreme events on ecosystem functioning. We illustrate how our recently developed STEMMUS-SCOPE model—STEMMUS, Simultaneous Transfer of Energy, Mass and Momentum in Unsaturated Soil; SCOPE, Soil Canopy Observation of Photosynthesis and Energy fluxes—links soil-plant processes to novel satellite observables (e.g. solar-induced chlorophyll fluorescence), contributing to such a digital twin. This soil-plant digital twin allows a mechanistic window for tracking above- and below-ground ecophysiological processes with remote sensing observations. Following Open Science and FAIR (Findable, Accessible, Interoperable, Reusable) principles, both for data and research software, we present the building blocks of the soil-plant digital twin. It emphasizes the importance of FAIR-enabling digital technologies to translate research needs and developments into reproducible and reusable data, software and knowledge.
干旱和热浪危及陆地生态系统服务。开发一个开放的数字孪生土壤-植物系统可以帮助监测和预测这些极端事件对生态系统功能的影响。我们说明了我们最近开发的STEMMUS-SCOPE模型- stemmus,非饱和土壤中能量,质量和动量的同时传递;SCOPE,光合作用和能量通量的土壤冠层观测-将土壤-植物过程与新的卫星观测(例如太阳诱导的叶绿素荧光)联系起来,有助于实现这样的数字孪生。这种土壤-植物数字孪生体为利用遥感观测跟踪地上和地下的生态生理过程提供了一个机制窗口。遵循开放科学和公平(可查找,可访问,可互操作,可重用)原则,数据和研究软件,我们提出了土壤-植物数字双胞胎的构建模块。它强调了促进公平的数字技术将研究需求和发展转化为可复制和可重复使用的数据、软件和知识的重要性。
{"title":"Towards an open soil-plant digital twin based on STEMMUS-SCOPE model following open science","authors":"Yijian Zeng ,&nbsp;Fakhereh Alidoost ,&nbsp;Bart Schilperoort ,&nbsp;Yang Liu ,&nbsp;Stefan Verhoeven ,&nbsp;Meiert Willem Grootes ,&nbsp;Yunfei Wang ,&nbsp;Zengjing Song ,&nbsp;Danyang Yu ,&nbsp;Enting Tang ,&nbsp;Qianqian Han ,&nbsp;Lianyu Yu ,&nbsp;Mostafa Gomaa Daoud ,&nbsp;Prajwal Khanal ,&nbsp;Yunfei Chen ,&nbsp;Christiaan van der Tol ,&nbsp;Raúl Zurita-Milla ,&nbsp;Serkan Girgin ,&nbsp;Bas Retsios ,&nbsp;Niels Drost ,&nbsp;Zhongbo Su","doi":"10.1016/j.cageo.2025.106013","DOIUrl":"10.1016/j.cageo.2025.106013","url":null,"abstract":"<div><div>Droughts and heatwaves jeopardize terrestrial ecosystem services. The development of an open digital twin of the soil-plant system can help monitor and predict the impact of these extreme events on ecosystem functioning. We illustrate how our recently developed STEMMUS-SCOPE model—STEMMUS, Simultaneous Transfer of Energy, Mass and Momentum in Unsaturated Soil; SCOPE, Soil Canopy Observation of Photosynthesis and Energy fluxes—links soil-plant processes to novel satellite observables (e.g. solar-induced chlorophyll fluorescence), contributing to such a digital twin. This soil-plant digital twin allows a mechanistic window for tracking above- and below-ground ecophysiological processes with remote sensing observations. Following Open Science and FAIR (Findable, Accessible, Interoperable, Reusable) principles, both for data and research software, we present the building blocks of the soil-plant digital twin. It emphasizes the importance of FAIR-enabling digital technologies to translate research needs and developments into reproducible and reusable data, software and knowledge.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 106013"},"PeriodicalIF":4.2,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144686901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DAM-CGNet: Semantic segmentation-based approach for valley-bottom extraction from digital elevation models 基于语义分割的数字高程模型谷底提取方法
IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-11 DOI: 10.1016/j.cageo.2025.106012
Yuhan Ren, Hongming Zhang, Liang Dong, Huanyu Yang, Hongyi Li, Lu Du, Qiankun Chen, Songyuan Li
The accurate extraction of valley bottoms from digital elevation models (DEMs) is crucial for hydrological and geomorphological analyses of mountainous landscapes. However, threshold settings rely on manual intervention; roads near valley bottoms resemble valley-bottom features, and thresholds cannot effectively adapt to valleys of various shapes, leading to low extraction accuracy in existing methods, particularly in narrow V-shaped valleys. To address these issues, this study developed a semantic segmentation approach called a Dense-based Attention Merging Context Guided Network (DAM-CGNet). Without relying on thresholds, this method effectively excludes roads on hillslopes and enhances the recognition of steep feature changes at valley boundaries, enabling the extraction of valley bottoms of various shapes. Key improvements include: (1) incorporating the convolutional block attention module (CBAM) to enhance feature reuse in the information flow, employing attention mechanisms to suppress irrelevant feature responses and focus on valley boundary features; (2) using the dense connection strategy of DenseNet to rebuild the feature flow, helping the model keep important valley-bottom details in deep layers and better recognize small and narrow valleys; and (3) addressing the limitations of single-channel DEM representation by evaluating various input combinations, ultimately selecting DEM, topographic position index (TPI), and slope as effective inputs for valley-bottom extraction. Experiments using semantic segmentation models and conventional methods validated the effectiveness of the proposed method. Specifically, DAM-CGNet achieved high accuracy on the test set (MPA: 90.15 %, MIoU: 84.18 %, FWIoU: 92.99 %) and outperformed conventional methods in extracting valley bottoms of various shapes. This method, without a manual threshold setting as in conventional approaches, enhances valley bottom extraction precision and provides a new perspective for subsequent valley bottom width calculations.
从数字高程模型(dem)中准确提取山谷底部对山地景观的水文和地貌分析至关重要。然而,阈值设置依赖于人工干预;靠近谷底的道路与谷底特征相似,阈值不能有效适应各种形状的山谷,导致现有方法的提取精度较低,特别是在窄v型山谷中。为了解决这些问题,本研究开发了一种语义分割方法,称为基于密集的注意合并上下文引导网络(DAM-CGNet)。该方法在不依赖阈值的情况下,有效地排除了山坡上的道路,增强了对山谷边界陡峭特征变化的识别,能够提取各种形状的谷底。主要改进包括:(1)引入卷积块注意模块(CBAM)增强信息流中的特征重用,采用注意机制抑制无关特征响应,关注谷边界特征;(2)利用DenseNet的密集连接策略重建特征流,帮助模型在深层保留重要的谷底细节,更好地识别小而窄的谷;(3)通过评估各种输入组合来解决单通道DEM表示的局限性,最终选择DEM、地形位置指数(TPI)和坡度作为提取谷底的有效输入。使用语义分割模型和传统方法进行的实验验证了该方法的有效性。具体而言,DAM-CGNet在测试集上取得了较高的准确率(MPA: 90.15%, MIoU: 84.18%, FWIoU: 92.99%),在提取各种形状的谷底方面优于传统方法。该方法不需要像传统方法那样手动设置阈值,提高了谷底提取精度,为后续谷底宽度计算提供了新的视角。
{"title":"DAM-CGNet: Semantic segmentation-based approach for valley-bottom extraction from digital elevation models","authors":"Yuhan Ren,&nbsp;Hongming Zhang,&nbsp;Liang Dong,&nbsp;Huanyu Yang,&nbsp;Hongyi Li,&nbsp;Lu Du,&nbsp;Qiankun Chen,&nbsp;Songyuan Li","doi":"10.1016/j.cageo.2025.106012","DOIUrl":"10.1016/j.cageo.2025.106012","url":null,"abstract":"<div><div>The accurate extraction of valley bottoms from digital elevation models (DEMs) is crucial for hydrological and geomorphological analyses of mountainous landscapes. However, threshold settings rely on manual intervention; roads near valley bottoms resemble valley-bottom features, and thresholds cannot effectively adapt to valleys of various shapes, leading to low extraction accuracy in existing methods, particularly in narrow V-shaped valleys. To address these issues, this study developed a semantic segmentation approach called a Dense-based Attention Merging Context Guided Network (DAM-CGNet). Without relying on thresholds, this method effectively excludes roads on hillslopes and enhances the recognition of steep feature changes at valley boundaries, enabling the extraction of valley bottoms of various shapes. Key improvements include: (1) incorporating the convolutional block attention module (CBAM) to enhance feature reuse in the information flow, employing attention mechanisms to suppress irrelevant feature responses and focus on valley boundary features; (2) using the dense connection strategy of DenseNet to rebuild the feature flow, helping the model keep important valley-bottom details in deep layers and better recognize small and narrow valleys; and (3) addressing the limitations of single-channel DEM representation by evaluating various input combinations, ultimately selecting DEM, topographic position index (TPI), and slope as effective inputs for valley-bottom extraction. Experiments using semantic segmentation models and conventional methods validated the effectiveness of the proposed method. Specifically, DAM-CGNet achieved high accuracy on the test set (MPA: 90.15 %, MIoU: 84.18 %, FWIoU: 92.99 %) and outperformed conventional methods in extracting valley bottoms of various shapes. This method, without a manual threshold setting as in conventional approaches, enhances valley bottom extraction precision and provides a new perspective for subsequent valley bottom width calculations.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 106012"},"PeriodicalIF":4.2,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144672332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty-aware methods for enhancing rainfall prediction with deep-learning based post-processing segmentation 基于深度学习的后处理分割增强降雨预测的不确定性感知方法
IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-11 DOI: 10.1016/j.cageo.2025.105992
Simone Monaco , Luca Monaco , Daniele Apiletti , Roberto Cremonini , Secondo Barbero
Precipitation forecast is critical in flood management, agricultural planning, water resource allocation, and weather warnings. Despite significant advancements in Numerical Weather Prediction (NWP) models, these systems often exhibit substantial biases and errors, particularly at high spatial and temporal resolutions. To address these limitations, we develop and evaluate uncertainty-aware deep learning ensemble architectures, focusing on characterizing forecast uncertainties while achieving high accuracy and an optimal balance between sharpness and reliability. This study presents SDE U-Net, a novel adaptation of SDE-Net designed specifically for segmentation tasks in precipitation forecasting. We conduct a comprehensive evaluation of state-of-the-art ensemble architectures, including SDE U-Net, and compare their forecast uncertainty against that of a Poor Man’s Ensemble (PME, i.e. NWPs forecast average) across diverse meteorological conditions, ranging from non-intense precipitation patterns to intense weather events. As an example case, we focus on predicting daily cumulative precipitation in northwest Italy, though our approach is broadly generalizable. Our findings demonstrate that all the evaluated probabilistic deep learning models outperform the PME benchmark in terms of median RMSE for both non-intense and intense precipitation events. Among them, SDE U-Net achieves the best overall performance, delivering the lowest RMSE for intense events (2.637×102) and demonstrating a more stable error distribution compared to other models. For non-intense events, SDE U-Net perform comparably to other deep learning models, still notably surpassing the baselines. Moreover, SDE U-Net effectively balances sharpness and reliability, making it particularly suitable for operational forecasting of extreme weather. Integrating uncertainty-aware models like SDE U-Net into forecasting workflows can enhance decision-making and preparedness for weather-related hazards.
降水预报在洪水管理、农业规划、水资源分配和天气预警中至关重要。尽管数值天气预报(NWP)模式取得了重大进展,但这些系统经常表现出严重的偏差和错误,特别是在高空间和时间分辨率下。为了解决这些限制,我们开发并评估了不确定性感知深度学习集成架构,重点是在实现高精度和锐度与可靠性之间的最佳平衡的同时表征预测不确定性。本研究提出了SDE U-Net,这是一种专门为降水预报中的分割任务而设计的SDE- net的新改编。我们对包括SDE U-Net在内的最先进的集合体系结构进行了全面评估,并将其预测不确定性与穷人集合(PME,即NWPs预测平均值)在不同气象条件下(从非强烈降水模式到强烈天气事件)的预测不确定性进行了比较。作为一个例子,我们专注于预测意大利西北部的日累积降水量,尽管我们的方法是广泛推广的。我们的研究结果表明,就非强烈和强烈降水事件的中位数RMSE而言,所有评估的概率深度学习模型都优于PME基准。其中,SDE U-Net实现了最佳的综合性能,对强烈事件提供了最低的RMSE (2.637×10−2),并且与其他模型相比显示出更稳定的误差分布。对于非激烈事件,SDE U-Net的表现与其他深度学习模型相当,仍然明显超过基线。此外,SDE U-Net有效地平衡了清晰度和可靠性,使其特别适合极端天气的业务预报。将像SDE U-Net这样的不确定性感知模型集成到预报工作流程中,可以加强对天气相关灾害的决策和准备。
{"title":"Uncertainty-aware methods for enhancing rainfall prediction with deep-learning based post-processing segmentation","authors":"Simone Monaco ,&nbsp;Luca Monaco ,&nbsp;Daniele Apiletti ,&nbsp;Roberto Cremonini ,&nbsp;Secondo Barbero","doi":"10.1016/j.cageo.2025.105992","DOIUrl":"10.1016/j.cageo.2025.105992","url":null,"abstract":"<div><div>Precipitation forecast is critical in flood management, agricultural planning, water resource allocation, and weather warnings. Despite significant advancements in Numerical Weather Prediction (NWP) models, these systems often exhibit substantial biases and errors, particularly at high spatial and temporal resolutions. To address these limitations, we develop and evaluate uncertainty-aware deep learning ensemble architectures, focusing on characterizing forecast uncertainties while achieving high accuracy and an optimal balance between sharpness and reliability. This study presents SDE U-Net, a novel adaptation of SDE-Net designed specifically for segmentation tasks in precipitation forecasting. We conduct a comprehensive evaluation of state-of-the-art ensemble architectures, including SDE U-Net, and compare their forecast uncertainty against that of a Poor Man’s Ensemble (PME, i.e. NWPs forecast average) across diverse meteorological conditions, ranging from non-intense precipitation patterns to intense weather events. As an example case, we focus on predicting daily cumulative precipitation in northwest Italy, though our approach is broadly generalizable. Our findings demonstrate that all the evaluated probabilistic deep learning models outperform the PME benchmark in terms of median RMSE for both non-intense and intense precipitation events. Among them, SDE U-Net achieves the best overall performance, delivering the lowest RMSE for intense events (<span><math><mrow><mn>2</mn><mo>.</mo><mn>637</mn><mo>×</mo><mn>1</mn><msup><mrow><mn>0</mn></mrow><mrow><mo>−</mo><mn>2</mn></mrow></msup></mrow></math></span>) and demonstrating a more stable error distribution compared to other models. For non-intense events, SDE U-Net perform comparably to other deep learning models, still notably surpassing the baselines. Moreover, SDE U-Net effectively balances sharpness and reliability, making it particularly suitable for operational forecasting of extreme weather. Integrating uncertainty-aware models like SDE U-Net into forecasting workflows can enhance decision-making and preparedness for weather-related hazards.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"205 ","pages":"Article 105992"},"PeriodicalIF":4.2,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144614778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computers & Geosciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1