首页 > 最新文献

Computers & Geosciences最新文献

英文 中文
Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorch 基于CUDA内核函数和PyTorch混合编译的快速探地雷达双参数全波形反演方法
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-07 DOI: 10.1016/j.cageo.2025.106101
Lei Liu, Chao Song, Liangsheng He, Silin Wang, Xuan Feng, Cai Liu
This study presents a fast and flexible three-dimensional dual-parameter full waveform inversion (FWI) framework for ground penetrating radar (GPR), enabled by a hybrid compilation strategy that integrates custom CUDA kernel functions with the PyTorch automatic differentiation ecosystem. In the proposed workflow, computationally intensive operations are executed by highly optimized CUDA kernels, while PyTorch is employed only for lightweight tasks. This selective integration substantially reduces memory usage and avoids the runtime bottlenecks often encountered in GPR FWI, achieving an effective balance between efficiency and algorithmic adaptability. The framework supports simultaneous inversion of relative permittivity and electrical conductivity in large-scale 3D domains, providing a practical solution for multi-parameter GPR imaging. Its modular Python-based architecture further allows users to easily customize loss functions, regularization schemes, and optimization settings without modifying code, making the method suitable for rapid prototyping and methodological development. Numerical experiments on 2D and 3D models demonstrate excellent scalability and stable reconstruction performance, while a field-data example confirms that the method can reliably detect subsurface anomalies even under challenging zero-offset acquisition conditions. Overall, the proposed CUDA-PyTorch hybrid framework advances the state of the art in GPR FWI by combining high-performance GPU computing with the flexibility of modern deep-learning toolchains, offering a practical and extensible platform for future GPR FWI research.
本研究提出了一种用于探地雷达(GPR)的快速灵活的三维双参数全波形反演(FWI)框架,该框架通过混合编译策略实现,该策略将自定义CUDA内核函数与PyTorch自动分化生态系统集成在一起。在提议的工作流中,计算密集型操作由高度优化的CUDA内核执行,而PyTorch仅用于轻量级任务。这种选择性集成大大减少了内存使用,避免了GPR FWI中经常遇到的运行时瓶颈,实现了效率和算法适应性之间的有效平衡。该框架支持大规模三维域相对介电常数和电导率的同时反演,为多参数探地雷达成像提供了实用的解决方案。其基于python的模块化架构进一步允许用户轻松定制损失函数、正则化方案和优化设置,而无需修改代码,使该方法适合快速原型和方法开发。在2D和3D模型上进行的数值实验表明,该方法具有出色的可扩展性和稳定的重建性能,而现场数据实例证实,即使在具有挑战性的零偏移采集条件下,该方法也可以可靠地检测地下异常。总体而言,提出的CUDA-PyTorch混合框架通过将高性能GPU计算与现代深度学习工具链的灵活性相结合,推进了GPR FWI的最新技术,为未来的GPR FWI研究提供了实用且可扩展的平台。
{"title":"Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorch","authors":"Lei Liu,&nbsp;Chao Song,&nbsp;Liangsheng He,&nbsp;Silin Wang,&nbsp;Xuan Feng,&nbsp;Cai Liu","doi":"10.1016/j.cageo.2025.106101","DOIUrl":"10.1016/j.cageo.2025.106101","url":null,"abstract":"<div><div>This study presents a fast and flexible three-dimensional dual-parameter full waveform inversion (FWI) framework for ground penetrating radar (GPR), enabled by a hybrid compilation strategy that integrates custom CUDA kernel functions with the PyTorch automatic differentiation ecosystem. In the proposed workflow, computationally intensive operations are executed by highly optimized CUDA kernels, while PyTorch is employed only for lightweight tasks. This selective integration substantially reduces memory usage and avoids the runtime bottlenecks often encountered in GPR FWI, achieving an effective balance between efficiency and algorithmic adaptability. The framework supports simultaneous inversion of relative permittivity and electrical conductivity in large-scale 3D domains, providing a practical solution for multi-parameter GPR imaging. Its modular Python-based architecture further allows users to easily customize loss functions, regularization schemes, and optimization settings without modifying code, making the method suitable for rapid prototyping and methodological development. Numerical experiments on 2D and 3D models demonstrate excellent scalability and stable reconstruction performance, while a field-data example confirms that the method can reliably detect subsurface anomalies even under challenging zero-offset acquisition conditions. Overall, the proposed CUDA-PyTorch hybrid framework advances the state of the art in GPR FWI by combining high-performance GPU computing with the flexibility of modern deep-learning toolchains, offering a practical and extensible platform for future GPR FWI research.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"209 ","pages":"Article 106101"},"PeriodicalIF":4.4,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146038649","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive variational fusion of multi-source lunar digital elevation models based on curvature regularization 基于曲率正则化的多源月球数字高程模型自适应变分融合
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-06 DOI: 10.1016/j.cageo.2026.106107
Siyi Qiu , Zhen Ye , Yusheng Xu , Jia Qian , Rong Huang , Tao Tao , Huan Xie , Xiaohua Tong
A good lunar Digital Elevation Model (DEM) is critical for successful lunar landings and exploration missions, providing essential terrain data for landing site selection and mission planning. However, existing lunar DEMs face significant limitations, such as insufficient resolution, incomplete data coverage, and errors arising from observational and processing methods. These limitations hinder their effectiveness in high-precision terrain analysis and practical applications. To address these challenges, this study proposes an adaptive regularized variational model with curvature smoothing constraints, aiming to robustly fuse multi-source DEMs with varying resolutions, noise levels, and data voids. The proposed model incorporates two key innovations: a weighted data fidelity term that dynamically adjusts based on slope anomaly detection and ensures accuracy consistency across multi-scale DEMs, and a curvature-constrained total variation regularization term that suppresses noise and artifacts while preserving terrain details. The variational problem is solved iteratively using the Alternating Direction Method of Multipliers, enabling the seamless integration of multi-source DEMs. Experimental validations on simulated and real lunar south pole datasets demonstrate the superior performance of the proposed method, achieving significant improvements in noise suppression, artifact removal, void filling, and terrain detail preservation. The results indicate that the proposed method is capable of generating high-quality, seamless DEMs with enhanced spatial consistency and accuracy.
一个好的月球数字高程模型(DEM)是成功的月球着陆和探测任务的关键,为着陆点选择和任务规划提供必要的地形数据。然而,现有的月球dem面临着明显的局限性,如分辨率不足、数据覆盖不完整以及观测和处理方法引起的误差。这些限制阻碍了它们在高精度地形分析和实际应用中的有效性。为了解决这些挑战,本研究提出了一种具有曲率平滑约束的自适应正则化变分模型,旨在鲁棒融合具有不同分辨率、噪声水平和数据空洞的多源dem。该模型包含两个关键创新:基于坡度异常检测动态调整的加权数据保真度项,确保跨多尺度dem的精度一致性;曲率约束的总变差正则化项,在保留地形细节的同时抑制噪声和伪影。采用乘法器交替方向法迭代求解变分问题,实现了多源dem的无缝集成。在模拟和真实月球南极数据集上的实验验证表明,该方法在噪声抑制、伪影去除、空隙填充和地形细节保存等方面取得了显著的进步。结果表明,该方法能够生成高质量、无缝的dem,并提高了空间一致性和精度。
{"title":"Adaptive variational fusion of multi-source lunar digital elevation models based on curvature regularization","authors":"Siyi Qiu ,&nbsp;Zhen Ye ,&nbsp;Yusheng Xu ,&nbsp;Jia Qian ,&nbsp;Rong Huang ,&nbsp;Tao Tao ,&nbsp;Huan Xie ,&nbsp;Xiaohua Tong","doi":"10.1016/j.cageo.2026.106107","DOIUrl":"10.1016/j.cageo.2026.106107","url":null,"abstract":"<div><div>A good lunar Digital Elevation Model (DEM) is critical for successful lunar landings and exploration missions, providing essential terrain data for landing site selection and mission planning. However, existing lunar DEMs face significant limitations, such as insufficient resolution, incomplete data coverage, and errors arising from observational and processing methods. These limitations hinder their effectiveness in high-precision terrain analysis and practical applications. To address these challenges, this study proposes an adaptive regularized variational model with curvature smoothing constraints, aiming to robustly fuse multi-source DEMs with varying resolutions, noise levels, and data voids. The proposed model incorporates two key innovations: a weighted data fidelity term that dynamically adjusts based on slope anomaly detection and ensures accuracy consistency across multi-scale DEMs, and a curvature-constrained total variation regularization term that suppresses noise and artifacts while preserving terrain details. The variational problem is solved iteratively using the Alternating Direction Method of Multipliers, enabling the seamless integration of multi-source DEMs. Experimental validations on simulated and real lunar south pole datasets demonstrate the superior performance of the proposed method, achieving significant improvements in noise suppression, artifact removal, void filling, and terrain detail preservation. The results indicate that the proposed method is capable of generating high-quality, seamless DEMs with enhanced spatial consistency and accuracy.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"209 ","pages":"Article 106107"},"PeriodicalIF":4.4,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145980490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep-learning-based hybrid model with iterative lithology constraints for the enhanced prediction of missing well-logs 基于迭代岩性约束的深度学习混合模型增强缺失测井预测
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-05 DOI: 10.1016/j.cageo.2026.106106
Jaesung Park , Jina Jeong , Eun-Jung Holden
Accurate reconstruction of missing well-log data is essential for subsurface characterization and reservoir modeling but remains challenging under conditions of stratigraphic heterogeneity and multi-log incompleteness. This study introduces a deep learning framework that enhance missing log prediction by embedding lithological information as a contextual constraint. The proposed framework integrates a Conditional Variational Autoencoder (CVAE) with a Long Short-Term Memory (LSTM)-based lithology predictor, namely the Iterative Lithology-Constrained Hybrid CVAE–LSTM Network (ILCH-Net), in an iterative refinement process. The model was trained and validated on 45,809 samples from six wells in the Volve oil field, Norwegian North Sea, comprising five commonly acquired logs (GR, RHOB, NPHI, DTC, DTS) across three lithologies (claystone, sandstone, limestone). Quantitative evaluation demonstrates that ILCH-Net surpasses baseline approaches (Autoencoder, Iteratively refined autoencoder, LSTM), achieving lower root mean squared error (10.43 vs. 12.84 for LSTM) and improved distributional similarity (median Kolmogorov–Smirnov statistic of 0.15 with an interquartile range of 0.09 across the six test wells). Lithology-specific analysis further shows that reconstruction accuracy is highest for limestone and claystone, reflecting their distinct well-log responses, while sandstone exhibits greater variability due to depth-dependent compaction effects. These results confirm that lithological constraints not only enhance accuracy but also reduce inter-well variability, thereby yielding geologically consistent reconstructions. By embedding geological priors within a data-driven framework, ILCH-Net provides a robust and scalable solution for applications in reservoir characterization, digital rock modeling, and geomechanical analysis where incomplete or irregular logs are prevalent.
准确重建缺失的测井数据对于地下表征和储层建模至关重要,但在地层非均质性和多测井不完整的情况下仍然具有挑战性。本研究引入了一个深度学习框架,通过嵌入岩性信息作为上下文约束来增强缺失测井预测。该框架将条件变分自编码器(CVAE)与基于长短期记忆(LSTM)的岩性预测器(即迭代岩性约束混合CVAE - LSTM网络(ILCH-Net))集成在迭代改进过程中。该模型在挪威北海Volve油田6口井的45809个样本上进行了训练和验证,包括5种常用测井(GR、RHOB、NPHI、DTC、DTS),涵盖3种岩性(粘土岩、砂岩、石灰岩)。定量评价表明,ILCH-Net优于基线方法(Autoencoder、iterative refined Autoencoder、LSTM),实现了更低的均方根误差(10.43 vs. 12.84 LSTM),并改善了分布相似性(6口测试井的Kolmogorov-Smirnov统计量中位数为0.15,四分位数区间为0.09)。岩性分析进一步表明,石灰岩和粘土岩的重建精度最高,反映了它们独特的测井响应,而砂岩由于深度相关的压实作用而表现出更大的变异性。这些结果证实,岩性约束不仅提高了精度,还减少了井间的变异性,从而产生了地质一致性的重建。通过将地质先验嵌入到数据驱动的框架中,ILCH-Net为储层表征、数字岩石建模和地质力学分析等应用提供了强大且可扩展的解决方案,这些应用普遍存在测井不完整或不规则的情况。
{"title":"Deep-learning-based hybrid model with iterative lithology constraints for the enhanced prediction of missing well-logs","authors":"Jaesung Park ,&nbsp;Jina Jeong ,&nbsp;Eun-Jung Holden","doi":"10.1016/j.cageo.2026.106106","DOIUrl":"10.1016/j.cageo.2026.106106","url":null,"abstract":"<div><div>Accurate reconstruction of missing well-log data is essential for subsurface characterization and reservoir modeling but remains challenging under conditions of stratigraphic heterogeneity and multi-log incompleteness. This study introduces a deep learning framework that enhance missing log prediction by embedding lithological information as a contextual constraint. The proposed framework integrates a Conditional Variational Autoencoder (CVAE) with a Long Short-Term Memory (LSTM)-based lithology predictor, namely the Iterative Lithology-Constrained Hybrid CVAE–LSTM Network (ILCH-Net), in an iterative refinement process. The model was trained and validated on 45,809 samples from six wells in the Volve oil field, Norwegian North Sea, comprising five commonly acquired logs (GR, RHOB, NPHI, DTC, DTS) across three lithologies (claystone, sandstone, limestone). Quantitative evaluation demonstrates that ILCH-Net surpasses baseline approaches (Autoencoder, Iteratively refined autoencoder, LSTM), achieving lower root mean squared error (10.43 vs. 12.84 for LSTM) and improved distributional similarity (median Kolmogorov–Smirnov statistic of 0.15 with an interquartile range of 0.09 across the six test wells). Lithology-specific analysis further shows that reconstruction accuracy is highest for limestone and claystone, reflecting their distinct well-log responses, while sandstone exhibits greater variability due to depth-dependent compaction effects. These results confirm that lithological constraints not only enhance accuracy but also reduce inter-well variability, thereby yielding geologically consistent reconstructions. By embedding geological priors within a data-driven framework, ILCH-Net provides a robust and scalable solution for applications in reservoir characterization, digital rock modeling, and geomechanical analysis where incomplete or irregular logs are prevalent.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"209 ","pages":"Article 106106"},"PeriodicalIF":4.4,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EWoPe: A light Embeddable WOrkflow PErsistence tool for geoscientific pipeline reproducibility EWoPe:用于地球科学管道再现的轻量级可嵌入工作流持久性工具
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-03 DOI: 10.1016/j.cageo.2025.106099
Marianna Miola , Daniela Cabiddu , Simone Pittaluga , Micaela Raviola , Marino Vetuschi Zuccolini
Scientific workflows are essential in modern geoscientific research, where complex, multi-stage computational pipelines are used to analyze heterogeneous environmental data. Ensuring the reproducibility and traceability of these workflows is critical but often challenging due to their intricacy and evolving structure. We introduce EWoPe (Embeddable Workflow Persistence), a lightweight and embeddable methodology and C++ library designed to persist and reconstruct scientific workflows over time. Unlike existing workflow systems and provenance tools, our method adds minimal overhead to workflow execution: it does not automate or optimize processes, but instead ensures persistence through lightweight and structured metadata. EWoPe models workflows as directed acyclic graphs (DAGs), in which each data node is linked to computational tasks through metadata. The latter captures input–output dependencies, algorithm parameters, execution commands, and intermediate results, supporting full traceability and reproducibility of computational histories. EWoPe offers dual usability: as a standalone command-line tool or as an embeddable component within larger applications. We show its flexibility and applicability through a case study involving a complex workflow leading to subsurface reaction-transport modeling, starting from boreholes data. The EWoPe library is publicly available and designed to be extensible, making it suitable for a wide range of scientific domains, including geochemistry, geophysics, environmental engineering, and any other fields where transparency and data integrity are critical.
科学工作流程在现代地球科学研究中是必不可少的,在现代地球科学研究中,复杂的、多阶段的计算管道被用来分析异构环境数据。确保这些工作流的可再现性和可追溯性至关重要,但由于其复杂性和不断发展的结构,这往往具有挑战性。我们介绍了EWoPe(可嵌入工作流持久性),这是一种轻量级的可嵌入方法和c++库,旨在随着时间的推移持久化和重构科学工作流。与现有的工作流系统和来源工具不同,我们的方法为工作流的执行增加了最小的开销:它不自动化或优化流程,而是通过轻量级和结构化的元数据确保持久性。EWoPe将工作流建模为有向无环图(dag),其中每个数据节点通过元数据链接到计算任务。后者捕获输入-输出依赖关系、算法参数、执行命令和中间结果,支持计算历史的完全可跟踪性和可再现性。EWoPe提供了双重可用性:作为独立的命令行工具或作为大型应用程序中的可嵌入组件。我们通过一个案例研究展示了它的灵活性和适用性,该案例研究涉及一个复杂的工作流程,从钻孔数据开始,导致地下反应传输建模。EWoPe库是公开可用的,设计为可扩展的,使其适用于广泛的科学领域,包括地球化学,地球物理,环境工程,以及其他透明度和数据完整性至关重要的领域。
{"title":"EWoPe: A light Embeddable WOrkflow PErsistence tool for geoscientific pipeline reproducibility","authors":"Marianna Miola ,&nbsp;Daniela Cabiddu ,&nbsp;Simone Pittaluga ,&nbsp;Micaela Raviola ,&nbsp;Marino Vetuschi Zuccolini","doi":"10.1016/j.cageo.2025.106099","DOIUrl":"10.1016/j.cageo.2025.106099","url":null,"abstract":"<div><div>Scientific workflows are essential in modern geoscientific research, where complex, multi-stage computational pipelines are used to analyze heterogeneous environmental data. Ensuring the reproducibility and traceability of these workflows is critical but often challenging due to their intricacy and evolving structure. We introduce <span>EWoPe</span> (Embeddable Workflow Persistence), a lightweight and embeddable methodology and C++ library designed to persist and reconstruct scientific workflows over time. Unlike existing workflow systems and provenance tools, our method adds minimal overhead to workflow execution: it does not automate or optimize processes, but instead ensures persistence through lightweight and structured metadata. <span>EWoPe</span> models workflows as directed acyclic graphs (DAGs), in which each data node is linked to computational tasks through metadata. The latter captures input–output dependencies, algorithm parameters, execution commands, and intermediate results, supporting full traceability and reproducibility of computational histories. <span>EWoPe</span> offers dual usability: as a standalone command-line tool or as an embeddable component within larger applications. We show its flexibility and applicability through a case study involving a complex workflow leading to subsurface reaction-transport modeling, starting from boreholes data. The <span>EWoPe</span> library is publicly available and designed to be extensible, making it suitable for a wide range of scientific domains, including geochemistry, geophysics, environmental engineering, and any other fields where transparency and data integrity are critical.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"209 ","pages":"Article 106099"},"PeriodicalIF":4.4,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145908904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HSDL: A novel and practical method to refine automatic earthquake catalog using hybrid shallow and deep learning HSDL:一种新颖实用的基于浅层和深度混合学习的自动地震目录细化方法
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-03 DOI: 10.1016/j.cageo.2025.106103
Daniel Siervo, Yangkang Chen
Most earthquake detection workflows are based on an optimized short-term-average/long-term-average (STA/LTA) ratio, especially in regions with relatively sparse station geometry and poor velocity models. As the magnitude threshold is lowered to enable more complete earthquake analysis, more false alarms are occurring daily in earthquake monitoring. Here, we propose a high-fidelity approach, called hybrid shallow and deep learning (HSDL), to automatically classify potential earthquakes detected by an optimized STA/LTA workflow as true positives or false positives. To facilitate classification, we leverage an advanced deep learning phase picker, the earthquake compact convolutional transformer (EQCCT), which provides several classification features. These features include the counts of P&S picks, the average, minimum, maximum, and standard deviation of P&S probabilities, and the S/P pick count ratios. On a moderate dataset containing 200 real earthquakes and 200 fake earthquake waveforms, we achieve 100% accuracy across all metrics for both the random forest and XGBoost methods. On a larger dataset of 1500 events, we still achieve a precision of 1.0, a recall above 0.99, and an F1 score above 0.99 for both the random forest and XGBoost methods, with XGBoost achieving slightly higher accuracy. We also analyzed the feature importance and found that the maximum S-pick probability and the S/P pick count ratio play the most critical roles in classification. The proposed method provides a highly effective and efficient approach for fine-tuning the automatic earthquake catalog using the optimized STA/LTA method, leveraging existing tools such as the deep-learning-based phase picker and XGBoost.
大多数地震探测工作流程都是基于优化的短期平均/长期平均(STA/LTA)比率,特别是在台站几何结构相对稀疏和速度模型较差的地区。随着震级阈值的降低,使地震分析更加完整,地震监测中每天都有更多的误报发生。在这里,我们提出了一种高保真度的方法,称为混合浅层和深度学习(HSDL),用于自动将优化的STA/LTA工作流检测到的潜在地震分类为真阳性或假阳性。为了便于分类,我们利用了一种先进的深度学习相位选择器,地震紧凑型卷积变压器(EQCCT),它提供了几个分类特征。这些特征包括P&;S选择的计数,P&;S概率的平均值,最小值,最大值和标准差,以及S/P选择计数比率。在一个包含200次真实地震和200次假地震波形的中等数据集上,我们在随机森林和XGBoost方法的所有指标上都实现了100%的准确性。在包含1500个事件的更大的数据集上,随机森林和XGBoost方法的精度仍然达到1.0,召回率高于0.99,F1分数高于0.99,XGBoost方法的精度略高。我们还分析了特征重要性,发现最大S-pick概率和S/P pick计数比在分类中起着最关键的作用。该方法利用基于深度学习的相位选择器和XGBoost等现有工具,利用优化的STA/LTA方法对自动地震目录进行微调,提供了一种高效的方法。
{"title":"HSDL: A novel and practical method to refine automatic earthquake catalog using hybrid shallow and deep learning","authors":"Daniel Siervo,&nbsp;Yangkang Chen","doi":"10.1016/j.cageo.2025.106103","DOIUrl":"10.1016/j.cageo.2025.106103","url":null,"abstract":"<div><div>Most earthquake detection workflows are based on an optimized short-term-average/long-term-average (STA/LTA) ratio, especially in regions with relatively sparse station geometry and poor velocity models. As the magnitude threshold is lowered to enable more complete earthquake analysis, more false alarms are occurring daily in earthquake monitoring. Here, we propose a high-fidelity approach, called hybrid shallow and deep learning (HSDL), to automatically classify potential earthquakes detected by an optimized STA/LTA workflow as true positives or false positives. To facilitate classification, we leverage an advanced deep learning phase picker, the earthquake compact convolutional transformer (EQCCT), which provides several classification features. These features include the counts of P&amp;S picks, the average, minimum, maximum, and standard deviation of P&amp;S probabilities, and the S/P pick count ratios. On a moderate dataset containing 200 real earthquakes and 200 fake earthquake waveforms, we achieve 100% accuracy across all metrics for both the random forest and XGBoost methods. On a larger dataset of 1500 events, we still achieve a precision of 1.0, a recall above 0.99, and an F1 score above 0.99 for both the random forest and XGBoost methods, with XGBoost achieving slightly higher accuracy. We also analyzed the feature importance and found that the maximum S-pick probability and the S/P pick count ratio play the most critical roles in classification. The proposed method provides a highly effective and efficient approach for fine-tuning the automatic earthquake catalog using the optimized STA/LTA method, leveraging existing tools such as the deep-learning-based phase picker and XGBoost.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"209 ","pages":"Article 106103"},"PeriodicalIF":4.4,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145928789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An acceleration method for elastic wave forward modeling based on 1D convolution operator 基于一维卷积算子的弹性波正演加速方法
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-29 DOI: 10.1016/j.cageo.2025.106100
Bohan Chen , Yan Zhang , Liangliang Yao , Haichao Wang , Qifeng Chen , Miaomiao Wang
The finite-difference time-domain (FDTD) method incurs significant computational overhead in high-precision, long sampling, and large-scale elastic wave forward numerical simulations, due to its inherent drawbacks, such as the need for repeated wave field iteration and the large number of grid points. To address this problem, this paper proposes a 1D convolution operator elastic wave forward acceleration method based on FDTD theory, the staggered grids, and the first-order velocity-stress equation while maintaining the original wave field iteration and grid point count. The method takes the finite difference coefficients as the 1D convolution kernel weights, transforms the finite difference partial derivative computation into the form of convolution operation, and makes full use of the high arithmetic intensity and parallel characteristics of convolution computation in GPUs to achieve efficient solving of spatial first-order derivatives. The matrix transposition optimization strategy is introduced to reorganise the storage layout of column direction data to improve the efficiency of reading column direction data and maximize the performance of the convolution operation. At the same time, a parallel matrix multiplication mechanism is designed to further improve the performance of convolutional computation. The proposed method achieves comparable numerical simulation accuracy to the FDTD method of the same order. It show a time efficiency improvement of 62.82 % in high-precision imaging, 58.48 % in long sampling, and 44.03 % in large-scale models.
时域有限差分法(finite-差分time-domain, FDTD)由于需要重复波场迭代和网格点数量多等缺点,在高精度、长采样、大规模弹性波正演数值模拟中造成了巨大的计算开销。针对这一问题,本文提出了一种基于时域有限差分理论、交错网格和一阶速度-应力方程的一维卷积算子弹性波正演加速方法,同时保持了原始波场迭代和网格点数。该方法以有限差分系数作为一维卷积核权值,将有限差分偏导数计算转化为卷积运算的形式,并充分利用卷积计算在gpu中的高运算强度和并行特性,实现空间一阶导数的高效求解。引入矩阵转置优化策略对列方向数据的存储布局进行重组,以提高读取列方向数据的效率,最大化卷积运算的性能。同时,设计了并行矩阵乘法机制,进一步提高了卷积计算的性能。该方法的数值模拟精度与同阶时域有限差分法相当。在高精度成像、长时间采样和大尺度模型中,时间效率分别提高了62.82%、58.48%和44.03%。
{"title":"An acceleration method for elastic wave forward modeling based on 1D convolution operator","authors":"Bohan Chen ,&nbsp;Yan Zhang ,&nbsp;Liangliang Yao ,&nbsp;Haichao Wang ,&nbsp;Qifeng Chen ,&nbsp;Miaomiao Wang","doi":"10.1016/j.cageo.2025.106100","DOIUrl":"10.1016/j.cageo.2025.106100","url":null,"abstract":"<div><div>The finite-difference time-domain (FDTD) method incurs significant computational overhead in high-precision, long sampling, and large-scale elastic wave forward numerical simulations, due to its inherent drawbacks, such as the need for repeated wave field iteration and the large number of grid points. To address this problem, this paper proposes a 1D convolution operator elastic wave forward acceleration method based on FDTD theory, the staggered grids, and the first-order velocity-stress equation while maintaining the original wave field iteration and grid point count. The method takes the finite difference coefficients as the 1D convolution kernel weights, transforms the finite difference partial derivative computation into the form of convolution operation, and makes full use of the high arithmetic intensity and parallel characteristics of convolution computation in GPUs to achieve efficient solving of spatial first-order derivatives. The matrix transposition optimization strategy is introduced to reorganise the storage layout of column direction data to improve the efficiency of reading column direction data and maximize the performance of the convolution operation. At the same time, a parallel matrix multiplication mechanism is designed to further improve the performance of convolutional computation. The proposed method achieves comparable numerical simulation accuracy to the FDTD method of the same order. It show a time efficiency improvement of 62.82 % in high-precision imaging, 58.48 % in long sampling, and 44.03 % in large-scale models.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106100"},"PeriodicalIF":4.4,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PRT-DeepONet: Geometry-aware neural operator for efficient prediction of pore-scale concentration fields PRT-DeepONet:用于孔隙尺度浓度场有效预测的几何感知神经算子
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-21 DOI: 10.1016/j.cageo.2025.106098
Yehoon Kim , Ho-rim Kim , Heewon Jung
The computational challenge of predicting reactive transport in heterogeneous porous media originates from resolving complex pore-scale geometries and sharp concentration gradients near solid-fluid interfaces. This study introduces PRT-DeepONet (Pore-scale Reaction Transport Deep Operator Network), a geometry-aware neural operator that predicts local concentration fields in porous media with linear and nonlinear reactions. The architecture comprises two branch networks—a CNN branch for encoding the spatial patterns of binary porous media and an FNN branch for extracting parametric controls in partial differential equations—and a trunk network augmented with a geodesic distance function. This geodesic encoding addresses the inherent limitation of convolutional neural networks in maintaining geometric fidelity at solid-fluid interfaces by providing explicit transport pathway information absent in conventional architectures. PRT-DeepONet was trained on lattice Boltzmann simulations spanning various Péclet and Damköhler numbers, encompassing diffusion-to advection-dominated transport and slow to fast reaction kinetics. For steady-state predictions, PRT-DeepONet achieves an average RMSE below 0.04 while preserving complex grain geometries that baseline models fail to capture, with computational speedups of 3–5 orders of magnitude—reducing simulation times from minutes to milliseconds. The architecture successfully extends to transient problems, accurately predicting temporal concentration evolution for both reversible sorption and Monod kinetics. PRT-DeepONet demonstrates robust interpolation for unseen parametric conditions and time points, with performance improving with denser sampling of training data. These capabilities position PRT-DeepONet as an efficient tool for subsurface applications requiring rapid evaluation of reactive transport, including groundwater contamination assessment, CO2 sequestration modeling, and nuclear waste disposal safety analysis.
预测非均质多孔介质中反应输运的计算挑战来自于解析复杂的孔隙尺度几何形状和固液界面附近的急剧浓度梯度。该研究引入了PRT-DeepONet(孔隙尺度反应传输深度算子网络),这是一种几何感知的神经算子,用于预测具有线性和非线性反应的多孔介质中的局部浓度场。该体系结构包括两个分支网络——一个用于编码二元多孔介质空间模式的CNN分支和一个用于提取偏微分方程参数控制的FNN分支——以及一个带有测地距离函数的主干网络。这种测地线编码解决了卷积神经网络在保持固流界面几何保真度方面的固有局限性,提供了传统架构中缺乏的明确传输路径信息。PRT-DeepONet在晶格玻尔兹曼模拟中进行了训练,模拟涵盖了各种psamclet和Damköhler数字,包括扩散到平流主导的运输以及慢速到快速的反应动力学。对于稳态预测,PRT-DeepONet实现了低于0.04的平均RMSE,同时保留了基线模型无法捕获的复杂谷物几何形状,计算速度将3-5个数量级的模拟时间从几分钟减少到几毫秒。该体系结构成功地扩展到瞬态问题,准确地预测了可逆吸附和莫诺动力学的时间浓度演变。PRT-DeepONet展示了对未知参数条件和时间点的鲁棒插值,随着训练数据采样密度的提高,性能得到改善。这些功能将PRT-DeepONet定位为需要快速评估反应输送的地下应用的有效工具,包括地下水污染评估、二氧化碳封存建模和核废料处理安全分析。
{"title":"PRT-DeepONet: Geometry-aware neural operator for efficient prediction of pore-scale concentration fields","authors":"Yehoon Kim ,&nbsp;Ho-rim Kim ,&nbsp;Heewon Jung","doi":"10.1016/j.cageo.2025.106098","DOIUrl":"10.1016/j.cageo.2025.106098","url":null,"abstract":"<div><div>The computational challenge of predicting reactive transport in heterogeneous porous media originates from resolving complex pore-scale geometries and sharp concentration gradients near solid-fluid interfaces. This study introduces PRT-DeepONet (Pore-scale Reaction Transport Deep Operator Network), a geometry-aware neural operator that predicts local concentration fields in porous media with linear and nonlinear reactions. The architecture comprises two branch networks—a CNN branch for encoding the spatial patterns of binary porous media and an FNN branch for extracting parametric controls in partial differential equations—and a trunk network augmented with a geodesic distance function. This geodesic encoding addresses the inherent limitation of convolutional neural networks in maintaining geometric fidelity at solid-fluid interfaces by providing explicit transport pathway information absent in conventional architectures. PRT-DeepONet was trained on lattice Boltzmann simulations spanning various Péclet and Damköhler numbers, encompassing diffusion-to advection-dominated transport and slow to fast reaction kinetics. For steady-state predictions, PRT-DeepONet achieves an average RMSE below 0.04 while preserving complex grain geometries that baseline models fail to capture, with computational speedups of 3–5 orders of magnitude—reducing simulation times from minutes to milliseconds. The architecture successfully extends to transient problems, accurately predicting temporal concentration evolution for both reversible sorption and Monod kinetics. PRT-DeepONet demonstrates robust interpolation for unseen parametric conditions and time points, with performance improving with denser sampling of training data. These capabilities position PRT-DeepONet as an efficient tool for subsurface applications requiring rapid evaluation of reactive transport, including groundwater contamination assessment, CO<sub>2</sub> sequestration modeling, and nuclear waste disposal safety analysis.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106098"},"PeriodicalIF":4.4,"publicationDate":"2025-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MJOFormer: An adaptive land-ocean spatio-temporal transformer for Madden–Julian Oscillation forecasting MJOFormer:用于Madden-Julian振荡预报的自适应陆-海时空转换器
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-20 DOI: 10.1016/j.cageo.2025.106097
Hongliang Li , Zhewen Xu , Zelong Fang , Nong Zhang , Changzheng Liu , Renge Zhou , Xiaohui Wei
The Madden–Julian Oscillation (MJO) represents the predominant driver of sub-seasonal variability within tropical regions. In deep learning of weather forecasting, achieving reliable accuracy in MJO prediction remains challenging, making sub-seasonal forecasts generally probabilistic. In this paper, we reveal the challenges that impede MJO forecasts, including distributional drift when the circulation passes from the Indian Ocean across Australia, hindered by obstacles like lands or shoals. In addition, we find it non-trivial to extract the sophisticated spatio-temporal relationships in climate data. To address these issues, we propose MJOFormer, an adaptive land-ocean spatio-temporal transformer, with (1) a land-ocean sampler to address distributional drift by adaptively partitioning across terrain; (2) a dynamic attention mechanism to compensate for the absence of spatial features by adaptively tackling the spatio-temporal correlation; (3) a cuboid method to improve efficiency by parallel training. Comprehensive experiments exhibit that MJOFormer possesses competitive, outperforming existing methods with better accuracy, stability, and efficiency.
麦登-朱利安涛动(MJO)是热带地区亚季节变率的主要驱动因素。在天气预报的深度学习中,实现MJO预测的可靠准确性仍然具有挑战性,使得分季节预报通常是概率性的。在本文中,我们揭示了阻碍MJO预测的挑战,包括环流从印度洋经过澳大利亚时的分布漂移,受到陆地或浅滩等障碍物的阻碍。此外,我们发现提取气候数据中复杂的时空关系并非易事。为了解决这些问题,我们提出了MJOFormer,一个自适应陆地-海洋时空转换器,它具有(1)陆地-海洋采样器,通过自适应跨地形划分来解决分布漂移;(2)动态注意机制,通过自适应处理时空相关性来补偿空间特征缺失;(3)长方体法通过并行训练提高效率。综合实验表明,MJOFormer具有竞争力,优于现有方法,具有更好的准确性,稳定性和效率。
{"title":"MJOFormer: An adaptive land-ocean spatio-temporal transformer for Madden–Julian Oscillation forecasting","authors":"Hongliang Li ,&nbsp;Zhewen Xu ,&nbsp;Zelong Fang ,&nbsp;Nong Zhang ,&nbsp;Changzheng Liu ,&nbsp;Renge Zhou ,&nbsp;Xiaohui Wei","doi":"10.1016/j.cageo.2025.106097","DOIUrl":"10.1016/j.cageo.2025.106097","url":null,"abstract":"<div><div>The Madden–Julian Oscillation (MJO) represents the predominant driver of sub-seasonal variability within tropical regions. In deep learning of weather forecasting, achieving reliable accuracy in MJO prediction remains challenging, making sub-seasonal forecasts generally probabilistic. In this paper, we reveal the challenges that impede MJO forecasts, including distributional drift when the circulation passes from the Indian Ocean across Australia, hindered by obstacles like lands or shoals. In addition, we find it non-trivial to extract the sophisticated spatio-temporal relationships in climate data. To address these issues, we propose MJOFormer, an adaptive land-ocean spatio-temporal transformer, with (1) a land-ocean sampler to address distributional drift by adaptively partitioning across terrain; (2) a dynamic attention mechanism to compensate for the absence of spatial features by adaptively tackling the spatio-temporal correlation; (3) a cuboid method to improve efficiency by parallel training. Comprehensive experiments exhibit that MJOFormer possesses competitive, outperforming existing methods with better accuracy, stability, and efficiency.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106097"},"PeriodicalIF":4.4,"publicationDate":"2025-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145885239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A mixture of experts model for shallow crustal earthquake ground motion prediction in Japan 日本浅层地壳地震地面运动预测的混合专家模型
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-13 DOI: 10.1016/j.cageo.2025.106095
Zifa Wang , Jinfeng Dai , Dengke Zhao , Xiangying Wang , Jianming Wang , Zhaoyan Li , Yongcheng Feng , Zhaodong Wang
Ground motion prediction is central to earthquake engineering and disaster assessment, but traditional ground motion prediction models (GMPMs) struggle to capture the complex nature of seismic wave propagation. GMPMs based on a single machine learning algorithm also exhibit unsatisfactory performance when handling high-dimensional nonlinear relationships and large datasets. This study proposes a novel ground motion prediction model, MoE-XGB, which combines the Mixture of Experts (MoE) architecture with the XGBoost algorithm. Through a gating network, it dynamically assigns weights to adaptively handle the heterogeneity of earthquake data. The model innovatively integrates latitude and longitude features of stations and seismic sources, primarily acting as proxies for relative positions between stations and epicenters. The model was trained and validated using a strong-motion database of shallow crustal earthquakes in Japan and tested for cross-regional generalization with a New Zealand earthquake dataset. Results show that the MoE-XGB model, trained on a 19972019 strong-motion dataset, improves the mean squared error (MSE) by 39.2 %, reduces the standard deviation by 22.0 %, and increases the correlation coefficient by 4.8 % compared to the XGBoost-SC model, which is a regression model based on XGBoost and specifically designed for predicting seismic motions in the shallow crust region (SC). The inclusion of latitude and longitude features, primarily acting as proxies for relative positions between stations and epicenters, significantly enhances prediction accuracy. Cross-regional testing in New Zealand confirms the model's robust generalization to earthquake events in other regions. By efficiently integrating spatial features and a dynamic expert mechanism, the MoE-XGB model provides a high-precision, highly generalizable solution for ground motion prediction.
地震动预测是地震工程和灾害评估的核心,但传统的地震动预测模型(GMPMs)难以捕捉地震波传播的复杂性。基于单一机器学习算法的gmpm在处理高维非线性关系和大型数据集时也表现出令人不满意的性能。本文提出了一种新的地震动预测模型MoE- xgb,该模型将混合专家(MoE)架构与XGBoost算法相结合。通过门控网络动态分配权重,自适应处理地震数据的异质性。该模型创新地整合了台站和震源的纬度和经度特征,主要作为台站和震中之间相对位置的代理。使用日本浅层地壳强震数据库对模型进行了训练和验证,并使用新西兰地震数据集进行了跨区域推广测试。结果表明,与XGBoost-SC模型相比,在1997-2019年强震数据集上训练的MoE-XGB模型的均方误差(MSE)提高了39.2%,标准差降低了22.0%,相关系数提高了4.8%。XGBoost-SC模型是一种基于XGBoost的回归模型,专门用于预测地壳浅层地震运动。包括纬度和经度特征,主要作为台站和震中之间相对位置的代理,显著提高了预测精度。在新西兰进行的跨区域测试证实了该模型对其他地区地震事件的稳健推广。通过有效地整合空间特征和动态专家机制,MoE-XGB模型为地震动预测提供了高精度、高度通用性的解决方案。
{"title":"A mixture of experts model for shallow crustal earthquake ground motion prediction in Japan","authors":"Zifa Wang ,&nbsp;Jinfeng Dai ,&nbsp;Dengke Zhao ,&nbsp;Xiangying Wang ,&nbsp;Jianming Wang ,&nbsp;Zhaoyan Li ,&nbsp;Yongcheng Feng ,&nbsp;Zhaodong Wang","doi":"10.1016/j.cageo.2025.106095","DOIUrl":"10.1016/j.cageo.2025.106095","url":null,"abstract":"<div><div>Ground motion prediction is central to earthquake engineering and disaster assessment, but traditional ground motion prediction models (GMPMs) struggle to capture the complex nature of seismic wave propagation. GMPMs based on a single machine learning algorithm also exhibit unsatisfactory performance when handling high-dimensional nonlinear relationships and large datasets. This study proposes a novel ground motion prediction model, MoE-XGB, which combines the Mixture of Experts (MoE) architecture with the XGBoost algorithm. Through a gating network, it dynamically assigns weights to adaptively handle the heterogeneity of earthquake data. The model innovatively integrates latitude and longitude features of stations and seismic sources, primarily acting as proxies for relative positions between stations and epicenters. The model was trained and validated using a strong-motion database of shallow crustal earthquakes in Japan and tested for cross-regional generalization with a New Zealand earthquake dataset. Results show that the MoE-XGB model, trained on a 1997<strong>–</strong>2019 strong-motion dataset, improves the mean squared error (MSE) by 39.2 %, reduces the standard deviation by 22.0 %, and increases the correlation coefficient by 4.8 % compared to the XGBoost-SC model, which is a regression model based on XGBoost and specifically designed for predicting seismic motions in the shallow crust region (SC). The inclusion of latitude and longitude features, primarily acting as proxies for relative positions between stations and epicenters, significantly enhances prediction accuracy. Cross-regional testing in New Zealand confirms the model's robust generalization to earthquake events in other regions. By efficiently integrating spatial features and a dynamic expert mechanism, the MoE-XGB model provides a high-precision, highly generalizable solution for ground motion prediction.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106095"},"PeriodicalIF":4.4,"publicationDate":"2025-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145791802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Raising the bar: Deep learning on comprehensive database sets new benchmark for automated fission-track detection 提高标准:综合数据库的深度学习为自动裂变轨迹检测设定了新的基准
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-13 DOI: 10.1016/j.cageo.2025.106096
Samuel C. Boone , Ling Chung , Noel Faux , Usha Nattala , Thomas Church , Chenghao Jiang , Malcolm McMillan , Sean Jones , David Liu , Han Jiang , Kris Ehinger , Tom Drummond , Barry Kohn , Andrew Gleadow
Apatite fission-track (FT) thermochronology is widely used for constraining the thermal evolution of crustal rocks. However, manual FT identification is time-intensive and subjective. Although recent AI-based approaches have shown promise, performance often declines for complex, natural samples due to limited and overly idealised training data.
We introduce two open-access convolutional neural networks (CNNs) for automatic detection of surface-intersecting FTs in apatite and mica. The first, HALtracks 2D, uses paired reflected- and transmitted-light surface images, while HALtracks 3D incorporates an additional 3D stack of transmitted light images. HALtracks 2D exhibits mean accuracies (94.2–91.6 %) that are as good or better than both HALtracks 3D and all previous FT algorithms across a broad range of apatite fission track densities (up to 8.54 × 106 tracks/cm2) on expert-curated reference data. This improvement is due to a comprehensive training dataset comprising a wider range of track densities and etch-pit morphologies.
Unexpectedly, HALtracks3D performed worse (91.5–80.1 %), likely because reflected-light information—critical for recognising track openings—became underrepresented among multiple transmitted-light inputs during CNN training. At very high track densities (>8.54 × 106 tracks/cm2) pushing the analytical boundaries of optical fission-track counting, Coincidence Mapping (Gleadow et al., 2009) remains more accurate than HALtracks 2D. Thermochronologists might therefore consider utilising a combination of automated fission-track algorithms depending on FT density.
Future work could expand the open-access training dataset to include a broader range of apatite FT specimens, and increased metadata for targeted CNN training on spurious features such as surface imperfections and dislocations, which are misidentified as fission tracks by existing algorithms. The open-access testing dataset presented here provides a benchmark for evaluating future FT algorithms.
Nevertheless, HALtracks 2D's enhanced accuracy brings apatite FT analysis significantly closer to full automation, with the potential to mitigate observer bias, reduce inter-laboratory variability, and broaden the accessibility of the technique to the wider geoscience community.
磷灰石裂变径迹(FT)热年代学被广泛用于约束地壳岩石的热演化。然而,手动FT识别是费时且主观的。尽管最近基于人工智能的方法显示出了希望,但由于有限和过度理想化的训练数据,复杂的自然样本的性能往往会下降。本文介绍了两种开放存取卷积神经网络(cnn),用于自动检测磷灰石和云母中表面相交的傅里叶变换。第一种是HALtracks 2D,使用成对的反射光和透射光表面图像,而HALtracks 3D则结合了透射光图像的额外3D堆栈。HALtracks 2D展示的平均精度(94.2 - 91.6%)在专家管理的参考数据上,在广泛的磷灰石裂变径迹密度(高达8.54 × 106径迹/cm2)范围内,与HALtracks 3D和所有以前的FT算法一样好或更好。这种改进是由于一个全面的训练数据集,包括更广泛的轨道密度和蚀刻坑形态。出乎意料的是,HALtracks3D的表现更差(91.5% - 80.1%),可能是因为在CNN训练期间,反射光信息在多个透射光输入中被低估了,而反射光信息是识别轨道开口的关键。在非常高的轨道密度(>;8.54 × 106轨道/cm2)推动光学裂变轨道计数的分析边界时,重合映射(Gleadow et al., 2009)仍然比HALtracks 2D更准确。因此,热年代学家可能会考虑根据FT密度使用自动裂变径迹算法的组合。未来的工作可以扩展开放获取的训练数据集,包括更广泛的磷灰石FT样本,并增加针对虚假特征(如表面缺陷和位错)的目标CNN训练的元数据,这些虚假特征被现有算法错误地识别为裂变轨迹。本文提供的开放获取测试数据集为评估未来的FT算法提供了基准。尽管如此,HALtracks 2D的提高精度使磷灰石FT分析更接近于完全自动化,有可能减轻观察者的偏见,减少实验室间的差异,并扩大该技术在更广泛的地球科学领域的可及性。
{"title":"Raising the bar: Deep learning on comprehensive database sets new benchmark for automated fission-track detection","authors":"Samuel C. Boone ,&nbsp;Ling Chung ,&nbsp;Noel Faux ,&nbsp;Usha Nattala ,&nbsp;Thomas Church ,&nbsp;Chenghao Jiang ,&nbsp;Malcolm McMillan ,&nbsp;Sean Jones ,&nbsp;David Liu ,&nbsp;Han Jiang ,&nbsp;Kris Ehinger ,&nbsp;Tom Drummond ,&nbsp;Barry Kohn ,&nbsp;Andrew Gleadow","doi":"10.1016/j.cageo.2025.106096","DOIUrl":"10.1016/j.cageo.2025.106096","url":null,"abstract":"<div><div>Apatite fission-track (FT) thermochronology is widely used for constraining the thermal evolution of crustal rocks. However, manual FT identification is time-intensive and subjective. Although recent AI-based approaches have shown promise, performance often declines for complex, natural samples due to limited and overly idealised training data.</div><div>We introduce two open-access convolutional neural networks (CNNs) for automatic detection of surface-intersecting FTs in apatite and mica. The first, <em>HALtracks 2D</em>, uses paired reflected- and transmitted-light surface images, while <em>HALtracks 3D</em> incorporates an additional 3D stack of transmitted light images. <em>HALtracks 2D</em> exhibits mean accuracies (94.2–91.6 %) that are as good or better than both <em>HALtracks 3D</em> and all previous FT algorithms across a broad range of apatite fission track densities (up to 8.54 × 10<sup>6</sup> tracks/cm<sup>2</sup>) on expert-curated reference data. This improvement is due to a comprehensive training dataset comprising a wider range of track densities and etch-pit morphologies.</div><div>Unexpectedly, <em>HALtracks3D</em> performed worse (91.5–80.1 %), likely because reflected-light information—critical for recognising track openings—became underrepresented among multiple transmitted-light inputs during CNN training. At very high track densities (&gt;8.54 × 10<sup>6</sup> tracks/cm<sup>2</sup>) pushing the analytical boundaries of optical fission-track counting, <em>Coincidence Mapping</em> (Gleadow et al., 2009) remains more accurate than <em>HALtracks 2D</em>. Thermochronologists might therefore consider utilising a combination of automated fission-track algorithms depending on FT density.</div><div>Future work could expand the open-access training dataset to include a broader range of apatite FT specimens, and increased metadata for targeted CNN training on spurious features such as surface imperfections and dislocations, which are misidentified as fission tracks by existing algorithms. The open-access testing dataset presented here provides a benchmark for evaluating future FT algorithms.</div><div>Nevertheless, HALtracks 2D's enhanced accuracy brings apatite FT analysis significantly closer to full automation, with the potential to mitigate observer bias, reduce inter-laboratory variability, and broaden the accessibility of the technique to the wider geoscience community.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106096"},"PeriodicalIF":4.4,"publicationDate":"2025-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145791271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computers & Geosciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1