首页 > 最新文献

Computers & Geosciences最新文献

英文 中文
PIKANs: Physics-informed Kolmogorov–Arnold networks for landslide time-to-failure prediction 基于物理的滑坡失效时间预测Kolmogorov-Arnold网络
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-12 DOI: 10.1016/j.cageo.2025.106094
Jiashan Wan , Liangjun Wen , Ziheng Jian , Jinhua Wu , Jingyang Li , Mengqi Lian , Kai Wang
Slope deformation is characterized by pronounced time variability and complexity. Although ground-based synthetic aperture radar (GB-SAR) provides high-frequency, broad monitoring, its strong oscillations and large fluctuations can impair predictive performance. To address this, the raw displacement sequence is first smoothed via misaligned subtraction to suppress high-frequency noise and highlight key deformation trends. A dynamic confidence boundary is then established on the inverse-velocity curve to robustly identify the acceleration start point. Building on prior work on physics-informed Kolmogorov–Arnold networks (PIKANs), we apply a PIKANs framework to landslide early warning, embedding the displacement-time evolution constraint into the basis-function space of Kolmogorov–Arnold network (KAN) to unify nonlinear deformation dynamics with governing physical laws. During model training, an alternating optimization scheme combining Adam and the L-BFGS algorithm accelerates convergence and enhances predictive accuracy. Comparative experiments on field GB-SAR datasets demonstrate that compared with an improved KAN baseline and a physics-informed neural network benchmark, PIKANs reduce the relative error in landslide failure-time prediction by 38.42% and 20.44%, respectively. These results confirm that integrating physical equation constraints into neural network parameter updates substantially improves the precision and efficiency of real-time landslide early warning.
边坡变形具有明显的时变性和复杂性。虽然地面合成孔径雷达(GB-SAR)提供高频、宽范围的监测,但其强烈的振荡和较大的波动会影响预测性能。为了解决这个问题,首先通过错位减法对原始位移序列进行平滑,以抑制高频噪声并突出关键变形趋势。然后在逆速度曲线上建立动态置信边界,鲁棒地识别加速度起点。在前人基于物理信息的Kolmogorov-Arnold网络(PIKANs)的基础上,我们将PIKANs框架应用于滑坡预警,将位移-时间演化约束嵌入到Kolmogorov-Arnold网络(KAN)的基函数空间中,以统一非线性变形动力学与控制物理定律。在模型训练过程中,采用Adam与L-BFGS算法相结合的交替优化方案,加快了收敛速度,提高了预测精度。在现场GB-SAR数据集上的对比实验表明,与改进的KAN基线和基于物理信息的神经网络基准相比,PIKANs在滑坡破坏时间预测中的相对误差分别降低了38.42%和20.44%。结果表明,将物理方程约束整合到神经网络参数更新中,大大提高了滑坡实时预警的精度和效率。
{"title":"PIKANs: Physics-informed Kolmogorov–Arnold networks for landslide time-to-failure prediction","authors":"Jiashan Wan ,&nbsp;Liangjun Wen ,&nbsp;Ziheng Jian ,&nbsp;Jinhua Wu ,&nbsp;Jingyang Li ,&nbsp;Mengqi Lian ,&nbsp;Kai Wang","doi":"10.1016/j.cageo.2025.106094","DOIUrl":"10.1016/j.cageo.2025.106094","url":null,"abstract":"<div><div>Slope deformation is characterized by pronounced time variability and complexity. Although ground-based synthetic aperture radar (GB-SAR) provides high-frequency, broad monitoring, its strong oscillations and large fluctuations can impair predictive performance. To address this, the raw displacement sequence is first smoothed via misaligned subtraction to suppress high-frequency noise and highlight key deformation trends. A dynamic confidence boundary is then established on the inverse-velocity curve to robustly identify the acceleration start point. Building on prior work on physics-informed Kolmogorov–Arnold networks (PIKANs), we apply a PIKANs framework to landslide early warning, embedding the displacement-time evolution constraint into the basis-function space of Kolmogorov–Arnold network (KAN) to unify nonlinear deformation dynamics with governing physical laws. During model training, an alternating optimization scheme combining Adam and the L-BFGS algorithm accelerates convergence and enhances predictive accuracy. Comparative experiments on field GB-SAR datasets demonstrate that compared with an improved KAN baseline and a physics-informed neural network benchmark, PIKANs reduce the relative error in landslide failure-time prediction by 38.42% and 20.44%, respectively. These results confirm that integrating physical equation constraints into neural network parameter updates substantially improves the precision and efficiency of real-time landslide early warning.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106094"},"PeriodicalIF":4.4,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145791273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RockSDM: High-fidelity 2D rock image generation via semantic diffusion for digital rock applications RockSDM:通过语义扩散为数字岩石应用生成高保真2D岩石图像
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-12 DOI: 10.1016/j.cageo.2025.106093
Yunlei Sun , Danning Qi , Tiancheng Chen , Ke Xu , Pengxiao Shi , Yongfei Yang
Digital rock technology is a critical technique for precise reservoir characterization and the optimization of oil and gas extraction. However, the high cost of rock sample acquisition and labor-intensive manual labeling lead to data scarcity, significantly hindering deep learning applications in geosciences and petroleum engineering. Existing rock image generation methods often suffer from limited fidelity and lack semantic control, inadequate for high-precision analysis. To address these challenges, we propose the Rock Image Semantic Diffusion Generative Model (RockSDM), a novel diffusion-based generative framework that introduces semantic control into 2D rock image generation for the first time. RockSDM overcomes data scarcity by generating high-quality rock images and pixel-level masks in a coordinated manner, ensuring both microstructural consistency and geological realism. Experimental results demonstrate that RockSDM significantly outperforms existing models in FID and KID. Moreover, the synthetic data generated by RockSDM substantially enhances segmentation performance in data-constrained scenarios. On the TriBSE dataset, RockSDM improves the mIoU by 17.9%, with the IoU of low-frequency categories increasing by up to 71.9%. This effectively mitigates data imbalance and improves model generalization. By reducing the cost of rock sample acquisition and manual annotation, RockSDM offers a powerful data augmentation tool, potentially accelerating deep learning adoption in geosciences and petroleum engineering.
数字岩石技术是精确表征储层和优化油气开采的关键技术。然而,岩石样本采集的高成本和劳动密集型的人工标记导致数据稀缺,严重阻碍了深度学习在地球科学和石油工程中的应用。现有的岩石图像生成方法往往保真度有限,缺乏语义控制,不足以进行高精度分析。为了解决这些挑战,我们提出了岩石图像语义扩散生成模型(RockSDM),这是一种新的基于扩散的生成框架,首次将语义控制引入到二维岩石图像生成中。RockSDM以协调的方式生成高质量的岩石图像和像素级掩模,从而克服了数据稀缺的问题,确保了微观结构的一致性和地质的真实感。实验结果表明,RockSDM在FID和KID方面明显优于现有模型。此外,RockSDM生成的合成数据大大提高了数据约束场景下的分割性能。在TriBSE数据集上,RockSDM将mIoU提高了17.9%,其中低频类别的IoU提高了71.9%。这有效地缓解了数据的不平衡,提高了模型的泛化。通过降低岩石样本采集和人工注释的成本,RockSDM提供了一个强大的数据增强工具,有可能加速深度学习在地球科学和石油工程中的应用。
{"title":"RockSDM: High-fidelity 2D rock image generation via semantic diffusion for digital rock applications","authors":"Yunlei Sun ,&nbsp;Danning Qi ,&nbsp;Tiancheng Chen ,&nbsp;Ke Xu ,&nbsp;Pengxiao Shi ,&nbsp;Yongfei Yang","doi":"10.1016/j.cageo.2025.106093","DOIUrl":"10.1016/j.cageo.2025.106093","url":null,"abstract":"<div><div>Digital rock technology is a critical technique for precise reservoir characterization and the optimization of oil and gas extraction. However, the high cost of rock sample acquisition and labor-intensive manual labeling lead to data scarcity, significantly hindering deep learning applications in geosciences and petroleum engineering. Existing rock image generation methods often suffer from limited fidelity and lack semantic control, inadequate for high-precision analysis. To address these challenges, we propose the Rock Image Semantic Diffusion Generative Model (RockSDM), a novel diffusion-based generative framework that introduces semantic control into 2D rock image generation for the first time. RockSDM overcomes data scarcity by generating high-quality rock images and pixel-level masks in a coordinated manner, ensuring both microstructural consistency and geological realism. Experimental results demonstrate that RockSDM significantly outperforms existing models in FID and KID. Moreover, the synthetic data generated by RockSDM substantially enhances segmentation performance in data-constrained scenarios. On the TriBSE dataset, RockSDM improves the mIoU by 17.9%, with the IoU of low-frequency categories increasing by up to 71.9%. This effectively mitigates data imbalance and improves model generalization. By reducing the cost of rock sample acquisition and manual annotation, RockSDM offers a powerful data augmentation tool, potentially accelerating deep learning adoption in geosciences and petroleum engineering.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106093"},"PeriodicalIF":4.4,"publicationDate":"2025-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145791270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comparative study of data- and image- domain LSRTM under velocity–impedance parametrization 速度-阻抗参数化下数据域与图像域LSRTM的比较研究
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-11 DOI: 10.1016/j.cageo.2025.106091
Pengliang Yang, Zhengyu Ji
Least-squares reverse time migration (LSRTM) is one of the classic seismic imaging methods to reconstruct model perturbations within a known reference medium. It can be computed in either data or image domain using different methods by solving a linear inverse problem, whereas a careful comparison analysis of them is lacking in the literature. In this article, we present a comparative study for multiparameter LSRTM in data- and image- domain in the framework of SMIwiz open software. Different from conventional LSRTM for recovering only velocity perturbation with variable density, we focus on simultaneous reconstruction of velocity and impedance perturbations after logarithmic scaling, using the first-order velocity–pressure formulation of acoustic wave equation. The first 3D data-domain LSRTM example has been performed to validate our implementation, involving expensive repetition of Born modeling and migration over a number of iterations. As a more cost-effective alternative, the image-domain LSRTM is implemented using point spread function (PSF) and nonstationary deblurring filter. Dramatic distinctions between data and image domain methods are discovered with 2D Marmousi test: (1) The data-domain multiparameter inversion provides much better reconstruction of reflectivity images than image-domain approaches, thanks to the complete use of Hessian in Krylov space; (2) The poor multiparameter image-domain inversion highlights the limitation of incomplete Hessian sampling and strong parameter crosstalks, making it difficult to work in practice; (3) In contrast, monoparameter image-domain inversion for seismic impedance is found to work well. These observations have been further validated on Viking Graben Line 12 dataset.
最小二乘逆时偏移(LSRTM)是在已知参考介质中重建模型扰动的经典地震成像方法之一。通过求解线性逆问题,可以使用不同的方法在数据域或图像域计算它,但文献中缺乏对它们的仔细比较分析。本文在SMIwiz开放软件框架下,对数据域和图像域的多参数LSRTM进行了比较研究。与仅恢复变密度速度扰动的传统LSRTM不同,我们着重于利用声波方程的一阶速度-压力公式,在对数标度后同时重建速度和阻抗扰动。已经执行了第一个3D数据域LSRTM示例来验证我们的实现,这涉及到在许多迭代中重复Born建模和迁移的昂贵操作。作为一种更具成本效益的替代方案,图像域LSRTM采用点扩展函数(PSF)和非平稳去模糊滤波器实现。通过二维Marmousi检验,发现了数据域和图像域方法之间的显著区别:(1)数据域多参数反演比图像域方法提供了更好的反射率图像重建,这得益于在Krylov空间中完全使用了Hessian;(2)多参数图像域反演较差,突出了Hessian采样不完全和参数串扰较强的局限性,使其难以在实际中应用;(3)相比之下,单参数地震阻抗图像域反演效果较好。这些观测结果在Viking地堑Line 12数据集上得到了进一步验证。
{"title":"A comparative study of data- and image- domain LSRTM under velocity–impedance parametrization","authors":"Pengliang Yang,&nbsp;Zhengyu Ji","doi":"10.1016/j.cageo.2025.106091","DOIUrl":"10.1016/j.cageo.2025.106091","url":null,"abstract":"<div><div>Least-squares reverse time migration (LSRTM) is one of the classic seismic imaging methods to reconstruct model perturbations within a known reference medium. It can be computed in either data or image domain using different methods by solving a linear inverse problem, whereas a careful comparison analysis of them is lacking in the literature. In this article, we present a comparative study for multiparameter LSRTM in data- and image- domain in the framework of SMIwiz open software. Different from conventional LSRTM for recovering only velocity perturbation with variable density, we focus on simultaneous reconstruction of velocity and impedance perturbations after logarithmic scaling, using the first-order velocity–pressure formulation of acoustic wave equation. The first 3D data-domain LSRTM example has been performed to validate our implementation, involving expensive repetition of Born modeling and migration over a number of iterations. As a more cost-effective alternative, the image-domain LSRTM is implemented using point spread function (PSF) and nonstationary deblurring filter. Dramatic distinctions between data and image domain methods are discovered with 2D Marmousi test: (1) The data-domain multiparameter inversion provides much better reconstruction of reflectivity images than image-domain approaches, thanks to the complete use of Hessian in Krylov space; (2) The poor multiparameter image-domain inversion highlights the limitation of incomplete Hessian sampling and strong parameter crosstalks, making it difficult to work in practice; (3) In contrast, monoparameter image-domain inversion for seismic impedance is found to work well. These observations have been further validated on Viking Graben Line 12 dataset.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106091"},"PeriodicalIF":4.4,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145738769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimising the computational performance of high degree lithospheric field models 高阶岩石圈场模型的计算性能优化
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-10 DOI: 10.1016/j.cageo.2025.106092
Michael Bareford , William Brown , Ciarán Beggan , Callum Watson , Mark Bull
The British Geological Survey (BGS) World Magnetic Anomaly Model (WMAM) code calculates spherical harmonic models of the natural magnetisation of the rocks of Earth’s crust. These models allow us to estimate the value of the full magnetic field vector at any location, based on scattered pointwise marine or aero-magnetic measurements of only the scalar magnetic field. Modelling the magnetic field in this way serves many important purposes, such as geological research, navigation and safe resource extraction. Global spherical harmonic models of degree and order 1440 (28 km spatial resolution) have been successfully computed on the HPC facilities local to BGS, but such runs require nearly the full compute capacity for multiple days. Further, the available resolution of the scalar field measurements is too high to be fully exploited by the WMAM code, limiting models of the crustal magnetic field to a resolution of 28 km. To overcome these issues, we refactored the WMAM code such that models of spherical harmonic degree 1440 and 2000 (20 km resolution) can be produced in hours rather than days. For example, a degree 2000 model was calculated using 64 HPE Cray EX nodes (8 192 cores) in 3 h and 44 mins. The resulting model power spectra and magnetic field maps showed excellent agreement with the existing degree 1440 model and the original input data. The performance of the WMAM code was further improved via offloading to GPU. We show the improvements due to GPU acceleration in terms of energy consumption as well as runtime. This fruitful collaboration between experts in the fields of Geoscience (BGS) and HPC (EPCC) has created the opportunity for the WMAM code to be used to gain new knowledge about crustal magnetic fields.
英国地质调查局(BGS)的世界磁异常模型(WMAM)代码计算了地壳岩石自然磁化的球谐模型。这些模型使我们能够估计在任何位置的全磁场矢量的值,基于散射的点对标量磁场的海洋或航空磁测量。以这种方式模拟磁场有许多重要用途,如地质研究、航海和安全资源开采。1440度和1440阶(~ 28 km空间分辨率)的全球球面调和模型已经在BGS本地的高性能计算设备上成功计算,但这样的运行几乎需要数天的全部计算能力。此外,标量场测量的可用分辨率太高,无法被WMAM代码充分利用,将地壳磁场模型限制在28公里的分辨率。为了克服这些问题,我们重构了WMAM代码,使球谐度1440和2000(~ 20公里分辨率)的模型可以在几小时内产生,而不是几天。例如,使用64个HPE Cray EX节点(8 192个内核)在3小时44分钟内计算了2000度模型。所得模型功率谱和磁场图与现有1440度模型和原始输入数据吻合良好。WMAM代码的性能通过卸载到GPU进一步提高。我们展示了由于GPU加速在能耗和运行时间方面的改进。地球科学(BGS)和HPC (EPCC)领域专家之间富有成效的合作为WMAM代码用于获取有关地壳磁场的新知识创造了机会。
{"title":"Optimising the computational performance of high degree lithospheric field models","authors":"Michael Bareford ,&nbsp;William Brown ,&nbsp;Ciarán Beggan ,&nbsp;Callum Watson ,&nbsp;Mark Bull","doi":"10.1016/j.cageo.2025.106092","DOIUrl":"10.1016/j.cageo.2025.106092","url":null,"abstract":"<div><div>The British Geological Survey (BGS) World Magnetic Anomaly Model (WMAM) code calculates spherical harmonic models of the natural magnetisation of the rocks of Earth’s crust. These models allow us to estimate the value of the full magnetic field vector at any location, based on scattered pointwise marine or aero-magnetic measurements of only the scalar magnetic field. Modelling the magnetic field in this way serves many important purposes, such as geological research, navigation and safe resource extraction. Global spherical harmonic models of degree and order 1440 (<span><math><mo>∼</mo></math></span>28 km spatial resolution) have been successfully computed on the HPC facilities local to BGS, but such runs require nearly the full compute capacity for multiple days. Further, the available resolution of the scalar field measurements is too high to be fully exploited by the WMAM code, limiting models of the crustal magnetic field to a resolution of 28 km. To overcome these issues, we refactored the WMAM code such that models of spherical harmonic degree 1440 and 2000 (<span><math><mo>∼</mo></math></span>20 km resolution) can be produced in hours rather than days. For example, a degree 2000 model was calculated using 64 HPE Cray EX nodes (8<!--> <!-->192 cores) in 3 h and 44 mins. The resulting model power spectra and magnetic field maps showed excellent agreement with the existing degree 1440 model and the original input data. The performance of the WMAM code was further improved via offloading to GPU. We show the improvements due to GPU acceleration in terms of energy consumption as well as runtime. This fruitful collaboration between experts in the fields of Geoscience (BGS) and HPC (EPCC) has created the opportunity for the WMAM code to be used to gain new knowledge about crustal magnetic fields.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106092"},"PeriodicalIF":4.4,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145791272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An interpretable causal variational autoencoder for geochemical anomalies recognition by regarding ore-controlling factors 考虑控矿因素的地球化学异常识别的可解释因果变分自编码器
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-06 DOI: 10.1016/j.cageo.2025.106090
Fan Ni, Yihui Xiong
Conventional approaches for identifying geochemical anomalies often overlook the causal relationships between geological ore-controlling factors (e.g., fault, granite, and strata) and mineralization. This ignoration can lead to a lack of interpretability and robustness of the deep learning based geochemical anomaly identification methods. This study presents an interpretable causal variational autoencoder, which integrates structural causal models (SCM) with variational autoencoders (VAE) to address this issue. The interpretable causal VAE explicitly defines the causal relationships between geological factors that control mineralization and geochemical elements by utilizing a directed acyclic graph (DAG) and assessing causal effects through counterfactual reasoning. The model employs a semi-supervised training methodology, combining a limited number of labeled samples that reflect established geological control relationships with a large dataset of unlabeled data. Throughout the training process, it continuously updates both the causal graph structure and model parameters through data-driven causal discovery. Experimental results from geochemical datasets in the Nanling region in South China indicate that interpretable causal VAE significantly improves the accuracy of geochemical anomaly identification compared to traditional methods; for example, it necessitates the identification of only 14 % of high anomaly areas to capture all known deposits. Additionally, interpretable causal VAE enhances the geological interpretability of the model's results by clarifying the causal mechanisms through which geological ore-controlling factors affect element enrichment and depletion. This research highlights the considerable potential of merging deep learning and causality with geological knowledge derived from mineral system, thereby providing a novel tool for geochemical exploration.
常规的地球化学异常识别方法往往忽略了地质控矿因素(如断层、花岗岩和地层)与矿化之间的因果关系。这种忽略会导致基于深度学习的地球化学异常识别方法缺乏可解释性和鲁棒性。本研究提出了一种可解释的因果变分自编码器,它将结构因果模型(SCM)与变分自编码器(VAE)相结合来解决这一问题。可解释因果VAE利用有向无环图(DAG)和反事实推理评估因果效应,明确定义了控制矿化的地质因素与地球化学元素之间的因果关系。该模型采用半监督训练方法,将反映已建立的地质控制关系的有限数量的标记样本与未标记数据的大型数据集相结合。在整个训练过程中,它通过数据驱动的因果发现不断更新因果图结构和模型参数。华南南岭地区地球化学数据的实验结果表明,与传统方法相比,可解释因果VAE显著提高了地球化学异常识别的精度;例如,只需要识别14%的高异常区域就可以捕获所有已知的矿床。此外,可解释的因果VAE通过阐明地质控矿因素影响元素富集和衰竭的因果机制,提高了模型结果的地质可解释性。该研究强调了将深度学习和因果关系与来自矿物系统的地质知识相结合的巨大潜力,从而为地球化学勘探提供了一种新的工具。
{"title":"An interpretable causal variational autoencoder for geochemical anomalies recognition by regarding ore-controlling factors","authors":"Fan Ni,&nbsp;Yihui Xiong","doi":"10.1016/j.cageo.2025.106090","DOIUrl":"10.1016/j.cageo.2025.106090","url":null,"abstract":"<div><div>Conventional approaches for identifying geochemical anomalies often overlook the causal relationships between geological ore-controlling factors (e.g., fault, granite, and strata) and mineralization. This ignoration can lead to a lack of interpretability and robustness of the deep learning based geochemical anomaly identification methods. This study presents an interpretable causal variational autoencoder, which integrates structural causal models (SCM) with variational autoencoders (VAE) to address this issue. The interpretable causal VAE explicitly defines the causal relationships between geological factors that control mineralization and geochemical elements by utilizing a directed acyclic graph (DAG) and assessing causal effects through counterfactual reasoning. The model employs a semi-supervised training methodology, combining a limited number of labeled samples that reflect established geological control relationships with a large dataset of unlabeled data. Throughout the training process, it continuously updates both the causal graph structure and model parameters through data-driven causal discovery. Experimental results from geochemical datasets in the Nanling region in South China indicate that interpretable causal VAE significantly improves the accuracy of geochemical anomaly identification compared to traditional methods; for example, it necessitates the identification of only 14 % of high anomaly areas to capture all known deposits. Additionally, interpretable causal VAE enhances the geological interpretability of the model's results by clarifying the causal mechanisms through which geological ore-controlling factors affect element enrichment and depletion. This research highlights the considerable potential of merging deep learning and causality with geological knowledge derived from mineral system, thereby providing a novel tool for geochemical exploration.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106090"},"PeriodicalIF":4.4,"publicationDate":"2025-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145738768","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implicit neural representations for 3D gravity inversion 三维重力反演的隐式神经表示
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-02 DOI: 10.1016/j.cageo.2025.106082
Xiong Li , Jingtao Zhao , Shuai Zhou
Three-dimensional gravity inversion is vital for recovering subsurface density distribution, but resolving sparse geological anomalies with complex geometries from superimposed gravity signals remains difficult due to severe ill-posedness. Existing methods often face limitations in representational flexibility or rely heavily on predefined regularization strategies. We introduce the physics-driven Implicit Gravity Inversion (IGI) method to reconstruct the underground density distribution in geophysical exploration, which employs a neural network to implicitly represent the continuous density field directly from spatial coordinates. Critically, IGI is driven by physical laws embedded within the loss function, optimizing the neural network to predict a density field consistent with observed gravity anomalies, thus eliminating the need for extensive labeled training data typical of deep learning. Results show that IGI provides comparable anomaly distributions to those obtained using deep learning inversion. Ablation studies reveal that parameter coupling provides effective depth weighting, indicating that architectural choices control regularization strength within well-defined optimal ranges. Extensive experiments on synthetic benchmarks and the real-world San Nicolas deposit dataset demonstrate that IGI robustly recovers sparse and complex density models with high accuracy, effectively delineating intricate structures such as disconnected ore bodies and sharp contacts, surpassing traditional methods in these aspects.
三维重力反演对于恢复地下密度分布至关重要,但由于严重的病态性,从叠加重力信号中解决具有复杂几何形状的稀疏地质异常仍然很困难。现有的方法往往面临表示灵活性的限制,或者严重依赖于预定义的正则化策略。引入物理驱动的隐式重力反演(IGI)方法,利用神经网络直接从空间坐标隐式表示连续的密度场,重建地球物理勘探中的地下密度分布。关键是,IGI由嵌入损失函数中的物理定律驱动,优化神经网络以预测与观测到的重力异常一致的密度场,从而消除了对深度学习典型的大量标记训练数据的需求。结果表明,IGI提供的异常分布与使用深度学习反演获得的异常分布相当。消融研究表明,参数耦合提供了有效的深度加权,表明建筑选择将正则化强度控制在明确的最佳范围内。在合成基准和真实的San Nicolas矿床数据集上进行的大量实验表明,IGI能够以高精度鲁棒地恢复稀疏和复杂的密度模型,有效地描绘出分离矿体和尖锐接触等复杂结构,在这些方面超越了传统方法。
{"title":"Implicit neural representations for 3D gravity inversion","authors":"Xiong Li ,&nbsp;Jingtao Zhao ,&nbsp;Shuai Zhou","doi":"10.1016/j.cageo.2025.106082","DOIUrl":"10.1016/j.cageo.2025.106082","url":null,"abstract":"<div><div>Three-dimensional gravity inversion is vital for recovering subsurface density distribution, but resolving sparse geological anomalies with complex geometries from superimposed gravity signals remains difficult due to severe ill-posedness. Existing methods often face limitations in representational flexibility or rely heavily on predefined regularization strategies. We introduce the physics-driven Implicit Gravity Inversion (IGI) method to reconstruct the underground density distribution in geophysical exploration, which employs a neural network to implicitly represent the continuous density field directly from spatial coordinates. Critically, IGI is driven by physical laws embedded within the loss function, optimizing the neural network to predict a density field consistent with observed gravity anomalies, thus eliminating the need for extensive labeled training data typical of deep learning. Results show that IGI provides comparable anomaly distributions to those obtained using deep learning inversion. Ablation studies reveal that parameter coupling provides effective depth weighting, indicating that architectural choices control regularization strength within well-defined optimal ranges. Extensive experiments on synthetic benchmarks and the real-world San Nicolas deposit dataset demonstrate that IGI robustly recovers sparse and complex density models with high accuracy, effectively delineating intricate structures such as disconnected ore bodies and sharp contacts, surpassing traditional methods in these aspects.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"208 ","pages":"Article 106082"},"PeriodicalIF":4.4,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145694905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An effective deep domain adaptation approach for least squares migration 一种有效的深度域自适应最小二乘迁移方法
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-21 DOI: 10.1016/j.cageo.2025.106081
Wenjun Ni , Shaoyong Liu , Zhikang Zhou , Hanming Gu , Bin Zhang
Seismic migration is an essential process in geophysical exploration for precisely imaging subsurface structures. Traditional image-domain least-squares migration (ID-LSM) is an effective tool for enhancing resolution; but it is often limited by high computational costs and its reliance on linear Point Spread Function (PSF) deconvolution, which struggles to capture complex nonlinear imaging effects. To address these limitations, deep learning (DL) provides a powerful framework capable of learning the complex, nonlinear mapping between conventional migration image to high-resolution reflectivity models. However, a critical defect of standard DL methods is their poor generalization: networks trained exclusively on synthetic data (source domain) often fail to generalize to field data (target domain) owing to inherent feature discrepancies. To bridge this domain gap, we propose a novel domain-adaptive mage-domain least-squares migration (DA-ID-LSM) approach. The proposed method employs a U-Net-based convolutional neural network combined with a Maximum Mean Discrepancy (MMD) loss function to minimize the distribution gap between the source and target domains, thereby enabling the model to learn domain-invariant features and generalize effectively. Numerical experiments on both synthetic and field datasets demonstrate that the proposed DA-ID-LSM approach outperforms conventional methods. It achieves improved resolution and enhanced lateral continuity. The incorporation of the MMD constraint notably improves imaging resolution and robustness without compromising training stability.
地震偏移是地球物理勘探中精确成像地下构造的一个重要过程。传统的图像域最小二乘迁移(ID-LSM)是提高分辨率的有效工具;但它通常受到计算成本高和依赖线性点扩散函数(PSF)反卷积的限制,这很难捕捉复杂的非线性成像效果。为了解决这些限制,深度学习(DL)提供了一个强大的框架,能够学习传统迁移图像与高分辨率反射率模型之间复杂的非线性映射。然而,标准深度学习方法的一个关键缺陷是泛化能力差:由于固有的特征差异,仅在合成数据(源域)上训练的网络往往无法泛化到现场数据(目标域)。为了弥补这一领域差距,我们提出了一种新的领域自适应图像-领域最小二乘迁移(DA-ID-LSM)方法。该方法采用基于u - net的卷积神经网络,结合最大平均差异(MMD)损失函数,使源域和目标域之间的分布差距最小化,从而使模型能够学习域不变特征并有效地进行泛化。在综合数据集和现场数据集上的数值实验表明,本文提出的DA-ID-LSM方法优于传统方法。它提高了分辨率,增强了横向连续性。MMD约束的结合显著提高了成像分辨率和鲁棒性,而不影响训练稳定性。
{"title":"An effective deep domain adaptation approach for least squares migration","authors":"Wenjun Ni ,&nbsp;Shaoyong Liu ,&nbsp;Zhikang Zhou ,&nbsp;Hanming Gu ,&nbsp;Bin Zhang","doi":"10.1016/j.cageo.2025.106081","DOIUrl":"10.1016/j.cageo.2025.106081","url":null,"abstract":"<div><div>Seismic migration is an essential process in geophysical exploration for precisely imaging subsurface structures. Traditional image-domain least-squares migration (ID-LSM) is an effective tool for enhancing resolution; but it is often limited by high computational costs and its reliance on linear Point Spread Function (PSF) deconvolution, which struggles to capture complex nonlinear imaging effects. To address these limitations, deep learning (DL) provides a powerful framework capable of learning the complex, nonlinear mapping between conventional migration image to high-resolution reflectivity models. However, a critical defect of standard DL methods is their poor generalization: networks trained exclusively on synthetic data (source domain) often fail to generalize to field data (target domain) owing to inherent feature discrepancies. To bridge this domain gap, we propose a novel domain-adaptive mage-domain least-squares migration (DA-ID-LSM) approach. The proposed method employs a U-Net-based convolutional neural network combined with a Maximum Mean Discrepancy (MMD) loss function to minimize the distribution gap between the source and target domains, thereby enabling the model to learn domain-invariant features and generalize effectively. Numerical experiments on both synthetic and field datasets demonstrate that the proposed DA-ID-LSM approach outperforms conventional methods. It achieves improved resolution and enhanced lateral continuity. The incorporation of the MMD constraint notably improves imaging resolution and robustness without compromising training stability.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"207 ","pages":"Article 106081"},"PeriodicalIF":4.4,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145617889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SeismoDual: A dual-domain deep learning framework for robust seismic phase picking SeismoDual:一个用于鲁棒地震相位提取的双域深度学习框架
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-19 DOI: 10.1016/j.cageo.2025.106080
Kuan-Wei Tang, Kuan-Yu Chen
We present SeismoDual, a dual-domain deep learning framework for robust seismic phase picking that integrates both time-domain waveforms and time–frequency spectrograms. Unlike conventional pickers or recent deep learning-based models that usually operate solely in the time domain, SeismoDual captures complementary temporal and spectral features, enhancing resilience to strong background noise and low-SNR conditions. The framework adopts a Conformer-based encoder for both local and long-range time-domain modeling, a meticulous design encoder for distilling and encapsulating time–frequency information into feature representations, and a domain-aware decoder for effective fusion of heterogeneous seismic features. Extensive experiments on three benchmark datasets – STEAD, INSTANCE, and CWA – demonstrate SeismoDual’s superior accuracy and generalization capability across diverse scenarios. Compared to advanced methods, including PhaseNet, EQTransformer, and RED-PAN, SeismoDual achieves consistently higher F1 scores, particularly under challenging noise and sensor variability, highlighting its potential for operational deployment in real-time seismic monitoring. Furthermore, we integrate SeismoDual into a real-time earthquake early warning system and demonstrate its capability to reduce false picks significantly while maintaining low latency.
我们提出了SeismoDual,这是一个用于鲁棒地震相位采集的双域深度学习框架,集成了时域波形和时频谱图。与传统的拾取器或最近基于深度学习的模型(通常只在时域中工作)不同,SeismoDual捕获互补的时间和频谱特征,增强了对强背景噪声和低信噪比条件的恢复能力。该框架采用基于conformer的编码器进行局部和远程时域建模,采用精心设计的编码器将时频信息提取并封装为特征表示,采用域感知解码器进行非均质地震特征的有效融合。在三个基准数据集(STEAD、INSTANCE和CWA)上进行的大量实验表明,SeismoDual在不同场景下具有卓越的精度和泛化能力。与PhaseNet、EQTransformer和RED-PAN等先进方法相比,SeismoDual的F1分数一直较高,特别是在噪声和传感器变化较大的情况下,这凸显了其在实时地震监测中的应用潜力。此外,我们将SeismoDual集成到实时地震预警系统中,并证明了其在保持低延迟的同时显著减少误取的能力。
{"title":"SeismoDual: A dual-domain deep learning framework for robust seismic phase picking","authors":"Kuan-Wei Tang,&nbsp;Kuan-Yu Chen","doi":"10.1016/j.cageo.2025.106080","DOIUrl":"10.1016/j.cageo.2025.106080","url":null,"abstract":"<div><div>We present SeismoDual, a dual-domain deep learning framework for robust seismic phase picking that integrates both time-domain waveforms and time–frequency spectrograms. Unlike conventional pickers or recent deep learning-based models that usually operate solely in the time domain, SeismoDual captures complementary temporal and spectral features, enhancing resilience to strong background noise and low-SNR conditions. The framework adopts a Conformer-based encoder for both local and long-range time-domain modeling, a meticulous design encoder for distilling and encapsulating time–frequency information into feature representations, and a domain-aware decoder for effective fusion of heterogeneous seismic features. Extensive experiments on three benchmark datasets – STEAD, INSTANCE, and CWA – demonstrate SeismoDual’s superior accuracy and generalization capability across diverse scenarios. Compared to advanced methods, including PhaseNet, EQTransformer, and RED-PAN, SeismoDual achieves consistently higher F1 scores, particularly under challenging noise and sensor variability, highlighting its potential for operational deployment in real-time seismic monitoring. Furthermore, we integrate SeismoDual into a real-time earthquake early warning system and demonstrate its capability to reduce false picks significantly while maintaining low latency.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"207 ","pages":"Article 106080"},"PeriodicalIF":4.4,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145571680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion models for multivariate subsurface generation and efficient probabilistic inversion 多变量地下生成扩散模型及有效概率反演
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-17 DOI: 10.1016/j.cageo.2025.106076
Roberto Miele, Niklas Linde
Diffusion models offer stable training and state-of-the-art performance for deep generative modeling tasks. Here, we consider their use in the context of multivariate subsurface modeling and probabilistic inversion. We first demonstrate that diffusion models enhance multivariate modeling capabilities compared to variational autoencoders and generative adversarial networks. In diffusion modeling, the generative process involves a comparatively large number of generative steps with update rules that can be modified to account for conditioning data. We propose different corrections to the popular Diffusion Posterior Sampling approach by Chung et al. (2023). In particular, we introduce a likelihood approximation accounting for the noise-contamination that is inherent in diffusion modeling. We assess performance in a multivariate geological scenario involving facies and correlated acoustic impedances. Conditional modeling is demonstrated using both local hard data (well logs) and nonlinear geophysics (fullstack seismic data). Our tests show significantly improved statistical robustness, enhanced sampling of the posterior probability density function and reduced computational costs, compared to the original approach. The method can be used with both hard and indirect conditioning data, individually or simultaneously. As the inversion is included within the diffusion process, it is faster than other methods requiring an outer-loop around the generative model, such as Markov chain Monte Carlo.
扩散模型为深度生成建模任务提供了稳定的训练和最先进的性能。在这里,我们考虑它们在多元地下建模和概率反演中的应用。我们首先证明,与变分自编码器和生成对抗网络相比,扩散模型增强了多变量建模能力。在扩散建模中,生成过程涉及相对大量的生成步骤和更新规则,这些规则可以修改以解释条件数据。我们对Chung等人(2023)提出的流行的扩散后验抽样方法提出了不同的修正。特别地,我们引入了一个似然近似来解释扩散建模中固有的噪声污染。我们在涉及相和相关声阻抗的多元地质情况下评估性能。使用本地硬数据(测井)和非线性地球物理(全叠地震数据)验证了条件建模。我们的测试表明,与原始方法相比,该方法显著提高了统计稳健性,增强了后验概率密度函数的采样,降低了计算成本。该方法可单独或同时用于硬条件和间接条件数据。由于反演包含在扩散过程中,因此它比其他需要在生成模型周围建立外环的方法(如马尔可夫链蒙特卡罗)要快。
{"title":"Diffusion models for multivariate subsurface generation and efficient probabilistic inversion","authors":"Roberto Miele,&nbsp;Niklas Linde","doi":"10.1016/j.cageo.2025.106076","DOIUrl":"10.1016/j.cageo.2025.106076","url":null,"abstract":"<div><div>Diffusion models offer stable training and state-of-the-art performance for deep generative modeling tasks. Here, we consider their use in the context of multivariate subsurface modeling and probabilistic inversion. We first demonstrate that diffusion models enhance multivariate modeling capabilities compared to variational autoencoders and generative adversarial networks. In diffusion modeling, the generative process involves a comparatively large number of generative steps with update rules that can be modified to account for conditioning data. We propose different corrections to the popular Diffusion Posterior Sampling approach by Chung et al. (2023). In particular, we introduce a likelihood approximation accounting for the noise-contamination that is inherent in diffusion modeling. We assess performance in a multivariate geological scenario involving facies and correlated acoustic impedances. Conditional modeling is demonstrated using both local hard data (well logs) and nonlinear geophysics (fullstack seismic data). Our tests show significantly improved statistical robustness, enhanced sampling of the posterior probability density function and reduced computational costs, compared to the original approach. The method can be used with both hard and indirect conditioning data, individually or simultaneously. As the inversion is included within the diffusion process, it is faster than other methods requiring an outer-loop around the generative model, such as Markov chain Monte Carlo.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"207 ","pages":"Article 106076"},"PeriodicalIF":4.4,"publicationDate":"2025-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145571681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A neural network architecture based on attention gate mechanism for 3D magnetotelluric forward modeling 基于注意门机制的三维大地电磁正演建模神经网络体系结构
IF 4.4 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-15 DOI: 10.1016/j.cageo.2025.106079
Xin Zhong , Weiwei Ling , Kejia Pan , Chaofei Liu , Pinxia Wu , Jiajing Zhang , Zhiliang Zhan , Wenbo Xiao
Traditional methods of three-dimensional (3D) magnetotelluric (MT) numerical forward modeling, such as the finite element method (FEM) and finite volume method (FVM), suffer from high computational costs and low efficiency due to limitations in mesh refinement and computational resources. We propose a novel neural network architecture named MTAGU-Net, which integrates an attention gating mechanism for 3D MT forward modeling. Specifically, a dual-path attention gating module is designed based on data images of the apparent resistivity and phase and embedded in the skip connections between the encoder and decoder. This module enables the fusion of critical anomaly information from shallow feature maps during the decoding of deep feature maps, significantly enhancing the network’s capability to extract features from anomalous regions. Numerical experiments show that the forward predictions of MTAGU-Net closely match the numerical solutions obtained through FEM. Compared to leading network models such as 3D U-Net and Swin-UNETR, MTAGU-Net not only achieves superior prediction accuracy but also demonstrates robust generalization capabilities when handling model samples that were not part of the training dataset. For medium-scale grid forward modeling, MTAGU-Net delivers a prediction speed more than a hundred times faster than FEM. This remarkable computational efficiency makes MTAGU-Net a highly promising core engine for inversion algorithms, significantly enhancing the computational performance of 3D MT inversion.
传统的三维大地电磁数值正演方法,如有限元法(FEM)和有限体积法(FVM),由于网格细化和计算资源的限制,计算成本高,效率低。我们提出了一种新的神经网络体系结构mtaguu - net,它集成了一个用于三维MT正演建模的注意力门控机制。具体而言,设计了基于视电阻率和相位数据图像的双路注意门控模块,并将其嵌入到编码器和解码器之间的跳线连接中。该模块实现了深度特征图解码过程中浅层特征图关键异常信息的融合,显著提高了网络对异常区域特征的提取能力。数值实验表明,mtaguu - net的正演预测结果与有限元数值解吻合较好。与3D U-Net和swan - unetr等领先的网络模型相比,mtaguu - net不仅实现了更高的预测精度,而且在处理非训练数据集的模型样本时表现出强大的泛化能力。对于中等尺度网格正演建模,mtague - net提供的预测速度比FEM快100倍以上。这种卓越的计算效率使mtaguu - net成为极具前景的反演算法核心引擎,显著提高了三维大地电磁反演的计算性能。
{"title":"A neural network architecture based on attention gate mechanism for 3D magnetotelluric forward modeling","authors":"Xin Zhong ,&nbsp;Weiwei Ling ,&nbsp;Kejia Pan ,&nbsp;Chaofei Liu ,&nbsp;Pinxia Wu ,&nbsp;Jiajing Zhang ,&nbsp;Zhiliang Zhan ,&nbsp;Wenbo Xiao","doi":"10.1016/j.cageo.2025.106079","DOIUrl":"10.1016/j.cageo.2025.106079","url":null,"abstract":"<div><div>Traditional methods of three-dimensional (3D) magnetotelluric (MT) numerical forward modeling, such as the finite element method (FEM) and finite volume method (FVM), suffer from high computational costs and low efficiency due to limitations in mesh refinement and computational resources. We propose a novel neural network architecture named MTAGU-Net, which integrates an attention gating mechanism for 3D MT forward modeling. Specifically, a dual-path attention gating module is designed based on data images of the apparent resistivity and phase and embedded in the skip connections between the encoder and decoder. This module enables the fusion of critical anomaly information from shallow feature maps during the decoding of deep feature maps, significantly enhancing the network’s capability to extract features from anomalous regions. Numerical experiments show that the forward predictions of MTAGU-Net closely match the numerical solutions obtained through FEM. Compared to leading network models such as 3D U-Net and Swin-UNETR, MTAGU-Net not only achieves superior prediction accuracy but also demonstrates robust generalization capabilities when handling model samples that were not part of the training dataset. For medium-scale grid forward modeling, MTAGU-Net delivers a prediction speed more than a hundred times faster than FEM. This remarkable computational efficiency makes MTAGU-Net a highly promising core engine for inversion algorithms, significantly enhancing the computational performance of 3D MT inversion.</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"207 ","pages":"Article 106079"},"PeriodicalIF":4.4,"publicationDate":"2025-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145571755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Computers & Geosciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1