首页 > 最新文献

Applied Computing and Geosciences最新文献

英文 中文
Corrigendum to “Enhancing neuro-symbolic AI for mineral prediction via LLM-guided knowledge integration” [Appl. Comput. Geosci. 29 (2026) 100310] “通过llm引导的知识整合增强矿物预测的神经符号人工智能”的勘误表[应用程序]。第一版。地球科学进展,29 (2026)100310 [j]
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-20 DOI: 10.1016/j.acags.2026.100321
Weilin Chen, Jiyin Zhang, Chenhao Li, Xiaogang Ma
{"title":"Corrigendum to “Enhancing neuro-symbolic AI for mineral prediction via LLM-guided knowledge integration” [Appl. Comput. Geosci. 29 (2026) 100310]","authors":"Weilin Chen, Jiyin Zhang, Chenhao Li, Xiaogang Ma","doi":"10.1016/j.acags.2026.100321","DOIUrl":"10.1016/j.acags.2026.100321","url":null,"abstract":"","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"29 ","pages":"Article 100321"},"PeriodicalIF":3.2,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146037979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of deep learning models for 1D magnetotelluric inversion 一维大地电磁反演深度学习模型比较
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-16 DOI: 10.1016/j.acags.2026.100320
Hakim Saibi , Abdelhadi Hireche , Takeshi Tsuji , Mohammed Y. Ali , Ahmad B. Ahmad
This paper presents a comparative study of three deep learning (DL) architectures for one-dimensional magnetotelluric (MT) data inversion: long short-term memory (LSTM), gated recurrent unit (GRU), and Informer models. We developed a comprehensive framework for generating realistic synthetic MT data, training the models, and evaluating their performance through multiple quantitative metrics. Our synthetic dataset comprised 100,000 samples with 25 periods spanning 10−3 to 103 seconds, created using statistical parameters derived from real MT data. Each model was trained on apparent resistivity and phase responses to recover subsurface resistivity profiles. The results show that the recurrent neural network architectures (LSTM and GRU) slightly outperform the attention-based Informer model, with the LSTM achieving the best performance (MSE of 0.06455 Ωm2, R2 of 0.45234). Despite their differing architectures, all the models successfully captured the major subsurface resistivity contrasts. When applied to real MT data from the UAE, the tested models showed promising results in terms of reconstructing subsurface structures. Overall, this study demonstrates the viability of DL approaches for MT inversion, with potential applications in efficient field-based subsurface imaging.
本文介绍了用于一维大地电磁数据反演的三种深度学习(DL)体系结构:长短期记忆(LSTM)、门控循环单元(GRU)和Informer模型的比较研究。我们开发了一个综合框架,用于生成真实的合成MT数据,训练模型,并通过多个定量指标评估其性能。我们的合成数据集包括100,000个样本,25个周期跨度为10−3到103秒,使用来自真实MT数据的统计参数创建。每个模型都经过视电阻率和相位响应的训练,以恢复地下电阻率剖面。结果表明,递归神经网络架构(LSTM和GRU)的性能略优于基于注意力的Informer模型,其中LSTM的性能最佳(MSE为0.06455 Ωm2, R2为0.45234)。尽管它们的结构不同,但所有模型都成功捕获了主要的地下电阻率对比。当应用于来自阿联酋的真实MT数据时,测试模型在重建地下结构方面显示出令人鼓舞的结果。总的来说,这项研究证明了深度学习方法在大地电磁学反演中的可行性,在高效的基于现场的地下成像中具有潜在的应用前景。
{"title":"Comparison of deep learning models for 1D magnetotelluric inversion","authors":"Hakim Saibi ,&nbsp;Abdelhadi Hireche ,&nbsp;Takeshi Tsuji ,&nbsp;Mohammed Y. Ali ,&nbsp;Ahmad B. Ahmad","doi":"10.1016/j.acags.2026.100320","DOIUrl":"10.1016/j.acags.2026.100320","url":null,"abstract":"<div><div>This paper presents a comparative study of three deep learning (DL) architectures for one-dimensional magnetotelluric (MT) data inversion: long short-term memory (LSTM), gated recurrent unit (GRU), and Informer models. We developed a comprehensive framework for generating realistic synthetic MT data, training the models, and evaluating their performance through multiple quantitative metrics. Our synthetic dataset comprised 100,000 samples with 25 periods spanning 10<sup>−3</sup> to 10<span><math><msup><mrow></mrow><mrow><mn>3</mn></mrow></msup></math></span> seconds, created using statistical parameters derived from real MT data. Each model was trained on apparent resistivity and phase responses to recover subsurface resistivity profiles. The results show that the recurrent neural network architectures (LSTM and GRU) slightly outperform the attention-based Informer model, with the LSTM achieving the best performance (MSE of 0.06455 <span><math><mrow><mi>Ω</mi><mspace></mspace><msup><mrow><mtext>m</mtext></mrow><mrow><mn>2</mn></mrow></msup></mrow></math></span>, R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.45234). Despite their differing architectures, all the models successfully captured the major subsurface resistivity contrasts. When applied to real MT data from the UAE, the tested models showed promising results in terms of reconstructing subsurface structures. Overall, this study demonstrates the viability of DL approaches for MT inversion, with potential applications in efficient field-based subsurface imaging.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"29 ","pages":"Article 100320"},"PeriodicalIF":3.2,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simulating tectonic stress fields in mineralization dynamics using a deep neural networks-based surrogate model 利用基于深度神经网络的替代模型模拟成矿动力学中的构造应力场
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-09 DOI: 10.1016/j.acags.2026.100318
Fan Xiao , Ao Tang , Dunhui Xiao , Shu Jiang , Qiuming Cheng
Tectonically induced fault systems create favorable environments for hydrothermal fluid migration and mineral precipitation, ultimately leading to the formation of certain deposits. Therefore, the numerical simulation of tectonic stress fields in mineralizing dynamic systems is crucial for understanding mineralization processes and for predictive modeling of ore prospectivity related to structure-controlled deposits. However, uncertainties in rock-physical parameters can undermine the reliability of simulation results. Although trial-and-error methods may seem to offer a solution to this issue, they typically necessitate extensive numerical simulations, leading to high computational costs. In this study, we developed a deep learning-based surrogate modeling method, using the Fankou lead-zinc deposit as a case study. This method enables rapid and accurate calculations of complex stress field distributions within mineralizing dynamic systems. We employed the Sobol method to analyze the sensitivity of the first, second and third principal stresses, to critial rock-physical parameters, including density, Poisson's ratio, and elastic modulus. This analysis allowed us to identify the critical parameters for the mineralization dynamics model. Subsequently, we utilized Latin hypercube sampling (LHS) to generate parameter sets within the key parameter space for numerical simulations, resulting in high-fidelity datasets of three distinct stress field distributions. Finally, we constructed an end-to-end surrogate model for each of the three stress fields based on deep neural networks, using the rock-physical parameters generated by LHS as input variables and the high-fidelity datasets obtained from numerical simulations as prediction variables. Evaluation metrics in the test dataset, including the correlation coefficient, root mean square error, mean relative error, and mean absolute error, indicate that the surrogate models perform well, effectively capturing the spatial distribution of the tectonic stress field within the mineralizing dynamic system. Our results also demonstrate that the average computational efficiency of the data-driven surrogate models is approximately 200 times greater than that of numerical simulation methods, providing an effective and rapid computational framework for parameter inversion and optimization related to the tectonic stress field in complex mineralizing dynamic systems.
构造断裂系统为热液运移和矿物沉淀创造了有利的环境,最终导致了某些矿床的形成。因此,成矿动力系统中构造应力场的数值模拟对于认识成矿过程和构造控矿找矿预测建模具有重要意义。然而,岩石物理参数的不确定性会影响模拟结果的可靠性。虽然试错法似乎可以解决这个问题,但它们通常需要大量的数值模拟,从而导致高计算成本。本文以凡口铅锌矿为例,开发了一种基于深度学习的代理建模方法。该方法能够快速准确地计算矿化动态系统中的复杂应力场分布。我们采用Sobol方法分析了第一、第二和第三主应力对关键岩石物理参数(包括密度、泊松比和弹性模量)的敏感性。这一分析使我们能够确定成矿动力学模型的关键参数。随后,我们利用拉丁超立方体采样(LHS)在数值模拟的关键参数空间内生成参数集,得到三种不同应力场分布的高保真数据集。最后,以LHS生成的岩石物性参数为输入变量,以数值模拟获得的高保真数据集为预测变量,基于深度神经网络构建了三个应力场的端到端代理模型。测试数据集的相关系数、均方根误差、平均相对误差和平均绝对误差等评价指标表明,替代模型表现良好,有效捕捉了成矿动力系统内构造应力场的空间分布。数据驱动替代模型的平均计算效率约为数值模拟方法的200倍,为复杂矿化动力系统构造应力场参数反演与优化提供了有效、快速的计算框架。
{"title":"Simulating tectonic stress fields in mineralization dynamics using a deep neural networks-based surrogate model","authors":"Fan Xiao ,&nbsp;Ao Tang ,&nbsp;Dunhui Xiao ,&nbsp;Shu Jiang ,&nbsp;Qiuming Cheng","doi":"10.1016/j.acags.2026.100318","DOIUrl":"10.1016/j.acags.2026.100318","url":null,"abstract":"<div><div>Tectonically induced fault systems create favorable environments for hydrothermal fluid migration and mineral precipitation, ultimately leading to the formation of certain deposits. Therefore, the numerical simulation of tectonic stress fields in mineralizing dynamic systems is crucial for understanding mineralization processes and for predictive modeling of ore prospectivity related to structure-controlled deposits. However, uncertainties in rock-physical parameters can undermine the reliability of simulation results. Although trial-and-error methods may seem to offer a solution to this issue, they typically necessitate extensive numerical simulations, leading to high computational costs. In this study, we developed a deep learning-based surrogate modeling method, using the Fankou lead-zinc deposit as a case study. This method enables rapid and accurate calculations of complex stress field distributions within mineralizing dynamic systems. We employed the Sobol method to analyze the sensitivity of the first, second and third principal stresses, to critial rock-physical parameters, including density, Poisson's ratio, and elastic modulus. This analysis allowed us to identify the critical parameters for the mineralization dynamics model. Subsequently, we utilized Latin hypercube sampling (LHS) to generate parameter sets within the key parameter space for numerical simulations, resulting in high-fidelity datasets of three distinct stress field distributions. Finally, we constructed an end-to-end surrogate model for each of the three stress fields based on deep neural networks, using the rock-physical parameters generated by LHS as input variables and the high-fidelity datasets obtained from numerical simulations as prediction variables. Evaluation metrics in the test dataset, including the correlation coefficient, root mean square error, mean relative error, and mean absolute error, indicate that the surrogate models perform well, effectively capturing the spatial distribution of the tectonic stress field within the mineralizing dynamic system. Our results also demonstrate that the average computational efficiency of the data-driven surrogate models is approximately 200 times greater than that of numerical simulation methods, providing an effective and rapid computational framework for parameter inversion and optimization related to the tectonic stress field in complex mineralizing dynamic systems.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"29 ","pages":"Article 100318"},"PeriodicalIF":3.2,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced anomaly detection in well log data through the application of ensemble GANs 集成gan的应用增强了对测井数据的异常检测
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-31 DOI: 10.1016/j.acags.2025.100316
Abdulrahman Al-Fakih , Ardiansyah Koeshidayatullah , Tapan Mukerji , SanLinn I. Kaka
Detecting subtle anomalies in well log data can significantly enhance subsurface characterization and inform reservoir decision-making. Well logs provide detailed records of rock and fluid properties. However, identifying anomalous patterns in these data remains a persistent challenge due to the complexity of geological systems and limitations in traditional statistical models. Classical anomaly detection approaches, such as Gaussian mixture models (GMMs), tend to oversimplify the intricacies of well log responses and often misinterpret geological heterogeneities as tool noise. This misclassification leads to high false positive rates and unreliable anomaly identification. In this context, generative models, particularly generative adversarial networks (GANs), have shown promise in structured data domains; however, they remain underutilized in geoscience applications. This study aims to benchmark the performance of ensemble GANs (EGANs) against GMMs for anomaly detection in well log data. The proposed EGANs framework aggregates multiple independently trained GANs to enhance model stability and robustness. Anomalies are detected based on discriminator scoring, while performance is evaluated using precision, recall, and F1-score across four key logs: gamma ray, travel time, bulk density, and neutron porosity. The results demonstrate that EGANs consistently outperform GMMs across all logs, achieving higher precision (up to 0.70) and F1-scores (up to 0.79), with statistically significant improvements confirmed via paired t-tests. These findings highlight the ability of EGANs to model complex subsurface patterns and detect subtle deviations more effectively than conventional probabilistic methods. This study introduces the first application of EGANs to petrophysical anomaly detection, bridging deep learning with geoscience workflows. It offers a scalable framework for integrating data-driven anomaly detection into reservoir modeling, quality control, and near-real-time decision-making in drilling operations. Future work will focus on multivariate analysis, cross-basin validation, and real-time deployment, advancing toward a more intelligent, adaptive reservoir monitoring system.
发现测井数据中的细微异常可以显著增强地下特征,并为储层决策提供信息。测井可以提供岩石和流体性质的详细记录。然而,由于地质系统的复杂性和传统统计模型的局限性,识别这些数据中的异常模式仍然是一个持续的挑战。经典的异常检测方法,如高斯混合模型(gmm),往往会过度简化测井响应的复杂性,并经常将地质非均质性误解为工具噪声。这种错误的分类导致高假阳性率和不可靠的异常识别。在这种情况下,生成模型,特别是生成对抗网络(gan),在结构化数据领域显示出前景;然而,它们在地球科学应用中仍未得到充分利用。本研究旨在对集成gan (EGANs)与GMMs在测井数据异常检测方面的性能进行基准测试。提出的EGANs框架聚合了多个独立训练的gan,以增强模型的稳定性和鲁棒性。异常检测基于判别评分,而性能评估使用精度、召回率和f1评分在四个关键日志:伽马射线、旅行时间、体积密度和中子孔隙度。结果表明,EGANs在所有日志中始终优于GMMs,获得更高的精度(高达0.70)和f1分数(高达0.79),通过配对t检验证实了统计学上显著的改进。这些发现强调了EGANs比传统的概率方法更有效地模拟复杂的地下模式和检测细微偏差的能力。本研究首次介绍了EGANs在岩石物理异常检测中的应用,将深度学习与地球科学工作流程相结合。它提供了一个可扩展的框架,将数据驱动的异常检测集成到油藏建模、质量控制和钻井作业中的近实时决策中。未来的工作将集中在多变量分析、跨盆地验证和实时部署上,朝着更智能、更自适应的油藏监测系统发展。
{"title":"Enhanced anomaly detection in well log data through the application of ensemble GANs","authors":"Abdulrahman Al-Fakih ,&nbsp;Ardiansyah Koeshidayatullah ,&nbsp;Tapan Mukerji ,&nbsp;SanLinn I. Kaka","doi":"10.1016/j.acags.2025.100316","DOIUrl":"10.1016/j.acags.2025.100316","url":null,"abstract":"<div><div>Detecting subtle anomalies in well log data can significantly enhance subsurface characterization and inform reservoir decision-making. Well logs provide detailed records of rock and fluid properties. However, identifying anomalous patterns in these data remains a persistent challenge due to the complexity of geological systems and limitations in traditional statistical models. Classical anomaly detection approaches, such as Gaussian mixture models (GMMs), tend to oversimplify the intricacies of well log responses and often misinterpret geological heterogeneities as tool noise. This misclassification leads to high false positive rates and unreliable anomaly identification. In this context, generative models, particularly generative adversarial networks (GANs), have shown promise in structured data domains; however, they remain underutilized in geoscience applications. This study aims to benchmark the performance of ensemble GANs (EGANs) against GMMs for anomaly detection in well log data. The proposed EGANs framework aggregates multiple independently trained GANs to enhance model stability and robustness. Anomalies are detected based on discriminator scoring, while performance is evaluated using precision, recall, and F1-score across four key logs: gamma ray, travel time, bulk density, and neutron porosity. The results demonstrate that EGANs consistently outperform GMMs across all logs, achieving higher precision (up to 0.70) and F1-scores (up to 0.79), with statistically significant improvements confirmed via paired t-tests. These findings highlight the ability of EGANs to model complex subsurface patterns and detect subtle deviations more effectively than conventional probabilistic methods. This study introduces the first application of EGANs to petrophysical anomaly detection, bridging deep learning with geoscience workflows. It offers a scalable framework for integrating data-driven anomaly detection into reservoir modeling, quality control, and near-real-time decision-making in drilling operations. Future work will focus on multivariate analysis, cross-basin validation, and real-time deployment, advancing toward a more intelligent, adaptive reservoir monitoring system.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"29 ","pages":"Article 100316"},"PeriodicalIF":3.2,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synergizing Radon transform and DINOv2 for artifact-resilient digital rock segmentation 协同Radon变换和DINOv2的伪影弹性数字岩石分割
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-23 DOI: 10.1016/j.acags.2025.100315
Shuai Hou , Danping Cao , Zhiyu Hou , Yali Zhu , Ziqiang Wang
Precise segmentation of core images acquired by X-ray computed tomography (X-CT) is fundamental for subsequent digital rock physics analysis. However, in cores with abundant high-density minerals, insufficient X-ray penetration often leads to distorted pore structures and severe artifacts, which compromise segmentation accuracy. To address this challenge, we propose a hybrid method that synergizes Radon transform with the DINOv2 vision foundation model. Our approach employs two stage strategy: first, a ‘hard correction’ via Radon transform to leverage the directional features of artifacts and suppress directional artifact patterns in the sinogram domain; second, a “soft learning” mechanism, where the fine-tuned DINOv2 model leverages its global semantic priors to discern and rectify residual artifact interference in the feature space. This hybrid approach not only suppresses artifact interference but also preserves the continuity and visual plausibility of pore structures. Experimental results demonstrate that, on synthetic artifact-contaminated samples, the model achieves incremental three-phase segmentation performance (mIoU = 0.755, F1 = 92.15 %). While Appling to real core images, with heavy mineral artifacts, the model demonstrates robust error-correcting capabilities, yielding segmentations that align more closely with the expected rock microstructure than the original ground truth annotations. This study provides an effective and validated framework for artifact resilient digital rock segmentation, offering substantial improvements for quantitative DRP studies.
x射线计算机断层扫描(X-CT)获得的岩心图像的精确分割是后续数字岩石物理分析的基础。然而,在富含高密度矿物的岩心中,x射线穿透不足往往会导致孔隙结构畸变和严重的伪影,从而影响分割精度。为了解决这一挑战,我们提出了一种将Radon变换与DINOv2视觉基础模型协同的混合方法。我们的方法采用两阶段策略:首先,通过Radon变换进行“硬校正”,以利用伪影的方向性特征并抑制sinogram域中的方向性伪影模式;第二,“软学习”机制,其中微调的DINOv2模型利用其全局语义先验来识别和纠正特征空间中的残余人工干扰。这种混合方法不仅抑制了伪影干扰,而且保持了孔隙结构的连续性和视觉上的合理性。实验结果表明,该模型在人工制品污染样品上取得了增量式三相分割效果(mIoU = 0.755, F1 = 92.15%)。当应用于真实岩心图像时,该模型显示出强大的纠错能力,产生的分割比原始的地面真实注释更接近预期的岩石微观结构。该研究为伪影弹性数字岩石分割提供了一个有效且经过验证的框架,为定量DRP研究提供了实质性的改进。
{"title":"Synergizing Radon transform and DINOv2 for artifact-resilient digital rock segmentation","authors":"Shuai Hou ,&nbsp;Danping Cao ,&nbsp;Zhiyu Hou ,&nbsp;Yali Zhu ,&nbsp;Ziqiang Wang","doi":"10.1016/j.acags.2025.100315","DOIUrl":"10.1016/j.acags.2025.100315","url":null,"abstract":"<div><div>Precise segmentation of core images acquired by X-ray computed tomography (X-CT) is fundamental for subsequent digital rock physics analysis. However, in cores with abundant high-density minerals, insufficient X-ray penetration often leads to distorted pore structures and severe artifacts, which compromise segmentation accuracy. To address this challenge, we propose a hybrid method that synergizes Radon transform with the DINOv2 vision foundation model. Our approach employs two stage strategy: first, a ‘hard correction’ via Radon transform to leverage the directional features of artifacts and suppress directional artifact patterns in the sinogram domain; second, a “soft learning” mechanism, where the fine-tuned DINOv2 model leverages its global semantic priors to discern and rectify residual artifact interference in the feature space. This hybrid approach not only suppresses artifact interference but also preserves the continuity and visual plausibility of pore structures. Experimental results demonstrate that, on synthetic artifact-contaminated samples, the model achieves incremental three-phase segmentation performance (mIoU = 0.755, F1 = 92.15 %). While Appling to real core images, with heavy mineral artifacts, the model demonstrates robust error-correcting capabilities, yielding segmentations that align more closely with the expected rock microstructure than the original ground truth annotations. This study provides an effective and validated framework for artifact resilient digital rock segmentation, offering substantial improvements for quantitative DRP studies.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"29 ","pages":"Article 100315"},"PeriodicalIF":3.2,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PetroMind: A multimodal petrographic model for rock image classification and lithological description generation PetroMind:用于岩石图像分类和岩性描述生成的多模态岩石学模型
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-16 DOI: 10.1016/j.acags.2025.100314
Zhongliang Chen , Chaojie Zheng , Mingming Zhang , Zhaoqi Hu , Jianchao Duan
Recent studies have revealed the remarkable capabilities of multimodal large models (MLLMs) in general vision-and-language tasks, generating increasing interest in their application to geoscientific domains. Rock image recognition and lithological description constitute fundamental skills for geologists and represent one of the earliest application scenarios in geosciences where artificial intelligence technologies have been actively explored. As rock image recognition and description are inherently multimodal tasks, involving both visual and textual data, models are required to understand the interactions between modalities and how these interactions affect information transfer. However, the simultaneous comprehension of rock images and their corresponding lithological descriptions remains a significant challenge for current machine learning frameworks. The impact of multimodal co-training on downstream geoscientific performance is still unconfirmed. This study presents PetroMind, a domain-adapted multimodal petrographic foundation model built upon Qwen2.5-VL, together with SA-Rock, a novel LLM-based metric designed to assess the semantic accuracy of rock image generation descriptions to evaluate the capabilities of MLLMs in petrographic multimodal tasks. In the rock image classification task, PetroMind achieves performance comparable to ViT-Base-Patch16-224, with accuracy and Macro-F1 scores exceeding or approaching 97 %. This demonstrates PetroMind's strong capability in long-tailed image learning and highlights its effectiveness in substantially improving classification accuracy for few-shot rock image categories. In the lithological description generation task, PetroMind attains BLEU-4 and SA-Rock scores of 0.644 and 7.864, respectively, indicating good few-shot learning performance. The SA-Rock metric shows that the model produces highly accurate descriptions of rock structure, texture, and colour, while leaving considerable scope for improvement in the description of mineral composition. Ablation experiments further indicate that task-specific LoRA adapters are more effective for high-resource tasks such as image classification, whereas a single shared LoRA adapter demonstrates superior multi-task interaction in low-resource captioning scenarios. This study demonstrates the potential of MLLM-based architectures in jointly understanding rock images and their associated geological descriptions.
最近的研究揭示了多模态大模型(mllm)在一般视觉和语言任务中的卓越能力,引起了人们对其在地球科学领域应用的越来越多的兴趣。岩石图像识别和岩性描述是地质学家的基本技能,也是人工智能技术在地球科学中最早应用的领域之一。由于岩石图像识别和描述本质上是多模态任务,涉及视觉和文本数据,因此需要模型来理解模态之间的相互作用以及这些相互作用如何影响信息传递。然而,同时理解岩石图像及其相应的岩性描述仍然是当前机器学习框架面临的一个重大挑战。多模式协同训练对下游地球科学性能的影响尚未得到证实。本研究提出了PetroMind,一个基于Qwen2.5-VL的自适应多模态岩石学基础模型,以及SA-Rock,一个新的基于llm的度量,旨在评估岩石图像生成描述的语义准确性,以评估mllm在岩石学多模态任务中的能力。在岩石图像分类任务中,PetroMind实现了与viti - base - patch16 -224相当的性能,准确率和Macro-F1得分超过或接近97%。这证明了PetroMind在长尾图像学习方面的强大能力,并突出了其在大幅提高少量岩石图像类别分类精度方面的有效性。在岩性描述生成任务中,PetroMind的BLEU-4和SA-Rock得分分别为0.644和7.864,显示了良好的少拍学习性能。SA-Rock度量表明,该模型对岩石结构、纹理和颜色的描述非常准确,同时在矿物成分的描述方面留下了相当大的改进空间。消融实验进一步表明,特定任务的LoRA适配器在高资源任务(如图像分类)中更有效,而单个共享的LoRA适配器在低资源字幕场景中表现出更好的多任务交互。这项研究证明了基于mllm的体系结构在共同理解岩石图像及其相关地质描述方面的潜力。
{"title":"PetroMind: A multimodal petrographic model for rock image classification and lithological description generation","authors":"Zhongliang Chen ,&nbsp;Chaojie Zheng ,&nbsp;Mingming Zhang ,&nbsp;Zhaoqi Hu ,&nbsp;Jianchao Duan","doi":"10.1016/j.acags.2025.100314","DOIUrl":"10.1016/j.acags.2025.100314","url":null,"abstract":"<div><div>Recent studies have revealed the remarkable capabilities of multimodal large models (MLLMs) in general vision-and-language tasks, generating increasing interest in their application to geoscientific domains. Rock image recognition and lithological description constitute fundamental skills for geologists and represent one of the earliest application scenarios in geosciences where artificial intelligence technologies have been actively explored. As rock image recognition and description are inherently multimodal tasks, involving both visual and textual data, models are required to understand the interactions between modalities and how these interactions affect information transfer. However, the simultaneous comprehension of rock images and their corresponding lithological descriptions remains a significant challenge for current machine learning frameworks. The impact of multimodal co-training on downstream geoscientific performance is still unconfirmed. This study presents PetroMind, a domain-adapted multimodal petrographic foundation model built upon Qwen2.5-VL, together with SA-Rock, a novel LLM-based metric designed to assess the semantic accuracy of rock image generation descriptions to evaluate the capabilities of MLLMs in petrographic multimodal tasks. In the rock image classification task, PetroMind achieves performance comparable to ViT-Base-Patch16-224, with accuracy and Macro-F1 scores exceeding or approaching 97 %. This demonstrates PetroMind's strong capability in long-tailed image learning and highlights its effectiveness in substantially improving classification accuracy for few-shot rock image categories. In the lithological description generation task, PetroMind attains BLEU-4 and SA-Rock scores of 0.644 and 7.864, respectively, indicating good few-shot learning performance. The SA-Rock metric shows that the model produces highly accurate descriptions of rock structure, texture, and colour, while leaving considerable scope for improvement in the description of mineral composition. Ablation experiments further indicate that task-specific LoRA adapters are more effective for high-resource tasks such as image classification, whereas a single shared LoRA adapter demonstrates superior multi-task interaction in low-resource captioning scenarios. This study demonstrates the potential of MLLM-based architectures in jointly understanding rock images and their associated geological descriptions.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"29 ","pages":"Article 100314"},"PeriodicalIF":3.2,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing neuro-symbolic AI for mineral prediction via LLM-guided knowledge integration 通过llm引导的知识整合增强矿物预测的神经符号人工智能
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-03 DOI: 10.1016/j.acags.2025.100310
Weilin Chen, Jiyin Zhang, Chenhao Li, Xiaogang Ma
Integrating domain knowledge into machine learning (ML) models is critical for achieving reliable and interpretable predictions in complex scientific fields such as geoscience. In several recent studies centered on the so-called Neuro-Symbolic AI (NSAI) frameworks, symbolic geological knowledge was successfully combined with traditional ML algorithms to improve the prediction of mineral deposit types. The fast development of Large Language Model (LLM) brings new opportunities to further enhance the NSAI applications. In this study, to construct the symbolic component of NSAI, we used an LLM to automatically extract, structure, and transform descriptive knowledge from authoritative geoscience textbooks into a machine-readable format. The result captures geochemical signatures, lithological settings, and alteration features associated with various mineral systems. The structured knowledge was integrated into a decision tree classifier by embedding each sample with a vectorized representation of its corresponding deposit type. Compared to conventional ML models trained solely on geochemical data, our NSAI model achieved significantly higher accuracy on the test sets, indicating improved generalization. Moreover, the NSAI model demonstrated consistent performance across a broader set of deposit types, including those with extremely limited training samples. In particular, the NSAI framework improved predictive stability and accuracy even for minority classes with only 3 to 5 samples, where traditional ML models tend to overfit or fail. This robustness underscores the value of incorporating expert-level geological knowledge into data-driven pipelines. In our result assessment, the SHAP (SHapley Additive exPlanations) analysis further revealed that symbolic knowledge vectors contributed substantially to the model's decision-making process, confirming their importance in enhancing interpretability and predictive power. Our work demonstrates that LLM-guided knowledge extraction offers an effective and scalable way to integrate structured domain knowledge into mineral prediction tasks. We hope the work can also provide insights for other geoscientific applications of NSAI.
将领域知识集成到机器学习(ML)模型中,对于在地球科学等复杂科学领域实现可靠和可解释的预测至关重要。在最近几项以所谓的神经符号人工智能(NSAI)框架为中心的研究中,符号地质知识成功地与传统的ML算法相结合,以提高对矿床类型的预测。大型语言模型(LLM)的快速发展为进一步增强NSAI的应用带来了新的机遇。在本研究中,为了构建NSAI的符号成分,我们使用LLM自动提取、结构和转换来自权威地学教科书的描述性知识为机器可读格式。结果捕获了与各种矿物系统相关的地球化学特征、岩性背景和蚀变特征。通过将每个样本嵌入其相应沉积类型的矢量化表示,将结构化知识集成到决策树分类器中。与仅在地球化学数据上训练的传统ML模型相比,我们的NSAI模型在测试集上取得了更高的精度,表明泛化能力得到了提高。此外,NSAI模型在更广泛的储层类型中表现出一致的性能,包括那些训练样本极其有限的储层。特别是,NSAI框架提高了预测的稳定性和准确性,即使只有3到5个样本的少数类别,传统的ML模型往往会过拟合或失败。这种稳健性强调了将专家级地质知识整合到数据驱动的管道中的价值。在我们的结果评估中,SHAP (SHapley Additive exPlanations)分析进一步揭示了符号知识向量对模型决策过程的重要贡献,证实了它们在提高可解释性和预测能力方面的重要性。我们的工作表明,法学硕士指导下的知识提取提供了一种有效且可扩展的方法,将结构化领域知识集成到矿物预测任务中。我们希望这项工作也可以为NSAI的其他地球科学应用提供见解。
{"title":"Enhancing neuro-symbolic AI for mineral prediction via LLM-guided knowledge integration","authors":"Weilin Chen,&nbsp;Jiyin Zhang,&nbsp;Chenhao Li,&nbsp;Xiaogang Ma","doi":"10.1016/j.acags.2025.100310","DOIUrl":"10.1016/j.acags.2025.100310","url":null,"abstract":"<div><div>Integrating domain knowledge into machine learning (ML) models is critical for achieving reliable and interpretable predictions in complex scientific fields such as geoscience. In several recent studies centered on the so-called Neuro-Symbolic AI (NSAI) frameworks, symbolic geological knowledge was successfully combined with traditional ML algorithms to improve the prediction of mineral deposit types. The fast development of Large Language Model (LLM) brings new opportunities to further enhance the NSAI applications. In this study, to construct the symbolic component of NSAI, we used an LLM to automatically extract, structure, and transform descriptive knowledge from authoritative geoscience textbooks into a machine-readable format. The result captures geochemical signatures, lithological settings, and alteration features associated with various mineral systems. The structured knowledge was integrated into a decision tree classifier by embedding each sample with a vectorized representation of its corresponding deposit type. Compared to conventional ML models trained solely on geochemical data, our NSAI model achieved significantly higher accuracy on the test sets, indicating improved generalization. Moreover, the NSAI model demonstrated consistent performance across a broader set of deposit types, including those with extremely limited training samples. In particular, the NSAI framework improved predictive stability and accuracy even for minority classes with only 3 to 5 samples, where traditional ML models tend to overfit or fail. This robustness underscores the value of incorporating expert-level geological knowledge into data-driven pipelines. In our result assessment, the SHAP (SHapley Additive exPlanations) analysis further revealed that symbolic knowledge vectors contributed substantially to the model's decision-making process, confirming their importance in enhancing interpretability and predictive power. Our work demonstrates that LLM-guided knowledge extraction offers an effective and scalable way to integrate structured domain knowledge into mineral prediction tasks. We hope the work can also provide insights for other geoscientific applications of NSAI.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"29 ","pages":"Article 100310"},"PeriodicalIF":3.2,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145705705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Separation of P- and S-waves on shallow subsurface using transfer learning 利用迁移学习分离浅层地下纵波和横波
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-01 DOI: 10.1016/j.acags.2025.100307
Jian Li, Jinchao Xing, Lujun Wei, Chuankun Li, Xiaoyan Li
Accurate separation of P- and S-waves is crucial for multi-component seismic imaging in shallow subsurface studies. We proposed a transfer learning framework based on the U-FCN architecture that combines multiscale feature fusion from U-Net with a fully convolutional neural network. Synthetic datasets were generated using finite-difference simulation and Helmholtz decomposition to train the network in a data-driven manner. For field data adaptation, we employed a transfer learning strategy involving: (1) freezing early feature extraction layers, (2) fine-tuning the final some layers, and (3) incorporating dilated convolutions to enhance feature extraction. Numerical simulations and field experiments demonstrate that the proposed approach achieves accurate P- and S-wave separation. On the field seismic data acquired from the Loess Plateau, the separated P-waves achieved R2=0.952, SSIM=0.906, and PSNR=29.197 dB, while the S-waves reached R2=0.938, SSIM=0.885, and PSNR=28.846 dB, with an inference time of only 1.3 s. Compared with conventional methods, our approach achieves cleaner P- and S-wave separation with significantly higher computational efficiency and without relying on prior model parameters. The results confirm the robustness and scalability of the method for real-world seismic applications, effectively bridging the gap between synthetic-based training and field data interpretation.
纵波和横波的准确分离是浅层地下多分量地震成像的关键。我们提出了一种基于U-FCN架构的迁移学习框架,该框架将U-Net的多尺度特征融合与全卷积神经网络相结合。利用有限差分模拟和亥姆霍兹分解生成合成数据集,以数据驱动的方式训练网络。为了适应现场数据,我们采用了一种迁移学习策略,包括:(1)冻结早期的特征提取层,(2)微调最后的一些层,以及(3)结合扩展卷积来增强特征提取。数值模拟和现场实验表明,该方法可以实现准确的纵波和横波分离。在黄土高原实测地震资料中,分离得到的p波R2=0.952, SSIM=0.906, PSNR=29.197 dB, s波R2=0.938, SSIM=0.885, PSNR=28.846 dB,推断时间仅为1.3 s。与传统方法相比,我们的方法实现了更清晰的P波和s波分离,计算效率显著提高,并且不依赖于先前的模型参数。结果证实了该方法在实际地震应用中的鲁棒性和可扩展性,有效地弥合了基于合成的训练和现场数据解释之间的差距。
{"title":"Separation of P- and S-waves on shallow subsurface using transfer learning","authors":"Jian Li,&nbsp;Jinchao Xing,&nbsp;Lujun Wei,&nbsp;Chuankun Li,&nbsp;Xiaoyan Li","doi":"10.1016/j.acags.2025.100307","DOIUrl":"10.1016/j.acags.2025.100307","url":null,"abstract":"<div><div>Accurate separation of P- and S-waves is crucial for multi-component seismic imaging in shallow subsurface studies. We proposed a transfer learning framework based on the U-FCN architecture that combines multiscale feature fusion from U-Net with a fully convolutional neural network. Synthetic datasets were generated using finite-difference simulation and Helmholtz decomposition to train the network in a data-driven manner. For field data adaptation, we employed a transfer learning strategy involving: (1) freezing early feature extraction layers, (2) fine-tuning the final some layers, and (3) incorporating dilated convolutions to enhance feature extraction. Numerical simulations and field experiments demonstrate that the proposed approach achieves accurate P- and S-wave separation. On the field seismic data acquired from the Loess Plateau, the separated P-waves achieved R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>=0.952, SSIM=0.906, and PSNR=29.197 dB, while the S-waves reached R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>=0.938, SSIM=0.885, and PSNR=28.846 dB, with an inference time of only 1.3 s. Compared with conventional methods, our approach achieves cleaner P- and S-wave separation with significantly higher computational efficiency and without relying on prior model parameters. The results confirm the robustness and scalability of the method for real-world seismic applications, effectively bridging the gap between synthetic-based training and field data interpretation.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"28 ","pages":"Article 100307"},"PeriodicalIF":3.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145624028","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Graph-based evidence accumulation for clustering 3D orientation measurements in planetary surface mapping under relational constraints 关系约束下行星表面制图中聚类三维方位测量的基于图的证据积累
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-01 DOI: 10.1016/j.acags.2025.100309
Orhun Aydin
Groups of structural measurements based on orientation similarity are indicative of deformation mechanisms and are important measures to infer the deformation history of planetary surfaces. Despite methods for defining groups based only on orientation or spatial proximity, a general framework for defining orientation-based groups under multiple constraints is lacking. A second challenge pertains to the computational challenge of clustering large structural data due to the large volume and velocity of the data collected as a part of field-based, earth-observing, and planetary missions. In this paper, we propose a general clustering framework for defining groups in angular data based on orientation similarity that can be constrained with relational constraints, such as spatial proximity or prior knowledge of geologic units. We represent the similarity of geologic measurements with a similarity graph where similarity links (graph edges) are defined via the clustering evidence accumulated by re-clustering of data with varying parameters. We showcase the use of a spectral gap measure to define the optimal number of clusters for the evidence graph. We apply the proposed method to define groups of compaction bands using field data collected from the Valley of Fire, NV. We compare our results to a state-of-the-art Bingham mixture model. Results indicate the realism of the proposed method in terms of mapping distinct structural groups under different spatial proximity constraints.
基于取向相似性的构造测量组是指示变形机制的重要手段,是推断行星表面变形历史的重要手段。尽管有仅基于取向或空间接近度定义群体的方法,但缺乏在多种约束下定义基于取向的群体的通用框架。第二个挑战涉及聚类大型结构数据的计算挑战,因为作为实地、地球观测和行星任务的一部分收集的数据量大、速度快。在本文中,我们提出了一个通用的聚类框架,用于基于方向相似性来定义角度数据中的组,该框架可以受到关系约束的约束,例如空间邻近性或地质单元的先验知识。我们用相似图来表示地质测量的相似性,其中相似链接(图边)是通过对不同参数的数据重新聚类而积累的聚类证据来定义的。我们展示了使用谱间隙度量来定义证据图的最佳簇数。我们利用从内华达州火谷收集的现场数据,应用所提出的方法来定义压实带组。我们将结果与最先进的Bingham混合模型进行了比较。结果表明,该方法在不同空间接近度约束下映射出不同的结构群是可行的。
{"title":"Graph-based evidence accumulation for clustering 3D orientation measurements in planetary surface mapping under relational constraints","authors":"Orhun Aydin","doi":"10.1016/j.acags.2025.100309","DOIUrl":"10.1016/j.acags.2025.100309","url":null,"abstract":"<div><div>Groups of structural measurements based on orientation similarity are indicative of deformation mechanisms and are important measures to infer the deformation history of planetary surfaces. Despite methods for defining groups based only on orientation or spatial proximity, a general framework for defining orientation-based groups under multiple constraints is lacking. A second challenge pertains to the computational challenge of clustering large structural data due to the large volume and velocity of the data collected as a part of field-based, earth-observing, and planetary missions. In this paper, we propose a general clustering framework for defining groups in angular data based on orientation similarity that can be constrained with relational constraints, such as spatial proximity or prior knowledge of geologic units. We represent the similarity of geologic measurements with a similarity graph where similarity links (graph edges) are defined via the clustering evidence accumulated by re-clustering of data with varying parameters. We showcase the use of a spectral gap measure to define the optimal number of clusters for the evidence graph. We apply the proposed method to define groups of compaction bands using field data collected from the Valley of Fire, NV. We compare our results to a state-of-the-art Bingham mixture model. Results indicate the realism of the proposed method in terms of mapping distinct structural groups under different spatial proximity constraints.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"28 ","pages":"Article 100309"},"PeriodicalIF":3.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145690945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analyzing land use land cover changes in Mysuru taluk, Karnataka state, India using vision transformers 使用视觉变压器分析印度卡纳塔克邦Mysuru taluk的土地利用和土地覆盖变化
IF 3.2 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-12-01 DOI: 10.1016/j.acags.2025.100308
H.N. Mahendra , V. Pushpalatha , S. Mallikarjunaswamy , S. Rama Subramoniam , Arjun Sunil Rao , N.C. Sanjay Shekar
This study presents an analysis of land use and land cover changes in Mysuru Taluk, Karnataka, India, over a two-decade period (2004–2014 and 2014–2024), using Vision Transformers (ViTs) to enhance classification accuracy. Using Linear Imaging Self-Scanning Sensor (LISS-III) remote sensing data, our approach combines the powerful feature extraction capabilities of ViTs to address the complexities inherent in multi-temporal satellite data. Traditional methods for LULC mapping face challenges due to variability in land features and temporal changes, which impact classification accuracy. By employing ViTs, we aim to overcome these limitations through their self-attention approach, which can capture long-range dependencies in the data, thus offering a more refined classification process. Our study results show improved overall classification accuracy across the assessed years, achieving 95.07 % in 2004, 95.79 % in 2014, and reaching 96.74 % in 2024. These progressive results highlight the efficiency of ViTs in accurately classifying and detecting subtle land cover changes over time. Further, change detection analysis results show that the built-up area increased by 17.25 %, and agricultural land decreased by 16.24 % over two decades. The findings will assist policymakers and urban planners develop strategies to manage urbanization effectively while minimizing environmental impacts.
本研究分析了印度卡纳塔克邦Mysuru Taluk 20年间(2004-2014和2014-2024)的土地利用和土地覆盖变化,使用视觉变换(ViTs)来提高分类精度。利用线性成像自扫描传感器(LISS-III)遥感数据,我们的方法结合了ViTs强大的特征提取能力,以解决多时相卫星数据固有的复杂性。由于地物的多变性和时间的变化,传统的LULC制图方法面临挑战,从而影响分类精度。通过使用vit,我们的目标是通过它们的自关注方法来克服这些限制,这种方法可以捕获数据中的长期依赖关系,从而提供更精细的分类过程。我们的研究结果表明,在评估年份中,总体分类准确率有所提高,2004年达到95.07%,2014年达到95.79%,2024年达到96.74%。这些进步的结果突出了ViTs在准确分类和检测土地覆盖随时间变化的细微变化方面的效率。此外,变化检测分析结果表明,20 a来,建成区面积增加了17.25%,农业用地减少了16.24%。研究结果将有助于决策者和城市规划者制定有效管理城市化的战略,同时尽量减少对环境的影响。
{"title":"Analyzing land use land cover changes in Mysuru taluk, Karnataka state, India using vision transformers","authors":"H.N. Mahendra ,&nbsp;V. Pushpalatha ,&nbsp;S. Mallikarjunaswamy ,&nbsp;S. Rama Subramoniam ,&nbsp;Arjun Sunil Rao ,&nbsp;N.C. Sanjay Shekar","doi":"10.1016/j.acags.2025.100308","DOIUrl":"10.1016/j.acags.2025.100308","url":null,"abstract":"<div><div>This study presents an analysis of land use and land cover changes in Mysuru Taluk, Karnataka, India, over a two-decade period (2004–2014 and 2014–2024), using Vision Transformers (ViTs) to enhance classification accuracy. Using Linear Imaging Self-Scanning Sensor (LISS-III) remote sensing data, our approach combines the powerful feature extraction capabilities of ViTs to address the complexities inherent in multi-temporal satellite data. Traditional methods for LULC mapping face challenges due to variability in land features and temporal changes, which impact classification accuracy. By employing ViTs, we aim to overcome these limitations through their self-attention approach, which can capture long-range dependencies in the data, thus offering a more refined classification process. Our study results show improved overall classification accuracy across the assessed years, achieving 95.07 % in 2004, 95.79 % in 2014, and reaching 96.74 % in 2024. These progressive results highlight the efficiency of ViTs in accurately classifying and detecting subtle land cover changes over time. Further, change detection analysis results show that the built-up area increased by 17.25 %, and agricultural land decreased by 16.24 % over two decades. The findings will assist policymakers and urban planners develop strategies to manage urbanization effectively while minimizing environmental impacts.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"28 ","pages":"Article 100308"},"PeriodicalIF":3.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145690943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied Computing and Geosciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1