Applied Computing and Geosciences最新文献_第4页

Evaluating the performances of SVR and XGBoost for short-range forecasting of heatwaves across different temperature zones of India 评估 SVR 和 XGBoost 在印度不同温度带热浪短程预报中的性能

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-10-23 DOI: 10.1016/j.acags.2024.100204

Srikanth Bhoopathi, Nitish Kumar, Somesh, Manali Pal

This research aims to forecast maximum temperatures and the frequency of heatwave days across four different temperature zones (Zone 1, 2, 3 and 4) in India. These four zones are categorized based on the 30-year average maximum temperatures (T_30AMT) during the summer months of April, May, and June (AMJ). Two Machine Learning (ML) algorithms eXtreme Gradient Boosting (XGBoost) and Support Vector Regression (SVR) are employed to achieve this goal. The study utilizes nine key atmospheric variables namely air temperature, geopotential height, relative humidity, U-wind, V-wind, soil moisture, solar radiation, sea surface temperature, and mean sea level pressure at a daily scale spanning from 1991 to 2020 for the months of March, April, May, and June as predictors. The India Meteorological Department daily maximum temperature data spanning from 1991 to 2020 for the months of AMJ serves as the predictands. ML models are developed using spatially averaged atmospheric variables and daily maximum temperature across the grids falling within each temperature zone. Results indicate that for a 7-day lead time, SVR outperforms XGBoost in Zone-1 (T_30AMT > 38 °C) and Zone-2 (T_30AMT: 35.01 °C–38 °C) by more accurately capturing peak temperatures during training and testing. Conversely, for a 15-day lead time in Zone-1, XGBoost better predicts temperature peaks in both phases. In Zone-3 (T_30AMT: 30 °C–35 °C) and Zone-4 (T_30AMT < 30 °C) for both lead times, the performance of both models decline, indicating models and input variables are more effective in predicting higher temperatures typical of Zone-1 and 2 but less so in Zone-3 and 4. In a nutshell, the study attempts to highlight the capability of advanced ML techniques combined with spatial climate data to enhance the prediction of extreme heatwave events. These insights can aid in heatwave preparedness, climate management, and adaptation strategies for different Indian temperature zones.

本研究旨在预测印度四个不同温度区（1、2、3 和 4 区）的最高气温和热浪日频率。这四个区域是根据四月、五月和六月（AMJ）夏季的 30 年平均最高气温（T30AMT）划分的。为实现这一目标，采用了两种机器学习（ML）算法：梯度提升算法（XGBoost）和支持向量回归算法（SVR）。该研究采用了九个关键大气变量作为预测因子，即 1991 年至 2020 年 3 月、4 月、5 月和 6 月每日的气温、位势高度、相对湿度、U 风、V 风、土壤湿度、太阳辐射、海面温度和平均海平面气压。印度气象局 1991 年至 2020 年 AMJ 月份的日最高气温数据作为预测因子。利用空间平均大气变量和每个气温区域内网格的日最高气温开发了 ML 模型。结果表明，在 7 天的前导时间内，SVR 在 1 区（T30AMT > 38 ℃）和 2 区（T30AMT: 35.01 ℃-38 ℃）的表现优于 XGBoost，因为 SVR 能更准确地捕捉到训练和测试期间的峰值温度。相反，在前导时间为 15 天的 1 区，XGBoost 能更好地预测两个阶段的温度峰值。在第 3 区（T30AMT：30 °C-35 °C）和第 4 区（T30AMT < 30 °C），在两个提前期，两个模型的性能都有所下降，表明模型和输入变量在预测第 1 区和第 2 区典型的较高温度时更为有效，但在第 3 区和第 4 区则效果较差。总之，这项研究试图强调先进的 ML 技术与空间气候数据相结合的能力，以加强对极端热浪事件的预测。这些见解有助于印度不同温区的热浪防范、气候管理和适应战略。

{"title":"Evaluating the performances of SVR and XGBoost for short-range forecasting of heatwaves across different temperature zones of India","authors":"Srikanth Bhoopathi, Nitish Kumar, Somesh, Manali Pal","doi":"10.1016/j.acags.2024.100204","DOIUrl":"10.1016/j.acags.2024.100204","url":null,"abstract":"<div><div>This research aims to forecast maximum temperatures and the frequency of heatwave days across four different temperature zones (Zone 1, 2, 3 and 4) in India. These four zones are categorized based on the 30-year average maximum temperatures (T<sub>30AMT</sub>) during the summer months of April, May, and June (AMJ). Two Machine Learning (ML) algorithms eXtreme Gradient Boosting (XGBoost) and Support Vector Regression (SVR) are employed to achieve this goal. The study utilizes nine key atmospheric variables namely air temperature, geopotential height, relative humidity, U-wind, V-wind, soil moisture, solar radiation, sea surface temperature, and mean sea level pressure at a daily scale spanning from 1991 to 2020 for the months of March, April, May, and June as predictors. The India Meteorological Department daily maximum temperature data spanning from 1991 to 2020 for the months of AMJ serves as the predictands. ML models are developed using spatially averaged atmospheric variables and daily maximum temperature across the grids falling within each temperature zone. Results indicate that for a 7-day lead time, SVR outperforms XGBoost in Zone-1 (T<sub>30AMT</sub> > 38 °C) and Zone-2 (T<sub>30AMT</sub>: 35.01 °C–38 °C) by more accurately capturing peak temperatures during training and testing. Conversely, for a 15-day lead time in Zone-1, XGBoost better predicts temperature peaks in both phases. In Zone-3 (T<sub>30AMT</sub>: 30 °C–35 °C) and Zone-4 (T<sub>30AMT</sub> < 30 °C) for both lead times, the performance of both models decline, indicating models and input variables are more effective in predicting higher temperatures typical of Zone-1 and 2 but less so in Zone-3 and 4. In a nutshell, the study attempts to highlight the capability of advanced ML techniques combined with spatial climate data to enhance the prediction of extreme heatwave events. These insights can aid in heatwave preparedness, climate management, and adaptation strategies for different Indian temperature zones.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100204"},"PeriodicalIF":2.6,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142534341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Current progress in subseasonal-to-decadal prediction based on machine learning 基于机器学习的副季节至十年期预测的最新进展

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-10-22 DOI: 10.1016/j.acags.2024.100201

Zixiong Shen , Qiming Sun , Xinyu Lu , Fenghua Ling , Yue Li , Jiye Wu , Jing-Jia Luo , Chaoxia Yuan

The application of machine learning (ML) techniques to climate science has received significant attention, particularly in the field of climate predictions, ranging from sub-seasonal to decadal time scales. This paper reviews recent progress of ML techniques employed in climate phenomena prediction and the enhancement of dynamic forecast models, which provide valuable insights into the great potentials of ML techniques to improve climate prediction capabilities with reduced computational time and resource consumption. This paper also discusses several major challenges in the application of ML to climate prediction, including the scarcity of datasets, physical inconsistency, and lack of model transparency and interpretability. Additionally, this paper sheds light on how climate change impacts ML model training and prediction, and explores three key areas with potential breakthroughs: large-scale climate models, knowledge discovery driven by ML, and hybrid dynamical-statistical models, underscoring the important role of the integration of “ML and dynamical models” in building a bridge between the artificial intelligence and climate science.

机器学习（ML）技术在气候科学中的应用，特别是在气候预测领域（从亚季节到十年时间尺度）的应用，已受到极大关注。本文回顾了机器学习技术在气候现象预测和动态预报模式增强方面的最新进展，这些进展提供了宝贵的见解，说明了机器学习技术在减少计算时间和资源消耗、提高气候预测能力方面的巨大潜力。本文还讨论了将 ML 应用于气候预测的几个主要挑战，包括数据集稀缺、物理不一致性以及模型缺乏透明度和可解释性。此外，本文还揭示了气候变化对 ML 模型训练和预测的影响，并探讨了可能取得突破的三个关键领域：大规模气候模型、ML 驱动的知识发现以及动态-统计混合模型，强调了 "ML 与动态模型 "的集成在搭建人工智能与气候科学之间桥梁的重要作用。

{"title":"Current progress in subseasonal-to-decadal prediction based on machine learning","authors":"Zixiong Shen , Qiming Sun , Xinyu Lu , Fenghua Ling , Yue Li , Jiye Wu , Jing-Jia Luo , Chaoxia Yuan","doi":"10.1016/j.acags.2024.100201","DOIUrl":"10.1016/j.acags.2024.100201","url":null,"abstract":"<div><div>The application of machine learning (ML) techniques to climate science has received significant attention, particularly in the field of climate predictions, ranging from sub-seasonal to decadal time scales. This paper reviews recent progress of ML techniques employed in climate phenomena prediction and the enhancement of dynamic forecast models, which provide valuable insights into the great potentials of ML techniques to improve climate prediction capabilities with reduced computational time and resource consumption. This paper also discusses several major challenges in the application of ML to climate prediction, including the scarcity of datasets, physical inconsistency, and lack of model transparency and interpretability. Additionally, this paper sheds light on how climate change impacts ML model training and prediction, and explores three key areas with potential breakthroughs: large-scale climate models, knowledge discovery driven by ML, and hybrid dynamical-statistical models, underscoring the important role of the integration of “ML and dynamical models” in building a bridge between the artificial intelligence and climate science.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100201"},"PeriodicalIF":2.6,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142534340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Novel application of unsupervised machine learning for characterization of subsurface seismicity, tectonic dynamics and stress distribution 无监督机器学习在表征地下地震活动性、构造动力学和应力分布方面的新应用

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-10-21 DOI: 10.1016/j.acags.2024.100200

Mohammad Salam, Muhammad Tahir Iqbal, Raja Adnan Habib, Amna Tahir, Aamir Sultan, Talat Iqbal

Our study pioneers an innovative use of unsupervised machine learning, a powerful tool for navigating unclassified data, to unravel the complexities of subsurface seismic activities and extract meaningful patterns. Our central objective is to comprehensively characterize seismicity within an active region by identifying distinct seismic clusters in spatial distribution, thereby gaining a deeper understanding of subsurface stress distribution and tectonic dynamics. Employing a diverse range of clustering algorithms, with particular emphasis on Fuzzy C-Means (FCM), our research meticulously dissects the intricate physical processes that govern a complex tectonic zone. This technique effectively delineates distinct tectonic zones, aligning seamlessly with established seismological knowledge and underscoring the transformative potential of Artificial Intelligence (AI) in analyzing regional subsurface phenomena, even under conditions of data scarcity. Moreover, associating earthquakes with specific seismogenic structures significantly enhances seismic hazard analyses, potentially paving the way for autonomous insights that inform engineering hazard assessments.

我们的研究开创性地使用了无监督机器学习这一用于浏览未分类数据的强大工具，以揭示地下地震活动的复杂性并提取有意义的模式。我们的核心目标是通过识别空间分布中不同的地震群，全面描述活跃区域内的地震活动特征，从而更深入地了解地下应力分布和构造动态。我们的研究采用了多种聚类算法，尤其侧重于模糊 C-Means (FCM)，细致地剖析了支配复杂构造带的错综复杂的物理过程。这项技术有效地划分了不同的构造带，与既有的地震学知识完美地结合在一起，并强调了人工智能（AI）在分析区域地下现象方面的变革潜力，即使在数据匮乏的条件下也是如此。此外，将地震与特定的成震结构联系起来，大大增强了地震灾害分析的效果，有可能为自主洞察力铺平道路，为工程灾害评估提供依据。

{"title":"Novel application of unsupervised machine learning for characterization of subsurface seismicity, tectonic dynamics and stress distribution","authors":"Mohammad Salam, Muhammad Tahir Iqbal, Raja Adnan Habib, Amna Tahir, Aamir Sultan, Talat Iqbal","doi":"10.1016/j.acags.2024.100200","DOIUrl":"10.1016/j.acags.2024.100200","url":null,"abstract":"<div><div>Our study pioneers an innovative use of unsupervised machine learning, a powerful tool for navigating unclassified data, to unravel the complexities of subsurface seismic activities and extract meaningful patterns. Our central objective is to comprehensively characterize seismicity within an active region by identifying distinct seismic clusters in spatial distribution, thereby gaining a deeper understanding of subsurface stress distribution and tectonic dynamics. Employing a diverse range of clustering algorithms, with particular emphasis on Fuzzy C-Means (FCM), our research meticulously dissects the intricate physical processes that govern a complex tectonic zone. This technique effectively delineates distinct tectonic zones, aligning seamlessly with established seismological knowledge and underscoring the transformative potential of Artificial Intelligence (AI) in analyzing regional subsurface phenomena, even under conditions of data scarcity. Moreover, associating earthquakes with specific seismogenic structures significantly enhances seismic hazard analyses, potentially paving the way for autonomous insights that inform engineering hazard assessments.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100200"},"PeriodicalIF":2.6,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142534339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A comparative study on machine learning approaches for rock mass classification using drilling data 利用钻探数据进行岩体分类的机器学习方法比较研究

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-10-05 DOI: 10.1016/j.acags.2024.100199

Tom F. Hansen , Georg H. Erharter , Zhongqiang Liu , Jim Torresen

Current rock engineering design in drill and blast tunnelling primarily relies on engineers' observational assessments. Measure While Drilling (MWD) data, a high-resolution sensor dataset collected during tunnel excavation, is underutilised, mainly serving for geological visualisation. This study aims to automate the translation of MWD data into actionable metrics for rock engineering. It seeks to link data to specific engineering actions, thus providing critical decision support for geological challenges ahead of the tunnel face. Leveraging a large and geologically diverse dataset of ∼500,000 drillholes from 15 tunnels, the research introduces models for accurate rock mass quality classification in a real-world tunnelling context. Both conventional machine learning and image-based deep learning are explored to classify MWD data into Q-classes and Q-values—examples of metrics describing the stability of the rock mass—using both tabular- and image data. The results indicate that the K-nearest neighbours algorithm in an ensemble with tree-based models using tabular data effectively classifies rock mass quality. It achieves a cross-validated balanced accuracy of 0.86 in classifying rock mass into the Q-classes A, B, C, D, E1, E2, and 0.95 for a binary classification with E versus the rest. Classification using a CNN with MWD-images for each blasting round resulted in a balanced accuracy of 0.82 for binary classification. Regressing the Q-value from tabular MWD-data achieved cross-validated R² and MSE scores of 0.80 and 0.18 for a similar ensemble model as in classification. High performance in regression and classification boosts confidence in automated rock mass assessment. Applying advanced modelling on a unique dataset demonstrates MWD data's value in improving rock mass classification accuracy and advancing data-driven rock engineering design, reducing manual intervention.

目前钻爆法隧道的岩石工程设计主要依靠工程师的观察评估。边钻边测（MWD）数据是一种在隧道开挖过程中收集的高分辨率传感器数据集，但未得到充分利用，主要用于地质可视化。本研究旨在将 MWD 数据自动转化为岩石工程的可操作指标。它旨在将数据与具体的工程行动联系起来，从而为隧道工作面前方的地质挑战提供重要的决策支持。该研究利用来自 15 座隧道的 500,000 个钻孔组成的大型地质多样性数据集，引入了在现实世界隧道工程中对岩体质量进行准确分类的模型。研究探索了传统的机器学习和基于图像的深度学习，利用表格和图像数据将 MWD 数据划分为 Q 类和 Q 值（描述岩体稳定性的指标实例）。结果表明，在使用表格数据的树状模型的集合中，K-近邻算法能有效地对岩体质量进行分类。在将岩体划分为 Q 类 A、B、C、D、E1、E2 时，其交叉验证平衡准确率为 0.86，而将 E 与其他岩体进行二元分类的准确率为 0.95。使用带有每轮爆破的 MWD 图像的 CNN 进行分类，二元分类的均衡准确率为 0.82。通过对表格式 MWD 数据的 Q 值进行回归分析，在与分类类似的集合模型中，交叉验证的 R2 和 MSE 分别为 0.80 和 0.18。回归和分类的高性能增强了对岩体自动评估的信心。在一个独特的数据集上应用先进的建模方法，证明了 MWD 数据在提高岩体分类准确性、推进数据驱动的岩石工程设计、减少人工干预方面的价值。

{"title":"A comparative study on machine learning approaches for rock mass classification using drilling data","authors":"Tom F. Hansen , Georg H. Erharter , Zhongqiang Liu , Jim Torresen","doi":"10.1016/j.acags.2024.100199","DOIUrl":"10.1016/j.acags.2024.100199","url":null,"abstract":"<div><div>Current rock engineering design in drill and blast tunnelling primarily relies on engineers' observational assessments. Measure While Drilling (MWD) data, a high-resolution sensor dataset collected during tunnel excavation, is underutilised, mainly serving for geological visualisation. This study aims to automate the translation of MWD data into actionable metrics for rock engineering. It seeks to link data to specific engineering actions, thus providing critical decision support for geological challenges ahead of the tunnel face. Leveraging a large and geologically diverse dataset of ∼500,000 drillholes from 15 tunnels, the research introduces models for accurate rock mass quality classification in a real-world tunnelling context. Both conventional machine learning and image-based deep learning are explored to classify MWD data into Q-classes and Q-values—examples of metrics describing the stability of the rock mass—using both tabular- and image data. The results indicate that the K-nearest neighbours algorithm in an ensemble with tree-based models using tabular data effectively classifies rock mass quality. It achieves a cross-validated balanced accuracy of 0.86 in classifying rock mass into the Q-classes A, B, C, D, E1, E2, and 0.95 for a binary classification with E versus the rest. Classification using a CNN with MWD-images for each blasting round resulted in a balanced accuracy of 0.82 for binary classification. Regressing the Q-value from tabular MWD-data achieved cross-validated R<sup>2</sup> and MSE scores of 0.80 and 0.18 for a similar ensemble model as in classification. High performance in regression and classification boosts confidence in automated rock mass assessment. Applying advanced modelling on a unique dataset demonstrates MWD data's value in improving rock mass classification accuracy and advancing data-driven rock engineering design, reducing manual intervention.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100199"},"PeriodicalIF":2.6,"publicationDate":"2024-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142534338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A generative deep neural network as an alternative to co-kriging 生成式深度神经网络作为协同控制的替代方案

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-10-01 DOI: 10.1016/j.acags.2024.100198

Herbert Rakotonirina , Paul Honeine , Olivier Atteia , Antonin Van Exem

In geosciences, kriging is leading spatial interpolation, and co-kriging is the most commonly used method for accomplishing spatial interpolation of a target variable by incorporating information from a secondary variable. Co-kriging relies on the assumption of spatial stationarity, which may not hold true in all geospatial contexts, leading to potential inaccuracies in interpolation. The effectiveness of co-kriging can be compromised in areas with sparse data, impacting the reliability of interpolated results. Moreover, it can be resource-intensive when used for interpolation with a substantial volume of data, especially in the case of 3D interpolation. In this paper, we introduce a new method for spatial interpolation that considers two variables using a generative deep neural network. This approach utilizes a convolutional neural network with an encoder–decoder architecture, featuring a single encoder and two decoders to handle the two variables. Additionally, we introduce a loss function that facilitates the control over the relationships between the two variables. Traditional Deep Learning methods require prior training and labeled data, whereas the proposed approach eliminates this requirement and simplifies the interpolation process. In order to assess the performance of our method, we use two real-world datasets. The first one is a 2D dataset of total soil organic carbon combined with the Normalized Difference Vegetation Index. The second one is a 3D dataset that combines concentrations of Hydrocarbon and Fluoride obtained from hyperspectral analysis of soil cores with very limited number of boreholes. The experimental results demonstrate that the proposed method outperforms ordinary kriging and co-kriging, showing a significant improvement when both variables are used. We also demonstrate how the inclusion of the auxiliary variable serves as a means to mitigate the overfitting of the model.

在地球科学领域，克里金法是空间插值的主要方法，而共克里金法是通过纳入次变量信息来完成目标变量空间插值的最常用方法。共克里金法依赖于空间静止性假设，但这一假设并非在所有地理空间环境中都成立，因此可能导致插值不准确。在数据稀少的地区，协同定位的有效性可能会大打折扣，影响插值结果的可靠性。此外，在使用大量数据进行插值时，特别是在三维插值的情况下，可能会耗费大量资源。在本文中，我们介绍了一种新的空间插值方法，它使用生成式深度神经网络考虑了两个变量。这种方法利用具有编码器-解码器架构的卷积神经网络，通过一个编码器和两个解码器来处理两个变量。此外，我们还引入了一个损失函数，便于控制两个变量之间的关系。传统的深度学习方法需要事先训练和标注数据，而我们提出的方法则消除了这一要求，简化了插值过程。为了评估我们方法的性能，我们使用了两个真实世界的数据集。第一个是土壤有机碳总量与归一化植被指数相结合的二维数据集。第二个数据集是一个三维数据集，结合了通过对钻孔数量非常有限的土壤岩心进行高光谱分析获得的碳氢化合物和氟化物的浓度。实验结果表明，所提出的方法优于普通克里金法和协同克里金法，当同时使用两个变量时，效果显著。我们还证明了加入辅助变量是如何减轻模型过拟合的。

{"title":"A generative deep neural network as an alternative to co-kriging","authors":"Herbert Rakotonirina , Paul Honeine , Olivier Atteia , Antonin Van Exem","doi":"10.1016/j.acags.2024.100198","DOIUrl":"10.1016/j.acags.2024.100198","url":null,"abstract":"<div><div>In geosciences, kriging is leading spatial interpolation, and co-kriging is the most commonly used method for accomplishing spatial interpolation of a target variable by incorporating information from a secondary variable. Co-kriging relies on the assumption of spatial stationarity, which may not hold true in all geospatial contexts, leading to potential inaccuracies in interpolation. The effectiveness of co-kriging can be compromised in areas with sparse data, impacting the reliability of interpolated results. Moreover, it can be resource-intensive when used for interpolation with a substantial volume of data, especially in the case of 3D interpolation. In this paper, we introduce a new method for spatial interpolation that considers two variables using a generative deep neural network. This approach utilizes a convolutional neural network with an encoder–decoder architecture, featuring a single encoder and two decoders to handle the two variables. Additionally, we introduce a loss function that facilitates the control over the relationships between the two variables. Traditional Deep Learning methods require prior training and labeled data, whereas the proposed approach eliminates this requirement and simplifies the interpolation process. In order to assess the performance of our method, we use two real-world datasets. The first one is a 2D dataset of total soil organic carbon combined with the Normalized Difference Vegetation Index. The second one is a 3D dataset that combines concentrations of Hydrocarbon and Fluoride obtained from hyperspectral analysis of soil cores with very limited number of boreholes. The experimental results demonstrate that the proposed method outperforms ordinary kriging and co-kriging, showing a significant improvement when both variables are used. We also demonstrate how the inclusion of the auxiliary variable serves as a means to mitigate the overfitting of the model.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100198"},"PeriodicalIF":2.6,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142426818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An open-source, QGIS-based solution for digital geological mapping: GEOL-QMAPS 基于 QGIS 的开源数字地质制图解决方案：GEOL-QMAPS

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-09-20 DOI: 10.1016/j.acags.2024.100197

Julien Perret, Mark W. Jessell, Eliott Bétend

Digital geological mapping has experienced significant growth over the past three decades due to the advent of commercial geographical information systems, advances in global positioning systems, and the availability of portable hand-held devices, such as mobile personal computers (PCs), smartphones, and tablets. Numerous software packages have been developed to collect, combine, organise, visualise, publish, and share field data with enhanced spatial accuracy and minimal post-field processing. However, many of these tools are not open-source or are not made available to the geoscientific community, remaining specific to given mapping projects or organisations.

In this contribution we introduce GEOL-QMAPS, an open-source, QGIS-based solution promoting digital geological mapping in a harmonised, comprehensive and flexible way. It can be used in the field with a tablet PC or via the QGIS-based QField app on iOS or Android mobile devices, enabling synchronisation with desktop QGIS and the creation of field databases. Designed as a general solution, the GEOL-QMAPS solution consists of a QGIS field data entry template and a custom QGIS plugin, both available on free-access online repositories. The plugin allows for the adaptation of dictionaries (i.e., lists of attributes describing geological features), initially set to international nomenclatures, to the guidelines of different mapping projects. The solution also facilitates the loading and consultation of relevant legacy geodatasets (e.g., preexisting field data, geochemical, geophysical maps or punctual datasets). A fact map, created from field data collected across the Archean Sula-Kangari greenstone belt in Sierra Leone, demonstrates the solution's advantages in terms of post-field processing and raw field data sharing.

由于商业地理信息系统的出现、全球定位系统的进步以及移动个人电脑（PC）、智能手机和平板电脑等便携式手持设备的普及，数字地质制图在过去三十年中经历了重大发展。目前已开发出大量软件包，用于收集、合并、组织、可视化、发布和共享野外数据，以提高空间精度和减少野外后期处理。然而，这些工具中的许多都不是开源的，或者没有向地球科学界开放，仍然是特定测绘项目或组织的专用工具。在本文中，我们将介绍 GEOL-QMAPS，这是一个基于 QGIS 的开源解决方案，以统一、全面和灵活的方式促进数字地质测绘。它可通过平板电脑或 iOS 或 Android 移动设备上基于 QGIS 的 QField 应用程序在野外使用，实现与桌面 QGIS 的同步并创建野外数据库。GEOL-QMAPS 解决方案是一个通用解决方案，包括一个 QGIS 野外数据录入模板和一个定制 QGIS 插件，两者均可从免费访问的在线资源库中获取。该插件允许根据不同制图项目的指导方针调整词典（即描述地质特征的属性列表），词典最初设置为国际术语。该解决方案还便于加载和查阅相关的遗留地质数据集（如已有的野外数据、地球化学、地球物理地图或标点数据集）。根据在塞拉利昂苏拉-康加里奥陶纪绿岩带收集的野外数据绘制的实况地图展示了该解决方案在野外后期处理和原始野外数据共享方面的优势。

{"title":"An open-source, QGIS-based solution for digital geological mapping: GEOL-QMAPS","authors":"Julien Perret, Mark W. Jessell, Eliott Bétend","doi":"10.1016/j.acags.2024.100197","DOIUrl":"10.1016/j.acags.2024.100197","url":null,"abstract":"<div><div>Digital geological mapping has experienced significant growth over the past three decades due to the advent of commercial geographical information systems, advances in global positioning systems, and the availability of portable hand-held devices, such as mobile personal computers (PCs), smartphones, and tablets. Numerous software packages have been developed to collect, combine, organise, visualise, publish, and share field data with enhanced spatial accuracy and minimal post-field processing. However, many of these tools are not open-source or are not made available to the geoscientific community, remaining specific to given mapping projects or organisations.</div><div>In this contribution we introduce GEOL-QMAPS, an open-source, QGIS-based solution promoting digital geological mapping in a harmonised, comprehensive and flexible way. It can be used in the field with a tablet PC or <em>via</em> the QGIS-based QField app on iOS or Android mobile devices, enabling synchronisation with desktop QGIS and the creation of field databases. Designed as a general solution, the GEOL-QMAPS solution consists of a QGIS field data entry template and a custom QGIS plugin, both available on free-access online repositories. The plugin allows for the adaptation of dictionaries (i.e., lists of attributes describing geological features), initially set to international nomenclatures, to the guidelines of different mapping projects. The solution also facilitates the loading and consultation of relevant legacy geodatasets (<em>e.g.,</em> preexisting field data, geochemical, geophysical maps or punctual datasets). A fact map, created from field data collected across the Archean Sula-Kangari greenstone belt in Sierra Leone, demonstrates the solution's advantages in terms of post-field processing and raw field data sharing.</div></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100197"},"PeriodicalIF":2.6,"publicationDate":"2024-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142426819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep learning for real-time P-wave detection: A case study in Indonesia's earthquake early warning system 用于实时 P 波检测的深度学习：印度尼西亚地震预警系统案例研究

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-09-07 DOI: 10.1016/j.acags.2024.100194

Adi Wibowo , Leni Sophia Heliani , Cecep Pratama , David Prambudi Sahara , Sri Widiyantoro , Dadan Ramdani , Mizan Bustanul Fuady Bisri , Ajat Sudrajat , Sidik Tri Wibowo , Satriawan Rasyid Purnama

Detecting seismic events in real-time for prompt alerts and responses is a challenging task that requires accurately capturing P-wave arrivals. This task becomes even more challenging in regions like Indonesia, where widely spaced seismic stations exist. The wide station spacing makes associating the seismic signals with specific even more difficult. This paper proposes a novel deep learning-based model with three convolutional layers, enriched with dual attention mechanisms—Squeeze, Excitation, and Transformer Encoder (CNN-SE-T) —to refine feature extraction and improve detection sensitivity. We have integrated several post-processing techniques to further bolster the model's robustness against noise. We conducted comprehensive evaluations of our method using three diverse datasets: local earthquake data from East Java, the publicly available Seismic Waveform Data (STEAD), and a continuous waveform dataset spanning 12 h from multiple Indonesian seismic stations. The performance of the CNN-SE-T P-wave detection model yielded exceptionally high F1 scores of 99.10% for East Java, 92.64% for STEAD, and 80% for the 12-h continuous waveforms across Indonesia's network, demonstrating the model's effectiveness and potential for real-world application in earthquake early warning systems.

实时检测地震事件以便及时发出警报和做出反应是一项极具挑战性的任务，需要准确捕捉 P 波到达。在印度尼西亚等地震台站间距较大的地区，这项任务变得更具挑战性。台站间距过大使得将地震信号与具体事件联系起来变得更加困难。本文提出了一种基于深度学习的新型模型，该模型具有三个卷积层，并采用了双重注意机制--挤压、激励和变压器编码器（CNN-SE-T）--以完善特征提取并提高检测灵敏度。我们还集成了几种后处理技术，以进一步增强模型对噪声的鲁棒性。我们使用三个不同的数据集对我们的方法进行了全面评估：东爪哇岛的本地地震数据、公开可用的地震波形数据（STEAD），以及来自多个印尼地震台站、时间跨度达 12 小时的连续波形数据集。CNN-SE-T P 波检测模型在东爪哇的 F1 得分为 99.10%，在 STEAD 的 F1 得分为 92.64%，在印尼网络的 12 小时连续波形的 F1 得分为 80%，表现出该模型在地震预警系统中的有效性和实际应用潜力。

{"title":"Deep learning for real-time P-wave detection: A case study in Indonesia's earthquake early warning system","authors":"Adi Wibowo , Leni Sophia Heliani , Cecep Pratama , David Prambudi Sahara , Sri Widiyantoro , Dadan Ramdani , Mizan Bustanul Fuady Bisri , Ajat Sudrajat , Sidik Tri Wibowo , Satriawan Rasyid Purnama","doi":"10.1016/j.acags.2024.100194","DOIUrl":"10.1016/j.acags.2024.100194","url":null,"abstract":"<div><p>Detecting seismic events in real-time for prompt alerts and responses is a challenging task that requires accurately capturing P-wave arrivals. This task becomes even more challenging in regions like Indonesia, where widely spaced seismic stations exist. The wide station spacing makes associating the seismic signals with specific even more difficult. This paper proposes a novel deep learning-based model with three convolutional layers, enriched with dual attention mechanisms—Squeeze, Excitation, and Transformer Encoder (CNN-SE-T) —to refine feature extraction and improve detection sensitivity. We have integrated several post-processing techniques to further bolster the model's robustness against noise. We conducted comprehensive evaluations of our method using three diverse datasets: local earthquake data from East Java, the publicly available Seismic Waveform Data (STEAD), and a continuous waveform dataset spanning 12 h from multiple Indonesian seismic stations. The performance of the CNN-SE-T P-wave detection model yielded exceptionally high F1 scores of 99.10% for East Java, 92.64% for STEAD, and 80% for the 12-h continuous waveforms across Indonesia's network, demonstrating the model's effectiveness and potential for real-world application in earthquake early warning systems.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100194"},"PeriodicalIF":2.6,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197424000417/pdfft?md5=b20faddd2c5b63b0d46b89310f92cfaf&pid=1-s2.0-S2590197424000417-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142161200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Integration of multi-temporal SAR data and robust machine learning models for improvement of flood susceptibility assessment in the southwest coast of India 整合多时合成孔径雷达数据和稳健的机器学习模型，改进印度西南海岸的洪水易感性评估

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-09-04 DOI: 10.1016/j.acags.2024.100189

Pankaj Prasad , Sourav Mandal , Sahil Sandeep Naik , Victor Joseph Loveson , Simanku Borah , Priyankar Chandra , Karthik Sudheer

The flood hazards in the southwest coastal region of India in 2018 and 2020 resulted in numerous casualties and the displacement of over a million people from their homes. In order to mitigate the loss of life and resources caused by recurrent major and minor flood events, it is imperative to develop a comprehensive spatial flood zonation map of the entire area. Therefore, the main aim of the present study is to prepare a flood susceptible map of the southwest coastal region of India using synthetic-aperture radar (SAR) data and robust machine learning algorithms. Accurate flood and non-flood locations have been identified from the multi-temporal Sentinel-1 images. These flood locations are correlated with sixteen flood conditioning geo-environmental variables. The Boruta algorithm has been applied to determine the importance of each flood conditioning parameter. Six efficient machine learning models, namely support vector machine (SVM), k-nearest neighbors (KNN), artificial neural network (ANN), random forest (RF), partial least squares (PLS) and penalized discriminant analysis (PDA) have been applied to delineate the flood susceptible areas of the study region. The performance of the models has been evaluated using several statistical criteria, including area under curve (AUC), overall accuracy, specificity, sensitivity and kappa index. The results have revealed that all models have performed more than 90% of AUC due to the high precision of radar data. However, the RF and SVM models have outperformed other models in terms of all statistical parameters. The findings have identified approximately 13% of the study region as highly vulnerable to flood hazards, emphasizing the need for proper planning and management in these areas.

2018 年和 2020 年，印度西南沿海地区的洪水灾害造成大量人员伤亡，超过 100 万人背井离乡。为了减轻经常性大小洪水事件造成的生命和资源损失，当务之急是绘制整个地区的综合空间洪水分区图。因此，本研究的主要目的是利用合成孔径雷达（SAR）数据和强大的机器学习算法，绘制印度西南沿海地区易受洪水影响的地图。从多时相 Sentinel-1 图像中确定了准确的洪水和非洪水位置。这些洪水位置与 16 个洪水条件地质环境变量相关联。Boruta 算法用于确定每个洪水调节参数的重要性。六种高效的机器学习模型，即支持向量机 (SVM)、k-近邻 (KNN)、人工神经网络 (ANN)、随机森林 (RF)、偏最小二乘法 (PLS) 和惩罚性判别分析 (PDA)，已被用于划定研究区域的洪水易发区。这些模型的性能采用了多种统计标准进行评估，包括曲线下面积（AUC）、总体准确度、特异性、灵敏度和卡帕指数。结果显示，由于雷达数据精度高，所有模型的 AUC 均超过 90%。不过，RF 和 SVM 模型在所有统计参数方面的表现都优于其他模型。研究结果表明，约有 13% 的研究区域极易受到洪水灾害的影响，强调了在这些区域进行适当规划和管理的必要性。

{"title":"Integration of multi-temporal SAR data and robust machine learning models for improvement of flood susceptibility assessment in the southwest coast of India","authors":"Pankaj Prasad , Sourav Mandal , Sahil Sandeep Naik , Victor Joseph Loveson , Simanku Borah , Priyankar Chandra , Karthik Sudheer","doi":"10.1016/j.acags.2024.100189","DOIUrl":"10.1016/j.acags.2024.100189","url":null,"abstract":"<div><p>The flood hazards in the southwest coastal region of India in 2018 and 2020 resulted in numerous casualties and the displacement of over a million people from their homes. In order to mitigate the loss of life and resources caused by recurrent major and minor flood events, it is imperative to develop a comprehensive spatial flood zonation map of the entire area. Therefore, the main aim of the present study is to prepare a flood susceptible map of the southwest coastal region of India using synthetic-aperture radar (SAR) data and robust machine learning algorithms. Accurate flood and non-flood locations have been identified from the multi-temporal Sentinel-1 images. These flood locations are correlated with sixteen flood conditioning geo-environmental variables. The Boruta algorithm has been applied to determine the importance of each flood conditioning parameter. Six efficient machine learning models, namely support vector machine (SVM), k-nearest neighbors (KNN), artificial neural network (ANN), random forest (RF), partial least squares (PLS) and penalized discriminant analysis (PDA) have been applied to delineate the flood susceptible areas of the study region. The performance of the models has been evaluated using several statistical criteria, including area under curve (AUC), overall accuracy, specificity, sensitivity and kappa index. The results have revealed that all models have performed more than 90% of AUC due to the high precision of radar data. However, the RF and SVM models have outperformed other models in terms of all statistical parameters. The findings have identified approximately 13% of the study region as highly vulnerable to flood hazards, emphasizing the need for proper planning and management in these areas.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"24 ","pages":"Article 100189"},"PeriodicalIF":2.6,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197424000363/pdfft?md5=c335020c63eb9eda70216e7662e23b2d&pid=1-s2.0-S2590197424000363-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142161206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

POSIT: An automated tool for detecting and characterizing diverse morphological features in raster data - Application to pockmarks, mounds, and craters POSIT：用于检测和描述栅格数据中各种形态特征的自动工具--应用于麻坑、土墩和火山口

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-09-01 DOI: 10.1016/j.acags.2024.100190

José J. Alonso del Rosario , Ariadna Canari , Elízabeth Blázquez Gómez , Sara Martínez-Loriente

Accurate detection and characterization of seafloor morphologies are crucial for marine researchers and industries involved in underwater mapping, environmental monitoring, or resource exploration. Although their detection has relied on visual inspection of detailed bathymetries, few efforts to automate the process can be found in the literature. This study presents a novel MatLab computer code called POSIT (Feature Signature Detection) based on the convolution and correlation with a structural element containing the shape to search for. POSIT is successfully tested on both synthetic and real datasets, encompassing marine and terrestrial digital elevation models of different resolution and on a digital image. The centroids of submarine pockmarks and mounds, terrestrial volcanic craters and lunar craters are calculated with zero dispersion and perfect location, and their geometric parameters and confidence are provided.

对于从事水下测绘、环境监测或资源勘探的海洋研究人员和行业来说，准确检测和描述海底形态至关重要。虽然对海底形态的检测一直依赖于对详细水深测量数据的目测，但文献中鲜有将这一过程自动化的尝试。本研究介绍了一种名为 POSIT（特征签名检测）的新型 MatLab 计算机代码，它基于与包含要搜索的形状的结构元素的卷积和相关性。POSIT 成功地在合成数据集和真实数据集上进行了测试，包括不同分辨率的海洋和陆地数字高程模型以及数字图像。计算出的海底麻坑和土墩、陆地火山口和月球陨石坑的中心点具有零分散和完美定位的特点，并提供了它们的几何参数和置信度。

{"title":"POSIT: An automated tool for detecting and characterizing diverse morphological features in raster data - Application to pockmarks, mounds, and craters","authors":"José J. Alonso del Rosario , Ariadna Canari , Elízabeth Blázquez Gómez , Sara Martínez-Loriente","doi":"10.1016/j.acags.2024.100190","DOIUrl":"10.1016/j.acags.2024.100190","url":null,"abstract":"<div><p>Accurate detection and characterization of seafloor morphologies are crucial for marine researchers and industries involved in underwater mapping, environmental monitoring, or resource exploration. Although their detection has relied on visual inspection of detailed bathymetries, few efforts to automate the process can be found in the literature. This study presents a novel MatLab computer code called POSIT (Feature Signature Detection) based on the convolution and correlation with a structural element containing the shape to search for. POSIT is successfully tested on both synthetic and real datasets, encompassing marine and terrestrial digital elevation models of different resolution and on a digital image. The centroids of submarine pockmarks and mounds, terrestrial volcanic craters and lunar craters are calculated with zero dispersion and perfect location, and their geometric parameters and confidence are provided.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"23 ","pages":"Article 100190"},"PeriodicalIF":2.6,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197424000375/pdfft?md5=dc86b3aae122c80855ab41c6633e87ec&pid=1-s2.0-S2590197424000375-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142096878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Resolution enhancement and segmentation of polar bubbly ice micro CT scans via 3D convolutional neural network 通过三维卷积神经网络提高极地气泡冰微型 CT 扫描的分辨率并进行分割

IF 2.6 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2024-09-01 DOI: 10.1016/j.acags.2024.100193

Faramarz Bagherzadeh , Johannes Freitag , Udo Frese , Frank Wilhelms

Accurate segmentation of 3D micro CT scans is a key step in the process of analysis of the microstructure of porous materials. In polar ice core studies, the environmental effects on the firn column could be detected if the microstructure is digitized accurately. The most challenging task is to obtain the microstructure parameters of the bubbly ice section. To identify the minimum, necessary resolution, the bubbly ice micro CT scans with different resolutions (120, 60, 30, 12 $μ m$ ) were compared object-wise via a region pairing algorithm. When the minimum resolution was found to be 60 $μ m$ , for generating the training/validation dataset, 4 ice core samples from a depth range of 96 to 108 meters (bubbly ice) were scanned with 120 $μ m$ (input images) and another time with 4 times higher resolution (30 $μ m$ ) to build ground truth. A specific pipeline was designed with non-rigid image registration to create an accurate ground truth from 4 times higher resolution scans. Then, two SOTA deep learning models (3D-Unet and FCN) were trained and later validated to perform super-resolution segmentation by taking input of $120 μ m$ resolution data and giving the output of binary segmented with two times higher resolution ( $60 μ m$ ). Finally, the outputs of CNN models were compared with traditional rule-based and unsupervised methods on blind test data. It is observed the 3D-Unet can segment low-resolution scans with an accuracy of 96% and an f1-score of 80.8% while preserving microstructure having less than 2% error in porosity and SSA.

准确分割三维微型 CT 扫描图像是分析多孔材料微观结构的关键步骤。在极地冰芯研究中，如果能对微观结构进行精确的数字化，就能检测出环境对枞树柱的影响。最具挑战性的任务是获取气泡冰断面的微观结构参数。为了确定所需的最小分辨率，通过区域配对算法对不同分辨率（120、60、30、12 μm）的气泡冰微型 CT 扫描进行了对象比较。当发现最小分辨率为 60 μm 时，为生成训练/验证数据集，用 120 μm（输入图像）扫描了 96 至 108 米深度范围内的 4 个冰芯样本（气泡冰），并用高 4 倍的分辨率（30 μm）扫描了另一次，以建立基本真相。设计了一个非刚性图像配准的特定流水线，以便从 4 倍更高分辨率的扫描中创建精确的地面实况。然后，对两个 SOTA 深度学习模型（3D-Unet 和 FCN）进行了训练和验证，以执行超分辨率分割，方法是输入 120 微米分辨率的数据，并输出高两倍分辨率（60 微米）的二进制分割结果。最后，在盲测试数据上将 CNN 模型的输出结果与传统的基于规则的方法和无监督方法进行了比较。结果表明，3D-Unet 能以 96% 的准确率和 80.8% 的 f1 分数分割低分辨率扫描数据，同时保留微观结构，在孔隙度和 SSA 方面的误差小于 2%。

{"title":"Resolution enhancement and segmentation of polar bubbly ice micro CT scans via 3D convolutional neural network","authors":"Faramarz Bagherzadeh , Johannes Freitag , Udo Frese , Frank Wilhelms","doi":"10.1016/j.acags.2024.100193","DOIUrl":"10.1016/j.acags.2024.100193","url":null,"abstract":"<div><p>Accurate segmentation of 3D micro CT scans is a key step in the process of analysis of the microstructure of porous materials. In polar ice core studies, the environmental effects on the firn column could be detected if the microstructure is digitized accurately. The most challenging task is to obtain the microstructure parameters of the bubbly ice section. To identify the minimum, necessary resolution, the bubbly ice micro CT scans with different resolutions (120, 60, 30, 12 <span><math><mrow><mi>μ</mi><mi>m</mi></mrow></math></span>) were compared object-wise via a region pairing algorithm. When the minimum resolution was found to be 60 <span><math><mrow><mi>μ</mi><mi>m</mi></mrow></math></span>, for generating the training/validation dataset, 4 ice core samples from a depth range of 96 to 108 meters (bubbly ice) were scanned with 120 <span><math><mrow><mi>μ</mi><mi>m</mi></mrow></math></span> (input images) and another time with 4 times higher resolution (30 <span><math><mrow><mi>μ</mi><mi>m</mi></mrow></math></span>) to build ground truth. A specific pipeline was designed with non-rigid image registration to create an accurate ground truth from 4 times higher resolution scans. Then, two SOTA deep learning models (3D-Unet and FCN) were trained and later validated to perform super-resolution segmentation by taking input of <span><math><mrow><mn>120</mn><mspace></mspace><mi>μ</mi><mi>m</mi></mrow></math></span> resolution data and giving the output of binary segmented with two times higher resolution (<span><math><mrow><mn>60</mn><mspace></mspace><mi>μ</mi><mi>m</mi></mrow></math></span>). Finally, the outputs of CNN models were compared with traditional rule-based and unsupervised methods on blind test data. It is observed the 3D-Unet can segment low-resolution scans with an accuracy of 96% and an f1-score of 80.8% while preserving microstructure having less than 2% error in porosity and SSA.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"23 ","pages":"Article 100193"},"PeriodicalIF":2.6,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197424000405/pdfft?md5=436bc0a47d2a2e990851e57a7c794d0b&pid=1-s2.0-S2590197424000405-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142158096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0