Pub Date : 2024-06-04DOI: 10.1016/j.cageo.2024.105625
Fantine Huot, Robert G. Clapp, Biondo L. Biondi
With fiber-optic seismic acquisition development, continuous dense seismic monitoring is becoming increasingly more accessible. Repurposing fiber cables in telecommunication conduits makes it possible to run seismic studies at low cost, even in locations where traditional seismometers are not easily installed, such as urban areas. However, due to the large volume of continuous streaming data, data collected in such a manner will go to waste unless we significantly automate the processing workflow. We train a convolutional neural network (CNN) for earthquake detection using 3000 events from a publicly available catalog and data acquired over three years by fiber cables in telecommunication conduits under the Stanford University campus. We performed a hyperparameter search both on the network architecture itself (e.g., number of layers, number of parameters) and on its training parameters, showing that CNNs with a small number of layers are sufficient for performing this detection task with high accuracy. We introduce a novel method for combining the deep learning results on fiber-optic and seismometer data to improve detection accuracy, dramatically reducing the false detection rate that is often a problem when processing large time-scale noisy continuous data. Consequently, we demonstrate that enhancing two sparse seismometer stations with an urban fiber system allows for the reliable detection of small earthquakes despite a low signal-to-noise ratio. We scale this processing method over three years of continuous data and show that this system reliably detects local small-amplitude earthquakes down to magnitudes as low as 0.5, leading to the discovery of previously uncataloged events.
{"title":"Detecting local earthquakes via fiber-optic cables in telecommunication conduits under Stanford University campus using deep learning","authors":"Fantine Huot, Robert G. Clapp, Biondo L. Biondi","doi":"10.1016/j.cageo.2024.105625","DOIUrl":"https://doi.org/10.1016/j.cageo.2024.105625","url":null,"abstract":"<div><p>With fiber-optic seismic acquisition development, continuous dense seismic monitoring is becoming increasingly more accessible. Repurposing fiber cables in telecommunication conduits makes it possible to run seismic studies at low cost, even in locations where traditional seismometers are not easily installed, such as urban areas. However, due to the large volume of continuous streaming data, data collected in such a manner will go to waste unless we significantly automate the processing workflow. We train a convolutional neural network (CNN) for earthquake detection using 3000 events from a publicly available catalog and data acquired over three years by fiber cables in telecommunication conduits under the Stanford University campus. We performed a hyperparameter search both on the network architecture itself (e.g., number of layers, number of parameters) and on its training parameters, showing that CNNs with a small number of layers are sufficient for performing this detection task with high accuracy. We introduce a novel method for combining the deep learning results on fiber-optic and seismometer data to improve detection accuracy, dramatically reducing the false detection rate that is often a problem when processing large time-scale noisy continuous data. Consequently, we demonstrate that enhancing two sparse seismometer stations with an urban fiber system allows for the reliable detection of small earthquakes despite a low signal-to-noise ratio. We scale this processing method over three years of continuous data and show that this system reliably detects local small-amplitude earthquakes down to magnitudes as low as 0.5, leading to the discovery of previously uncataloged events.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"190 ","pages":"Article 105625"},"PeriodicalIF":4.2,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141433892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-03DOI: 10.1016/j.cageo.2024.105638
Ferdinand Bhavsar, Nicolas Desassis, Fabien Ors, Thomas Romary
The simulation of geological facies in an unobservable volume is essential in various geoscience applications. Given the complexity of the problem, deep generative learning is a promising approach to overcome the limitations of traditional geostatistical simulation models, in particular their lack of physical realism. This research aims to investigate the application of generative adversarial networks and deep variational inference for conditionally simulating channelized reservoir in underground volumes. In this paper, we review the generative deep learning approaches, in particular the adversarial ones and the stabilization techniques that aim to facilitate their training. We also study the problem of conditioning deep learning models to observations through a variational Bayes approach, comparing a conditional neural network model to a Gaussian mixture model. The proposed approach is tested on 2D and 3D simulations generated by the stochastic process-based model Flumy. Morphological metrics are utilized to compare our proposed method with earlier iterations of generative adversarial networks. The results indicate that by utilizing recent stabilization techniques, generative adversarial networks can efficiently sample complex target data distributions.
{"title":"A stable deep adversarial learning approach for geological facies generation","authors":"Ferdinand Bhavsar, Nicolas Desassis, Fabien Ors, Thomas Romary","doi":"10.1016/j.cageo.2024.105638","DOIUrl":"https://doi.org/10.1016/j.cageo.2024.105638","url":null,"abstract":"<div><p>The simulation of geological facies in an unobservable volume is essential in various geoscience applications. Given the complexity of the problem, deep generative learning is a promising approach to overcome the limitations of traditional geostatistical simulation models, in particular their lack of physical realism. This research aims to investigate the application of generative adversarial networks and deep variational inference for conditionally simulating channelized reservoir in underground volumes. In this paper, we review the generative deep learning approaches, in particular the adversarial ones and the stabilization techniques that aim to facilitate their training. We also study the problem of conditioning deep learning models to observations through a variational Bayes approach, comparing a conditional neural network model to a Gaussian mixture model. The proposed approach is tested on 2D and 3D simulations generated by the stochastic process-based model Flumy. Morphological metrics are utilized to compare our proposed method with earlier iterations of generative adversarial networks. The results indicate that by utilizing recent stabilization techniques, generative adversarial networks can efficiently sample complex target data distributions.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"190 ","pages":"Article 105638"},"PeriodicalIF":4.4,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0098300424001213/pdfft?md5=b3d713c3bb7a69df64f3f89d94fdd43a&pid=1-s2.0-S0098300424001213-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141294638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-01DOI: 10.1016/j.cageo.2024.105641
Yifeng Xiao, Tongxi Wang, Hua Xiang
Oil source correlation can be used to identify the origin of crude oil by linking crude oil to source rocks; however, the manual methods, which are limited by the sample or parameter quantity or imbalanced datasets, are facing uncertainties. Although the existing multivariate statistical techniques can alleviate this problem, they are facing difficulties in processing imbalanced datasets and quantifying source beds. Therefore, a novel oil-source correlation analysis model called SVM-SelectKBest combining a support vector machine (SVM) with a feature selection algorithm to mitigate the common issue of dataset imbalance in oil-source correlations is proposed in this paper. The SVM-SelectKBest offers advantages over normal SVM by dynamically selecting the most relevant features and fine-tuning model parameters to achieve higher accuracy and better generalizability in complex datasets. SVM compensates for class imbalances by heavily penalizing the misclassification of the minority class, and SelectKBest streamlines the feature set to enhance SVM's effectiveness on critical variables. Furthermore, a shallow neural network (SensoryAttentionNet) is introduced into the proposed model to address the issue of quantifying the source bed proportions in crude oil. The result show that SVM-SelectKBest has better performance in identifying key geochemical parameters and discriminating oil source correlation, its accuracy in unbalanced datasets is improved by near 40% compared to SVM. The model obtains 25 key geochemical parameters such as C17 n-heptadecane, Pr pristane, and C18 n-octadecane, it achieves F1 scores of 1.0 on the training, validation, and test sets. SensoryAttentionNet also performs robustly, with a low variance of 0.05 between its predicted and actual values. All the results indicate the effectiveness of the proposed method in dealing with the imbalance problem in oil-source source correlation datasets and in determining the proportional contribution of source beds in crude oil.
{"title":"Optimizing oil-source correlation analysis using support vector machines and sensory attention networks","authors":"Yifeng Xiao, Tongxi Wang, Hua Xiang","doi":"10.1016/j.cageo.2024.105641","DOIUrl":"10.1016/j.cageo.2024.105641","url":null,"abstract":"<div><p>Oil source correlation can be used to identify the origin of crude oil by linking crude oil to source rocks; however, the manual methods, which are limited by the sample or parameter quantity or imbalanced datasets, are facing uncertainties. Although the existing multivariate statistical techniques can alleviate this problem, they are facing difficulties in processing imbalanced datasets and quantifying source beds. Therefore, a novel oil-source correlation analysis model called SVM-SelectKBest combining a support vector machine (SVM) with a feature selection algorithm to mitigate the common issue of dataset imbalance in oil-source correlations is proposed in this paper. The SVM-SelectKBest offers advantages over normal SVM by dynamically selecting the most relevant features and fine-tuning model parameters to achieve higher accuracy and better generalizability in complex datasets. SVM compensates for class imbalances by heavily penalizing the misclassification of the minority class, and SelectKBest streamlines the feature set to enhance SVM's effectiveness on critical variables. Furthermore, a shallow neural network (SensoryAttentionNet) is introduced into the proposed model to address the issue of quantifying the source bed proportions in crude oil. The result show that SVM-SelectKBest has better performance in identifying key geochemical parameters and discriminating oil source correlation, its accuracy in unbalanced datasets is improved by near 40% compared to SVM. The model obtains 25 key geochemical parameters such as C17 n-heptadecane, Pr pristane, and C18 n-octadecane, it achieves F1 scores of 1.0 on the training, validation, and test sets. SensoryAttentionNet also performs robustly, with a low variance of 0.05 between its predicted and actual values. All the results indicate the effectiveness of the proposed method in dealing with the imbalance problem in oil-source source correlation datasets and in determining the proportional contribution of source beds in crude oil.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"189 ","pages":"Article 105641"},"PeriodicalIF":4.4,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141231396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-28DOI: 10.1016/j.cageo.2024.105639
Ana Alice Rodrigues Dantas Almeida , Rafael Lopes Mendonça , Natalia Maria Mendes Silva , Adriano Rolim da Paz
Digital elevation models obtained from LiDAR surveys typically have a few meters or sub-meter resolution. DEM-derived products in such a fine resolution may not be desired for several circumstances, such as matching the resolution with other spatial datasets, preparing input data for hydrological models, and reducing the computational cost. This leads to DEM coarsening for further river network extraction. An alternative could be to derive the river flow paths in the original DEM resolution and use this information to obtain the coarser river networks (a procedure known as flow directions upscaling). This approach is the macroscale hydrology benchmark for deriving river networks with spatial resolution on the order of a few kilometers or even larger, based on the available DEM with tenths or hundreds of meters resolution. However, no study has applied this procedure for the change of scale involving fine-resolution LiDAR DEM. This research evaluated for the first time in literature a flow direction upscaling algorithm for deriving relatively coarse-resolution (30, 100, and 200m) river networks from very fine-resolution (1 m) flow paths obtained from LiDAR DEM. Two river basins of contrasting characteristics located in Northeast Brazil are studied. Results were evaluated through visual inspection, percentage within buffer (PWB) metrics, and river length comparison. It is shown that using an upscaling algorithm improves the ability of the coarse network to preserve river networks’ spatial patterns across multiple scale changes. Considering both basins, PWB ranged from 80% to 100% (average of 97%) for the upscaling procedure, while the DEM resampling resulted in PWB between 40% and 100% (average of 85%). A flow direction upscaling algorithm already used for macroscale hydrology proved helpful for the LiDAR-related shift in scale, outperforming the DEM resampling. Increasing the scale change augments the difference in performance between them, making the upscaling procedure more recommended. In addition, such an upscaling procedure provided drainage networks in the 100-m and 200-m resolutions with higher quality than the one obtained in the 30-m resolution directly from a globally available DEM.
通过激光雷达勘测获得的数字高程模型通常只有几米或几米以下的分辨率。在一些情况下,例如与其他空间数据集的分辨率相匹配、为水文模型准备输入数据以及降低计算成本,可能并不需要如此精细分辨率的 DEM 衍生产品。这就导致在进一步提取河网时需要对 DEM 进行粗略处理。另一种方法是在原始 DEM 分辨率中提取河流流向,并利用这些信息获得更粗的河网(这一过程被称为流向放大)。这种方法是宏观水文学的基准,可根据现有的十分之一或数百米分辨率的 DEM 得出空间分辨率为几公里甚至更大的河网。然而,还没有研究将这一程序用于涉及精细分辨率 LiDAR DEM 的尺度变化。本研究首次在文献中评估了一种流向升级算法,该算法可根据从激光雷达 DEM 中获取的极高分辨率(1 米)流径推导出相对较粗分辨率(30、100 和 200 米)的河网。研究对象是位于巴西东北部的两个特点截然不同的河流流域。通过目测、缓冲区内百分比(PWB)度量和河流长度比较对结果进行了评估。结果表明,使用上标算法可以提高粗网络在多种尺度变化中保持河网空间模式的能力。考虑到两个流域的情况,上规模程序的 PWB 在 80% 至 100% 之间(平均为 97%),而 DEM 重采样的 PWB 在 40% 至 100% 之间(平均为 85%)。事实证明,已经用于宏观水文的流向放大算法有助于解决与激光雷达相关的尺度变化问题,其效果优于 DEM 重采样。随着尺度变化的增加,两者之间的性能差异也在扩大,因此更推荐使用上标程序。此外,与直接从全球可用的 DEM 中获得的 30 米分辨率的排水网络相比,这种升级程序提供的 100 米和 200 米分辨率的排水网络质量更高。
{"title":"Evaluation of LiDAR-derived river networks coarsening with spatial patterns preservation","authors":"Ana Alice Rodrigues Dantas Almeida , Rafael Lopes Mendonça , Natalia Maria Mendes Silva , Adriano Rolim da Paz","doi":"10.1016/j.cageo.2024.105639","DOIUrl":"https://doi.org/10.1016/j.cageo.2024.105639","url":null,"abstract":"<div><p>Digital elevation models obtained from LiDAR surveys typically have a few meters or sub-meter resolution. DEM-derived products in such a fine resolution may not be desired for several circumstances, such as matching the resolution with other spatial datasets, preparing input data for hydrological models, and reducing the computational cost. This leads to DEM coarsening for further river network extraction. An alternative could be to derive the river flow paths in the original DEM resolution and use this information to obtain the coarser river networks (a procedure known as flow directions upscaling). This approach is the macroscale hydrology benchmark for deriving river networks with spatial resolution on the order of a few kilometers or even larger, based on the available DEM with tenths or hundreds of meters resolution. However, no study has applied this procedure for the change of scale involving fine-resolution LiDAR DEM. This research evaluated for the first time in literature a flow direction upscaling algorithm for deriving relatively coarse-resolution (30, 100, and 200m) river networks from very fine-resolution (1 m) flow paths obtained from LiDAR DEM. Two river basins of contrasting characteristics located in Northeast Brazil are studied. Results were evaluated through visual inspection, percentage within buffer (PWB) metrics, and river length comparison. It is shown that using an upscaling algorithm improves the ability of the coarse network to preserve river networks’ spatial patterns across multiple scale changes. Considering both basins, PWB ranged from 80% to 100% (average of 97%) for the upscaling procedure, while the DEM resampling resulted in PWB between 40% and 100% (average of 85%). A flow direction upscaling algorithm already used for macroscale hydrology proved helpful for the LiDAR-related shift in scale, outperforming the DEM resampling. Increasing the scale change augments the difference in performance between them, making the upscaling procedure more recommended. In addition, such an upscaling procedure provided drainage networks in the 100-m and 200-m resolutions with higher quality than the one obtained in the 30-m resolution directly from a globally available DEM.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"189 ","pages":"Article 105639"},"PeriodicalIF":4.4,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141242758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-27DOI: 10.1016/j.cageo.2024.105627
Abdelhamid Nouayti , E. Chham , I. Berriban , M. Azahra , Mohamed Drissi El-Bouzaidi , J.A.G. Orza , M. Hadouachi , T. El Ghalbzouri , T. El Bardouni , H. El Yaakoubi , M.A. Ferro-García
In this paper, we present a new open-source software “Open-AMA” developed to investigate atmospheric circulation dynamics and environmental research. Open AMA presents an integral package to conduct several air mass analyses. It appears to be a powerful, versatile software package developed to meet the needs of researchers using python and C++ in order to facilitate and speed up working time. This software seamlessly integrates new models for source identification based on air trajectories and ambient air pollution concentration data and enhances certain existing ones. Beyond source identification, it offers a rich array of functionalities for making it automatic, quick and easy to get access many kinds data including gridded meteorological data, trajectory calculations, synoptic parameter extraction from back-trajectories. All this functionalities can be used through a user-friendly graphical interface. Open-AMA can be a significant leap forward in air quality research and analysis, empowering researchers with the tools they need to make informed decisions and address pressing environmental and public health challenges and enhance understanding of pollutant origins in the atmosphere.
在本文中,我们介绍了一个新的开源软件 "Open-AMA",该软件是为研究大气环流动力学和环境研究而开发的。Open AMA 是一个用于进行多项气团分析的完整软件包。它似乎是一个功能强大、用途广泛的软件包,使用 python 和 C++ 来满足研究人员的需求,以方便和加快工作时间。该软件无缝集成了基于空气轨迹和环境空气污染浓度数据的新污染源识别模型,并增强了某些现有模型。除污染源识别外,该软件还提供了丰富的功能,可自动、快速、方便地获取多种数据,包括网格气象数据、轨迹计算、从回溯轨迹中提取同步参数等。所有这些功能都可以通过用户友好的图形界面使用。Open-AMA 可以成为空气质量研究和分析领域的一次重大飞跃,为研究人员提供所需的工具,使他们能够做出明智的决策,应对紧迫的环境和公共卫生挑战,并加深对大气中污染物来源的了解。
{"title":"Open-AMA: Open-source software for air masses statistical analysis","authors":"Abdelhamid Nouayti , E. Chham , I. Berriban , M. Azahra , Mohamed Drissi El-Bouzaidi , J.A.G. Orza , M. Hadouachi , T. El Ghalbzouri , T. El Bardouni , H. El Yaakoubi , M.A. Ferro-García","doi":"10.1016/j.cageo.2024.105627","DOIUrl":"https://doi.org/10.1016/j.cageo.2024.105627","url":null,"abstract":"<div><p>In this paper, we present a new open-source software “Open-AMA” developed to investigate atmospheric circulation dynamics and environmental research. Open AMA presents an integral package to conduct several air mass analyses. It appears to be a powerful, versatile software package developed to meet the needs of researchers using python and C++ in order to facilitate and speed up working time. This software seamlessly integrates new models for source identification based on air trajectories and ambient air pollution concentration data and enhances certain existing ones. Beyond source identification, it offers a rich array of functionalities for making it automatic, quick and easy to get access many kinds data including gridded meteorological data, trajectory calculations, synoptic parameter extraction from back-trajectories. All this functionalities can be used through a user-friendly graphical interface. Open-AMA can be a significant leap forward in air quality research and analysis, empowering researchers with the tools they need to make informed decisions and address pressing environmental and public health challenges and enhance understanding of pollutant origins in the atmosphere.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"189 ","pages":"Article 105627"},"PeriodicalIF":4.4,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141242757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-25DOI: 10.1016/j.cageo.2024.105626
Nils B. Gies, Pierre Lanari, Jörg Hermann
Spectroscopic analytical techniques such as Fourier Transform Infrared Spectroscopy (FTIR), Raman or hyperspectral imaging are important in modern geosciences. In recent years there has been a shift from using exclusively single-spot analyses to 2-dimensional maps. Maps help to reveal patterns in a sample that would not be detected by single point measurements. Filtering and extracting signal information from multiple combined pixels can help improve the signal-to-noise ratio and thus the precision of the data. The combination of multi-layer numerical datasets obtained from different instruments or measurement settings opens up the possibility of exploring and investigating individual datasets in much greater detail. However, the amount of data and information in the dataset increases significantly when high-resolution spectroscopic and spatial data is used instead of spot analyses, thus making the data examination and data validation more challenging and time consuming. To investigate large datasets, we have developed SpecXY, a software solution for preparing, editing, extracting, and comparing spatially resolved spectral datasets. SpecXY aims to provide a user-friendly and open-source software solution for working with spectroscopic data by providing a simple user interface that is accessible to all users with basic computer skills. Advanced users with a basic understanding of MATLAB® programming can adapt, customise and extend SpecXY due to its modular and function-based program structure. SpecXY also provides innovative algorithms for analyzing spectral data, such as Monte Carlo deconvolution of peaks, and hybrid classification and filtering based on spectra in combination with user knowledge. Two examples illustrate possible applications of SpecXY: (1) multidimensional classification and correlation of spatially resolved spectroscopic data and quantified chemical element maps, and (2) classification, filtering, quantification of H2O in minerals and profile extraction of a high-resolution spectroscopic data set measured by Fourier Transform Infrared (FTIR) Spectroscopy coupled to a Focal Plane Array (FPA) detector.
{"title":"A workflow and software solution for spatially resolved spectroscopic and numerical data (SpecXY)","authors":"Nils B. Gies, Pierre Lanari, Jörg Hermann","doi":"10.1016/j.cageo.2024.105626","DOIUrl":"https://doi.org/10.1016/j.cageo.2024.105626","url":null,"abstract":"<div><p>Spectroscopic analytical techniques such as Fourier Transform Infrared Spectroscopy (FTIR), Raman or hyperspectral imaging are important in modern geosciences. In recent years there has been a shift from using exclusively single-spot analyses to 2-dimensional maps. Maps help to reveal patterns in a sample that would not be detected by single point measurements. Filtering and extracting signal information from multiple combined pixels can help improve the signal-to-noise ratio and thus the precision of the data. The combination of multi-layer numerical datasets obtained from different instruments or measurement settings opens up the possibility of exploring and investigating individual datasets in much greater detail. However, the amount of data and information in the dataset increases significantly when high-resolution spectroscopic and spatial data is used instead of spot analyses, thus making the data examination and data validation more challenging and time consuming. To investigate large datasets, we have developed SpecXY, a software solution for preparing, editing, extracting, and comparing spatially resolved spectral datasets. SpecXY aims to provide a user-friendly and open-source software solution for working with spectroscopic data by providing a simple user interface that is accessible to all users with basic computer skills. Advanced users with a basic understanding of MATLAB® programming can adapt, customise and extend SpecXY due to its modular and function-based program structure. SpecXY also provides innovative algorithms for analyzing spectral data, such as Monte Carlo deconvolution of peaks, and hybrid classification and filtering based on spectra in combination with user knowledge. Two examples illustrate possible applications of SpecXY: (1) multidimensional classification and correlation of spatially resolved spectroscopic data and quantified chemical element maps, and (2) classification, filtering, quantification of H<sub>2</sub>O in minerals and profile extraction of a high-resolution spectroscopic data set measured by Fourier Transform Infrared (FTIR) Spectroscopy coupled to a Focal Plane Array (FPA) detector.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"189 ","pages":"Article 105626"},"PeriodicalIF":4.4,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0098300424001092/pdfft?md5=ae46d0df52c2b41688489543aec1765e&pid=1-s2.0-S0098300424001092-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141250207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-23DOI: 10.1016/j.cageo.2024.105623
Lewen Qiu , Zhengguang Liu , Hongbo Yao , Jingtian Tang
Nowadays, there is a growing trend that direct current (DC) field surveys are shifting towards challenging areas characterized by mountainous topography and electrical anisotropy. Given these complex geological settings, there is an urgent need for 3-D DC forward modeling software capable of effectively addressing large-scale problems and delivering accurate modeling results to interpret field data. However, most open-source software packages face certain limitations, such as the high numerical cost to handle complex surface topography, the lack of consideration for anisotropic conductivity, the absence of mesh refinement techniques to guarantee accuracy in forward modeling, and the lack of parallel computing techniques to solve large-scale problems. In this study, we develop an efficient and highly accurate 3-D DC anisotropic forward modeling software, namely DC3DPAFEM, using the adaptive finite element algorithm based on the unstructured tetrahedral mesh. Firstly, we construct a strong compatible boundary value problem (BVP) for 3-D anisotropic DC problems by adopting a specialized secondary potential approach to handle the surface topography efficiently. Then, we develop a goal-oriented adaptive mesh refinement (AMR) technique to ensure accurate forward modeling results, even with a coarse initial mesh. To ensure time and memory efficiency, we employ a robust conjugate gradient (CG) algorithm preconditioned by the algebraic multigrid (AMG) solver to solve the large-scale linear system of equations resulting from complex geological structures. We aim to investigate the performance of the AMG scheme in anisotropic DC cases. Furthermore, we incorporate the domain decomposition technique into the iterative solution scheme for further efficiency gains. This technique significantly improves computing efficiency for large-scale problems in parallel clusters. Finally, we conduct comprehensive performance tests for DC3DPAFEM using a two-layer anisotropic model and a 3-D complex model with undulating terrain. The results of both examples validate the accuracy of DC3DPAFEM, as they closely align with the analytical solutions and the solutions obtained from the existing 3-D DC forward modeling code. Compared to traditional direct solver MUMPS and ILU-preconditioned iterative solvers, DC3DPAFEM exhibits highly scalable performance for large-scale problems, offering significant advantages in terms of memory and time consumption. Overall, DC3DPAFEM demonstrates substantial advances in efficiency, accuracy, and practicality through a series of numerical examples. This open-source code provides an efficient and available tool for developing a 3-D DC inversion method that can deal with large-scale problems involving intricate topography and anisotropic media.
如今,直流(DC)野外勘测正逐渐转向以山地地形和电各向异性为特征的挑战性地区。鉴于这些复杂的地质环境,迫切需要能够有效解决大规模问题并提供精确建模结果以解释野外数据的三维直流正演建模软件。然而,大多数开源软件包都存在一定的局限性,例如处理复杂地表地形的数值成本较高,缺乏对各向异性导电性的考虑,缺乏保证正演建模精度的网格细化技术,以及缺乏解决大规模问题的并行计算技术。在本研究中,我们利用基于非结构四面体网格的自适应有限元算法,开发了一种高效、高精度的三维直流各向异性正演建模软件,即 DC3DPAFEM。首先,我们为三维各向异性直流问题构建了强兼容边界值问题(BVP),采用专门的二次电动势方法高效处理表面形貌。然后,我们开发了一种面向目标的自适应网格细化(AMR)技术,以确保即使在初始网格较粗的情况下也能获得精确的前向建模结果。为确保时间和内存效率,我们采用了一种鲁棒共轭梯度(CG)算法,并通过代数多网格(AMG)求解器进行预处理,以求解复杂地质结构产生的大规模线性方程组。我们旨在研究 AMG 方案在各向异性直流情况下的性能。此外,我们还将域分解技术纳入迭代求解方案,以进一步提高效率。这种技术大大提高了并行集群中大规模问题的计算效率。最后,我们使用双层各向异性模型和带起伏地形的三维复杂模型对 DC3DPAFEM 进行了全面的性能测试。这两个例子的结果验证了 DC3DPAFEM 的准确性,因为它们与分析解以及现有三维 DC 正演建模代码得到的解非常接近。与传统的直接求解器 MUMPS 和 ILU 条件迭代求解器相比,DC3DPAFEM 在处理大规模问题时表现出高度可扩展的性能,在内存和时间消耗方面具有显著优势。总之,DC3DPAFEM 通过一系列数值示例展示了在效率、精度和实用性方面的巨大进步。该开源代码为开发三维直流反演方法提供了一个高效、可用的工具,可以处理涉及复杂地形和各向异性介质的大规模问题。
{"title":"DC3DPAFEM: An efficient and accurate 3-D direct current resistivity anisotropic forward modeling software for complex geological settings","authors":"Lewen Qiu , Zhengguang Liu , Hongbo Yao , Jingtian Tang","doi":"10.1016/j.cageo.2024.105623","DOIUrl":"10.1016/j.cageo.2024.105623","url":null,"abstract":"<div><p>Nowadays, there is a growing trend that direct current (DC) field surveys are shifting towards challenging areas characterized by mountainous topography and electrical anisotropy. Given these complex geological settings, there is an urgent need for 3-D DC forward modeling software capable of effectively addressing large-scale problems and delivering accurate modeling results to interpret field data. However, most open-source software packages face certain limitations, such as the high numerical cost to handle complex surface topography, the lack of consideration for anisotropic conductivity, the absence of mesh refinement techniques to guarantee accuracy in forward modeling, and the lack of parallel computing techniques to solve large-scale problems. In this study, we develop an efficient and highly accurate 3-D DC anisotropic forward modeling software, namely DC3DPAFEM, using the adaptive finite element algorithm based on the unstructured tetrahedral mesh. Firstly, we construct a strong compatible boundary value problem (BVP) for 3-D anisotropic DC problems by adopting a specialized secondary potential approach to handle the surface topography efficiently. Then, we develop a goal-oriented adaptive mesh refinement (AMR) technique to ensure accurate forward modeling results, even with a coarse initial mesh. To ensure time and memory efficiency, we employ a robust conjugate gradient (CG) algorithm preconditioned by the algebraic multigrid (AMG) solver to solve the large-scale linear system of equations resulting from complex geological structures. We aim to investigate the performance of the AMG scheme in anisotropic DC cases. Furthermore, we incorporate the domain decomposition technique into the iterative solution scheme for further efficiency gains. This technique significantly improves computing efficiency for large-scale problems in parallel clusters. Finally, we conduct comprehensive performance tests for DC3DPAFEM using a two-layer anisotropic model and a 3-D complex model with undulating terrain. The results of both examples validate the accuracy of DC3DPAFEM, as they closely align with the analytical solutions and the solutions obtained from the existing 3-D DC forward modeling code. Compared to traditional direct solver MUMPS and ILU-preconditioned iterative solvers, DC3DPAFEM exhibits highly scalable performance for large-scale problems, offering significant advantages in terms of memory and time consumption. Overall, DC3DPAFEM demonstrates substantial advances in efficiency, accuracy, and practicality through a series of numerical examples. This open-source code provides an efficient and available tool for developing a 3-D DC inversion method that can deal with large-scale problems involving intricate topography and anisotropic media.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"189 ","pages":"Article 105623"},"PeriodicalIF":4.4,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141130601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-23DOI: 10.1016/j.cageo.2024.105624
Gul Rukh Khattak, Gul Muhammad Khan, Suhail Yousaf
We present as a tool for discriminating seismic phases, leveraging artificial intelligence technique (Convolutional Neural Network) for short-time Frequency Transform of the seismic signal. Timely detection of the vertical () wave from an earthquake can generate a warning several tens of precious seconds before the more destructive waves strike. We propose a train-for-each-station approach for an Internet-of-Things-based Smart Earthquake Early Warning System, where lightweight neural networks trained for the seismic data belonging to each station are implemented on edge devices directly interfaced with seismometers. The approach has the potential to get the most from the sparse seismic network for Pakistan and other third-world countries. We train networks for multi-station and single-station data and achieve 96% and 99% accuracy, respectively, proving that train-for-each-station maximizes accuracy. The total processing time (including preprocessing and inference) is about for each event, thus suitable for real-time deployment. We further compare the performance of on simulated real-time data with several state-of-the-art contemporary algorithms. Our proposed approach demonstrates a robust response on diverse metrics. The classifies the vertical seismic signal component with high accuracy and the can classify any seismic data component, inculcating robustness against connectivity issues.
{"title":"ConvEQ: Convolutional neural network for earthquake phase classification using short time frequency transform","authors":"Gul Rukh Khattak, Gul Muhammad Khan, Suhail Yousaf","doi":"10.1016/j.cageo.2024.105624","DOIUrl":"10.1016/j.cageo.2024.105624","url":null,"abstract":"<div><p>We present <span><math><mrow><mi>C</mi><mi>o</mi><mi>n</mi><mi>v</mi><mi>E</mi><mi>Q</mi></mrow></math></span> as a tool for discriminating seismic phases, leveraging artificial intelligence technique (Convolutional Neural Network) for short-time Frequency Transform of the seismic signal. Timely detection of the vertical (<span><math><mi>P</mi></math></span>) wave from an earthquake can generate a warning several tens of precious seconds before the more destructive waves strike. We propose a train-for-each-station approach for an Internet-of-Things-based Smart Earthquake Early Warning System, where lightweight neural networks trained for the seismic data belonging to each station are implemented on edge devices directly interfaced with seismometers. The approach has the potential to get the most from the sparse seismic network for Pakistan and other third-world countries. We train networks for multi-station and single-station data and achieve 96% and 99% accuracy, respectively, proving that train-for-each-station maximizes accuracy. The total processing time (including preprocessing and inference) is about <span><math><mrow><mn>30</mn><mspace></mspace><mi>ms</mi></mrow></math></span> for each event, thus suitable for real-time deployment. We further compare the performance of <span><math><mrow><mi>C</mi><mi>o</mi><mi>n</mi><mi>v</mi><mi>E</mi><mi>Q</mi></mrow></math></span> on simulated real-time data with several state-of-the-art contemporary algorithms. Our proposed approach demonstrates a robust response on diverse metrics. The <span><math><mrow><mi>C</mi><mi>o</mi><mi>n</mi><mi>v</mi><mi>E</mi><mi>Q</mi><mi>Z</mi></mrow></math></span> classifies the vertical seismic signal component with high accuracy and the <span><math><mrow><mi>C</mi><mi>o</mi><mi>n</mi><mi>v</mi><mi>E</mi><mi>Q</mi><mi>X</mi></mrow></math></span> can classify any seismic data component, inculcating robustness against connectivity issues.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"189 ","pages":"Article 105624"},"PeriodicalIF":4.4,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141137147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-22DOI: 10.1016/j.cageo.2024.105628
Wei Fang , Yuxiang Fu , Victor S. Sheng
Traditional methods for fire point segmentation (FPS) in satellite remote sensing images (RSIs) overly rely on threshold judgment, which are greatly affected by factors such as regional time and show poor generalization. Besides, due to the difference between natural scene images (NSIs) and RSIs, directly apply NSIs-based deep learning methods to forest fire RSIs without any modification fails to achieve satisfactory results. To address these issues, first, we construct a Landsat8 RSI-FPS dataset covering different years, seasons and regions. Then, for the first time, we apply salient object detection (SOD) to FPS in forest fire monitoring and propose a novel network FPS-U2Net to improve the performance of FPS. FPS-U2Net is based on U2Netp (a lightweight U2Net), to make better use of the multi-level features from adjacent encoders, we propose multi-level aggregation module (MAM), which is placed between the encoder and decoder at the same stage to aggregate the adjacent multi-scale features and capture richer contextual information. To make up for the weakness of BCE loss, we employ the hybrid loss, BCE + IoU, for the training of the network, which can guide the network learn the salient information from pixel and map levels. Extensive experiments on three datasets demonstrate that our FPS-U2Net significantly outperforms the state-of-the-art semantic segmentation and SOD methods. FPS-U2Net can accurately segment fire regions and predict clear local details.
{"title":"FPS-U2Net: Combining U2Net and multi-level aggregation architecture for fire point segmentation in remote sensing images","authors":"Wei Fang , Yuxiang Fu , Victor S. Sheng","doi":"10.1016/j.cageo.2024.105628","DOIUrl":"https://doi.org/10.1016/j.cageo.2024.105628","url":null,"abstract":"<div><p>Traditional methods for fire point segmentation (FPS) in satellite remote sensing images (RSIs) overly rely on threshold judgment, which are greatly affected by factors such as regional time and show poor generalization. Besides, due to the difference between natural scene images (NSIs) and RSIs, directly apply NSIs-based deep learning methods to forest fire RSIs without any modification fails to achieve satisfactory results. To address these issues, first, we construct a Landsat8 RSI-FPS dataset covering different years, seasons and regions. Then, for the first time, we apply salient object detection (SOD) to FPS in forest fire monitoring and propose a novel network FPS-U<sup>2</sup>Net to improve the performance of FPS. FPS-U<sup>2</sup>Net is based on U<sup>2</sup>Netp (a lightweight U<sup>2</sup>Net), to make better use of the multi-level features from adjacent encoders, we propose multi-level aggregation module (MAM), which is placed between the encoder and decoder at the same stage to aggregate the adjacent multi-scale features and capture richer contextual information. To make up for the weakness of BCE loss, we employ the hybrid loss, BCE + IoU, for the training of the network, which can guide the network learn the salient information from pixel and map levels. Extensive experiments on three datasets demonstrate that our FPS-U<sup>2</sup>Net significantly outperforms the state-of-the-art semantic segmentation and SOD methods. FPS-U<sup>2</sup>Net can accurately segment fire regions and predict clear local details.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"189 ","pages":"Article 105628"},"PeriodicalIF":4.4,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141097858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The simultaneous prediction of the subsurface distribution of facies and acoustic impedance () from fullstack seismic data requires solving an inverse problem and is fundamental in natural resources exploration, carbon capture and storage, and environmental risk management. In recent years, deep generative models (DGM), such as variational autoencoders (VAE) and generative adversarial networks (GAN), were proposed to reproduce complex facies patterns honoring prior geological information. Variational Bayesian inference using inverse autoregressive flows (IAF) can be performed to infer the solution to a geophysical inverse problem from the encoded latent space of such pre-trained DGM. Successful applications of such approach on crosshole ground-penetrating radar synthetic data inversion demonstrated that the technique's accuracy is comparable to that of Markov chain Monte Carlo (MCMC) inference methods, while significantly reducing the computational cost. Nonetheless, these application examples did not account for the spatial uncertainty affecting the facies-dependent continuous physical property, from which the geophysical data are calculated. This uncertainty can significantly affect the inversion accuracy and its applicability to real data. In this work, specific VAE and GAN architectures are proposed to simultaneously predict facies and co-located , while accounting for their spatial uncertainties. The two types of generative networks are used in Bayesian inversion with IAF for the inversion of seismic data. The results are found to reproduce the statistics of the training images and solve the seismic inversion problem accurately, comparably to MCMC inversion. Furthermore, advantages and limitations of the two DGMs are evaluated by comparing the results obtained.
{"title":"Deep generative networks for multivariate fullstack seismic data inversion using inverse autoregressive flows","authors":"Roberto Miele , Shiran Levy , Niklas Linde , Amilcar Soares , Leonardo Azevedo","doi":"10.1016/j.cageo.2024.105622","DOIUrl":"10.1016/j.cageo.2024.105622","url":null,"abstract":"<div><p>The simultaneous prediction of the subsurface distribution of facies and acoustic impedance (<span><math><msub><mi>I</mi><mi>P</mi></msub></math></span>) from fullstack seismic data requires solving an inverse problem and is fundamental in natural resources exploration, carbon capture and storage, and environmental risk management. In recent years, deep generative models (DGM), such as variational autoencoders (VAE) and generative adversarial networks (GAN), were proposed to reproduce complex facies patterns honoring prior geological information. Variational Bayesian inference using inverse autoregressive flows (IAF) can be performed to infer the solution to a geophysical inverse problem from the encoded latent space of such pre-trained DGM. Successful applications of such approach on crosshole ground-penetrating radar synthetic data inversion demonstrated that the technique's accuracy is comparable to that of Markov chain Monte Carlo (MCMC) inference methods, while significantly reducing the computational cost. Nonetheless, these application examples did not account for the spatial uncertainty affecting the facies-dependent continuous physical property, from which the geophysical data are calculated. This uncertainty can significantly affect the inversion accuracy and its applicability to real data. In this work, specific VAE and GAN architectures are proposed to simultaneously predict facies and co-located <span><math><msub><mi>I</mi><mi>P</mi></msub></math></span>, while accounting for their spatial uncertainties. The two types of generative networks are used in Bayesian inversion with IAF for the inversion of seismic data. The results are found to reproduce the statistics of the training images and solve the seismic inversion problem accurately, comparably to MCMC inversion. Furthermore, advantages and limitations of the two DGMs are evaluated by comparing the results obtained.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"188 ","pages":"Article 105622"},"PeriodicalIF":4.4,"publicationDate":"2024-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0098300424001055/pdfft?md5=8765d44fb856d4c3f9c42a93b690c4fe&pid=1-s2.0-S0098300424001055-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141041984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}