Artificial Intelligence in Geosciences最新文献_第7页

MLReal: Bridging the gap between training on synthetic data and real data applications in machine learning MLReal:弥合机器学习中合成数据训练与真实数据应用之间的差距

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.09.002

Tariq Alkhalifah, Hanchen Wang, Oleg Ovcharenko

Among the biggest challenges we face in utilizing neural networks trained on waveform (i.e., seismic, electromagnetic, or ultrasound) data is its application to real data. The requirement for accurate labels often forces us to train our networks using synthetic data, where labels are readily available. However, synthetic data often fail to capture the reality of the field/real experiment, and we end up with poor performance of the trained neural networks (NNs) at the inference stage. This is because synthetic data lack many of the realistic features embedded in real data, including an accurate waveform source signature, realistic noise, and accurate reflectivity. In other words, the real data set is far from being a sample from the distribution of the synthetic training set. Thus, we describe a novel approach to enhance our supervised neural network (NN) training on synthetic data with real data features (domain adaptation). Specifically, for tasks in which the absolute values of the vertical axis (time or depth) of the input section are not crucial to the prediction, like classification, or can be corrected after the prediction, like velocity model building using a well, we suggest a series of linear operations on the input to the network data so that the training and application data have similar distributions. This is accomplished by applying two operations on the input data to the NN, whether the input is from the synthetic or real data subset domain: (1) The crosscorrelation of the input data section (i.e., shot gather, seismic image, etc.) with a fixed-location reference trace from the input data section. (2) The convolution of the resulting data with the mean (or a random sample) of the autocorrelated sections from the other subset domain. In the training stage, the input data are from the synthetic subset domain and the auto-corrected (we crosscorrelate each trace with itself) sections are from the real subset domain, and the random selection of sections from the real data is implemented at every epoch of the training. In the inference/application stage, the input data are from the real subset domain and the mean of the autocorrelated sections are from the synthetic data subset domain. Example applications on passive seismic data for microseismic event source location determination and on active seismic data for predicting low frequencies are used to demonstrate the power of this approach in improving the applicability of our trained NNs to real data.

在利用波形(即地震、电磁或超声波)数据训练的神经网络时，我们面临的最大挑战之一是将其应用于实际数据。对准确标签的要求常常迫使我们使用合成数据来训练我们的网络，在这些数据中，标签很容易获得。然而，合成数据往往不能捕捉现场/真实实验的现实，并且我们最终在推理阶段训练的神经网络(nn)的性能很差。这是因为合成数据缺乏真实数据中嵌入的许多真实特征，包括准确的波形源特征、真实的噪声和准确的反射率。换句话说，真实的数据集远不是合成训练集分布的样本。因此，我们描述了一种新的方法来增强我们的监督神经网络(NN)训练的合成数据与真实的数据特征(域适应)。具体来说，对于输入段纵轴(时间或深度)的绝对值对预测不重要的任务(如分类)，或者可以在预测后进行校正的任务(如使用井建立速度模型)，我们建议对网络数据的输入进行一系列线性操作，使训练数据和应用数据具有相似的分布。这是通过对输入数据对NN应用两种操作来实现的，无论输入是来自合成数据子集域还是真实数据子集域:(1)输入数据部分(即射击采集，地震图像等)与输入数据部分的固定位置参考轨迹的相互关系。(2)结果数据与来自其他子集域的自相关部分的平均值(或随机样本)的卷积。在训练阶段，输入数据来自合成子集域，自动校正(我们将每个轨迹与自身相互关联)的部分来自真实子集域，并且在训练的每个epoch都从真实数据中随机选择部分。在推理/应用阶段，输入数据来自真实子集域，自相关部分的平均值来自合成数据子集域。在微地震事件源定位的被动地震数据和低频预测的主动地震数据上的实例应用，证明了这种方法在提高我们训练的神经网络对实际数据的适用性方面的强大作用。

{"title":"MLReal: Bridging the gap between training on synthetic data and real data applications in machine learning","authors":"Tariq Alkhalifah, Hanchen Wang, Oleg Ovcharenko","doi":"10.1016/j.aiig.2022.09.002","DOIUrl":"https://doi.org/10.1016/j.aiig.2022.09.002","url":null,"abstract":"<div><p>Among the biggest challenges we face in utilizing neural networks trained on waveform (i.e., seismic, electromagnetic, or ultrasound) data is its application to real data. The requirement for accurate labels often forces us to train our networks using synthetic data, where labels are readily available. However, synthetic data often fail to capture the reality of the field/real experiment, and we end up with poor performance of the trained neural networks (NNs) at the inference stage. This is because synthetic data lack many of the realistic features embedded in real data, including an accurate waveform source signature, realistic noise, and accurate reflectivity. In other words, the real data set is far from being a sample from the distribution of the synthetic training set. Thus, we describe a novel approach to enhance our supervised neural network (NN) training on synthetic data with real data features (domain adaptation). Specifically, for tasks in which the absolute values of the vertical axis (time or depth) of the input section are not crucial to the prediction, like classification, or can be corrected after the prediction, like velocity model building using a well, we suggest a series of linear operations on the input to the network data so that the training and application data have similar distributions. This is accomplished by applying two operations on the input data to the NN, whether the input is from the synthetic or real data subset domain: (1) The crosscorrelation of the input data section (i.e., shot gather, seismic image, etc.) with a fixed-location reference trace from the input data section. (2) The convolution of the resulting data with the mean (or a random sample) of the autocorrelated sections from the other subset domain. In the training stage, the input data are from the synthetic subset domain and the auto-corrected (we crosscorrelate each trace with itself) sections are from the real subset domain, and the random selection of sections from the real data is implemented at every epoch of the training. In the inference/application stage, the input data are from the real subset domain and the mean of the autocorrelated sections are from the synthetic data subset domain. Example applications on passive seismic data for microseismic event source location determination and on active seismic data for predicting low frequencies are used to demonstrate the power of this approach in improving the applicability of our trained NNs to real data.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 101-114"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000260/pdfft?md5=3e63a5c64f3830cf6afacef439cdef2b&pid=1-s2.0-S2666544122000260-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91696854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Advanced geochemical exploration knowledge using machine learning: Prediction of unknown elemental concentrations and operational prioritization of Re-analysis campaigns 利用机器学习的先进地球化学勘探知识:预测未知元素浓度和重新分析活动的操作优先级

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.10.003

Steven E. Zhang , Julie E. Bourdeau , Glen T. Nwaila , Yousef Ghorbani

In exploration geochemistry, advances in the detection limit, breadth of elements analyze-able, accuracy and precision of analytical instruments have motivated the re-analysis of legacy samples to improve confidence in geochemical data and gain more insights into potentially mineralized areas. While a re-analysis campaign in a geochemical exploration program modernizes legacy geochemical data by providing more trustworthy and higher-dimensional geochemical data, especially where modern data is considerably different than legacy data, it is an expensive exercise. The risk associated with modernizing such legacy data lies within its uncertainty in return (e.g., the possibility of new discoveries, in primarily greenfield settings). Without any advanced knowledge of yet unanalyzed elements, the importance of re-analyses remains ambiguous. To address this uncertainty, we apply machine learning to multivariate geochemical data from different regions in Canada (i.e., the Churchill Province and the Trans-Hudson Orogen) in order to use legacy geochemical data to predict modern and higher dimensional multi-elemental concentrations ahead of planned re-analyses. Our study demonstrates that legacy and modern geochemical data can be repurposed to predict yet unanalyzed elements that will be realized from re-analyses and in a manner that significantly reduces the latency to downstream usage of modern geochemical data (e.g., prospectivity mapping). Findings from this study serve as a pillar of a framework for exploration geologists to predictively explore and prioritize potentially mineralized districts for further prospects in a timely manner before employing more invasive and expensive techniques.

在勘探地球化学领域，探测极限、可分析元素的广度、分析仪器的准确度和精密度等方面的进步，推动了对遗留样品的重新分析，以提高地球化学数据的可信度，并对潜在矿化区有更多的了解。虽然地球化学勘探项目中的重新分析活动通过提供更可靠、更高维的地球化学数据，使遗留地球化学数据现代化，特别是在现代数据与遗留数据有很大不同的情况下，这是一项昂贵的工作。对这些遗留数据进行现代化改造的风险在于其回报的不确定性(例如，新发现的可能性，主要是在绿地环境中)。由于对尚未分析的元素没有任何先进的知识，重新分析的重要性仍然模糊不清。为了解决这一不确定性，我们将机器学习应用于加拿大不同地区(即丘吉尔省和跨哈德逊造山带)的多元地球化学数据，以便在计划的重新分析之前使用传统的地球化学数据来预测现代和更高维度的多元素浓度。我们的研究表明，遗留的和现代的地球化学数据可以被重新利用来预测尚未分析的元素，这些元素将通过重新分析来实现，并且以一种显著减少现代地球化学数据下游使用的延迟的方式(例如，远景图)。这项研究的发现为勘探地质学家提供了一个框架的支柱，以便在采用更具侵入性和昂贵的技术之前，及时对潜在矿化地区进行预测勘探和优先排序，以便进一步进行勘探。

{"title":"Advanced geochemical exploration knowledge using machine learning: Prediction of unknown elemental concentrations and operational prioritization of Re-analysis campaigns","authors":"Steven E. Zhang , Julie E. Bourdeau , Glen T. Nwaila , Yousef Ghorbani","doi":"10.1016/j.aiig.2022.10.003","DOIUrl":"https://doi.org/10.1016/j.aiig.2022.10.003","url":null,"abstract":"<div><p>In exploration geochemistry, advances in the detection limit, breadth of elements analyze-able, accuracy and precision of analytical instruments have motivated the re-analysis of legacy samples to improve confidence in geochemical data and gain more insights into potentially mineralized areas. While a re-analysis campaign in a geochemical exploration program modernizes legacy geochemical data by providing more trustworthy and higher-dimensional geochemical data, especially where modern data is considerably different than legacy data, it is an expensive exercise. The risk associated with modernizing such legacy data lies within its uncertainty in return (e.g., the possibility of new discoveries, in primarily greenfield settings). Without any advanced knowledge of yet unanalyzed elements, the importance of re-analyses remains ambiguous. To address this uncertainty, we apply machine learning to multivariate geochemical data from different regions in Canada (i.e., the Churchill Province and the Trans-Hudson Orogen) in order to use legacy geochemical data to predict modern and higher dimensional multi-elemental concentrations ahead of planned re-analyses. Our study demonstrates that legacy and modern geochemical data can be repurposed to predict yet unanalyzed elements that will be realized from re-analyses and in a manner that significantly reduces the latency to downstream usage of modern geochemical data (e.g., prospectivity mapping). Findings from this study serve as a pillar of a framework for exploration geologists to predictively explore and prioritize potentially mineralized districts for further prospects in a timely manner before employing more invasive and expensive techniques.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 86-100"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000296/pdfft?md5=115f57a35bc434c4294614fe797ddff6&pid=1-s2.0-S2666544122000296-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91696855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

A convolutional recurrent neural network for strong convective rainfall nowcasting using weather radar data in Southeastern Brazil 基于气象雷达资料的巴西东南部强对流降雨临近预报的卷积递归神经网络

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.06.001

Angelica N. Caseri , Leonardo Bacelar Lima Santos , Stephan Stephany

Strong convective systems and the associated heavy rainfall events can trig-ger floods and landslides with severe detrimental consequences. These events have a high spatio-temporal variability, being difficult to predict by standard meteorological numerical models. This work proposes the M5Images method for performing the very short-term prediction (nowcasting) of heavy convective rainfall using weather radar data by means of a convolutional recurrent neural network. The recurrent part of it is a Long Short-Term Memory (LSTM) neural network. Prediction tests were performed for the city and surroundings of Campinas, located in the Southeastern Brazil. The convolutional recurrent neural network was trained using time series of rainfall rate images derived from weather radar data for a selected set of heavy rainfall events. The attained pre-diction performance was better than that given by the persistence forecasting method for different prediction times.

强对流系统和相关的强降雨事件可能引发洪水和山体滑坡，造成严重的有害后果。这些事件具有很高的时空变异性，难以用标准气象数值模式进行预测。本研究提出了M5Images方法，利用气象雷达数据，通过卷积递归神经网络对强对流降雨进行极短期预测(临近预报)。它的循环部分是一个长短期记忆(LSTM)神经网络。对位于巴西东南部的坎皮纳斯市及其周边地区进行了预测测试。卷积递归神经网络是使用来自天气雷达数据的降雨率图像的时间序列来训练一组选定的暴雨事件。在不同的预测时间下，所获得的预测效果优于持续性预测方法。

{"title":"A convolutional recurrent neural network for strong convective rainfall nowcasting using weather radar data in Southeastern Brazil","authors":"Angelica N. Caseri , Leonardo Bacelar Lima Santos , Stephan Stephany","doi":"10.1016/j.aiig.2022.06.001","DOIUrl":"https://doi.org/10.1016/j.aiig.2022.06.001","url":null,"abstract":"<div><p>Strong convective systems and the associated heavy rainfall events can trig-ger floods and landslides with severe detrimental consequences. These events have a high spatio-temporal variability, being difficult to predict by standard meteorological numerical models. This work proposes the M5Images method for performing the very short-term prediction (nowcasting) of heavy convective rainfall using weather radar data by means of a convolutional recurrent neural network. The recurrent part of it is a Long Short-Term Memory (LSTM) neural network. Prediction tests were performed for the city and surroundings of Campinas, located in the Southeastern Brazil. The convolutional recurrent neural network was trained using time series of rainfall rate images derived from weather radar data for a selected set of heavy rainfall events. The attained pre-diction performance was better than that given by the persistence forecasting method for different prediction times.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 8-13"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000211/pdfft?md5=e23aece2442afd1d3bbbab2bff69ba36&pid=1-s2.0-S2666544122000211-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91776947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

ResGraphNet: GraphSAGE with embedded residual module for prediction of global monthly mean temperature ResGraphNet:内置残差模块的GraphSAGE，用于预测全球月平均温度

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.11.001

Ziwei Chen , Zhiguo Wang , Yang Yang , Jinghuai Gao

Data-driven prediction of time series is significant in many scientific research fields such as global climate change and weather forecast. For global monthly mean temperature series, considering the strong potential of deep neural network for extracting data features, this paper proposes a data-driven model, ResGraphNet, which improves the prediction accuracy of time series by an embedded residual module in GraphSAGE layers. The experimental results of a global mean temperature dataset, HadCRUT5, show that compared with 11 traditional prediction technologies, the proposed ResGraphNet obtains the best accuracy. The error indicator predicted by the proposed ResGraphNet is smaller than that of the other 11 prediction models. Furthermore, the performance on seven temperature datasets shows the excellent generalization of the ResGraphNet. Finally, based on our proposed ResGraphNet, the predicted 2022 annual anomaly of global temperature is 0.74722 °C, which provides confidence for limiting warming to 1.5 °C above pre-industrial levels.

时间序列数据驱动预测在全球气候变化、天气预报等诸多科学研究领域具有重要意义。对于全球月平均温度序列，考虑到深度神经网络在提取数据特征方面的强大潜力，本文提出了一种数据驱动模型ResGraphNet，该模型通过在GraphSAGE层中嵌入残差模块来提高时间序列的预测精度。在全球平均温度数据集HadCRUT5上的实验结果表明，与11种传统预测技术相比，本文提出的ResGraphNet的预测精度最高。本文提出的ResGraphNet预测的误差指标小于其他11种预测模型。此外，在7个温度数据集上的性能显示了ResGraphNet的良好泛化效果。最后，基于我们提出的ResGraphNet，预测2022年全球温度的年距平为0.74722°C，这为将升温限制在比工业化前水平高1.5°C提供了信心。

{"title":"ResGraphNet: GraphSAGE with embedded residual module for prediction of global monthly mean temperature","authors":"Ziwei Chen , Zhiguo Wang , Yang Yang , Jinghuai Gao","doi":"10.1016/j.aiig.2022.11.001","DOIUrl":"10.1016/j.aiig.2022.11.001","url":null,"abstract":"<div><p>Data-driven prediction of time series is significant in many scientific research fields such as global climate change and weather forecast. For global monthly mean temperature series, considering the strong potential of deep neural network for extracting data features, this paper proposes a data-driven model, ResGraphNet, which improves the prediction accuracy of time series by an embedded residual module in GraphSAGE layers. The experimental results of a global mean temperature dataset, HadCRUT5, show that compared with 11 traditional prediction technologies, the proposed ResGraphNet obtains the best accuracy. The error indicator predicted by the proposed ResGraphNet is smaller than that of the other 11 prediction models. Furthermore, the performance on seven temperature datasets shows the excellent generalization of the ResGraphNet. Finally, based on our proposed ResGraphNet, the predicted 2022 annual anomaly of global temperature is 0.74722 °C, which provides confidence for limiting warming to 1.5 °C above pre-industrial levels.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 148-156"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000314/pdfft?md5=ee2f07a7b856a9a9839f99750242e44a&pid=1-s2.0-S2666544122000314-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85523982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Integrating the artificial intelligence and hybrid machine learning algorithms for improving the accuracy of spatial prediction of landslide hazards in Kurseong Himalayan Region 结合人工智能和混合机器学习算法提高库城喜马拉雅地区滑坡灾害空间预测精度

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.06.002

Anik Saha, Sunil Saha

The aim of the current work is to compare susceptibility maps of landslides produced using machine learning techniques i.e. multilayer perception neural nets (MLP), kernel logistic regression (KLR), random forest (RF), and multivariate adaptive regression splines (MARS); novel ensemble approaches i.e. MLP-Bagging, KLR-Bagging, RF-Bagging and MARS-Bagging in the Kurseong-Himalayan region. For the ensemble models the RF, KLR, MLP and MARS were used as base classifiers, and Bagging was used as meta classifier. Another objective of the current work is to introduce and evaluate the effectiveness of the novel KLR-Bagging and MARS-Bagging ensembles in susceptibility to landslide. Compiling 303 landslide locations to calibrate and test the models, an inventory map was created. Eighteen LCFs were chosen using the Relief-F and multi-collinearity tests for mapping the landslide susceptibility. Applying receiver operating characteristic (ROC), precision, accuracy, incorrectly categorized proportion, mean-absolute-error (MAE), and root-mean-square-error (RMSE), the LSMs were subsequently verified. The different validation results showed RF-Bagging (AUC training 88.69% & testing 92.28%) with ensemble Meta classifier gives better performance than the MLP, KLR, RF, MARS, MLP-Bagging, KLR-Bagging, and MARS-Bagging based LSMs. RF model showed that the slope, altitude, rainfall, and geomorphology played the most vital role in landslide occurrence comparing the other LCFs. These results will help to reduce the losses caused by the landslides in the Kurseong and in other areas where geo-environmental and geological conditions more or less similar.

当前工作的目的是比较使用机器学习技术(即多层感知神经网络(MLP)、核逻辑回归(KLR)、随机森林(RF)和多变量自适应回归样条(MARS))产生的滑坡易感性图;新集合方法:MLP-Bagging、KLR-Bagging、RF-Bagging和MARS-Bagging。对于集成模型，使用RF、KLR、MLP和MARS作为基本分类器，使用Bagging作为元分类器。当前工作的另一个目标是介绍和评估新的KLR-Bagging和MARS-Bagging组合在滑坡敏感性方面的有效性。编制了303个滑坡位置来校准和测试模型，并创建了清单地图。采用Relief-F和多重共线性试验选择了18个lcf来绘制滑坡敏感性图。应用受试者工作特征(ROC)、精密度、准确度、不正确分类比例、平均绝对误差(MAE)和均方根误差(RMSE)对lsm进行验证。不同的验证结果显示:RF-Bagging (AUC training 88.69%;使用集成元分类器进行92.28%的测试，结果优于基于MLP、KLR、RF、MARS、MLP- bagging、KLR- bagging和MARS- bagging的lsm。RF模型表明，坡度、海拔、降雨量和地貌对滑坡发生的影响最为重要。这些结果将有助于减少龟城地区和其他地质环境和地质条件大致相似的地区的滑坡造成的损失。

{"title":"Integrating the artificial intelligence and hybrid machine learning algorithms for improving the accuracy of spatial prediction of landslide hazards in Kurseong Himalayan Region","authors":"Anik Saha, Sunil Saha","doi":"10.1016/j.aiig.2022.06.002","DOIUrl":"10.1016/j.aiig.2022.06.002","url":null,"abstract":"<div><p>The aim of the current work is to compare susceptibility maps of landslides produced using machine learning techniques i.e. multilayer perception neural nets (MLP), kernel logistic regression (KLR), random forest (RF), and multivariate adaptive regression splines (MARS); novel ensemble approaches i.e. MLP-Bagging, KLR-Bagging, RF-Bagging and MARS-Bagging in the Kurseong-Himalayan region. For the ensemble models the RF, KLR, MLP and MARS were used as base classifiers, and Bagging was used as meta classifier. Another objective of the current work is to introduce and evaluate the effectiveness of the novel KLR-Bagging and MARS-Bagging ensembles in susceptibility to landslide. Compiling 303 landslide locations to calibrate and test the models, an inventory map was created. Eighteen LCFs were chosen using the Relief-F and multi-collinearity tests for mapping the landslide susceptibility. Applying receiver operating characteristic (ROC), precision, accuracy, incorrectly categorized proportion, mean-absolute-error (MAE), and root-mean-square-error (RMSE), the LSMs were subsequently verified. The different validation results showed RF-Bagging (AUC training 88.69% & testing 92.28%) with ensemble Meta classifier gives better performance than the MLP, KLR, RF, MARS, MLP-Bagging, KLR-Bagging, and MARS-Bagging based LSMs. RF model showed that the slope, altitude, rainfall, and geomorphology played the most vital role in landslide occurrence comparing the other LCFs. These results will help to reduce the losses caused by the landslides in the Kurseong and in other areas where geo-environmental and geological conditions more or less similar.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 14-27"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000223/pdfft?md5=0375259296fcb4ea2649acf35b8f7633&pid=1-s2.0-S2666544122000223-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73739288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Artificial intelligence-based anomaly detection of the Assen iron deposit in South Africa using remote sensing data from the Landsat-8 Operational Land Imager 基于人工智能的南非Assen铁矿异常检测，使用来自Landsat-8操作陆地成像仪的遥感数据

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.10.001

Glen T. Nwaila , Steven E. Zhang , Julie E. Bourdeau , Yousef Ghorbani , Emmanuel John M. Carranza

Most known mineral deposits were discovered by accident using expensive, time-consuming, and knowledge-based methods such as stream sediment geochemical data, diamond drilling, reconnaissance geochemical and geophysical surveys, and/or remote sensing. Recent years have seen a decrease in the number of newly discovered mineral deposits and a rise in demand for critical raw materials, prompting exploration geologists to seek more efficient and inventive ways for processing various data types at different phases of mineral exploration. Remote sensing is one of the most sought-after tools for early-phase mineral prospecting because of its broad coverage and low cost. Remote sensing images from satellites are publicly available and can be utilised for lithological mapping and mineral exploitation. In this study, we extend an artificial intelligence-based, unsupervised anomaly detection method to identify iron deposit occurrence using Landsat-8 Operational Land Imager (OLI) satellite imagery and machine learning. The novelty in our method includes: (1) knowledge-guided and unsupervised anomaly detection that does not assume any specific anomaly signatures; (2) detection of anomalies occurs only in the variable domain; and (3) a choice of a range of machine learning algorithms to balance between explain-ability and performance. Our new unsupervised method detects anomalies through three successive stages, namely (a) stage I – acquisition of satellite imagery, data processing and selection of bands, (b) stage II – predictive modelling and anomaly detection, and (c) stage III – construction of anomaly maps and analysis. In this study, the new method was tested over the Assen iron deposit in the Transvaal Supergroup (South Africa). It detected both the known areas of the Assen iron deposit and additional deposit occurrence features around the Assen iron mine that were not known. To summarise the anomalies in the area, principal component analysis was used on the reconstruction errors across all modelled bands. Our method enhanced the Assen deposit as an anomaly and attenuated the background, including anthropogenic structural anomalies, which resulted in substantially improved visual contrast and delineation of the iron deposit relative to the background. The results demonstrate the robustness of the proposed unsupervised anomaly detection method, and it could be useful for the delineation of mineral exploration targets. In particular, the method will be useful in areas where no data labels exist regarding the existence or specific spectral signatures of anomalies, such as mineral deposits under greenfield exploration.

大多数已知的矿床都是通过昂贵、耗时和基于知识的方法偶然发现的，例如水系沉积物地球化学数据、钻石钻探、侦察地球化学和地球物理调查以及/或遥感。近年来，新发现的矿藏数量减少，对关键原材料的需求增加，促使勘探地质学家寻求更有效和更创新的方法来处理矿产勘探不同阶段的各种数据类型。遥感因其覆盖范围广、成本低而成为早期矿产勘查最受欢迎的工具之一。来自卫星的遥感图像是公开的，可用于岩性测绘和矿物开采。在这项研究中，我们扩展了一种基于人工智能的无监督异常检测方法，利用Landsat-8操作陆地成像仪(OLI)卫星图像和机器学习来识别铁矿的存在。该方法的新颖性包括:(1)不假设任何特定异常特征的知识引导和无监督异常检测;(2)异常检测只发生在变量域中;(3)选择一系列机器学习算法来平衡可解释性和性能。我们的新方法通过三个连续的阶段来检测异常，即(a)第一阶段-获取卫星图像，数据处理和选择波段，(b)第二阶段-预测建模和异常检测，以及(c)第三阶段-构建异常图和分析。在这项研究中，新方法在德兰士瓦超级群(南非)的Assen铁矿床上进行了测试。它探测到了Assen铁矿的已知区域和Assen铁矿周围未知的额外矿床赋存特征。为了总结该区域的异常，对所有模拟波段的重建误差进行了主成分分析。我们的方法增强了Assen矿床的异常性，减弱了背景(包括人为构造异常)，从而大大改善了相对于背景的视觉对比度和铁矿圈定。结果表明，所提出的无监督异常检测方法具有较好的鲁棒性，可用于矿产勘查目标的圈定。特别是，该方法将在没有数据标签的地区有用，这些地区没有关于异常存在或特定光谱特征的数据标签，例如在绿地勘探下的矿床。

{"title":"Artificial intelligence-based anomaly detection of the Assen iron deposit in South Africa using remote sensing data from the Landsat-8 Operational Land Imager","authors":"Glen T. Nwaila , Steven E. Zhang , Julie E. Bourdeau , Yousef Ghorbani , Emmanuel John M. Carranza","doi":"10.1016/j.aiig.2022.10.001","DOIUrl":"10.1016/j.aiig.2022.10.001","url":null,"abstract":"<div><p>Most known mineral deposits were discovered by accident using expensive, time-consuming, and knowledge-based methods such as stream sediment geochemical data, diamond drilling, reconnaissance geochemical and geophysical surveys, and/or remote sensing. Recent years have seen a decrease in the number of newly discovered mineral deposits and a rise in demand for critical raw materials, prompting exploration geologists to seek more efficient and inventive ways for processing various data types at different phases of mineral exploration. Remote sensing is one of the most sought-after tools for early-phase mineral prospecting because of its broad coverage and low cost. Remote sensing images from satellites are publicly available and can be utilised for lithological mapping and mineral exploitation. In this study, we extend an artificial intelligence-based, unsupervised anomaly detection method to identify iron deposit occurrence using Landsat-8 Operational Land Imager (OLI) satellite imagery and machine learning. The novelty in our method includes: (1) knowledge-guided and unsupervised anomaly detection that does not assume any specific anomaly signatures; (2) detection of anomalies occurs only in the variable domain; and (3) a choice of a range of machine learning algorithms to balance between explain-ability and performance. Our new unsupervised method detects anomalies through three successive stages, namely (a) stage I – acquisition of satellite imagery, data processing and selection of bands, (b) stage II – predictive modelling and anomaly detection, and (c) stage III – construction of anomaly maps and analysis. In this study, the new method was tested over the Assen iron deposit in the Transvaal Supergroup (South Africa). It detected both the known areas of the Assen iron deposit and additional deposit occurrence features around the Assen iron mine that were not known. To summarise the anomalies in the area, principal component analysis was used on the reconstruction errors across all modelled bands. Our method enhanced the Assen deposit as an anomaly and attenuated the background, including anthropogenic structural anomalies, which resulted in substantially improved visual contrast and delineation of the iron deposit relative to the background. The results demonstrate the robustness of the proposed unsupervised anomaly detection method, and it could be useful for the delineation of mineral exploration targets. In particular, the method will be useful in areas where no data labels exist regarding the existence or specific spectral signatures of anomalies, such as mineral deposits under greenfield exploration.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 71-85"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000272/pdfft?md5=9ce5b9f88b3f1d81b5e69e17e51d8a1f&pid=1-s2.0-S2666544122000272-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89798656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Ensemble hybrid machine learning methods for gully erosion susceptibility mapping: K-fold cross validation approach 谷地侵蚀敏感性映射的集成混合机器学习方法:K-fold交叉验证方法

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.07.001

Jagabandhu Roy, Sunil Saha

Gully erosion is one of the important problems creating barrier to agricultural development. The present research used the radial basis function neural network (RBFnn) and its ensemble with random sub-space (RSS) and rotation forest (RTF) ensemble Meta classifiers for the spatial mapping of gully erosion susceptibility (GES) in Hinglo river basin. 120 gullies were marked and grouped into four-fold. A total of 23 factors including topographical, hydrological, lithological, and soil physio-chemical properties were effectively used. GES maps were built by RBFnn, RSS-RBFnn, and RTF-RBFnn models. The very high susceptibility zone of RBFnn, RTF-RBFnn and RSS-RBFnn models covered 6.75%, 6.72% and 6.57% in Fold-1, 6.21%, 6.10% and 6.09% in Fold-2, 6.26%, 6.13% and 6.05% in Fold-3 and 7%, 6.975% and 6.42% in Fold-4 of the basin. Receiver operating characteristics (ROC) curve and statistical techniques such as mean-absolute-error (MAE), root-mean-absolute-error (RMSE) and relative gully density area (R-index) methods were used for evaluating the GES maps. The results of the ROC, MAE, RMSE and R-index methods showed that the models of susceptibility to gully erosion have excellent predictive efficiency. The simulation results based on machine learning are satisfactory and outstanding and could be used to forecast the areas vulnerable to gully erosion.

沟蚀是阻碍农业发展的重要问题之一。采用径向基函数神经网络(RBFnn)及其集合与随机子空间(RSS)和旋转森林(RTF)集合Meta分类器对兴洛河流域沟壑区侵蚀敏感性进行空间映射。120个沟壑被标记并分成四组。地形、水文、岩性、土壤理化性质等共23个因子被有效利用。采用RBFnn、RSS-RBFnn和RTF-RBFnn模型构建GES图谱。RBFnn、RTF-RBFnn和RSS-RBFnn模型的高敏感区分别为:Fold-1的6.75%、6.72%和6.57%，Fold-2的6.21%、6.10%和6.09%，Fold-3的6.26%、6.13%和6.05%，Fold-4的7%、6.975%和6.42%。采用受试者工作特征(ROC)曲线和平均绝对误差(MAE)、均方根绝对误差(RMSE)、相对沟密度面积(R-index)等统计技术对GES图谱进行评价。ROC、MAE、RMSE和r指数等方法的结果表明，该模型具有较好的预测效果。基于机器学习的模拟结果令人满意且突出，可用于易受沟壑区侵蚀的预测。

{"title":"Ensemble hybrid machine learning methods for gully erosion susceptibility mapping: K-fold cross validation approach","authors":"Jagabandhu Roy, Sunil Saha","doi":"10.1016/j.aiig.2022.07.001","DOIUrl":"10.1016/j.aiig.2022.07.001","url":null,"abstract":"<div><p>Gully erosion is one of the important problems creating barrier to agricultural development. The present research used the radial basis function neural network (RBFnn) and its ensemble with random sub-space (RSS) and rotation forest (RTF) ensemble Meta classifiers for the spatial mapping of gully erosion susceptibility (GES) in Hinglo river basin. 120 gullies were marked and grouped into four-fold. A total of 23 factors including topographical, hydrological, lithological, and soil physio-chemical properties were effectively used. GES maps were built by RBFnn, RSS-RBFnn, and RTF-RBFnn models. The very high susceptibility zone of RBFnn, RTF-RBFnn and RSS-RBFnn models covered 6.75%, 6.72% and 6.57% in Fold-1, 6.21%, 6.10% and 6.09% in Fold-2, 6.26%, 6.13% and 6.05% in Fold-3 and 7%, 6.975% and 6.42% in Fold-4 of the basin. Receiver operating characteristics (ROC) curve and statistical techniques such as mean-absolute-error (MAE), root-mean-absolute-error (RMSE) and relative gully density area (R-index) methods were used for evaluating the GES maps. The results of the ROC, MAE, RMSE and R-index methods showed that the models of susceptibility to gully erosion have excellent predictive efficiency. The simulation results based on machine learning are satisfactory and outstanding and could be used to forecast the areas vulnerable to gully erosion.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 28-45"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000235/pdfft?md5=fa746e3cb56d5094abe0b3f54d826092&pid=1-s2.0-S2666544122000235-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89849616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Synthetic shear sonic log generation utilizing hybrid machine learning techniques 利用混合机器学习技术生成合成剪切声波测井

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.09.001

Jongkook Kim

Compressional and shear sonic logs (DTC and DTS, respectively) are one of the effective means for determining petrophysical/geomechanical properties. However, the DTS log has limited availability mainly due to high acquisition costs. This study introduces a hybrid machine learning approach to generating synthetic DTS logs. Five wireline logs such as gamma ray (GR), density (RHOB), neutron porosity (NPHI), deep resistivity (Rt), and DTS logs are used as input data for three supervised-machine learning models including support vector machine for regression (SVR), deep neural network (DNN), and long short-term memory (LSTM). The hybrid machine learning model utilizes two additional techniques. First, as an unsupervised-learning approach, data clustering is integrated with general machine learning models for the purpose of improving model accuracy. All the machine learning models using the data-clustered approach show higher accuracies in predicting target (DTS) values, compared to non-clustered models. Second, particle swarm optimization (PSO) is combined with the models to determine optimal hyperparameters. The PSO algorithm proves time-effective, automated advantages as it gets feedback from previous computations so that is able to narrow down candidates for optimal hyperparameters. Compared to previous studies focusing on the performance comparison among machine learning algorithms, this study introduces an advanced approach to further improve the performance by integrating the unsupervised learning technique and PSO optimization with the general models. Based on this study result, we recommend the hybrid machine learning approach for improving the reliability and efficiency of synthetic log generation.

压缩和剪切声波测井(分别为DTC和DTS)是确定岩石物理/地质力学性质的有效手段之一。然而，DTS日志的可用性有限，主要是由于高昂的获取成本。本研究介绍了一种混合机器学习方法来生成合成DTS日志。五种电缆测井数据，如伽马射线(GR)、密度(RHOB)、中子孔隙度(NPHI)、深部电阻率(Rt)和DTS测井，被用作三种监督机器学习模型的输入数据，包括回归支持向量机(SVR)、深度神经网络(DNN)和长短期记忆(LSTM)。混合机器学习模型利用了另外两种技术。首先，作为一种无监督学习方法，数据聚类与一般机器学习模型相结合，以提高模型的准确性。与非聚类模型相比，所有使用数据聚类方法的机器学习模型在预测目标(DTS)值方面都显示出更高的准确性。其次，将粒子群算法与模型相结合，确定最优超参数。粒子群算法证明了它的时效性和自动化优势，因为它从以前的计算中得到反馈，因此能够缩小最佳超参数的候选范围。与以往的研究关注机器学习算法之间的性能比较相比，本研究引入了一种先进的方法，通过将无监督学习技术和粒子群优化与一般模型相结合，进一步提高机器学习算法的性能。基于此研究结果，我们推荐混合机器学习方法来提高合成日志生成的可靠性和效率。

{"title":"Synthetic shear sonic log generation utilizing hybrid machine learning techniques","authors":"Jongkook Kim","doi":"10.1016/j.aiig.2022.09.001","DOIUrl":"https://doi.org/10.1016/j.aiig.2022.09.001","url":null,"abstract":"<div><p>Compressional and shear sonic logs (DTC and DTS, respectively) are one of the effective means for determining petrophysical/geomechanical properties. However, the DTS log has limited availability mainly due to high acquisition costs. This study introduces a hybrid machine learning approach to generating synthetic DTS logs. Five wireline logs such as gamma ray (GR), density (RHOB), neutron porosity (NPHI), deep resistivity (Rt), and DTS logs are used as input data for three supervised-machine learning models including support vector machine for regression (SVR), deep neural network (DNN), and long short-term memory (LSTM). The hybrid machine learning model utilizes two additional techniques. First, as an unsupervised-learning approach, data clustering is integrated with general machine learning models for the purpose of improving model accuracy. All the machine learning models using the data-clustered approach show higher accuracies in predicting target (DTS) values, compared to non-clustered models. Second, particle swarm optimization (PSO) is combined with the models to determine optimal hyperparameters. The PSO algorithm proves time-effective, automated advantages as it gets feedback from previous computations so that is able to narrow down candidates for optimal hyperparameters. Compared to previous studies focusing on the performance comparison among machine learning algorithms, this study introduces an advanced approach to further improve the performance by integrating the unsupervised learning technique and PSO optimization with the general models. Based on this study result, we recommend the hybrid machine learning approach for improving the reliability and efficiency of synthetic log generation.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 53-70"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000259/pdfft?md5=f8d8c2ffcf15e6348a6ff164b1ab9e0a&pid=1-s2.0-S2666544122000259-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91696381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A new correlation for calculating wellhead oil flow rate using artificial neural network 用人工神经网络计算井口油流量的一种新关联

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.04.001

Reda Abdel Azim

A separator and multiphase flow meters are considered the most accurate tools used to measure the surface oil flow rates. However, these tools are expensive and time consuming. Thus, this study aims to develop a correlation for accurate and quick evaluation of well surface flow rates and consequently the well inflow performance relationship. In order to achieve the abovementioned aim, this study uses artificial neural network (ANN) for flow rates prediction particularly in artificial lifted wells especially in the absence of wellhead pressure data. The ANN model is developed and validated by utilizing 350 data points collected from numerous fields located in Nile Delta and Western Desert of Egypt with inputs include; wellhead temperature, gas liquid ratio, water cut, surface and bottomhole temperatures, water cut, surface production rates, tubing cross section area, and well depth. The results of this study show that, the collected data are distributed as follows; 60% for training, 30% for testing and 10% for the validation processes with R² of 0.96 and mean square error (MSE) of 0.02. A comparison study is implemented between the new ANN correlation and other published correlations (Gilbert, Ros and Achong correlations) to show the robustness of the developed correlation. The results show that the developed correlation able to predict oil flow rates accurately with the lowest mean square error.

分离器和多相流量计被认为是测量地面油流量最精确的工具。然而，这些工具既昂贵又耗时。因此，本研究旨在建立一种关系式，以便准确、快速地评价井面流量，从而得出井流入动态关系。为了达到上述目的，本研究将人工神经网络(ANN)用于人工举升井的流量预测，特别是在没有井口压力数据的情况下。利用从尼罗河三角洲和埃及西部沙漠的多个油田收集的350个数据点开发和验证了人工神经网络模型，输入包括;井口温度、气液比、含水率、地面和井底温度、含水率、地面产量、油管横截面面积和井深。本研究结果表明，收集到的数据分布如下:60%用于培训，30%用于测试，10%用于验证过程，R2为0.96，均方误差(MSE)为0.02。将新的人工神经网络相关性与其他已发表的相关性(Gilbert, Ros和Achong相关性)进行比较研究，以显示所开发的相关性的稳健性。结果表明，所建立的相关关系能够以最小的均方误差准确预测油流量。

{"title":"A new correlation for calculating wellhead oil flow rate using artificial neural network","authors":"Reda Abdel Azim","doi":"10.1016/j.aiig.2022.04.001","DOIUrl":"https://doi.org/10.1016/j.aiig.2022.04.001","url":null,"abstract":"<div><p>A separator and multiphase flow meters are considered the most accurate tools used to measure the surface oil flow rates. However, these tools are expensive and time consuming. Thus, this study aims to develop a correlation for accurate and quick evaluation of well surface flow rates and consequently the well inflow performance relationship. In order to achieve the abovementioned aim, this study uses artificial neural network (ANN) for flow rates prediction particularly in artificial lifted wells especially in the absence of wellhead pressure data. The ANN model is developed and validated by utilizing 350 data points collected from numerous fields located in Nile Delta and Western Desert of Egypt with inputs include; wellhead temperature, gas liquid ratio, water cut, surface and bottomhole temperatures, water cut, surface production rates, tubing cross section area, and well depth. The results of this study show that, the collected data are distributed as follows; 60% for training, 30% for testing and 10% for the validation processes with R<sup>2</sup> of 0.96 and mean square error (MSE) of 0.02. A comparison study is implemented between the new ANN correlation and other published correlations (Gilbert, Ros and Achong correlations) to show the robustness of the developed correlation. The results show that the developed correlation able to predict oil flow rates accurately with the lowest mean square error.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 1-7"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266654412200020X/pdfft?md5=2c091668ca45ce23b755dc40f668900f&pid=1-s2.0-S266654412200020X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91696383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Attenuation of seismic migration smile artifacts with deep learning 基于深度学习的地震偏移微笑伪影衰减

Artificial Intelligence in Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.aiig.2022.11.002

Jewoo Yoo, Paul Zwartjes

Attenuation of migration artifacts on Kirchhoff migrated seismic data can be challenging due to the relatively low amplitude of migration artifacts compared to reflections as well as the overlap in the kinematics of reflection and migration smiles. Several ‘conventional’ filtering methods exist and recently deep learning based workflows have been proposed. A deep learning workflow can be a simple and fast alternative to existing methods. In case of supervised training of a deep neural network using training data made by physics-based modelling or actual migrations is expensive and lacks diversity in terms of noise, amplitude, frequency content and wavelet. This can result in poor generalization beyond the training data without re-training and transfer learning. In this paper we demonstrate successful applications of migration smile separation using a conventional U-net architecture. The novelty in our approach is that we do not use synthetic data created from physics-based modelling, but instead use only synthetic data build form basic geometric shapes. Our domain of application is the migrated common offset domain, or simply the stack of the pre-stack migrated data, where reflections resemble local geology and migration smiles are upward convex hyperbolic patterns. Both patterns were randomly perturbed in many ways while maintaining their intrinsic features. This approach is inspired by the common practice of data augmentation in deep learning for machine vision applications. Since many of the standard data augmentation techniques lack a geophysical motivation, we have instead perturbed our synthetic training data in ways to make more sense for a signal processing perspective or given our ‘domain knowledge’ of the problem at hand. We did not have to retrain the network to produce good results on the field dataset. The large variety and diversity in examples enabled the trained neural network to show encouraging results on synthetic and field datasets that were not used in training.

Kirchhoff偏移地震数据上偏移伪影的衰减可能具有挑战性，因为与反射相比，偏移伪影的振幅相对较低，并且反射和偏移的运动学重叠。存在几种“传统”过滤方法，最近提出了基于深度学习的工作流。深度学习工作流可以是现有方法的一种简单快速的替代方法。在使用基于物理建模或实际迁移的训练数据对深度神经网络进行监督训练的情况下，成本高昂，并且在噪声、幅度、频率内容和小波方面缺乏多样性。如果没有重新训练和迁移学习，这可能导致训练数据之外的不良泛化。在本文中，我们展示了使用传统U-net架构的迁移微笑分离的成功应用。我们方法的新颖之处在于，我们不使用基于物理建模的合成数据，而是只使用从基本几何形状构建的合成数据。我们的应用领域是迁移的共同偏移域，或者简单地说就是叠前迁移数据的堆栈，其中反射类似于当地地质，迁移微笑是向上凸双曲模式。这两种模式在保持其固有特征的同时，在许多方面受到随机干扰。这种方法的灵感来自于机器视觉应用中深度学习中数据增强的常见实践。由于许多标准的数据增强技术缺乏地球物理动机，我们转而以某种方式干扰我们的合成训练数据，以使信号处理角度或给定我们手头问题的“领域知识”更有意义。我们不需要重新训练网络来在现场数据集上产生良好的结果。示例的多样性和多样性使训练后的神经网络在未用于训练的合成和现场数据集上显示出令人鼓舞的结果。

{"title":"Attenuation of seismic migration smile artifacts with deep learning","authors":"Jewoo Yoo, Paul Zwartjes","doi":"10.1016/j.aiig.2022.11.002","DOIUrl":"10.1016/j.aiig.2022.11.002","url":null,"abstract":"<div><p>Attenuation of migration artifacts on Kirchhoff migrated seismic data can be challenging due to the relatively low amplitude of migration artifacts compared to reflections as well as the overlap in the kinematics of reflection and migration smiles. Several ‘conventional’ filtering methods exist and recently deep learning based workflows have been proposed. A deep learning workflow can be a simple and fast alternative to existing methods. In case of supervised training of a deep neural network using training data made by physics-based modelling or actual migrations is expensive and lacks diversity in terms of noise, amplitude, frequency content and wavelet. This can result in poor generalization beyond the training data without re-training and transfer learning. In this paper we demonstrate successful applications of migration smile separation using a conventional U-net architecture. The novelty in our approach is that we do not use synthetic data created from physics-based modelling, but instead use only synthetic data build form basic geometric shapes. Our domain of application is the migrated common offset domain, or simply the stack of the pre-stack migrated data, where reflections resemble local geology and migration smiles are upward convex hyperbolic patterns. Both patterns were randomly perturbed in many ways while maintaining their intrinsic features. This approach is inspired by the common practice of data augmentation in deep learning for machine vision applications. Since many of the standard data augmentation techniques lack a geophysical motivation, we have instead perturbed our synthetic training data in ways to make more sense for a signal processing perspective or given our ‘domain knowledge’ of the problem at hand. We did not have to retrain the network to produce good results on the field dataset. The large variety and diversity in examples enabled the trained neural network to show encouraging results on synthetic and field datasets that were not used in training.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"3 ","pages":"Pages 123-131"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000302/pdfft?md5=43a70c119d2af5e0b7b62e57e6c51e6a&pid=1-s2.0-S2666544122000302-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86628522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0