{"title":"Reconstruction of lithofacies using a supervised Self-Organizing Map: Application in pseudo-wells based on a synthetic geologic cross-section","authors":"Carreira V.R. , Bijani R. , Ponte-Neto C.F.","doi":"10.1016/j.aiig.2024.100072","DOIUrl":null,"url":null,"abstract":"<div><p>Recently, machine learning (ML) has been considered a powerful technological element of different society areas. To transform the computer into a decision maker, several sophisticated methods and algorithms are constantly created and analyzed. In geophysics, both supervised and unsupervised ML methods have dramatically contributed to the development of seismic and well-log data interpretation. In well-logging, ML algorithms are well-suited for lithologic reconstruction problems, once there is no analytical expressions for computing well-log data produced by a particular rock unit. Additionally, supervised ML methods are strongly dependent on a accurate-labeled training data-set, which is not a simple task to achieve, due to data absences or corruption. Once an adequate supervision is performed, the classification outputs tend to be more accurate than unsupervised methods. This work presents a supervised version of a Self-Organizing Map, named as SSOM, to solve a lithologic reconstruction problem from well-log data. Firstly, we go for a more controlled problem and simulate well-log data directly from an interpreted geologic cross-section. We then define two specific training data-sets composed by density (RHOB), sonic (DT), spontaneous potential (SP) and gamma-ray (GR) logs, all simulated through a Gaussian distribution function per lithology. Once the training data-set is created, we simulate a particular pseudo-well, referred to as classification well, for defining controlled tests. First one comprises a training data-set with no labeled log data of the simulated fault zone. In the second test, we intentionally improve the training data-set with the fault. To bespeak the obtained results for each test, we analyze confusion matrices, logplots, accuracy and precision. Apart from very thin layer misclassifications, the SSOM provides reasonable lithologic reconstructions, especially when the improved training data-set is considered for supervision. The set of numerical experiments shows that our SSOM is extremely well-suited for a supervised lithologic reconstruction, especially to recover lithotypes that are weakly-sampled in the training log-data. On the other hand, some misclassifications are also observed when the cortex could not group the slightly different lithologies.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100072"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000133/pdfft?md5=9b25c5edb1e3ce0398ce55cee93baf8d&pid=1-s2.0-S2666544124000133-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544124000133","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, machine learning (ML) has been considered a powerful technological element of different society areas. To transform the computer into a decision maker, several sophisticated methods and algorithms are constantly created and analyzed. In geophysics, both supervised and unsupervised ML methods have dramatically contributed to the development of seismic and well-log data interpretation. In well-logging, ML algorithms are well-suited for lithologic reconstruction problems, once there is no analytical expressions for computing well-log data produced by a particular rock unit. Additionally, supervised ML methods are strongly dependent on a accurate-labeled training data-set, which is not a simple task to achieve, due to data absences or corruption. Once an adequate supervision is performed, the classification outputs tend to be more accurate than unsupervised methods. This work presents a supervised version of a Self-Organizing Map, named as SSOM, to solve a lithologic reconstruction problem from well-log data. Firstly, we go for a more controlled problem and simulate well-log data directly from an interpreted geologic cross-section. We then define two specific training data-sets composed by density (RHOB), sonic (DT), spontaneous potential (SP) and gamma-ray (GR) logs, all simulated through a Gaussian distribution function per lithology. Once the training data-set is created, we simulate a particular pseudo-well, referred to as classification well, for defining controlled tests. First one comprises a training data-set with no labeled log data of the simulated fault zone. In the second test, we intentionally improve the training data-set with the fault. To bespeak the obtained results for each test, we analyze confusion matrices, logplots, accuracy and precision. Apart from very thin layer misclassifications, the SSOM provides reasonable lithologic reconstructions, especially when the improved training data-set is considered for supervision. The set of numerical experiments shows that our SSOM is extremely well-suited for a supervised lithologic reconstruction, especially to recover lithotypes that are weakly-sampled in the training log-data. On the other hand, some misclassifications are also observed when the cortex could not group the slightly different lithologies.
最近,机器学习(ML)被认为是不同社会领域的一个强大技术要素。为了将计算机转化为决策制定者,人们不断创造和分析出一些复杂的方法和算法。在地球物理学领域,有监督和无监督的 ML 方法极大地促进了地震和测井数据解释的发展。在测井方面,一旦没有计算特定岩石单元产生的测井数据的分析表达式,ML 算法就非常适合岩性重建问题。此外,有监督的 ML 方法在很大程度上依赖于准确标记的训练数据集,而由于数据缺失或损坏,要实现这一点并不容易。一旦进行了充分的监督,分类结果往往比无监督方法更准确。本研究提出了一种监督版自组织图(SSOM),用于解决井记录数据的岩性重建问题。首先,我们要解决一个更可控的问题,直接从解释的地质横截面模拟井录数据。然后,我们定义了两个特定的训练数据集,分别由密度(RHOB)、声波(DT)、自发电位(SP)和伽马射线(GR)测井数据组成,所有数据均通过高斯分布函数按岩性进行模拟。一旦创建了训练数据集,我们就模拟一个特定的伪井,称为分类井,用于定义控制测试。第一个测试包括一个训练数据集,其中没有模拟断层带的标注测井数据。在第二次测试中,我们有意改进了带有断层的训练数据集。为了说明每次测试的结果,我们对混淆矩阵、对数图、准确率和精确度进行了分析。除了极薄层的错误分类外,SSOM 提供了合理的岩性重建,尤其是在将改进的训练数据集作为监督数据时。一组数值实验表明,我们的 SSOM 非常适合用于有监督的岩性重建,尤其是恢复训练日志数据中采样较弱的岩性。另一方面,当皮层无法将略有不同的岩性归类时,也会出现一些分类错误。