首页 > 最新文献

Artificial Intelligence in Geosciences最新文献

英文 中文
EQGraphNet: Advancing single-station earthquake magnitude estimation via deep graph networks with residual connections EQGraphNet:通过具有残差连接的深度图网络推进单站地震震级估算
Pub Date : 2024-08-22 DOI: 10.1016/j.aiig.2024.100089
Zhiguo Wang , Ziwei Chen , Huai Zhang

Magnitude estimation is a critical task in seismology, and conventional methods usually require dense seismic station arrays to provide data with sufficient spatiotemporal distribution. In this context, we propose the Earthquake Graph Network (EQGraphNet) to enhance the performance of single-station magnitude estimation. The backbone of the proposed model consists of eleven convolutional neural network layers and ten RCGL modules, where a RCGL combines a Residual Connection and a Graph convolutional Layer capable of mitigating the over-smoothing problem and simultaneously extracting temporal features of seismic signals. Our work uses the STanford EArthquake Dataset for model training and performance testing. Compared with three existing deep learning models, EQGraphNet demonstrates improved accuracy for both local magnitude and duration magnitude scales. To evaluate the robustness, we add natural background noise to the model input and find that EQGraphNet achieves the best results, particularly for signals with lower signal-to-noise ratios. Additionally, by replacing various network components and comparing their estimation performances, we illustrate the contribution of each part of EQGraphNet, validating the rationality of our approach. We also demonstrate the generalization capability of our model across different earthquakes occurring environments, achieving mean errors of ±0.1 units. Furthermore, by demonstrating the effectiveness of deeper architectures, this work encourages further exploration of deeper GNN models for both multi-station and single-station magnitude estimation.

震级估计是地震学中的一项关键任务,传统方法通常需要密集的地震台阵列来提供具有足够时空分布的数据。在这种情况下,我们提出了地震图网络(EQGraphNet)来提高单台站震级估计的性能。该模型的骨干由 11 个卷积神经网络层和 10 个 RCGL 模块组成,其中 RCGL 结合了残差连接和图卷积层,能够缓解过度平滑问题,同时提取地震信号的时间特征。我们的工作使用斯坦福大学地震数据集进行模型训练和性能测试。与现有的三个深度学习模型相比,EQGraphNet 在局部震级和持续时间震级尺度上的准确性都有所提高。为了评估鲁棒性,我们在模型输入中添加了自然背景噪声,结果发现 EQGraphNet 取得了最佳结果,尤其是对于信噪比较低的信号。此外,通过替换各种网络组件并比较其估计性能,我们说明了 EQGraphNet 各部分的贡献,验证了我们方法的合理性。我们还证明了我们的模型在不同地震发生环境下的泛化能力,其平均误差为 ±0.1 个单位。此外,通过证明更深层架构的有效性,这项工作鼓励进一步探索用于多站和单站震级估计的更深层 GNN 模型。
{"title":"EQGraphNet: Advancing single-station earthquake magnitude estimation via deep graph networks with residual connections","authors":"Zhiguo Wang ,&nbsp;Ziwei Chen ,&nbsp;Huai Zhang","doi":"10.1016/j.aiig.2024.100089","DOIUrl":"10.1016/j.aiig.2024.100089","url":null,"abstract":"<div><p>Magnitude estimation is a critical task in seismology, and conventional methods usually require dense seismic station arrays to provide data with sufficient spatiotemporal distribution. In this context, we propose the Earthquake Graph Network (EQGraphNet) to enhance the performance of single-station magnitude estimation. The backbone of the proposed model consists of eleven convolutional neural network layers and ten RCGL modules, where a RCGL combines a Residual Connection and a Graph convolutional Layer capable of mitigating the over-smoothing problem and simultaneously extracting temporal features of seismic signals. Our work uses the STanford EArthquake Dataset for model training and performance testing. Compared with three existing deep learning models, EQGraphNet demonstrates improved accuracy for both local magnitude and duration magnitude scales. To evaluate the robustness, we add natural background noise to the model input and find that EQGraphNet achieves the best results, particularly for signals with lower signal-to-noise ratios. Additionally, by replacing various network components and comparing their estimation performances, we illustrate the contribution of each part of EQGraphNet, validating the rationality of our approach. We also demonstrate the generalization capability of our model across different earthquakes occurring environments, achieving mean errors of <span><math><mo>±</mo></math></span>0.1 units. Furthermore, by demonstrating the effectiveness of deeper architectures, this work encourages further exploration of deeper GNN models for both multi-station and single-station magnitude estimation.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100089"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000303/pdfft?md5=c749cb3c43c7b43360a083719cc07ae7&pid=1-s2.0-S2666544124000303-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142049374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wide & deep learning for predicting relative mineral compositions of sediment cores solely based on XRF scans, a case study from Pleistocene Paleolake Olduvai, Tanzania 完全基于 XRF 扫描预测沉积岩芯相对矿物成分的广泛和深度学习,坦桑尼亚更新世古湖奥杜威案例研究
Pub Date : 2024-08-22 DOI: 10.1016/j.aiig.2024.100088
Gayantha R.L. Kodikara , Lindsay J. McHenry , Ian G. Stanistreet , Harald Stollhofen , Jackson K. Njau , Nicholas Toth , Kathy Schick

This study develops a method to use deep learning models to predict the mineral assemblages and their relative abundances in paleolake cores using high-resolution XRF core scan elemental data and X-ray diffraction (XRD) mineralogical results from the same core taken at coarser resolution. It uses the XRF core scan data along with published mineralogical information from the Olduvai Gorge Coring Project (OGCP) 2014 sediment cores 1A, 2A, and 3A from Paleolake Olduvai, Tanzania. Both regression and classification models were developed using a Keras deep learning framework to assess the predictability of mineral assemblages with their relative abundances (in regression models) or at least the mineral assemblages (in classification models) using XRF core scan data. Models were created using the Sequential class and Functional API with different model architectures. The correlation matrix of element ratios calculated from XRF element intensity records from the cores and XRD-derived mineralogical information was used to select the most useful features to train the models. 1057 training data records were used for the models. Lithological classes were also used for some models using Wide & Deep neural networks since those combine the benefits of memorization and generalization for mineral prediction. The results were validated using 265 validation data records unseen by the model and discuss the accuracy of models using six test records. The optimized Deep Neural Network (DNN) classification model achieved over 86% binary accuracy while the regression models were also able to predict the relative mineral abundances of samples with high accuracies. Overall, the study shows the efficacy of a carefully crafted Deep Learning (DL) model for predicting mineral assemblages and abundances using high-resolution XRF core scan data.

本研究开发了一种方法,利用高分辨率 XRF 岩心扫描元素数据和同一岩心较粗分辨率的 X 射线衍射 (XRD) 矿物学结果,使用深度学习模型预测古湖岩心中的矿物组合及其相对丰度。该研究使用了 XRF 岩心扫描数据以及已公布的来自坦桑尼亚奥杜威峡谷岩芯采集项目(OGCP)2014 年奥杜威古湖 1A、2A 和 3A 号沉积岩芯的矿物学信息。我们使用 Keras 深度学习框架开发了回归和分类模型,以评估使用 XRF 岩心扫描数据的矿物组合及其相对丰度(在回归模型中)或至少矿物组合(在分类模型中)的可预测性。使用具有不同模型结构的序列类和功能应用程序接口创建了模型。根据岩心的 XRF 元素强度记录和 XRD 衍生矿物学信息计算出的元素比率相关矩阵用于选择最有用的特征来训练模型。模型使用了 1057 条训练数据记录。由于深度神经网络结合了矿物预测的记忆性和概括性优势,因此一些模型还使用了宽amp;深度神经网络的岩性类别。使用模型未见过的 265 条验证数据对结果进行了验证,并使用 6 条测试数据对模型的准确性进行了讨论。经过优化的深度神经网络(DNN)分类模型达到了 86% 以上的二元准确率,而回归模型也能以较高的准确率预测样本的相对矿物丰度。总之,这项研究表明,精心设计的深度学习(DL)模型能够有效地利用高分辨率 XRF 岩心扫描数据预测矿物组合和丰度。
{"title":"Wide & deep learning for predicting relative mineral compositions of sediment cores solely based on XRF scans, a case study from Pleistocene Paleolake Olduvai, Tanzania","authors":"Gayantha R.L. Kodikara ,&nbsp;Lindsay J. McHenry ,&nbsp;Ian G. Stanistreet ,&nbsp;Harald Stollhofen ,&nbsp;Jackson K. Njau ,&nbsp;Nicholas Toth ,&nbsp;Kathy Schick","doi":"10.1016/j.aiig.2024.100088","DOIUrl":"10.1016/j.aiig.2024.100088","url":null,"abstract":"<div><p>This study develops a method to use deep learning models to predict the mineral assemblages and their relative abundances in paleolake cores using high-resolution XRF core scan elemental data and X-ray diffraction (XRD) mineralogical results from the same core taken at coarser resolution. It uses the XRF core scan data along with published mineralogical information from the Olduvai Gorge Coring Project (OGCP) 2014 sediment cores 1A, 2A, and 3A from Paleolake Olduvai, Tanzania. Both regression and classification models were developed using a Keras deep learning framework to assess the predictability of mineral assemblages with their relative abundances (in regression models) or at least the mineral assemblages (in classification models) using XRF core scan data. Models were created using the Sequential class and Functional API with different model architectures. The correlation matrix of element ratios calculated from XRF element intensity records from the cores and XRD-derived mineralogical information was used to select the most useful features to train the models. 1057 training data records were used for the models. Lithological classes were also used for some models using Wide &amp; Deep neural networks since those combine the benefits of memorization and generalization for mineral prediction. The results were validated using 265 validation data records unseen by the model and discuss the accuracy of models using six test records. The optimized Deep Neural Network (DNN) classification model achieved over 86% binary accuracy while the regression models were also able to predict the relative mineral abundances of samples with high accuracies. Overall, the study shows the efficacy of a carefully crafted Deep Learning (DL) model for predicting mineral assemblages and abundances using high-resolution XRF core scan data.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100088"},"PeriodicalIF":0.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000297/pdfft?md5=8bd7402c96f4a311b4dbf3ffa0c2ef1b&pid=1-s2.0-S2666544124000297-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142075965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
When linear inversion fails: Neural-network optimization for sparse-ray travel-time tomography of a volcanic edifice 当线性反演失败时火山大厦稀疏射线旅行时间断层成像的神经网络优化
Pub Date : 2024-08-21 DOI: 10.1016/j.aiig.2024.100086
Abolfazl Komeazi , Georg Rümpker , Johannes Faber , Fabian Limberger , Nishtha Srivastava

In this study, we present an artificial neural network (ANN)-based approach for travel-time tomography of a volcanic edifice under sparse-ray coverage. We employ ray tracing to simulate the propagation of seismic waves through the heterogeneous medium of a volcanic edifice, and an inverse modeling algorithm that uses an ANN to estimate the velocity structure from the “observed” travel-time data. The performance of the approach is evaluated through a 2-dimensional numerical study that simulates i) an active source seismic experiment with a few (explosive) sources placed on one side of the edifice and a dense line of receivers placed on the other side, and ii) earthquakes located inside the edifice with receivers placed on both sides of the edifice. The results are compared with those obtained from conventional damped linear inversion. The average Root Mean Square Error (RMSE) between the input and output models is approximately 0.03 km/s for the ANN inversions, whereas it is about 0.4 km/s for the linear inversions, demonstrating that the ANN-based approach outperforms the classical approach, particularly in situations with sparse ray coverage. Our study emphasizes the advantages of employing a relatively simple ANN architecture in conjunction with second-order optimizers to minimize the loss function. Compared to using first-order optimizers, our ANN architecture shows a ∼25% reduction in RMSE. The ANN-based approach is computationally efficient. We observed that even though the ANN is trained based on completely random velocity models, it is still capable of resolving previously unseen anomalous structures within the edifice with about 5% anomalous discrepancies, making it a potentially valuable tool for the detection of low velocity anomalies related to magmatic intrusions or mush.

在本研究中,我们提出了一种基于人工神经网络(ANN)的方法,用于在稀疏射线覆盖下对火山建筑物进行走时层析成像。我们采用射线追踪来模拟地震波在火山大厦异质介质中的传播,并采用逆建模算法,利用人工神经网络从 "观测到的 "走时数据中估计速度结构。通过一项二维数值研究对该方法的性能进行了评估,该研究模拟了 i) 在火山口一侧放置几个(爆炸)震源,在另一侧放置密集的接收器的主动源地震实验,以及 ii) 位于火山口内部,在火山口两侧放置接收器的地震实验。结果与传统的阻尼线性反演结果进行了比较。输入和输出模型之间的平均均方根误差(RMSE)在 ANN 反演中约为 0.03 km/s,而线性反演中约为 0.4 km/s,这表明基于 ANN 的方法优于传统方法,特别是在射线覆盖稀疏的情况下。我们的研究强调了采用相对简单的 ANN 架构和二阶优化器来最小化损失函数的优势。与使用一阶优化器相比,我们的 ANN 架构可将 RMSE 降低 25%。基于 ANN 的方法计算效率很高。我们观察到,即使是基于完全随机的速度模型训练的方差网络,它仍然能够以大约5%的异常差异解决先前未发现的建筑物内的异常结构,这使它成为检测与岩浆侵入或蘑菇相关的低速异常的一个有潜在价值的工具。
{"title":"When linear inversion fails: Neural-network optimization for sparse-ray travel-time tomography of a volcanic edifice","authors":"Abolfazl Komeazi ,&nbsp;Georg Rümpker ,&nbsp;Johannes Faber ,&nbsp;Fabian Limberger ,&nbsp;Nishtha Srivastava","doi":"10.1016/j.aiig.2024.100086","DOIUrl":"10.1016/j.aiig.2024.100086","url":null,"abstract":"<div><p>In this study, we present an artificial neural network (ANN)-based approach for travel-time tomography of a volcanic edifice under sparse-ray coverage. We employ ray tracing to simulate the propagation of seismic waves through the heterogeneous medium of a volcanic edifice, and an inverse modeling algorithm that uses an ANN to estimate the velocity structure from the “observed” travel-time data. The performance of the approach is evaluated through a 2-dimensional numerical study that simulates i) an active source seismic experiment with a few (explosive) sources placed on one side of the edifice and a dense line of receivers placed on the other side, and ii) earthquakes located inside the edifice with receivers placed on both sides of the edifice. The results are compared with those obtained from conventional damped linear inversion. The average Root Mean Square Error (RMSE) between the input and output models is approximately 0.03 km/s for the ANN inversions, whereas it is about 0.4 km/s for the linear inversions, demonstrating that the ANN-based approach outperforms the classical approach, particularly in situations with sparse ray coverage. Our study emphasizes the advantages of employing a relatively simple ANN architecture in conjunction with second-order optimizers to minimize the loss function. Compared to using first-order optimizers, our ANN architecture shows a ∼25% reduction in RMSE. The ANN-based approach is computationally efficient. We observed that even though the ANN is trained based on completely random velocity models, it is still capable of resolving previously unseen anomalous structures within the edifice with about 5% anomalous discrepancies, making it a potentially valuable tool for the detection of low velocity anomalies related to magmatic intrusions or mush.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100086"},"PeriodicalIF":0.0,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000273/pdfft?md5=7b60a887781291fb9c1bbd214c747929&pid=1-s2.0-S2666544124000273-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142049373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Water resource forecasting with machine learning and deep learning: A scientometric analysis 利用机器学习和深度学习进行水资源预测:科学计量分析
Pub Date : 2024-08-10 DOI: 10.1016/j.aiig.2024.100084
Chanjuan Liu , Jing Xu , Xi’an Li , Zhongyao Yu , Jinran Wu

Water prediction plays a crucial role in modern-day water resource management, encompassing both hydrological patterns and demand forecasts. To gain insights into its current focus, status, and emerging themes, this study analyzed 876 articles published between 2015 and 2022, retrieved from the Web of Science database. Leveraging CiteSpace visualization software, bibliometric techniques, and literature review methodologies, the investigation identified essential literature related to water prediction using machine learning and deep learning approaches. Through a comprehensive analysis, the study identified significant countries, institutions, authors, journals, and keywords in this field. By exploring this data, the research mapped out prevailing trends and cutting-edge areas, providing valuable insights for researchers and practitioners involved in water prediction through machine learning and deep learning. The study aims to guide future inquiries by highlighting key research domains and emerging areas of interest.

水资源预测在现代水资源管理中发挥着至关重要的作用,包括水文模式和需求预测。为了深入了解其当前的重点、现状和新兴主题,本研究分析了从科学网数据库中检索到的 2015 年至 2022 年间发表的 876 篇文章。利用 CiteSpace 可视化软件、文献计量学技术和文献综述方法,该研究利用机器学习和深度学习方法确定了与水预测相关的重要文献。通过综合分析,研究确定了该领域的重要国家、机构、作者、期刊和关键词。通过探索这些数据,研究绘制出了当前趋势和前沿领域,为通过机器学习和深度学习进行水资源预测的研究人员和从业人员提供了宝贵的见解。本研究旨在通过突出关键研究领域和新兴关注领域来指导未来的研究。
{"title":"Water resource forecasting with machine learning and deep learning: A scientometric analysis","authors":"Chanjuan Liu ,&nbsp;Jing Xu ,&nbsp;Xi’an Li ,&nbsp;Zhongyao Yu ,&nbsp;Jinran Wu","doi":"10.1016/j.aiig.2024.100084","DOIUrl":"10.1016/j.aiig.2024.100084","url":null,"abstract":"<div><p>Water prediction plays a crucial role in modern-day water resource management, encompassing both hydrological patterns and demand forecasts. To gain insights into its current focus, status, and emerging themes, this study analyzed 876 articles published between 2015 and 2022, retrieved from the Web of Science database. Leveraging CiteSpace visualization software, bibliometric techniques, and literature review methodologies, the investigation identified essential literature related to water prediction using machine learning and deep learning approaches. Through a comprehensive analysis, the study identified significant countries, institutions, authors, journals, and keywords in this field. By exploring this data, the research mapped out prevailing trends and cutting-edge areas, providing valuable insights for researchers and practitioners involved in water prediction through machine learning and deep learning. The study aims to guide future inquiries by highlighting key research domains and emerging areas of interest.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100084"},"PeriodicalIF":0.0,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266654412400025X/pdfft?md5=8bb63629925bdc6599eb399ca1cbfe94&pid=1-s2.0-S266654412400025X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141991036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring emerald global geochemical provenance through fingerprinting and machine learning methods 通过指纹识别和机器学习方法探索祖母绿的全球地球化学出处
Pub Date : 2024-08-10 DOI: 10.1016/j.aiig.2024.100085
Raquel Alonso-Perez , James M.D. Day , D. Graham Pearson , Yan Luo , Manuel A. Palacios , Raju Sudhakar , Aaron Palke

Emeralds – the green colored variety of beryl – occur as gem-quality specimens in over fifty deposits globally. While digital traceability methods for emerald have limitations, sample-based approaches offer robust alternatives, particularly for determining the geographic origin of emerald. Three factors make emerald suitable for provenance studies and hence for developing models for origin determination. First, the diverse elemental chemistry of emerald at minor (<1 wt%) and trace levels (<1 to 100’s ppmw) exhibits unique inter-element fractionations between global deposits. Second, minimally destructive techniques, including laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS), enable measurement of these diagnostic elemental signatures. Third, when applied to extensive datasets, machine learning (ML) techniques enable the creation of predictive models and statistical discrimination with adequate characterization of the deposits. This study employs a carefully selected dataset comprising more than 1000 LA-ICP-MS analyses of gem-quality emeralds, enriched with new analyses. This dataset represents the largest available for global emerald deposits. We conducted unsupervised exploratory analysis using Principal Component Analysis (PCA). For machine learning-based classification, we employed Support Vector Machine Classification (SVM-C), achieving an initial accuracy rate of 79%. This was enhanced to 96.8% through the use of hierarchical SVM-C with PCA filters as our modeling approach. The ML models were trained using the concentrations of eight statistically significant elements (Li, V, Cr, Fe, Sc, Ga, Rb, Cs). By leveraging high-quality LA-ICP-MS data and ML techniques, accurate identification of the geographical origin of emerald becomes possible. These models are important for accurate provenance of emerald, and from a geochemical perspective, for understanding the formation environments of beryl-bearing pegmatites and shales.

祖母绿--绿柱石的绿色品种--以宝石级标本的形式出现在全球五十多个矿床中。虽然祖母绿的数字溯源方法有其局限性,但基于样本的方法提供了可靠的替代方法,特别是在确定祖母绿的地理原产地方面。有三个因素使祖母绿适合用于产地研究,从而开发出确定原产地的模型。首先,祖母绿在微量(1 wt%)和痕量水平(1 到 100's ppmw)上的元素化学性质各不相同,在全球矿床之间表现出独特的元素间分馏。其次,包括激光烧蚀电感耦合等离子体质谱法(LA-ICP-MS)在内的破坏性最小的技术可以测量这些诊断性元素特征。第三,当应用于大量数据集时,机器学习(ML)技术能够创建预测模型,并在充分描述矿床特征的情况下进行统计判别。这项研究采用了一个精心挑选的数据集,其中包括对宝石级祖母绿进行的 1000 多项 LA-ICP-MS 分析,并添加了新的分析。该数据集是目前全球祖母绿矿床中最大的数据集。我们使用主成分分析法(PCA)进行了无监督探索性分析。在基于机器学习的分类方面,我们采用了支持向量机分类法(SVM-C),最初的准确率为 79%。通过使用带有 PCA 过滤器的分层 SVM-C 作为建模方法,准确率提高到 96.8%。我们使用八种具有统计意义的元素(锂、钒、铬、铁、钪、镓、铷、铯)的浓度来训练 ML 模型。通过利用高质量的 LA-ICP-MS 数据和 ML 技术,准确鉴定祖母绿的地理来源成为可能。这些模型对于准确确定祖母绿的产地非常重要,从地球化学的角度来看,对于了解含绿柱石伟晶岩和页岩的形成环境也非常重要。
{"title":"Exploring emerald global geochemical provenance through fingerprinting and machine learning methods","authors":"Raquel Alonso-Perez ,&nbsp;James M.D. Day ,&nbsp;D. Graham Pearson ,&nbsp;Yan Luo ,&nbsp;Manuel A. Palacios ,&nbsp;Raju Sudhakar ,&nbsp;Aaron Palke","doi":"10.1016/j.aiig.2024.100085","DOIUrl":"10.1016/j.aiig.2024.100085","url":null,"abstract":"<div><p>Emeralds – the green colored variety of beryl – occur as gem-quality specimens in over fifty deposits globally. While digital traceability methods for emerald have limitations, sample-based approaches offer robust alternatives, particularly for determining the geographic origin of emerald. Three factors make emerald suitable for provenance studies and hence for developing models for origin determination. First, the diverse elemental chemistry of emerald at minor (&lt;1 wt%) and trace levels (&lt;1 to 100’s ppmw) exhibits unique inter-element fractionations between global deposits. Second, minimally destructive techniques, including laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS), enable measurement of these diagnostic elemental signatures. Third, when applied to extensive datasets, machine learning (ML) techniques enable the creation of predictive models and statistical discrimination with adequate characterization of the deposits. This study employs a carefully selected dataset comprising more than 1000 LA-ICP-MS analyses of gem-quality emeralds, enriched with new analyses. This dataset represents the largest available for global emerald deposits. We conducted unsupervised exploratory analysis using Principal Component Analysis (PCA). For machine learning-based classification, we employed Support Vector Machine Classification (SVM-C), achieving an initial accuracy rate of 79%. This was enhanced to 96.8% through the use of hierarchical SVM-C with PCA filters as our modeling approach. The ML models were trained using the concentrations of eight statistically significant elements (Li, V, Cr, Fe, Sc, Ga, Rb, Cs). By leveraging high-quality LA-ICP-MS data and ML techniques, accurate identification of the geographical origin of emerald becomes possible. These models are important for accurate provenance of emerald, and from a geochemical perspective, for understanding the formation environments of beryl-bearing pegmatites and shales.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100085"},"PeriodicalIF":0.0,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000261/pdfft?md5=8ac6027d08bf9d1a8618e5ecdb9f25b3&pid=1-s2.0-S2666544124000261-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142096082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-resolution seismic inversion method based on joint data-driven in the time-frequency domain 基于时频域联合数据驱动的高分辨率地震反演方法
Pub Date : 2024-07-27 DOI: 10.1016/j.aiig.2024.100083
Yu Liu , Sisi Miao

Seismic inversion can be divided into time-domain inversion and frequency-domain inversion based on different transform domains. Time-domain inversion has stronger stability and noise resistance compared to frequency-domain inversion. Frequency domain inversion has stronger ability to identify small-scale bodies and higher inversion resolution. Therefore, the research on the joint inversion method in the time-frequency domain is of great significance for improving the inversion resolution, stability, and noise resistance. The introduction of prior information constraints can effectively reduce ambiguity in the inversion process. However, the existing model-driven time-frequency joint inversion assumes a specific prior distribution of the reservoir. These methods do not consider the original features of the data and are difficult to describe the relationship between time-domain features and frequency-domain features. Therefore, this paper proposes a high-resolution seismic inversion method based on joint data-driven in the time-frequency domain. The method is based on the impedance and reflectivity samples from logging, using joint dictionary learning to obtain adaptive feature information of the reservoir, and using sparse coefficients to capture the intrinsic relationship between impedance and reflectivity. The optimization result of the inversion is achieved through the regularization term of the joint dictionary sparse representation. We have finally achieved an inversion method that combines constraints on time-domain features and frequency features. By testing the model data and field data, the method has higher resolution in the inversion results and good noise resistance.

根据变换域的不同,地震反演可分为时域反演和频域反演。与频域反演相比,时域反演具有更强的稳定性和抗噪性。频域反演具有更强的识别小尺度体的能力和更高的反演分辨率。因此,研究时频域联合反演方法对提高反演分辨率、稳定性和抗噪能力具有重要意义。先验信息约束的引入可以有效减少反演过程中的模糊性。然而,现有的模型驱动时频联合反演假设储层具有特定的先验分布。这些方法没有考虑数据的原始特征,难以描述时域特征与频域特征之间的关系。因此,本文提出了一种基于时频域联合数据驱动的高分辨率地震反演方法。该方法基于测井获得的阻抗和反射率样本,利用联合字典学习获得储层的自适应特征信息,并利用稀疏系数捕捉阻抗和反射率之间的内在关系。反演的优化结果是通过联合字典稀疏表示的正则化项实现的。我们最终实现了一种结合时域特征和频率特性约束的反演方法。通过对模型数据和现场数据的测试,该方法的反演结果具有更高的分辨率和良好的抗噪能力。
{"title":"High-resolution seismic inversion method based on joint data-driven in the time-frequency domain","authors":"Yu Liu ,&nbsp;Sisi Miao","doi":"10.1016/j.aiig.2024.100083","DOIUrl":"10.1016/j.aiig.2024.100083","url":null,"abstract":"<div><p>Seismic inversion can be divided into time-domain inversion and frequency-domain inversion based on different transform domains. Time-domain inversion has stronger stability and noise resistance compared to frequency-domain inversion. Frequency domain inversion has stronger ability to identify small-scale bodies and higher inversion resolution. Therefore, the research on the joint inversion method in the time-frequency domain is of great significance for improving the inversion resolution, stability, and noise resistance. The introduction of prior information constraints can effectively reduce ambiguity in the inversion process. However, the existing model-driven time-frequency joint inversion assumes a specific prior distribution of the reservoir. These methods do not consider the original features of the data and are difficult to describe the relationship between time-domain features and frequency-domain features. Therefore, this paper proposes a high-resolution seismic inversion method based on joint data-driven in the time-frequency domain. The method is based on the impedance and reflectivity samples from logging, using joint dictionary learning to obtain adaptive feature information of the reservoir, and using sparse coefficients to capture the intrinsic relationship between impedance and reflectivity. The optimization result of the inversion is achieved through the regularization term of the joint dictionary sparse representation. We have finally achieved an inversion method that combines constraints on time-domain features and frequency features. By testing the model data and field data, the method has higher resolution in the inversion results and good noise resistance.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100083"},"PeriodicalIF":0.0,"publicationDate":"2024-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000248/pdfft?md5=a121e4ba7407f86ad1bada2790fbffb4&pid=1-s2.0-S2666544124000248-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141850013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing economic sustainability in mature oil fields: Insights from the clustering approach to select candidate wells for extended shut-in 提高成熟油田的经济可持续性:用聚类方法选择延长停产的候选油井的启示
Pub Date : 2024-07-23 DOI: 10.1016/j.aiig.2024.100082
B. Lobut , E. Artun

Fluctuations in oil prices adversely affect decision making situations in which performance forecasting must be combined with realistic price forecasts. In periods of significant price drops, companies may consider extended duration of well shut-ins (i.e. temporarily stopping oil production) for economic reasons. For example, prices during the early days of the Covid-19 pandemic forced operators to consider shutting in all or some of their active wells. In the case of partial shut-in, selection of candidate wells may evolve as a challenging decision problem considering the uncertainties involved. In this study, a mature oil field with a long (50+ years) production history with 170+ wells is considered. Reservoirs with similar conditions face many challenges related to economic sustainability such as frequent maintenance requirements and low production rates. We aimed to solve this decision-making problem through unsupervised machine learning. Average reservoir characteristics at well locations, well production performance statistics and well locations are used as potential features that could characterize similarities and differences among wells. While reservoir characteristics are measured at well locations for the purpose of describing the subsurface reservoir, well performance consists of volumetric rates and pressures, which are frequently measured during oil production. After a multivariate data analysis that explored correlations among parameters, clustering algorithms were used to identify groups of wells that are similar with respect to aforementioned features. Using the field’s reservoir simulation model, scenarios of shutting in different groups of wells were simulated. Forecasted reservoir performance for three years was used for economic evaluation that assumed an oil price drop to $30/bbl for 6, 12 or 18 months. Results of economic analysis were analyzed to identify which group(s) of wells should have been shut-in by also considering the sensitivity to different price levels. It was observed that wells can be characterized in the 3-cluster case as low, medium and high performance wells. Analyzing the forecasting scenarios showed that shutting in all or high- and medium-performance wells altogether results in better economic outcomes. The results were most sensitive to the number of active wells and the oil price during the high-price period. This study demonstrated the effectiveness of unsupervised machine learning in well classification for operational decision making purposes. Operating companies may use this approach for improved decision making to select wells for extended shut-in during low oil-price periods. This approach would lead to cost savings especially in mature fields with low-profit margins.

油价波动对决策产生不利影响,在这种情况下,业绩预测必须与现实的价格预测相结合。在价格大幅下跌期间,出于经济原因,公司可能会考虑延长油井关闭时间(即暂时停止石油生产)。例如,Covid-19 大流行初期的价格迫使运营商考虑关闭全部或部分活跃油井。在部分关井的情况下,考虑到所涉及的不确定性,候选油井的选择可能会成为一个具有挑战性的决策问题。在本研究中,考虑的是一个拥有很长(50 多年)生产历史、170 多口油井的成熟油田。具有类似条件的油藏面临着许多与经济可持续性相关的挑战,如频繁的维护要求和较低的生产率。我们的目标是通过无监督机器学习来解决这一决策问题。油井位置的平均储层特征、油井生产性能统计数据和油井位置被用作潜在特征,可以描述油井之间的异同。油藏特征是在油井位置测量的,目的是描述地下油藏,而油井性能包括容积率和压力,在石油生产过程中经常测量。在对参数之间的相关性进行多变量数据分析后,使用聚类算法识别出与上述特征相似的油井组。利用油田的储层模拟模型,模拟了关闭不同井组的情况。使用三年的储油层性能预测进行经济评估,假设油价在 6、12 或 18 个月内跌至 30 美元/桶。对经济分析的结果进行了分析,通过考虑对不同价格水平的敏感性,确定哪一组(几组)油井应该关闭。据观察,在 3 组情况下,油井可分为低效井、中效井和高效井。对预测方案的分析表明,关闭所有油井或中高产油井会带来更好的经济效益。在高油价时期,结果对活跃油井的数量和油价最为敏感。这项研究证明了无监督机器学习在油井分类中对运营决策的有效性。运营公司可以利用这种方法改进决策,选择在低油价时期延长停产的油井。这种方法可以节约成本,尤其是在利润率较低的成熟油田。
{"title":"Enhancing economic sustainability in mature oil fields: Insights from the clustering approach to select candidate wells for extended shut-in","authors":"B. Lobut ,&nbsp;E. Artun","doi":"10.1016/j.aiig.2024.100082","DOIUrl":"10.1016/j.aiig.2024.100082","url":null,"abstract":"<div><p>Fluctuations in oil prices adversely affect decision making situations in which performance forecasting must be combined with realistic price forecasts. In periods of significant price drops, companies may consider extended duration of well shut-ins (i.e. temporarily stopping oil production) for economic reasons. For example, prices during the early days of the Covid-19 pandemic forced operators to consider shutting in all or some of their active wells. In the case of partial shut-in, selection of candidate wells may evolve as a challenging decision problem considering the uncertainties involved. In this study, a mature oil field with a long (50+ years) production history with 170+ wells is considered. Reservoirs with similar conditions face many challenges related to economic sustainability such as frequent maintenance requirements and low production rates. We aimed to solve this decision-making problem through unsupervised machine learning. Average reservoir characteristics at well locations, well production performance statistics and well locations are used as potential features that could characterize similarities and differences among wells. While reservoir characteristics are measured at well locations for the purpose of describing the subsurface reservoir, well performance consists of volumetric rates and pressures, which are frequently measured during oil production. After a multivariate data analysis that explored correlations among parameters, clustering algorithms were used to identify groups of wells that are similar with respect to aforementioned features. Using the field’s reservoir simulation model, scenarios of shutting in different groups of wells were simulated. Forecasted reservoir performance for three years was used for economic evaluation that assumed an oil price drop to $30/bbl for 6, 12 or 18 months. Results of economic analysis were analyzed to identify which group(s) of wells should have been shut-in by also considering the sensitivity to different price levels. It was observed that wells can be characterized in the 3-cluster case as low, medium and high performance wells. Analyzing the forecasting scenarios showed that shutting in all or high- and medium-performance wells altogether results in better economic outcomes. The results were most sensitive to the number of active wells and the oil price during the high-price period. This study demonstrated the effectiveness of unsupervised machine learning in well classification for operational decision making purposes. Operating companies may use this approach for improved decision making to select wells for extended shut-in during low oil-price periods. This approach would lead to cost savings especially in mature fields with low-profit margins.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100082"},"PeriodicalIF":0.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000236/pdfft?md5=f0af05f8a34df1aaea52508e477709e5&pid=1-s2.0-S2666544124000236-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141851938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Locally varying geostatistical machine learning for spatial prediction 用于空间预测的局部变化地质统计机器学习
Pub Date : 2024-07-02 DOI: 10.1016/j.aiig.2024.100081
Francky Fouedjio , Emet Arya

Machine learning methods dealing with the spatial auto-correlation of the response variable have garnered significant attention in the context of spatial prediction. Nonetheless, under these methods, the relationship between the response variable and explanatory variables is assumed to be homogeneous throughout the entire study area. This assumption, known as spatial stationarity, is very questionable in real-world situations due to the influence of contextual factors. Therefore, allowing the relationship between the target variable and predictor variables to vary spatially within the study region is more reasonable. However, existing machine learning techniques accounting for the spatially varying relationship between the dependent variable and the predictor variables do not capture the spatial auto-correlation of the dependent variable itself. Moreover, under these techniques, local machine learning models are effectively built using only fewer observations, which can lead to well-known issues such as over-fitting and the curse of dimensionality. This paper introduces a novel geostatistical machine learning approach where both the spatial auto-correlation of the response variable and the spatial non-stationarity of the regression relationship between the response and predictor variables are explicitly considered. The basic idea consists of relying on the local stationarity assumption to build a collection of local machine learning models while leveraging on the local spatial auto-correlation of the response variable to locally augment the training dataset. The proposed method’s effectiveness is showcased via experiments conducted on synthetic spatial data with known characteristics as well as real-world spatial data. In the synthetic (resp. real) case study, the proposed method’s predictive accuracy, as indicated by the Root Mean Square Error (RMSE) on the test set, is 17% (resp. 7%) better than that of popular machine learning methods dealing with the response variable’s spatial auto-correlation. Additionally, this method is not only valuable for spatial prediction but also offers a deeper understanding of how the relationship between the target and predictor variables varies across space, and it can even be used to investigate the local significance of predictor variables.

在空间预测方面,处理响应变量空间自相关性的机器学习方法备受关注。然而,在这些方法中,响应变量与解释变量之间的关系被假定为在整个研究区域内是同质的。由于受到环境因素的影响,这种被称为空间静止性的假设在现实世界中很成问题。因此,允许目标变量和预测变量之间的关系在研究区域内发生空间变化更为合理。然而,考虑因变量与预测变量之间空间变化关系的现有机器学习技术并不能捕捉因变量本身的空间自相关性。此外,在这些技术下,只需使用较少的观测数据就能有效地建立局部机器学习模型,这可能会导致众所周知的问题,如过拟合和维度诅咒。本文介绍了一种新颖的地理统计机器学习方法,其中明确考虑了响应变量的空间自相关性以及响应变量与预测变量之间回归关系的空间非平稳性。该方法的基本思路是依靠本地静态假设来建立一系列本地机器学习模型,同时利用响应变量的本地空间自相关性来本地增强训练数据集。通过对具有已知特征的合成空间数据和真实世界空间数据进行实验,展示了所提方法的有效性。在合成(或真实)案例研究中,根据测试集上的均方根误差(RMSE),建议方法的预测准确性比处理响应变量空间自相关性的流行机器学习方法高出 17%(或 7%)。此外,这种方法不仅对空间预测有价值,还能更深入地了解目标变量和预测变量之间的关系在不同空间的变化情况,甚至可以用来研究预测变量的局部重要性。
{"title":"Locally varying geostatistical machine learning for spatial prediction","authors":"Francky Fouedjio ,&nbsp;Emet Arya","doi":"10.1016/j.aiig.2024.100081","DOIUrl":"https://doi.org/10.1016/j.aiig.2024.100081","url":null,"abstract":"<div><p>Machine learning methods dealing with the spatial auto-correlation of the response variable have garnered significant attention in the context of spatial prediction. Nonetheless, under these methods, the relationship between the response variable and explanatory variables is assumed to be homogeneous throughout the entire study area. This assumption, known as spatial stationarity, is very questionable in real-world situations due to the influence of contextual factors. Therefore, allowing the relationship between the target variable and predictor variables to vary spatially within the study region is more reasonable. However, existing machine learning techniques accounting for the spatially varying relationship between the dependent variable and the predictor variables do not capture the spatial auto-correlation of the dependent variable itself. Moreover, under these techniques, local machine learning models are effectively built using only fewer observations, which can lead to well-known issues such as over-fitting and the curse of dimensionality. This paper introduces a novel geostatistical machine learning approach where both the spatial auto-correlation of the response variable and the spatial non-stationarity of the regression relationship between the response and predictor variables are explicitly considered. The basic idea consists of relying on the local stationarity assumption to build a collection of local machine learning models while leveraging on the local spatial auto-correlation of the response variable to locally augment the training dataset. The proposed method’s effectiveness is showcased via experiments conducted on synthetic spatial data with known characteristics as well as real-world spatial data. In the synthetic (resp. real) case study, the proposed method’s predictive accuracy, as indicated by the Root Mean Square Error (RMSE) on the test set, is 17% (resp. 7%) better than that of popular machine learning methods dealing with the response variable’s spatial auto-correlation. Additionally, this method is not only valuable for spatial prediction but also offers a deeper understanding of how the relationship between the target and predictor variables varies across space, and it can even be used to investigate the local significance of predictor variables.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100081"},"PeriodicalIF":0.0,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000224/pdfft?md5=54078444bfb0fb6f7d6f252ccf51265a&pid=1-s2.0-S2666544124000224-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141582759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The 3-billion fossil question: How to automate classification of microfossils 30 亿化石问题:如何自动分类微化石
Pub Date : 2024-06-08 DOI: 10.1016/j.aiig.2024.100080
Iver Martinsen , David Wade , Benjamin Ricaud , Fred Godtliebsen

Microfossil classification is an important discipline in subsurface exploration, for both oil & gas and Carbon Capture and Storage (CCS). The abundance and distribution of species found in sedimentary rocks provide valuable information about the age and depositional environment. However, the analysis is difficult and time-consuming, as it is based on manual work by human experts. Attempts to automate this process face two key challenges: (1) the input data are very large - our dataset is projected to grow to 3 billion microfossils, and (2) there are not enough labeled data to use the standard procedure of training a deep learning classifier. We propose an efficient pipeline for processing and grouping fossils by genus, or even species, from microscope slides using self-supervised learning. First we show how to efficiently extract crops from whole slide images by adapting previously trained object detection algorithms. Second, we provide a comparison of a range of self-supervised learning methods to classify and identify microfossils from very few labels. We obtain excellent results with both convolutional neural networks and vision transformers fine-tuned by self-supervision. Our approach is fast and computationally light, providing a handy tool for geologists working with microfossils.

微化石分类是油气和碳捕获与封存(CCS)地下勘探的一门重要学科。沉积岩中物种的数量和分布提供了有关年龄和沉积环境的宝贵信息。然而,这项分析工作十分困难且耗时,因为它是基于人类专家的手工操作。尝试将这一过程自动化面临两个关键挑战:(1)输入数据非常庞大--我们的数据集预计将增长到 30 亿个微化石;(2)没有足够的标记数据来使用训练深度学习分类器的标准程序。我们提出了一种高效的方法,利用自监督学习从显微镜切片中按属、甚至种处理化石并对其进行分组。首先,我们展示了如何通过调整以前训练过的物体检测算法,从整张玻片图像中高效提取作物。其次,我们比较了一系列自我监督学习方法,以便从极少的标签中对微化石进行分类和识别。我们利用卷积神经网络和经自我监督微调的视觉转换器取得了出色的结果。我们的方法速度快、计算量小,为从事微化石研究的地质学家提供了一个便捷的工具。
{"title":"The 3-billion fossil question: How to automate classification of microfossils","authors":"Iver Martinsen ,&nbsp;David Wade ,&nbsp;Benjamin Ricaud ,&nbsp;Fred Godtliebsen","doi":"10.1016/j.aiig.2024.100080","DOIUrl":"https://doi.org/10.1016/j.aiig.2024.100080","url":null,"abstract":"<div><p>Microfossil classification is an important discipline in subsurface exploration, for both oil &amp; gas and Carbon Capture and Storage (CCS). The abundance and distribution of species found in sedimentary rocks provide valuable information about the age and depositional environment. However, the analysis is difficult and time-consuming, as it is based on manual work by human experts. Attempts to automate this process face two key challenges: (1) the input data are very large - our dataset is projected to grow to 3 billion microfossils, and (2) there are not enough labeled data to use the standard procedure of training a deep learning classifier. We propose an efficient pipeline for processing and grouping fossils by genus, or even species, from microscope slides using self-supervised learning. First we show how to efficiently extract crops from whole slide images by adapting previously trained object detection algorithms. Second, we provide a comparison of a range of self-supervised learning methods to classify and identify microfossils from very few labels. We obtain excellent results with both convolutional neural networks and vision transformers fine-tuned by self-supervision. Our approach is fast and computationally light, providing a handy tool for geologists working with microfossils.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100080"},"PeriodicalIF":0.0,"publicationDate":"2024-06-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000212/pdfft?md5=b8887f1ac00c8bb87b2ff8b6e47d5830&pid=1-s2.0-S2666544124000212-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141322751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of ChatGPT in soil science research and the perceptions of soil scientists in Indonesia ChatGPT 在土壤科学研究中的应用和印度尼西亚土壤科学家的看法
Pub Date : 2024-05-22 DOI: 10.1016/j.aiig.2024.100078
Destika Cahyana , Agus Hadiarto , Irawan , Diah Puspita Hati , Mira Media Pratamaningsih , Vicca Karolinoerita , Anny Mulyani , Sukarman , Muhammad Hikmat , Fadhlullah Ramadhani , Rachmat Abdul Gani , Edi Yatno , R. Bambang Heryanto , Suratman , Nuni Gofar , Abraham Suriadikusumah

Since its arrival in late November 2022, ChatGPT-3.5 has rapidly gained popularity and significantly impacted how research is planned, conducted, and published using a generative artificial intelligence approach. ChatGPT-4 was released four months later and became more popular in November 2023. However, there is little study about the perception of scientists of these chatbots, especially in soil science. This article presents the new findings of a brief research investigating soil scientists' responses and perceptions towards chatbots in Indonesia. This artificial intelligence application facilitates conversation-based interactions in text format. The study evaluated ten ChatGPT answers to fundamental questions in soil science, which has developed into a normal science with a mutually agreed-upon paradigm. The evaluation was carried out by seven soil scientists recognized for their expertise in Indonesia, using a scale of 1–100. In addition, a questionnaire was distributed to soil scientists at the National Research and Innovation Agency of the Republic of Indonesia (BRIN), universities, and Indonesian Soil Science Society (HITI) members to gauge their perception of ChatGPT's presence in the research field. The study results indicate that the scores of ChatGPT answers range from 82.99 to 92.24. ChatGPT-4 is better than both the paid and free versions of ChatGPT-3.5. There is no significant difference between the English and Indonesian versions of ChatGPT-4.0. However, the perception of general soil scientists about the level of trust is only 55%. Furthermore, 80% of soil scientists believe that chatbots can only be used as digital tools to assist in soil science research and cannot be used without the involvement of soil scientists.

自 2022 年 11 月底问世以来,ChatGPT-3.5 迅速受到欢迎,并极大地影响了使用生成式人工智能方法规划、开展和发布研究的方式。ChatGPT-4 在四个月后发布,并于 2023 年 11 月变得更加流行。然而,有关科学家对这些聊天机器人的看法的研究却很少,尤其是在土壤科学领域。本文介绍了一项简要研究的新发现,该研究调查了印度尼西亚土壤科学家对聊天机器人的反应和看法。这种人工智能应用促进了基于文本格式的对话式互动。该研究评估了十个 ChatGPT 对土壤科学基本问题的回答,土壤科学已经发展成为一门具有共同认可范式的正常科学。评估由印度尼西亚公认的七位土壤学家进行,他们的专业知识得到了认可,评估采用 1-100 分制。此外,还向印度尼西亚共和国国家研究与创新局(BRIN)、大学和印度尼西亚土壤科学学会(HITI)成员中的土壤科学家发放了调查问卷,以了解他们对 ChatGPT 在研究领域的存在的看法。研究结果表明,ChatGPT 答案的得分在 82.99 到 92.24 之间。ChatGPT-4 优于付费版和免费版的 ChatGPT-3.5。英文版和印尼语版的 ChatGPT-4.0 之间没有明显差异。然而,普通土壤科学家对信任度的认知度仅为 55%。此外,80% 的土壤科学家认为聊天机器人只能作为辅助土壤科学研究的数字工具,没有土壤科学家的参与就无法使用。
{"title":"Application of ChatGPT in soil science research and the perceptions of soil scientists in Indonesia","authors":"Destika Cahyana ,&nbsp;Agus Hadiarto ,&nbsp;Irawan ,&nbsp;Diah Puspita Hati ,&nbsp;Mira Media Pratamaningsih ,&nbsp;Vicca Karolinoerita ,&nbsp;Anny Mulyani ,&nbsp;Sukarman ,&nbsp;Muhammad Hikmat ,&nbsp;Fadhlullah Ramadhani ,&nbsp;Rachmat Abdul Gani ,&nbsp;Edi Yatno ,&nbsp;R. Bambang Heryanto ,&nbsp;Suratman ,&nbsp;Nuni Gofar ,&nbsp;Abraham Suriadikusumah","doi":"10.1016/j.aiig.2024.100078","DOIUrl":"10.1016/j.aiig.2024.100078","url":null,"abstract":"<div><p>Since its arrival in late November 2022, ChatGPT-3.5 has rapidly gained popularity and significantly impacted how research is planned, conducted, and published using a generative artificial intelligence approach. ChatGPT-4 was released four months later and became more popular in November 2023. However, there is little study about the perception of scientists of these chatbots, especially in soil science. This article presents the new findings of a brief research investigating soil scientists' responses and perceptions towards chatbots in Indonesia. This artificial intelligence application facilitates conversation-based interactions in text format. The study evaluated ten ChatGPT answers to fundamental questions in soil science, which has developed into a normal science with a mutually agreed-upon paradigm. The evaluation was carried out by seven soil scientists recognized for their expertise in Indonesia, using a scale of 1–100. In addition, a questionnaire was distributed to soil scientists at the National Research and Innovation Agency of the Republic of Indonesia (BRIN), universities, and Indonesian Soil Science Society (HITI) members to gauge their perception of ChatGPT's presence in the research field. The study results indicate that the scores of ChatGPT answers range from 82.99 to 92.24. ChatGPT-4 is better than both the paid and free versions of ChatGPT-3.5. There is no significant difference between the English and Indonesian versions of ChatGPT-4.0. However, the perception of general soil scientists about the level of trust is only 55%. Furthermore, 80% of soil scientists believe that chatbots can only be used as digital tools to assist in soil science research and cannot be used without the involvement of soil scientists.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100078"},"PeriodicalIF":0.0,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000194/pdfft?md5=a1f673f82f524163cc02389c4b2b6d47&pid=1-s2.0-S2666544124000194-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141137044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence in Geosciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1