Data-centric approach for predicting critical metals distribution: Heavy rare earth elements in cretaceous Mediterranean-type karst bauxite deposits, southern Italy
{"title":"Data-centric approach for predicting critical metals distribution: Heavy rare earth elements in cretaceous Mediterranean-type karst bauxite deposits, southern Italy","authors":"Roberto Buccione , Ouafi Ameur-Zaimeche , Abdelhamid Ouladmansour , Rabah Kechiched , Giovanni Mongelli","doi":"10.1016/j.chemer.2023.126026","DOIUrl":null,"url":null,"abstract":"<div><p><span>In the last few years, many efforts have been devoted to the factors controlling the distribution of CMs in karst bauxites<span>, residual deposits hosted in carbonate rocks. Most of these efforts regard Mediterranean-type karst bauxite deposits of Cretaceous age occurring in southern Italy. Further, there is an increasing interest in assessing the usefulness of machine learning applications devoted to geochemically based datasets. With this in mind, we explored a data-centric machine learning arrangement aiming to find the proper input, limited to Al</span></span><sub>2</sub>O<sub>3</sub>, Fe<sub>2</sub>O<sub>3</sub>, TiO<sub>2</sub>, and SiO<sub>2</sub>, the most abundant major oxides occurring in these ores, for predicting the HREE distribution in southern Italy karst bauxite deposits.</p><p><span><span>Among the machine learning techniques used, Artificial Neural Network (ANN), </span>Support Vector Machine (SVR), Random Forest (RF) and Extreme Gradient Boosting (XGBoost) are those that effectively predict HREE concentrations. A predictive model based on just Al</span><sub>2</sub>O<sub>3</sub>, Fe<sub>2</sub>O<sub>3</sub>, and SiO<sub>2</sub>, is one conducing at the worst performance impact suggesting that TiO<sub>2</sub> is a relevant input variable in order to predict HREE concentrations in considered karst bauxite deposits. The XGBoost model was found to deliver the highest accuracy in predicting HREE for the validation data records (R<sup>2</sup> ~ 0.830, RMSE~7.299, MAE ~ 5.091).</p><p>Moreover, Fe<sub>2</sub>O<sub>3</sub> is the highest correlated input variable with the output variable and is a significant predictor in our model suggesting iron oxyhydroxides play a relevant role in distributing HREE, likely through a scavenging mechanism at the expense of soil solutions.</p><p>A further step of our research will involve comprehensive cross-validation studies across multiple areas where Mediterranean-type karst bauxite deposits occur, thus providing a thorough assessment of the model's performance. By addressing these tasks and exploring avenues for improvement, the data-centric approach can advance its potential as a cheap and fast technique to perform a preliminary economic evaluation of potentially HREE abundance, as well as other CMs, in karst bauxite ores benefiting applications reliant on these critical resources.</p></div>","PeriodicalId":55973,"journal":{"name":"Chemie Der Erde-Geochemistry","volume":"84 2","pages":"Article 126026"},"PeriodicalIF":2.6000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemie Der Erde-Geochemistry","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009281923000776","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
In the last few years, many efforts have been devoted to the factors controlling the distribution of CMs in karst bauxites, residual deposits hosted in carbonate rocks. Most of these efforts regard Mediterranean-type karst bauxite deposits of Cretaceous age occurring in southern Italy. Further, there is an increasing interest in assessing the usefulness of machine learning applications devoted to geochemically based datasets. With this in mind, we explored a data-centric machine learning arrangement aiming to find the proper input, limited to Al2O3, Fe2O3, TiO2, and SiO2, the most abundant major oxides occurring in these ores, for predicting the HREE distribution in southern Italy karst bauxite deposits.
Among the machine learning techniques used, Artificial Neural Network (ANN), Support Vector Machine (SVR), Random Forest (RF) and Extreme Gradient Boosting (XGBoost) are those that effectively predict HREE concentrations. A predictive model based on just Al2O3, Fe2O3, and SiO2, is one conducing at the worst performance impact suggesting that TiO2 is a relevant input variable in order to predict HREE concentrations in considered karst bauxite deposits. The XGBoost model was found to deliver the highest accuracy in predicting HREE for the validation data records (R2 ~ 0.830, RMSE~7.299, MAE ~ 5.091).
Moreover, Fe2O3 is the highest correlated input variable with the output variable and is a significant predictor in our model suggesting iron oxyhydroxides play a relevant role in distributing HREE, likely through a scavenging mechanism at the expense of soil solutions.
A further step of our research will involve comprehensive cross-validation studies across multiple areas where Mediterranean-type karst bauxite deposits occur, thus providing a thorough assessment of the model's performance. By addressing these tasks and exploring avenues for improvement, the data-centric approach can advance its potential as a cheap and fast technique to perform a preliminary economic evaluation of potentially HREE abundance, as well as other CMs, in karst bauxite ores benefiting applications reliant on these critical resources.
期刊介绍:
GEOCHEMISTRY was founded as Chemie der Erde 1914 in Jena, and, hence, is one of the oldest journals for geochemistry-related topics.
GEOCHEMISTRY (formerly Chemie der Erde / Geochemistry) publishes original research papers, short communications, reviews of selected topics, and high-class invited review articles addressed at broad geosciences audience. Publications dealing with interdisciplinary questions are particularly welcome. Young scientists are especially encouraged to submit their work. Contributions will be published exclusively in English. The journal, through very personalized consultation and its worldwide distribution, offers entry into the world of international scientific communication, and promotes interdisciplinary discussion on chemical problems in a broad spectrum of geosciences.
The following topics are covered by the expertise of the members of the editorial board (see below):
-cosmochemistry, meteoritics-
igneous, metamorphic, and sedimentary petrology-
volcanology-
low & high temperature geochemistry-
experimental - theoretical - field related studies-
mineralogy - crystallography-
environmental geosciences-
archaeometry