Snehamoy Chatterjee , C. Özgen Karacan , Maria Mastalerz
{"title":"Exploring the uncertainty of machine learning models and geostatistical mapping of rare earth element potential in Indiana coals, USA","authors":"Snehamoy Chatterjee , C. Özgen Karacan , Maria Mastalerz","doi":"10.1016/j.coal.2023.104419","DOIUrl":null,"url":null,"abstract":"<div><p><span>Rare earth elements and </span>yttrium<span><span><span><span> (REEs) have a wide range of applications in high- and low-carbon technologies. The strategic significance of REEs has grown due to their expanding applications in manufacturing industries and the constrained availability of these essential resources. This research explores the applicability of machine learning models and their uncertainty for assessing the REE potential in coal beds using various coal parameters as inputs. The work focuses on developing a predictive model based on geological variables, excluding considerations related to potential shifts in the commodities market. The Indiana Coal Quality Database was used as the data source. The promising and unpromising indicators derived from the outlook coefficient of samples from the database were used as the REE potential indicator for machine learning classification models. The filter-based approach with bootstrap was used to evaluate the importance of the coal parameters and their prediction uncertainties. Four </span>machine learning methods (linear </span>discriminant analysis (LDA), random forest (RF), </span>support vector machine<span> (SVM), and artificial neural networks (ANN), a data balancing and augmentation approach (Synthetic Minority Over-sampling Technique), and bootstrap resampling techniques were used for building the models and evaluating their prediction capabilities under uncertainty. It was determined that the SVM bootstrap model with ten-times balanced and augmented data provided superior results compared with other models. Finally, stochastic spatial maps of the REE potential within the coal basin were generated using sequential indicator simulation. The spatial maps of the REE potential showed that a 29% area of the Indiana section of the Illinois coal basin has economic potential of REEs, with 90% confidence.</span></span></p></div>","PeriodicalId":13864,"journal":{"name":"International Journal of Coal Geology","volume":null,"pages":null},"PeriodicalIF":5.6000,"publicationDate":"2023-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Coal Geology","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166516223002379","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Rare earth elements and yttrium (REEs) have a wide range of applications in high- and low-carbon technologies. The strategic significance of REEs has grown due to their expanding applications in manufacturing industries and the constrained availability of these essential resources. This research explores the applicability of machine learning models and their uncertainty for assessing the REE potential in coal beds using various coal parameters as inputs. The work focuses on developing a predictive model based on geological variables, excluding considerations related to potential shifts in the commodities market. The Indiana Coal Quality Database was used as the data source. The promising and unpromising indicators derived from the outlook coefficient of samples from the database were used as the REE potential indicator for machine learning classification models. The filter-based approach with bootstrap was used to evaluate the importance of the coal parameters and their prediction uncertainties. Four machine learning methods (linear discriminant analysis (LDA), random forest (RF), support vector machine (SVM), and artificial neural networks (ANN), a data balancing and augmentation approach (Synthetic Minority Over-sampling Technique), and bootstrap resampling techniques were used for building the models and evaluating their prediction capabilities under uncertainty. It was determined that the SVM bootstrap model with ten-times balanced and augmented data provided superior results compared with other models. Finally, stochastic spatial maps of the REE potential within the coal basin were generated using sequential indicator simulation. The spatial maps of the REE potential showed that a 29% area of the Indiana section of the Illinois coal basin has economic potential of REEs, with 90% confidence.
期刊介绍:
The International Journal of Coal Geology deals with fundamental and applied aspects of the geology and petrology of coal, oil/gas source rocks and shale gas resources. The journal aims to advance the exploration, exploitation and utilization of these resources, and to stimulate environmental awareness as well as advancement of engineering for effective resource management.