Mohammad Parsa, Christopher J. M. Lawley, Renato Cumani, Ernst Schetselaar, Jeff Harris, David R. Lentz, Steven E. Zhang, Julie E. Bourdeau
{"title":"加拿大碳酸盐岩寄生 REE +/- Nb 矿床的预测建模","authors":"Mohammad Parsa, Christopher J. M. Lawley, Renato Cumani, Ernst Schetselaar, Jeff Harris, David R. Lentz, Steven E. Zhang, Julie E. Bourdeau","doi":"10.1007/s11053-024-10369-7","DOIUrl":null,"url":null,"abstract":"<p>Carbonatites are the primary geological sources for rare earth elements (REEs) and niobium (Nb). This study applies machine learning techniques to generate national-scale prospectivity models and support mineral exploration targeting of Canadian carbonatite-hosted REE +/− Nb deposits. Extreme target feature label imbalance, diverse geological settings hosting these deposits throughout Canada, selecting negative labels, and issues regarding the interpretability of some machine learning models are major challenges impeding data-driven prospectivity modeling of carbonatite-hosted REE +/− Nb deposits. A multi-stage framework, exploiting global hierarchical tessellation model systems, data-space similarity measures, ensemble modeling, and Shapley additive explanations was coupled with convolutional neural networks (CNN) and random forest to meet the objectives of this work. A <i>risk</i>–<i>return</i> analysis was further implemented to assist with model interpretation and visualization. Multiple models were compared in terms of their predictive ability and their capability of reducing the search space for mineral exploration. The best-performing model, derived using a CNN that incorporates public geoscience datasets, exhibits an area under the curve for receiver operating characteristics plot of 0.96 for the testing labels, reducing the search area by 80%, while predicting all known carbonatite-hosted REE +/− Nb occurrences. The framework used in our study allows for an explicit definition of input vectors and provides a clear interpretation of outcomes generated by prospectivity models.</p>","PeriodicalId":54284,"journal":{"name":"Natural Resources Research","volume":"10 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predictive Modeling of Canadian Carbonatite-Hosted REE +/− Nb Deposits\",\"authors\":\"Mohammad Parsa, Christopher J. M. Lawley, Renato Cumani, Ernst Schetselaar, Jeff Harris, David R. Lentz, Steven E. Zhang, Julie E. Bourdeau\",\"doi\":\"10.1007/s11053-024-10369-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Carbonatites are the primary geological sources for rare earth elements (REEs) and niobium (Nb). This study applies machine learning techniques to generate national-scale prospectivity models and support mineral exploration targeting of Canadian carbonatite-hosted REE +/− Nb deposits. Extreme target feature label imbalance, diverse geological settings hosting these deposits throughout Canada, selecting negative labels, and issues regarding the interpretability of some machine learning models are major challenges impeding data-driven prospectivity modeling of carbonatite-hosted REE +/− Nb deposits. A multi-stage framework, exploiting global hierarchical tessellation model systems, data-space similarity measures, ensemble modeling, and Shapley additive explanations was coupled with convolutional neural networks (CNN) and random forest to meet the objectives of this work. A <i>risk</i>–<i>return</i> analysis was further implemented to assist with model interpretation and visualization. Multiple models were compared in terms of their predictive ability and their capability of reducing the search space for mineral exploration. The best-performing model, derived using a CNN that incorporates public geoscience datasets, exhibits an area under the curve for receiver operating characteristics plot of 0.96 for the testing labels, reducing the search area by 80%, while predicting all known carbonatite-hosted REE +/− Nb occurrences. The framework used in our study allows for an explicit definition of input vectors and provides a clear interpretation of outcomes generated by prospectivity models.</p>\",\"PeriodicalId\":54284,\"journal\":{\"name\":\"Natural Resources Research\",\"volume\":\"10 1\",\"pages\":\"\"},\"PeriodicalIF\":4.8000,\"publicationDate\":\"2024-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Natural Resources Research\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1007/s11053-024-10369-7\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11053-024-10369-7","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
碳酸盐岩是稀土元素(REE)和铌(Nb)的主要地质来源。本研究应用机器学习技术生成全国规模的远景模型,支持加拿大碳酸盐岩孕育的稀土元素+/-铌矿床的矿产勘探目标。目标特征标签极度不平衡、加拿大各地孕育这些矿床的地质环境各不相同、选择负面标签以及一些机器学习模型的可解释性问题,这些都是阻碍对碳酸盐岩孕育的 REE +/- Nb 矿床进行数据驱动的远景建模的主要挑战。为了实现这项工作的目标,利用全局分层细分模型系统、数据空间相似性度量、集合建模和夏普利加法解释的多阶段框架与卷积神经网络(CNN)和随机森林相结合。还进一步实施了风险回报分析,以协助模型解释和可视化。对多个模型的预测能力和缩小矿产勘探搜索空间的能力进行了比较。表现最好的模型是使用结合了公共地球科学数据集的 CNN 得出的,测试标签的接收器操作特征曲线图下面积为 0.96,搜索范围缩小了 80%,同时预测了所有已知的碳酸盐岩寄生 REE +/- Nb 矿点。我们研究中使用的框架允许对输入向量进行明确定义,并对勘探模型产生的结果进行清晰解释。
Predictive Modeling of Canadian Carbonatite-Hosted REE +/− Nb Deposits
Carbonatites are the primary geological sources for rare earth elements (REEs) and niobium (Nb). This study applies machine learning techniques to generate national-scale prospectivity models and support mineral exploration targeting of Canadian carbonatite-hosted REE +/− Nb deposits. Extreme target feature label imbalance, diverse geological settings hosting these deposits throughout Canada, selecting negative labels, and issues regarding the interpretability of some machine learning models are major challenges impeding data-driven prospectivity modeling of carbonatite-hosted REE +/− Nb deposits. A multi-stage framework, exploiting global hierarchical tessellation model systems, data-space similarity measures, ensemble modeling, and Shapley additive explanations was coupled with convolutional neural networks (CNN) and random forest to meet the objectives of this work. A risk–return analysis was further implemented to assist with model interpretation and visualization. Multiple models were compared in terms of their predictive ability and their capability of reducing the search space for mineral exploration. The best-performing model, derived using a CNN that incorporates public geoscience datasets, exhibits an area under the curve for receiver operating characteristics plot of 0.96 for the testing labels, reducing the search area by 80%, while predicting all known carbonatite-hosted REE +/− Nb occurrences. The framework used in our study allows for an explicit definition of input vectors and provides a clear interpretation of outcomes generated by prospectivity models.
期刊介绍:
This journal publishes quantitative studies of natural (mainly but not limited to mineral) resources exploration, evaluation and exploitation, including environmental and risk-related aspects. Typical articles use geoscientific data or analyses to assess, test, or compare resource-related aspects. NRR covers a wide variety of resources including minerals, coal, hydrocarbon, geothermal, water, and vegetation. Case studies are welcome.