Uncover implicit associations among geochemical elements using machine learning

IF 3.6 2区 地球科学 Q1 GEOLOGY Ore Geology Reviews Pub Date : 2025-04-01 Epub Date: 2025-02-16 DOI:10.1016/j.oregeorev.2025.106506
Shuguang Zhou , Zhizhong Cheng , Jinlin Wang , Nuo Li , Guo Jiang
{"title":"Uncover implicit associations among geochemical elements using machine learning","authors":"Shuguang Zhou ,&nbsp;Zhizhong Cheng ,&nbsp;Jinlin Wang ,&nbsp;Nuo Li ,&nbsp;Guo Jiang","doi":"10.1016/j.oregeorev.2025.106506","DOIUrl":null,"url":null,"abstract":"<div><div>The production of geochemical data serves diverse purposes, and a variety of analytical methods are utilized for analyzing geochemical element content. However, due to limitations in project funds, censored or missing values are common in geochemical data. This scarcity of data becomes more pronounced when dealing with large datasets. Regrettably, numerous data analysis techniques are unable to process datasets containing missing values, which presents a significant hurdle for researchers who depend on geochemical data. To address this issue, here we employed a random forest model to simulate the geochemical elements of rocks and stream sediments. By comparing and analyzing the effects of model parameters and feature variable selection on the simulation results of major and trace elements, the study found that with appropriate model parameters and variable selection, the simulation results for many elements are reliable, and the generalization performance of the random forest model is satisfactory. This research sheds light on the inherent correlations among various elements in nature, offers solutions to the challenges posed by missing values in geochemical data, and provides valuable technical support for disciplines such as geology, environmental science and soil science.</div></div>","PeriodicalId":19644,"journal":{"name":"Ore Geology Reviews","volume":"179 ","pages":"Article 106506"},"PeriodicalIF":3.6000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ore Geology Reviews","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169136825000666","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/16 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"GEOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The production of geochemical data serves diverse purposes, and a variety of analytical methods are utilized for analyzing geochemical element content. However, due to limitations in project funds, censored or missing values are common in geochemical data. This scarcity of data becomes more pronounced when dealing with large datasets. Regrettably, numerous data analysis techniques are unable to process datasets containing missing values, which presents a significant hurdle for researchers who depend on geochemical data. To address this issue, here we employed a random forest model to simulate the geochemical elements of rocks and stream sediments. By comparing and analyzing the effects of model parameters and feature variable selection on the simulation results of major and trace elements, the study found that with appropriate model parameters and variable selection, the simulation results for many elements are reliable, and the generalization performance of the random forest model is satisfactory. This research sheds light on the inherent correlations among various elements in nature, offers solutions to the challenges posed by missing values in geochemical data, and provides valuable technical support for disciplines such as geology, environmental science and soil science.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用机器学习揭示地球化学元素之间的隐含联系
地球化学数据的产生具有多种用途,分析地球化学元素含量的分析方法也多种多样。然而,由于项目资金的限制,地球化学资料中经常出现删减或缺失值的情况。在处理大型数据集时,这种数据的稀缺性变得更加明显。遗憾的是,许多数据分析技术无法处理包含缺失值的数据集,这对依赖地球化学数据的研究人员来说是一个重大障碍。为了解决这个问题,我们采用随机森林模型来模拟岩石和河流沉积物的地球化学元素。通过比较分析模型参数和特征变量选择对主量元素和微量元素模拟结果的影响,研究发现,在适当的模型参数和特征变量选择下,对许多元素的模拟结果是可靠的,随机森林模型的泛化性能是令人满意的。该研究揭示了自然界中各种元素之间的内在相关性,解决了地球化学数据缺失带来的挑战,并为地质、环境科学和土壤科学等学科提供了宝贵的技术支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Ore Geology Reviews
Ore Geology Reviews 地学-地质学
CiteScore
6.50
自引率
27.30%
发文量
546
审稿时长
22.9 weeks
期刊介绍: Ore Geology Reviews aims to familiarize all earth scientists with recent advances in a number of interconnected disciplines related to the study of, and search for, ore deposits. The reviews range from brief to longer contributions, but the journal preferentially publishes manuscripts that fill the niche between the commonly shorter journal articles and the comprehensive book coverages, and thus has a special appeal to many authors and readers.
期刊最新文献
Chenjiazhuang uraninite: mineralogy, genesis, and potential as a U-Pb dating reference material Linking modern-paleo lake systems: climate-driven Li-B differentiation and paleolake evolution validation in Tibetan Plateau salt lakes Hydrothermal evolution and ore genesis of the Sigou Mo–W deposit in the Northern Qinling Belt Gold enrichment mechanisms of the Qijiagou deposit (Jiaodong Peninsula): Remobilization by coupled dissolution-reprecipitation and scavenging by low-melting-point chalcophile element melts Redox control on Middle Permian Mn mineralization in South China
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1