Prediction of soil arsenic concentration in European soils: A dimensionality reduction and ensemble learning approach

IF 7.7 Q2 ENGINEERING, ENVIRONMENTAL Journal of hazardous materials advances Pub Date : 2025-02-01 Epub Date: 2025-01-15 DOI:10.1016/j.hazadv.2025.100604
Mohammad Sadegh Barkhordari , Chongchong Qi
{"title":"Prediction of soil arsenic concentration in European soils: A dimensionality reduction and ensemble learning approach","authors":"Mohammad Sadegh Barkhordari ,&nbsp;Chongchong Qi","doi":"10.1016/j.hazadv.2025.100604","DOIUrl":null,"url":null,"abstract":"<div><div>Arsenic contamination in soils poses significant risks to human health and the environment, necessitating accurate prediction methods to support effective mitigation strategies. This study addresses critical gaps in previous research, including multicollinearity among predictor variables, limited consideration of anthropogenic factors, and insufficient use of dimensionality reduction techniques. Principal Component Analysis (PCA) was employed for feature extraction, and six ensemble learning models were compared to enhance prediction accuracy for arsenic concentrations in European soils. Key environmental, chemical, physical, and anthropogenic factors were incorporated. Random Forest emerged as the top-performing model, achieving a mean squared error of 0.71 and a prediction accuracy of 89 % on test data. The results highlight the significant role of anthropogenic factors—particularly agricultural practices—in influencing arsenic levels, alongside chemical properties like phosphorus concentration and soil pH. The study demonstrates that advanced feature engineering, including PCA, can address multicollinearity while improving machine learning model performance. The findings provide critical insights for environmental risk assessment and policymaking, emphasizing the need for targeted interventions in regions with high anthropogenic activity. By combining robust data preprocessing and state-of-the-art ensemble learning techniques, this research offers a scalable and effective framework for predicting soil contamination and guiding remediation efforts across diverse geographic settings.</div></div>","PeriodicalId":73763,"journal":{"name":"Journal of hazardous materials advances","volume":"17 ","pages":"Article 100604"},"PeriodicalIF":7.7000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of hazardous materials advances","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772416625000166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/15 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Arsenic contamination in soils poses significant risks to human health and the environment, necessitating accurate prediction methods to support effective mitigation strategies. This study addresses critical gaps in previous research, including multicollinearity among predictor variables, limited consideration of anthropogenic factors, and insufficient use of dimensionality reduction techniques. Principal Component Analysis (PCA) was employed for feature extraction, and six ensemble learning models were compared to enhance prediction accuracy for arsenic concentrations in European soils. Key environmental, chemical, physical, and anthropogenic factors were incorporated. Random Forest emerged as the top-performing model, achieving a mean squared error of 0.71 and a prediction accuracy of 89 % on test data. The results highlight the significant role of anthropogenic factors—particularly agricultural practices—in influencing arsenic levels, alongside chemical properties like phosphorus concentration and soil pH. The study demonstrates that advanced feature engineering, including PCA, can address multicollinearity while improving machine learning model performance. The findings provide critical insights for environmental risk assessment and policymaking, emphasizing the need for targeted interventions in regions with high anthropogenic activity. By combining robust data preprocessing and state-of-the-art ensemble learning techniques, this research offers a scalable and effective framework for predicting soil contamination and guiding remediation efforts across diverse geographic settings.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
欧洲土壤中砷浓度的预测:一种降维和集合学习方法
土壤中的砷污染对人类健康和环境构成重大风险,因此需要准确的预测方法来支持有效的缓解战略。本研究解决了先前研究中的关键空白,包括预测变量之间的多重共线性,对人为因素的考虑有限,以及降维技术的使用不足。采用主成分分析(PCA)进行特征提取,并对6种集成学习模型进行比较,以提高欧洲土壤砷浓度的预测精度。主要的环境、化学、物理和人为因素被纳入。随机森林成为表现最好的模型,在测试数据上实现了0.71的均方误差和89%的预测精度。研究结果强调了人为因素(尤其是农业实践)在影响砷水平方面的重要作用,以及磷浓度和土壤ph等化学性质。该研究表明,包括PCA在内的先进特征工程可以解决多重共线性问题,同时提高机器学习模型的性能。这些发现为环境风险评估和政策制定提供了重要见解,强调了在高人为活动区域进行有针对性干预的必要性。通过结合强大的数据预处理和最先进的集成学习技术,本研究为预测土壤污染和指导不同地理环境下的修复工作提供了一个可扩展和有效的框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of hazardous materials advances
Journal of hazardous materials advances Environmental Engineering
CiteScore
4.80
自引率
0.00%
发文量
0
审稿时长
50 days
期刊最新文献
Valorization of Philippine phosphogypsum through optimized calcium extraction via salt and acid leaching for indirect mineral carbonation applications Ecological bleaching process using ozone as alternative to hydrogen peroxide: A comparative life cycle assessment Fungal hyphae upcycling: A green bio-adsorbent for sustainable lithium recovery from salt lake brines Ecological disaster in the making: Assessing the impact of Sino-Metals Leach tailings dam collapse on the Kafue River ecosystem Ecological risk of diethylhexyl phthalate (DEHP) and non-phthalate plasticizers in Osaka Bay - Japan and in rivers of Yogyakarta - Indonesia
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1