The application of transfer machine learning to predict and impute missing sulphate levels in different Acid Mine Drainage treatment plants

Taskeen Hasrod , Yannick B. Nuapia , Hlanganani Tutu
{"title":"The application of transfer machine learning to predict and impute missing sulphate levels in different Acid Mine Drainage treatment plants","authors":"Taskeen Hasrod ,&nbsp;Yannick B. Nuapia ,&nbsp;Hlanganani Tutu","doi":"10.1016/j.clwat.2024.100029","DOIUrl":null,"url":null,"abstract":"<div><p>An accurately pre-trained stacking ensemble machine learning regressor was used to predict sulphate levels in two other Acid Mine Drainage (AMD) treatment plants using Transfer Learning (TL). The model was trained on the large Central Rand (CR) water quality dataset and was used to predict and impute the sulphate levels in the scanty East Rand (ER) and West Rand (W<em>R</em>) datasets which would not have been sufficient to train ML models from scratch. TL was successfully used to overcome this barrier and rapidly predicted sulphate levels in the East Rand and West Rand plants using the pre-trained model and achieved a high level of accuracy (Mean Squared Error:0.00124, Mean Absolute Error:0.0290 and R<sup>2</sup>:0.963) for the East Rand plant when comparing the predicted and true sulphate values. No true sulphate values existed for the West Rand plant; however, TL was successful in imputing these missing values and rapidly completed the West Rand dataset by providing the historic sulphate levels. This was possible due to the high degree of similarity between all domains (treatment plants) since they had similar geographic locations, the same treatment process, possessed the same important features and had the same relationships between variables. TL was successful in providing three accurate datasets for AMD sulphate levels, an important accomplishment towards having reliable data for use in design of experiments aimed at recovering valuable resources such as elemental sulphur, gypsum and important metals from AMD.</p></div>","PeriodicalId":100257,"journal":{"name":"Cleaner Water","volume":"2 ","pages":"Article 100029"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2950263224000279/pdfft?md5=c39a7a24f8f6b6b582c4de57512044b8&pid=1-s2.0-S2950263224000279-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cleaner Water","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2950263224000279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

An accurately pre-trained stacking ensemble machine learning regressor was used to predict sulphate levels in two other Acid Mine Drainage (AMD) treatment plants using Transfer Learning (TL). The model was trained on the large Central Rand (CR) water quality dataset and was used to predict and impute the sulphate levels in the scanty East Rand (ER) and West Rand (WR) datasets which would not have been sufficient to train ML models from scratch. TL was successfully used to overcome this barrier and rapidly predicted sulphate levels in the East Rand and West Rand plants using the pre-trained model and achieved a high level of accuracy (Mean Squared Error:0.00124, Mean Absolute Error:0.0290 and R2:0.963) for the East Rand plant when comparing the predicted and true sulphate values. No true sulphate values existed for the West Rand plant; however, TL was successful in imputing these missing values and rapidly completed the West Rand dataset by providing the historic sulphate levels. This was possible due to the high degree of similarity between all domains (treatment plants) since they had similar geographic locations, the same treatment process, possessed the same important features and had the same relationships between variables. TL was successful in providing three accurate datasets for AMD sulphate levels, an important accomplishment towards having reliable data for use in design of experiments aimed at recovering valuable resources such as elemental sulphur, gypsum and important metals from AMD.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
应用迁移机器学习来预测和估算不同酸性矿井排水处理厂中缺失的硫酸盐含量
利用迁移学习(TL)技术,将预先训练好的堆叠集合机器学习回归器用于预测另外两家酸性矿井排水(AMD)处理厂的硫酸盐含量。该模型在大型中央兰德(CR)水质数据集上进行了训练,并用于预测和估算稀少的东兰德(ER)和西兰德(WR)数据集中的硫酸盐含量。TL 成功克服了这一障碍,使用预先训练好的模型快速预测了东兰德和西兰德工厂的硫酸盐含量,并在比较东兰德工厂的预测值和真实硫酸盐值时达到了很高的准确度(平均平方误差:0.00124,平均绝对误差:0.0290,R2:0.963)。West Rand 工厂没有真实的硫酸盐值;然而,TL 成功地填补了这些缺失值,并通过提供历史硫酸盐水平迅速完成了 West Rand 数据集。之所以能够做到这一点,是因为所有域(处理厂)之间具有高度的相似性,因为它们具有相似的地理位置、相同的处理工艺、相同的重要特征以及变量之间的相同关系。TL 成功地为 AMD 的硫酸盐水平提供了三个准确的数据集,这是一项重要的成就,可为旨在从 AMD 中回收元素硫、石膏和重要金属等宝贵资源的实验设计提供可靠的数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Harnessing microbial synergy: A comprehensive evaluation of consortia-mediated bioremediation strategies for petroleum refinery wastewater treatment Resources optimization using Pareto analysis for sea water desalination plants The incorporation of activated carbon as a substrate in a constructed wetland. A review Long-term AI prediction of ammonium levels in rivers using transformer and ensemble models Groundwater salinization challenges in agriculturally valuable low-lying North Sea region: A review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1