Long-term AI prediction of ammonium levels in rivers using transformer and ensemble models

Ali J. Ali, Ashraf A. Ahmed
{"title":"Long-term AI prediction of ammonium levels in rivers using transformer and ensemble models","authors":"Ali J. Ali,&nbsp;Ashraf A. Ahmed","doi":"10.1016/j.clwat.2024.100051","DOIUrl":null,"url":null,"abstract":"<div><div>This study provides a cutting-edge machine learning approach to forecast ammonium (<span><math><msubsup><mrow><mi>NH</mi></mrow><mrow><mn>4</mn></mrow><mrow><mo>+</mo></mrow></msubsup></math></span>) levels in River Lee London. Ammonium concentrations were predicted over several time intervals using a complete dataset that includes temperature, turbidity, chlorophyll, dissolved oxygen, conductivity, and pH. Our technique captures the intricate connections between environmental conditions and ammonium concentrations using developed algorithms, including Temporal Fusion Transformer (TFT), Random Forest (RF) and Extreme Gradient Boosting (XGBoost) levels versus the important factors, considerably improving prediction accuracy. The novel aspect of this study is the utilisation of the TFT model for multi-horizon forecasting, which offers high accuracy and interpretability in hydrological predictions by combining convolutional components with an attention mechanism. The study also demonstrates the effectiveness of the TFT model in capturing short-term fluctuations while retaining accuracy over long time periods, which is a major difficulty in environmental modelling. The models used, have exceptional forecasting skills, predicting 150, 200, 365, 730, and 1095 days based on daily average and 12, 24 and 30 months based on monthly average. This dual-scale model combines flexibility and resilience, making it an effective tool for forecasting both short- and long-term environmental changes. The RF model excelled in long-term forecasts, sustaining high R-squared (R²) (0.97) values and low root mean square error (RMSE) (0.18), and the second best one was the XGBoost with optimiser with R<sup>2</sup> of (0.92) and RMSE of (0.25) with forecasting 1095 days. The results also found that whilst the TFT captured the fluctuations in the short-term, it struggled with the longer-term predictions due to data granularity. The XGBoost model did remarkably well in monthly forecasts up to 12 months, maintaining low RSME. The findings also highlight the necessity of proactive water management techniques to reduce the risk of potential ecological effects, including hypoxia and oxygen depletion. The findings support resource managers in addressing prospective ammonium toxicity concerns such as oxygen depletion and ecological stress.</div></div>","PeriodicalId":100257,"journal":{"name":"Cleaner Water","volume":"2 ","pages":"Article 100051"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cleaner Water","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2950263224000498","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This study provides a cutting-edge machine learning approach to forecast ammonium (NH4+) levels in River Lee London. Ammonium concentrations were predicted over several time intervals using a complete dataset that includes temperature, turbidity, chlorophyll, dissolved oxygen, conductivity, and pH. Our technique captures the intricate connections between environmental conditions and ammonium concentrations using developed algorithms, including Temporal Fusion Transformer (TFT), Random Forest (RF) and Extreme Gradient Boosting (XGBoost) levels versus the important factors, considerably improving prediction accuracy. The novel aspect of this study is the utilisation of the TFT model for multi-horizon forecasting, which offers high accuracy and interpretability in hydrological predictions by combining convolutional components with an attention mechanism. The study also demonstrates the effectiveness of the TFT model in capturing short-term fluctuations while retaining accuracy over long time periods, which is a major difficulty in environmental modelling. The models used, have exceptional forecasting skills, predicting 150, 200, 365, 730, and 1095 days based on daily average and 12, 24 and 30 months based on monthly average. This dual-scale model combines flexibility and resilience, making it an effective tool for forecasting both short- and long-term environmental changes. The RF model excelled in long-term forecasts, sustaining high R-squared (R²) (0.97) values and low root mean square error (RMSE) (0.18), and the second best one was the XGBoost with optimiser with R2 of (0.92) and RMSE of (0.25) with forecasting 1095 days. The results also found that whilst the TFT captured the fluctuations in the short-term, it struggled with the longer-term predictions due to data granularity. The XGBoost model did remarkably well in monthly forecasts up to 12 months, maintaining low RSME. The findings also highlight the necessity of proactive water management techniques to reduce the risk of potential ecological effects, including hypoxia and oxygen depletion. The findings support resource managers in addressing prospective ammonium toxicity concerns such as oxygen depletion and ecological stress.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用变压器和集合模型对河流中的氨含量进行长期 AI 预测
本研究提供了一种先进的机器学习方法,用于预测伦敦利河的氨(NH4+)含量。利用包括温度、浊度、叶绿素、溶解氧、电导率和 pH 值在内的完整数据集预测了多个时间间隔内的氨浓度。我们的技术利用开发的算法(包括时态融合变换器 (TFT)、随机森林 (RF) 和极梯度提升 (XGBoost) 等)捕捉环境条件与氨浓度之间错综复杂的联系,与重要因素进行对比,从而大大提高了预测的准确性。本研究的新颖之处在于利用 TFT 模型进行多视距预测,该模型通过将卷积成分与注意力机制相结合,为水文预测提供了高准确性和可解释性。这项研究还证明了 TFT 模型在捕捉短期波动的同时保持长期准确性方面的有效性,而这正是环境建模中的一大难题。所使用的模型具有卓越的预测能力,根据日平均值可预测 150 天、200 天、365 天、730 天和 1095 天,根据月平均值可预测 12 个月、24 个月和 30 个月。这种双尺度模型兼具灵活性和弹性,是预测短期和长期环境变化的有效工具。射频模型在长期预测方面表现出色,保持了较高的 R 平方(R²)(0.97)值和较低的均方根误差(RMSE)(0.18),其次是带有优化器的 XGBoost 模型,其 R2 值为 0.92,均方根误差为 0.25,预测天数为 1095 天。结果还发现,虽然 TFT 模型捕捉到了短期波动,但由于数据粒度的原因,它在长期预测方面显得力不从心。XGBoost 模型在长达 12 个月的月度预测中表现出色,保持了较低的 RSME。研究结果还强调,必须采用积极的水资源管理技术来降低潜在的生态影响风险,包括缺氧和氧气耗尽。研究结果有助于资源管理人员解决氨毒性的潜在问题,如氧气耗竭和生态压力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The incorporation of activated carbon as a substrate in a constructed wetland. A review Long-term AI prediction of ammonium levels in rivers using transformer and ensemble models Groundwater salinization challenges in agriculturally valuable low-lying North Sea region: A review Sequential novel use of Moringa oleifera Lam., biochar, and sand to remove turbidity, E. coli, and heavy metals from drinking water Waste biomass-based graphene oxide decorated with ternary metal oxide (MnO-NiO-ZnO) composite for adsorption of methylene blue dye
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1