{"title":"Long-term AI prediction of ammonium levels in rivers using transformer and ensemble models","authors":"Ali J. Ali, Ashraf A. Ahmed","doi":"10.1016/j.clwat.2024.100051","DOIUrl":null,"url":null,"abstract":"<div><div>This study provides a cutting-edge machine learning approach to forecast ammonium (<span><math><msubsup><mrow><mi>NH</mi></mrow><mrow><mn>4</mn></mrow><mrow><mo>+</mo></mrow></msubsup></math></span>) levels in River Lee London. Ammonium concentrations were predicted over several time intervals using a complete dataset that includes temperature, turbidity, chlorophyll, dissolved oxygen, conductivity, and pH. Our technique captures the intricate connections between environmental conditions and ammonium concentrations using developed algorithms, including Temporal Fusion Transformer (TFT), Random Forest (RF) and Extreme Gradient Boosting (XGBoost) levels versus the important factors, considerably improving prediction accuracy. The novel aspect of this study is the utilisation of the TFT model for multi-horizon forecasting, which offers high accuracy and interpretability in hydrological predictions by combining convolutional components with an attention mechanism. The study also demonstrates the effectiveness of the TFT model in capturing short-term fluctuations while retaining accuracy over long time periods, which is a major difficulty in environmental modelling. The models used, have exceptional forecasting skills, predicting 150, 200, 365, 730, and 1095 days based on daily average and 12, 24 and 30 months based on monthly average. This dual-scale model combines flexibility and resilience, making it an effective tool for forecasting both short- and long-term environmental changes. The RF model excelled in long-term forecasts, sustaining high R-squared (R²) (0.97) values and low root mean square error (RMSE) (0.18), and the second best one was the XGBoost with optimiser with R<sup>2</sup> of (0.92) and RMSE of (0.25) with forecasting 1095 days. The results also found that whilst the TFT captured the fluctuations in the short-term, it struggled with the longer-term predictions due to data granularity. The XGBoost model did remarkably well in monthly forecasts up to 12 months, maintaining low RSME. The findings also highlight the necessity of proactive water management techniques to reduce the risk of potential ecological effects, including hypoxia and oxygen depletion. The findings support resource managers in addressing prospective ammonium toxicity concerns such as oxygen depletion and ecological stress.</div></div>","PeriodicalId":100257,"journal":{"name":"Cleaner Water","volume":"2 ","pages":"Article 100051"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cleaner Water","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2950263224000498","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study provides a cutting-edge machine learning approach to forecast ammonium () levels in River Lee London. Ammonium concentrations were predicted over several time intervals using a complete dataset that includes temperature, turbidity, chlorophyll, dissolved oxygen, conductivity, and pH. Our technique captures the intricate connections between environmental conditions and ammonium concentrations using developed algorithms, including Temporal Fusion Transformer (TFT), Random Forest (RF) and Extreme Gradient Boosting (XGBoost) levels versus the important factors, considerably improving prediction accuracy. The novel aspect of this study is the utilisation of the TFT model for multi-horizon forecasting, which offers high accuracy and interpretability in hydrological predictions by combining convolutional components with an attention mechanism. The study also demonstrates the effectiveness of the TFT model in capturing short-term fluctuations while retaining accuracy over long time periods, which is a major difficulty in environmental modelling. The models used, have exceptional forecasting skills, predicting 150, 200, 365, 730, and 1095 days based on daily average and 12, 24 and 30 months based on monthly average. This dual-scale model combines flexibility and resilience, making it an effective tool for forecasting both short- and long-term environmental changes. The RF model excelled in long-term forecasts, sustaining high R-squared (R²) (0.97) values and low root mean square error (RMSE) (0.18), and the second best one was the XGBoost with optimiser with R2 of (0.92) and RMSE of (0.25) with forecasting 1095 days. The results also found that whilst the TFT captured the fluctuations in the short-term, it struggled with the longer-term predictions due to data granularity. The XGBoost model did remarkably well in monthly forecasts up to 12 months, maintaining low RSME. The findings also highlight the necessity of proactive water management techniques to reduce the risk of potential ecological effects, including hypoxia and oxygen depletion. The findings support resource managers in addressing prospective ammonium toxicity concerns such as oxygen depletion and ecological stress.