{"title":"Enhancing short-term algal bloom forecasting through an anti-mimicking hybrid deep learning method","authors":"Yaqin Zhang, Yichong Wang, Jixin Chen, Lizhen Lin, Wupeng Xiao, Bangqin Huang","doi":"10.1016/j.jenvman.2025.124832","DOIUrl":null,"url":null,"abstract":"<div><div>Accurately predicting algal blooms remains a critical challenge due to their dynamic and non-stationary nature, compounded by high-frequency fluctuations and noise in monitoring data. Additionally, a common issue in time-series forecasting is data replication, where models tend to replicate historical patterns rather than capturing true future variations, leading to inaccurate forecasts during abrupt changes. To address these challenges, we developed a hybrid deep learning model (TAB) that integrates a Temporal Convolutional Network (TCN), an attention mechanism, and Bidirectional Long Short-Term Memory (BiLSTM) network. Furthermore, we employed a novel distortion loss function—DIstortion Loss including shApe and TimE (DILATE)—which incorporates both shape and temporal losses to enhance the model's predictive robustness. Using <em>in situ</em> algal bloom data from Jiangdong Reservoir, Jiulong River, China, the TAB model accurately forecasted hourly chlorophyll-<em>a</em> dynamics for the subsequent 24 h, achieving an <em>R</em><sup>2</sup> of 0.74, a mean absolute percentage error of 21.22%, a root mean square error of 4.12 μg L<sup>−1</sup>, and a Kling-Gupta efficiency of 0.78. The DILATE loss function effectively mitigated data replication and improved low-peak chlorophyll-<em>a</em> prediction accuracy by 10.57%, compared to the traditional mean squared error (MSE) loss, particularly in the concentration range of 0–20 μg L<sup>−1</sup>, which represents the most frequently observed levels in the dataset. SHapley Additive exPlanations (SHAP) analysis identified key environmental drivers, improving model interpretability and computational efficiency. This study demonstrates the potential of advanced deep learning models in algal bloom prediction and provides insights for further optimization in forecasting performance and broader applications.</div></div>","PeriodicalId":356,"journal":{"name":"Journal of Environmental Management","volume":"379 ","pages":"Article 124832"},"PeriodicalIF":8.0000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Management","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0301479725008084","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately predicting algal blooms remains a critical challenge due to their dynamic and non-stationary nature, compounded by high-frequency fluctuations and noise in monitoring data. Additionally, a common issue in time-series forecasting is data replication, where models tend to replicate historical patterns rather than capturing true future variations, leading to inaccurate forecasts during abrupt changes. To address these challenges, we developed a hybrid deep learning model (TAB) that integrates a Temporal Convolutional Network (TCN), an attention mechanism, and Bidirectional Long Short-Term Memory (BiLSTM) network. Furthermore, we employed a novel distortion loss function—DIstortion Loss including shApe and TimE (DILATE)—which incorporates both shape and temporal losses to enhance the model's predictive robustness. Using in situ algal bloom data from Jiangdong Reservoir, Jiulong River, China, the TAB model accurately forecasted hourly chlorophyll-a dynamics for the subsequent 24 h, achieving an R2 of 0.74, a mean absolute percentage error of 21.22%, a root mean square error of 4.12 μg L−1, and a Kling-Gupta efficiency of 0.78. The DILATE loss function effectively mitigated data replication and improved low-peak chlorophyll-a prediction accuracy by 10.57%, compared to the traditional mean squared error (MSE) loss, particularly in the concentration range of 0–20 μg L−1, which represents the most frequently observed levels in the dataset. SHapley Additive exPlanations (SHAP) analysis identified key environmental drivers, improving model interpretability and computational efficiency. This study demonstrates the potential of advanced deep learning models in algal bloom prediction and provides insights for further optimization in forecasting performance and broader applications.
期刊介绍:
The Journal of Environmental Management is a journal for the publication of peer reviewed, original research for all aspects of management and the managed use of the environment, both natural and man-made.Critical review articles are also welcome; submission of these is strongly encouraged.