{"title":"A novel algal bloom risk assessment framework by integrating environmental factors based on explainable machine learning","authors":"Lingfang Gao , Yulin Shangguan , Zhong Sun , Qiaohui Shen , Lianqing Zhou","doi":"10.1016/j.ecoinf.2025.103098","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, the algal blooms have intensified, posing mounting threats to aquatic ecosystems and water security. However, most previous studies merely detected algal blooms according to the characteristics of the water body at the time of algal bloom occurrence, overlooking the influence of environmental factors on algae proliferation. This study proposes a novel algal bloom risk assessment framework that integrates explainable machine learning with multivariate environmental analysis. Specifically, the Shapley Additive Explanations (SHAP) effect values were used to separately explore the relationship between chlorophyll <em>a</em> (Chla) and six factors, namely the total phosphorus (TP), total nitrogen (TN), TN: TP ratio (RNP), dissolved oxygen (DO), temperature, and precipitation, across riverine and lacustrine ecosystems. Results identified TP and temperature as dominant regulators, accounting for the first two in lakes and the second and third positions in rivers. The thermal effect varies between different ecosystems: Chla decreases after reaching a peak in lakes, while Chla increases linearly with temperature in rivers. In addition, DO played an important role in rivers. The Chla concentration was estimated using Random Forest and thresholds for bloom identification were adjusted (25 μg/L for lakes and 40 μg/L for rivers), reflecting hydrodynamic and optical disparities. The risk framework was applied to the Qiantang River Basin (2020−2022), and results showed low annual risk (mean Algal Bloom Risk Index <0.5) but identified spring susceptibility related to nutrient resuspension and thermal stratification. By quantifying the impact of environmental factors on algal blooms, this study improves algal bloom risk assessment in rivers and lakes, which advances proactive bloom management in mixed river-lake basins under intensifying anthropogenic and climatic pressures.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"87 ","pages":"Article 103098"},"PeriodicalIF":5.8000,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125001074","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, the algal blooms have intensified, posing mounting threats to aquatic ecosystems and water security. However, most previous studies merely detected algal blooms according to the characteristics of the water body at the time of algal bloom occurrence, overlooking the influence of environmental factors on algae proliferation. This study proposes a novel algal bloom risk assessment framework that integrates explainable machine learning with multivariate environmental analysis. Specifically, the Shapley Additive Explanations (SHAP) effect values were used to separately explore the relationship between chlorophyll a (Chla) and six factors, namely the total phosphorus (TP), total nitrogen (TN), TN: TP ratio (RNP), dissolved oxygen (DO), temperature, and precipitation, across riverine and lacustrine ecosystems. Results identified TP and temperature as dominant regulators, accounting for the first two in lakes and the second and third positions in rivers. The thermal effect varies between different ecosystems: Chla decreases after reaching a peak in lakes, while Chla increases linearly with temperature in rivers. In addition, DO played an important role in rivers. The Chla concentration was estimated using Random Forest and thresholds for bloom identification were adjusted (25 μg/L for lakes and 40 μg/L for rivers), reflecting hydrodynamic and optical disparities. The risk framework was applied to the Qiantang River Basin (2020−2022), and results showed low annual risk (mean Algal Bloom Risk Index <0.5) but identified spring susceptibility related to nutrient resuspension and thermal stratification. By quantifying the impact of environmental factors on algal blooms, this study improves algal bloom risk assessment in rivers and lakes, which advances proactive bloom management in mixed river-lake basins under intensifying anthropogenic and climatic pressures.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.