Freshwater ecosystems play a crucial role in maintaining ecological balance and supporting biodiversity. Assessing water quality and identifying pollution factors affected by seasonal changes is of utmost importance. The traditional approach for identifying pollution sources and seasonal water quality assessment is often labor-intensive, time-consuming, and complex. In response, our study introduces a hybrid receptor, multivariate statistical and machine learning (ML) based framework as an alternative. Positive matrix factorization (PMF) and receptor model were used to identify key pollutant sources, while Principal Component Analysis (PCA) was used to corroborate pollutant sources. Additionally, various supervised ML classification algorithms are used to predict and identify pollution sources on full physicochemical datasets and reduced in situ parameter scenarios. ML model stability and generalization capacity were evaluated using learning curves, cross-validation method. The findings indicate significant seasonal variability in water quality index, ranging from 38.5 (poor) to 78.6 (good). PMF and PCA revealed three dominant pollution factors, such as biogeochemical processes, sediment resuspension/nutrient influx, and anthropogenic pollution, affecting the seasonal water quality by 15–30%, 25–45%, and 40–70%, respectively. Model performance indicators using confusion matrices, ROC-AUC, and other evaluation metrics, for full dataset identified XGBoost as the best model, achieving the highest accuracy of 96.3%, followed by CatBoosting (93.6%), Random Forest (93.6%), Support Vector Classifier (91.8%), K-Nearest Neighbors (87.2%) and Decision Tree (84.3%). However, with the in situ data the ML models show stable and consistent classification accuracy in the range of approximately 70–81%. Overall, the modelling framework provides an effective approach for capturing seasonal variability and pollution source attribution, contributing to improved ecological understanding and predictive management of freshwater system.
扫码关注我们
求助内容:
应助结果提醒方式:
