{"title":"Low-cost sensor-based algal bloom labeling: a comparative study of SVM and logic methods","authors":"Jai-yeop Lee","doi":"10.1007/s10661-025-13815-y","DOIUrl":null,"url":null,"abstract":"<p>This study explores a low-cost sensor system for real-time algae bloom detection and water management. Harmful algal blooms (HABs) threaten water quality, ecosystems, and public health. Traditional detection methods, like satellite imagery and unmanned aerial vehicle (UAV), are expensive and not always suited for real-time monitoring. The proposed system uses sunlight and illuminance sensors to predict algae blooms and water level fluctuations. Data from these sensors are analyzed using support vector machines (SVM) and logical algorithms, with labeling based on sensor readings (e.g., “algae”, “sunny”, “shade”, “aqua”). A multiple linear regression (MLR) model is also developed to predict chlorophyll-a (Chl-a) concentrations. Both SVM and logical algorithms proved effective. The classification using SVM with four sensor values achieved an accuracy of 92.6%, while applying principal component analysis (PCA) before SVM classification resulted in 91.0% accuracy. In contrast, applying a sequential logical sequence to the boundary conditions of a single SVM model improved accuracy to 95.1%, and incorporating PCA-transformed SVM boundary conditions achieved 100.0% accuracy. This surpassed the performance of nonlinear decision models such as random forest and gradient boosting, which achieved 99.2% accuracy. The MLR model successfully predicted Chl-a levels with a 14.3% error rate for values above 5 mg/L. The developed system is an efficient alternative to traditional methods, enhancing real-time monitoring in water quality management.</p>","PeriodicalId":544,"journal":{"name":"Environmental Monitoring and Assessment","volume":"197 4","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Monitoring and Assessment","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s10661-025-13815-y","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
This study explores a low-cost sensor system for real-time algae bloom detection and water management. Harmful algal blooms (HABs) threaten water quality, ecosystems, and public health. Traditional detection methods, like satellite imagery and unmanned aerial vehicle (UAV), are expensive and not always suited for real-time monitoring. The proposed system uses sunlight and illuminance sensors to predict algae blooms and water level fluctuations. Data from these sensors are analyzed using support vector machines (SVM) and logical algorithms, with labeling based on sensor readings (e.g., “algae”, “sunny”, “shade”, “aqua”). A multiple linear regression (MLR) model is also developed to predict chlorophyll-a (Chl-a) concentrations. Both SVM and logical algorithms proved effective. The classification using SVM with four sensor values achieved an accuracy of 92.6%, while applying principal component analysis (PCA) before SVM classification resulted in 91.0% accuracy. In contrast, applying a sequential logical sequence to the boundary conditions of a single SVM model improved accuracy to 95.1%, and incorporating PCA-transformed SVM boundary conditions achieved 100.0% accuracy. This surpassed the performance of nonlinear decision models such as random forest and gradient boosting, which achieved 99.2% accuracy. The MLR model successfully predicted Chl-a levels with a 14.3% error rate for values above 5 mg/L. The developed system is an efficient alternative to traditional methods, enhancing real-time monitoring in water quality management.
本研究探索了一种低成本的实时藻华检测和水管理传感器系统。有害藻华(HABs)威胁着水质、生态系统和公众健康。传统的检测方法,如卫星图像和无人机(UAV),价格昂贵,并不总是适合实时监控。该系统使用阳光和照度传感器来预测藻类繁殖和水位波动。来自这些传感器的数据使用支持向量机(SVM)和逻辑算法进行分析,并基于传感器读数进行标记(例如,“藻类”,“阳光明媚”,“阴影”,“水”)。建立了叶绿素- A (Chl-a)浓度的多元线性回归(MLR)模型。结果表明,支持向量机和逻辑算法都是有效的。4个传感器值的SVM分类准确率为92.6%,在SVM分类前应用主成分分析(PCA)分类准确率为91.0%。相比之下,将顺序逻辑序列应用于单个SVM模型的边界条件可将准确率提高到95.1%,将pca变换的SVM边界条件应用于SVM模型可将准确率提高到100.0%。这超过了非线性决策模型的性能,如随机森林和梯度增强,达到99.2%的准确率。MLR模型成功地预测了Chl-a水平,对于高于5 mg/L的值,错误率为14.3%。该系统是传统方法的有效替代,增强了水质管理的实时监测。
期刊介绍:
Environmental Monitoring and Assessment emphasizes technical developments and data arising from environmental monitoring and assessment, the use of scientific principles in the design of monitoring systems at the local, regional and global scales, and the use of monitoring data in assessing the consequences of natural resource management actions and pollution risks to man and the environment.