{"title":"Prediction of high turbidity in rivers using LSTM algorithm","authors":"Jungsu Park, Hyunho Lee","doi":"10.11001/jksww.2020.34.1.035","DOIUrl":null,"url":null,"abstract":"Turbidity has various effects on the water quality and ecosystem of a river. High turbidity during floods increases the operation cost of a drinking water supply system. Thus, the management of turbidity is essential for providing safe water to the public. There have been various efforts to estimate turbidity in river systems for proper management and early warning of high turbidity in the water supply process. Advanced data analysis technology using machine learning has been increasingly used in water quality management processes. Artificial neural networks(ANNs) is one of the first algorithms applied, where the overfitting of a model to observed data and vanishing gradient in the backpropagation process limit the wide application of ANNs in practice. In recent years, deep learning, which overcomes the limitations of ANNs, has been applied in water quality management. LSTM(Long-Short Term Memory) is one of novel deep learning algorithms that is widely used in the analysis of time series data. In this study, LSTM is used for the prediction of high turbidity(>30 NTU) in a river from the relationship of turbidity to discharge, which enables early warning of high turbidity in a drinking water supply system. The model showed 0.98, 0.99, 0.98 and 0.99 for precision, recall, F1-score and accuracy respectively, for the prediction of high turbidity in a river with 2 hour frequency data. The sensitivity of the model to the observation intervals of data is also compared with time periods of 2 hour, 8 hour, 1 day and 2 days. The model shows higher precision with shorter observation intervals, which underscores the importance of collecting high frequency data for better management of water resources in the future.","PeriodicalId":16258,"journal":{"name":"Journal of Korean Society of Water and Wastewater","volume":"249 1","pages":"35-43"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korean Society of Water and Wastewater","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11001/jksww.2020.34.1.035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Turbidity has various effects on the water quality and ecosystem of a river. High turbidity during floods increases the operation cost of a drinking water supply system. Thus, the management of turbidity is essential for providing safe water to the public. There have been various efforts to estimate turbidity in river systems for proper management and early warning of high turbidity in the water supply process. Advanced data analysis technology using machine learning has been increasingly used in water quality management processes. Artificial neural networks(ANNs) is one of the first algorithms applied, where the overfitting of a model to observed data and vanishing gradient in the backpropagation process limit the wide application of ANNs in practice. In recent years, deep learning, which overcomes the limitations of ANNs, has been applied in water quality management. LSTM(Long-Short Term Memory) is one of novel deep learning algorithms that is widely used in the analysis of time series data. In this study, LSTM is used for the prediction of high turbidity(>30 NTU) in a river from the relationship of turbidity to discharge, which enables early warning of high turbidity in a drinking water supply system. The model showed 0.98, 0.99, 0.98 and 0.99 for precision, recall, F1-score and accuracy respectively, for the prediction of high turbidity in a river with 2 hour frequency data. The sensitivity of the model to the observation intervals of data is also compared with time periods of 2 hour, 8 hour, 1 day and 2 days. The model shows higher precision with shorter observation intervals, which underscores the importance of collecting high frequency data for better management of water resources in the future.
浑浊度对河流水质和生态系统有多种影响。洪水期间的高浊度增加了饮用水供应系统的运行成本。因此,浊度管理对于向公众提供安全用水至关重要。为了在供水过程中进行适当的管理和高浑浊度的早期预警,已经有各种努力来估计河流系统的浑浊度。使用机器学习的先进数据分析技术已越来越多地用于水质管理过程。人工神经网络是最早应用的算法之一,但模型对观测数据的过拟合和反向传播过程中的梯度消失等问题限制了人工神经网络在实际中的广泛应用。近年来,深度学习克服了人工神经网络的局限性,在水质管理中得到了应用。LSTM(Long-Short Term Memory)是一种新型的深度学习算法,广泛应用于时间序列数据的分析。本研究从浑浊度与流量的关系出发,利用LSTM对某河流的高浑浊度(>30 NTU)进行预测,实现了饮用水供应系统高浑浊度的预警。对于2小时频率数据的高浊度预测,模型的精密度、召回率、f1评分和准确度分别为0.98、0.99、0.98和0.99。比较了2小时、8小时、1天和2天时间段模型对数据观测间隔的敏感性。该模型以更短的观测间隔显示出更高的精度,这强调了收集高频数据对未来更好地管理水资源的重要性。