Domain adaptation through active learning strategies for anomaly classification in wastewater treatment plants.

IF 2.5 4区 环境科学与生态学 Q3 ENGINEERING, ENVIRONMENTAL Water Science and Technology Pub Date : 2024-12-01 Epub Date: 2024-11-27 DOI:10.2166/wst.2024.387
Francesca Bellamoli, Marco Vian, Mattia Di Iorio, Farid Melgani
{"title":"Domain adaptation through active learning strategies for anomaly classification in wastewater treatment plants.","authors":"Francesca Bellamoli, Marco Vian, Mattia Di Iorio, Farid Melgani","doi":"10.2166/wst.2024.387","DOIUrl":null,"url":null,"abstract":"<p><p>The increasing use of intermittent aeration controllers in wastewater treatment plants (WWTPs) aims to reduce aeration costs via continuous ammonia and oxygen measurements but faces challenges in detecting sensor and process anomalies. Applying machine learning to this unbalanced, multivariate, multiclass classification challenge requires much data, difficult to obtain from a new plant. This study develops a machine learning algorithm to identify anomalies in intermittent aeration WWTPs, adaptable to new plants with limited data. Utilizing active learning, the method iteratively selects samples from the target domain to fine-tune a gradient-boosting model initially trained on data from 17 plants. Three sampling strategies were tested, with low probability and high entropy sampling proving effective in early adaptation, achieving an F2-score close to the optimal with minimal sample use. The objective is to deploy these models as decision support systems for WWTP management, providing a strategy for efficient model adaptation to new plants, and optimizing labeling efforts<b>.</b></p>","PeriodicalId":23653,"journal":{"name":"Water Science and Technology","volume":"90 11","pages":"3123-3138"},"PeriodicalIF":2.5000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Science and Technology","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.2166/wst.2024.387","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/27 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

The increasing use of intermittent aeration controllers in wastewater treatment plants (WWTPs) aims to reduce aeration costs via continuous ammonia and oxygen measurements but faces challenges in detecting sensor and process anomalies. Applying machine learning to this unbalanced, multivariate, multiclass classification challenge requires much data, difficult to obtain from a new plant. This study develops a machine learning algorithm to identify anomalies in intermittent aeration WWTPs, adaptable to new plants with limited data. Utilizing active learning, the method iteratively selects samples from the target domain to fine-tune a gradient-boosting model initially trained on data from 17 plants. Three sampling strategies were tested, with low probability and high entropy sampling proving effective in early adaptation, achieving an F2-score close to the optimal with minimal sample use. The objective is to deploy these models as decision support systems for WWTP management, providing a strategy for efficient model adaptation to new plants, and optimizing labeling efforts.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过主动学习策略对污水处理厂的异常分类进行领域适应。
污水处理厂(WWTP)越来越多地使用间歇曝气控制器,旨在通过连续测量氨氮和氧气来降低曝气成本,但在检测传感器和过程异常方面却面临着挑战。将机器学习应用于这一不平衡、多变量、多类分类挑战需要大量数据,而这些数据很难从新工厂获得。本研究开发了一种机器学习算法,用于识别间歇曝气式污水处理厂的异常情况,适用于数据有限的新工厂。该方法利用主动学习,从目标域中迭代选择样本,对梯度提升模型进行微调,该模型最初是在 17 家工厂的数据基础上进行训练的。测试了三种取样策略,其中低概率和高熵取样在早期适应中被证明是有效的,在使用最少样本的情况下实现了接近最优的 F2 分数。我们的目标是将这些模型作为污水处理厂管理的决策支持系统,为新工厂提供高效的模型适应策略,并优化标签工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Water Science and Technology
Water Science and Technology 环境科学-工程:环境
CiteScore
4.90
自引率
3.70%
发文量
366
审稿时长
4.4 months
期刊介绍: Water Science and Technology publishes peer-reviewed papers on all aspects of the science and technology of water and wastewater. Papers are selected by a rigorous peer review procedure with the aim of rapid and wide dissemination of research results, development and application of new techniques, and related managerial and policy issues. Scientists, engineers, consultants, managers and policy-makers will find this journal essential as a permanent record of progress of research activities and their practical applications.
期刊最新文献
Factorial experiment to identify two-way interactions between temperature, harvesting period, hydraulic retention time, and light intensity that influence the biomass productivity and phosphorus removal efficiency of a microalgae-bacteria biofilm. Hybrid modelling framework for ozonation and biological activated carbon in tertiary wastewater treatment. Impact of chemical oxygen demand to nitrogen ratio on ANAMMOX bacterial growth in an up-flow anaerobic sludge blanket reactor. Reduction of blue and total water footprints per unit biomass yield of silage maize with grey water footprint input in subsurface drip irrigation. Simulation-based process optimization of full-scale advanced wastewater treatment systems using powdered activated carbon.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1