Joung Min Choi, Vineeth Manthapuri, Ishi Keenum, Connor L Brown, Kang Xia, Chaoqi Chen, Peter J Vikesland, Matthew F Blair, Charles Bott, Amy Pruden, Liqing Zhang
{"title":"A machine learning framework to predict PPCP removal through various wastewater and water reuse treatment trains.","authors":"Joung Min Choi, Vineeth Manthapuri, Ishi Keenum, Connor L Brown, Kang Xia, Chaoqi Chen, Peter J Vikesland, Matthew F Blair, Charles Bott, Amy Pruden, Liqing Zhang","doi":"10.1039/d4ew00892h","DOIUrl":null,"url":null,"abstract":"<p><p>The persistence of pharmaceuticals and personal care products (PPCPs) through wastewater treatment and resulting contamination of aquatic environments and drinking water is a pervasive concern, necessitating means of identifying effective treatment strategies for PPCP removal. In this study, we employed machine learning (ML) models to classify 149 PPCPs based on their chemical properties and predict their removal <i>via</i> wastewater and water reuse treatment trains. We evaluated two distinct clustering approaches: C1 (clustering based on the most efficient individual treatment process) and C2 (clustering based on the removal pattern of PPCPs across treatments). For this, we grouped PPCPs based on their relative abundances by comparing peak areas measured <i>via</i> non-target profiling using ultra-performance liquid chromatography-tandem mass spectrometry through two field-scale treatment trains. The resulting clusters were then classified using Abraham descriptors and log <i>K</i> <sub>ow</sub> as input to the three ML models: support vector machines (SVM), logistic regression, and random forest (RF). SVM achieved the highest accuracy, 79.1%, in predicting PPCP removal. Notably, a 58-75% overlap was observed between the ML clusters of PPCPs and the Abraham descriptor and log <i>K</i> <sub>ow</sub> clusters of PPCPs, indicating the potential of using Abraham descriptors and log <i>K</i> <sub>ow</sub> to predict the fate of PPCPs through various treatment trains. Given the myriad of PPCPs of concern, this approach can supplement information gathered from experimental testing to help optimize the design of wastewater and water reuse treatment trains for PPCP removal.</p>","PeriodicalId":75,"journal":{"name":"Environmental Science: Water Research & Technology","volume":" ","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11694563/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science: Water Research & Technology","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1039/d4ew00892h","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
The persistence of pharmaceuticals and personal care products (PPCPs) through wastewater treatment and resulting contamination of aquatic environments and drinking water is a pervasive concern, necessitating means of identifying effective treatment strategies for PPCP removal. In this study, we employed machine learning (ML) models to classify 149 PPCPs based on their chemical properties and predict their removal via wastewater and water reuse treatment trains. We evaluated two distinct clustering approaches: C1 (clustering based on the most efficient individual treatment process) and C2 (clustering based on the removal pattern of PPCPs across treatments). For this, we grouped PPCPs based on their relative abundances by comparing peak areas measured via non-target profiling using ultra-performance liquid chromatography-tandem mass spectrometry through two field-scale treatment trains. The resulting clusters were then classified using Abraham descriptors and log Kow as input to the three ML models: support vector machines (SVM), logistic regression, and random forest (RF). SVM achieved the highest accuracy, 79.1%, in predicting PPCP removal. Notably, a 58-75% overlap was observed between the ML clusters of PPCPs and the Abraham descriptor and log Kow clusters of PPCPs, indicating the potential of using Abraham descriptors and log Kow to predict the fate of PPCPs through various treatment trains. Given the myriad of PPCPs of concern, this approach can supplement information gathered from experimental testing to help optimize the design of wastewater and water reuse treatment trains for PPCP removal.
期刊介绍:
Environmental Science: Water Research & Technology seeks to showcase high quality research about fundamental science, innovative technologies, and management practices that promote sustainable water.