Machine Learning-based Classification for the Prioritization of Potentially Hazardous Chemicals with Structural Alerts in Nontarget Screening

IF 11.3 1区 环境科学与生态学 Q1 ENGINEERING, ENVIRONMENTAL 环境科学与技术 Pub Date : 2025-03-07 DOI:10.1021/acs.est.4c10498
Nienke Meekel, Anneli Kruve, Marja H. Lamoree, Frederic M. Been
{"title":"Machine Learning-based Classification for the Prioritization of Potentially Hazardous Chemicals with Structural Alerts in Nontarget Screening","authors":"Nienke Meekel, Anneli Kruve, Marja H. Lamoree, Frederic M. Been","doi":"10.1021/acs.est.4c10498","DOIUrl":null,"url":null,"abstract":"Nontarget screening (NTS) with liquid chromatography high-resolution mass spectrometry (LC-HRMS) is commonly used to detect unknown organic micropollutants in the environment. One of the main challenges in NTS is the prioritization of relevant LC-HRMS features. A novel prioritization strategy based on structural alerts to select NTS features that correspond to potentially hazardous chemicals is presented here. This strategy leverages raw tandem mass spectra (MS<sup>2</sup>) and machine learning models to predict the probability that NTS features correspond to chemicals with structural alerts. The models were trained on fragments and neutral losses from the experimental MS<sup>2</sup> data. The feasibility of this approach is evaluated for two groups: aromatic amines and organophosphorus structural alerts. The neural network classification model for organophosphorus structural alerts achieved an Area Under the Curve of the Receiver Operating Characteristics (AUC-ROC) of 0.97 and a true positive rate of 0.65 on the test set. The random forest model for the classification of aromatic amines achieved an AUC-ROC value of 0.82 and a true positive rate of 0.58 on the test set. The models were successfully applied to prioritize LC-HRMS features in surface water samples, showcasing the high potential to develop and implement this approach further.","PeriodicalId":36,"journal":{"name":"环境科学与技术","volume":"9 1","pages":""},"PeriodicalIF":11.3000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学与技术","FirstCategoryId":"1","ListUrlMain":"https://doi.org/10.1021/acs.est.4c10498","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Nontarget screening (NTS) with liquid chromatography high-resolution mass spectrometry (LC-HRMS) is commonly used to detect unknown organic micropollutants in the environment. One of the main challenges in NTS is the prioritization of relevant LC-HRMS features. A novel prioritization strategy based on structural alerts to select NTS features that correspond to potentially hazardous chemicals is presented here. This strategy leverages raw tandem mass spectra (MS2) and machine learning models to predict the probability that NTS features correspond to chemicals with structural alerts. The models were trained on fragments and neutral losses from the experimental MS2 data. The feasibility of this approach is evaluated for two groups: aromatic amines and organophosphorus structural alerts. The neural network classification model for organophosphorus structural alerts achieved an Area Under the Curve of the Receiver Operating Characteristics (AUC-ROC) of 0.97 and a true positive rate of 0.65 on the test set. The random forest model for the classification of aromatic amines achieved an AUC-ROC value of 0.82 and a true positive rate of 0.58 on the test set. The models were successfully applied to prioritize LC-HRMS features in surface water samples, showcasing the high potential to develop and implement this approach further.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习的非目标筛选中具有结构警报的潜在危险化学品优先级分类
液相色谱-高分辨率质谱法(LC-HRMS)非靶筛选(NTS)是检测环境中未知有机微污染物的常用方法。NTS的主要挑战之一是相关LC-HRMS功能的优先级。本文提出了一种基于结构警报的新型优先级策略,以选择与潜在危险化学品对应的NTS特征。该策略利用原始串联质谱(MS2)和机器学习模型来预测NTS特征与具有结构警报的化学物质对应的概率。这些模型是用来自MS2实验数据的碎片和中性损失进行训练的。该方法的可行性评估了两组:芳香胺和有机磷结构警报。有机磷结构预警的神经网络分类模型在测试集上的接受者工作特征曲线下面积(AUC-ROC)为0.97,真阳性率为0.65。芳香胺分类的随机森林模型在测试集上的AUC-ROC值为0.82,真阳性率为0.58。该模型已成功应用于地表水样品中LC-HRMS特征的优先排序,显示出进一步开发和实施该方法的巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
环境科学与技术
环境科学与技术 环境科学-工程:环境
CiteScore
17.50
自引率
9.60%
发文量
12359
审稿时长
2.8 months
期刊介绍: Environmental Science & Technology (ES&T) is a co-sponsored academic and technical magazine by the Hubei Provincial Environmental Protection Bureau and the Hubei Provincial Academy of Environmental Sciences. Environmental Science & Technology (ES&T) holds the status of Chinese core journals, scientific papers source journals of China, Chinese Science Citation Database source journals, and Chinese Academic Journal Comprehensive Evaluation Database source journals. This publication focuses on the academic field of environmental protection, featuring articles related to environmental protection and technical advancements.
期刊最新文献
Bridging the Microbial and Mineral Carbon Pumps: Biochar for Synergistic Soil Carbon Sequestration. A Missing Source of Atmospheric NO2-: Heterogeneous Dark Transformation of PAN on Industrial Mineral Dust. Tunable Strategies for Controlled CO2 Hydrate Formation: Integrating Promoter-Driven Regulation and Cage Occupancy Analysis. From Launch to Legacy: Charting the Path of the Intergovernmental Science-Policy Panel on Chemicals, Waste, and Pollution. Spatial Organization of Biomass Controls Intrinsic Permeability of Porous Systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1