Machine learning prediction of dioxin lipophilicity and key feature Identification

IF 3 3区 化学 Q3 CHEMISTRY, PHYSICAL Computational and Theoretical Chemistry Pub Date : 2025-02-01 Epub Date: 2024-12-06 DOI:10.1016/j.comptc.2024.115032
Yingwei Wang, Yufei Li
{"title":"Machine learning prediction of dioxin lipophilicity and key feature Identification","authors":"Yingwei Wang,&nbsp;Yufei Li","doi":"10.1016/j.comptc.2024.115032","DOIUrl":null,"url":null,"abstract":"<div><div>Dioxins are potent exogenous ligands for the aryl hydrocarbon receptor (AHR) within human cell membranes. Their lipophilicity is a critical factor influencing the immunotoxicity mediated by AHR. This study utilizes experimental data on the lipophilicity of certain PCDDs as the dependent variable, and molecular descriptors of PCDDs as independent variables, to construct five machine learning models for predicting PCDDs’ lipophilicity. The evaluation metrics of these models indicate that the XGBoost model exhibits excellent predictive performance, achieving an 86% accuracy in predicting the logKow values of 75 PCDDs. An XGBoost-Bayesian stacked model was developed by employing a stacking algorithm, enhancing the prediction accuracy to 96%. This improved model was successfully applied to predict the log<em>K</em><sub>ow</sub> values of 175 PCDFs and validated through molecular membrane dynamics. The SHAP method identified key molecular descriptors influencing dioxin lipophilicity. This study offers a theoretical basis for investigating the toxicity of dioxins via AHR receptors.</div></div>","PeriodicalId":284,"journal":{"name":"Computational and Theoretical Chemistry","volume":"1244 ","pages":"Article 115032"},"PeriodicalIF":3.0000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational and Theoretical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210271X24005711","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/6 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Dioxins are potent exogenous ligands for the aryl hydrocarbon receptor (AHR) within human cell membranes. Their lipophilicity is a critical factor influencing the immunotoxicity mediated by AHR. This study utilizes experimental data on the lipophilicity of certain PCDDs as the dependent variable, and molecular descriptors of PCDDs as independent variables, to construct five machine learning models for predicting PCDDs’ lipophilicity. The evaluation metrics of these models indicate that the XGBoost model exhibits excellent predictive performance, achieving an 86% accuracy in predicting the logKow values of 75 PCDDs. An XGBoost-Bayesian stacked model was developed by employing a stacking algorithm, enhancing the prediction accuracy to 96%. This improved model was successfully applied to predict the logKow values of 175 PCDFs and validated through molecular membrane dynamics. The SHAP method identified key molecular descriptors influencing dioxin lipophilicity. This study offers a theoretical basis for investigating the toxicity of dioxins via AHR receptors.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
二恶英亲脂性的机器学习预测及关键特征识别
二恶英是人细胞膜内芳烃受体(AHR)的有效外源性配体。它们的亲脂性是影响AHR介导的免疫毒性的关键因素。本研究以某些pcdd的亲脂性实验数据为因变量,以pcdd的分子描述符为自变量,构建了预测pcdd亲脂性的5个机器学习模型。这些模型的评估指标表明,XGBoost模型具有出色的预测性能,在预测75个pcdd的logKow值时达到86%的准确率。采用叠加算法建立XGBoost-Bayesian叠加模型,将预测精度提高到96%。该改进模型成功地用于预测175个pcdf的logKow值,并通过分子膜动力学进行了验证。SHAP方法确定了影响二恶英亲脂性的关键分子描述子。本研究为研究二恶英通过AHR受体的毒性提供了理论基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.20
自引率
10.70%
发文量
331
审稿时长
31 days
期刊介绍: Computational and Theoretical Chemistry publishes high quality, original reports of significance in computational and theoretical chemistry including those that deal with problems of structure, properties, energetics, weak interactions, reaction mechanisms, catalysis, and reaction rates involving atoms, molecules, clusters, surfaces, and bulk matter.
期刊最新文献
Pressure-dependent kinetics of the NH3+OH reaction from cryogenic to high temperatures An explicit density-matrix formulation for spin-resolved local reactivity in open-shell systems Stark effect and complex dielectric response in doped toroidal core/shell quantum rings with finite confinement A DFT study on the structural, electronic, and catalytic properties of CuₙSnₙ (n = 2–14) clusters Tailoring graphene-based diatomic catalysts via structural descriptors toward efficient CO2 reduction reaction: A DFT-ML study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1