Application of machine learning to predict unbound drug bioavailability in the brain

J. F. Morales, M. E. Ruiz, Robert E. Stratford, Alan Talevi
{"title":"Application of machine learning to predict unbound drug bioavailability in the brain","authors":"J. F. Morales, M. E. Ruiz, Robert E. Stratford, Alan Talevi","doi":"10.3389/fddsv.2024.1360732","DOIUrl":null,"url":null,"abstract":"Purpose: Optimizing brain bioavailability is highly relevant for the development of drugs targeting the central nervous system. Several pharmacokinetic parameters have been used for measuring drug bioavailability in the brain. The most biorelevant among them is possibly the unbound brain-to-plasma partition coefficient, Kpuu,brain,ss, which relates unbound brain and plasma drug concentrations under steady-state conditions. In this study, we developed new in silico models to predict Kpuu,brain,ss.Methods: A manually curated 157-compound dataset was compiled from literature and split into training and test sets using a clustering approach. Additional models were trained with a refined dataset generated by removing known P-gp and/or Breast Cancer Resistance Protein substrates from the original dataset. Different supervised machine learning algorithms have been tested, including Support Vector Machine, Gradient Boosting Machine, k-nearest neighbors, classificatory Partial Least Squares, Random Forest, Extreme Gradient Boosting, Deep Learning and Linear Discriminant Analysis. Good practices of predictive Quantitative Structure-Activity Relationships modeling were followed for the development of the models.Results: The best performance in the complete dataset was achieved by extreme gradient boosting, with an accuracy in the test set of 85.1%. A similar estimation of accuracy was observed in a prospective validation experiment, using a small sample of compounds and comparing predicted unbound brain bioavailability with observed experimental data.Conclusion: New in silico models were developed to predict the Kpuu,brain,ss of drug candidates. The dataset used in this study is publicly disclosed, so that the models may be reproduced, refined, or expanded, as a useful tool to assist drug discovery processes.","PeriodicalId":507971,"journal":{"name":"Frontiers in Drug Discovery","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Drug Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fddsv.2024.1360732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: Optimizing brain bioavailability is highly relevant for the development of drugs targeting the central nervous system. Several pharmacokinetic parameters have been used for measuring drug bioavailability in the brain. The most biorelevant among them is possibly the unbound brain-to-plasma partition coefficient, Kpuu,brain,ss, which relates unbound brain and plasma drug concentrations under steady-state conditions. In this study, we developed new in silico models to predict Kpuu,brain,ss.Methods: A manually curated 157-compound dataset was compiled from literature and split into training and test sets using a clustering approach. Additional models were trained with a refined dataset generated by removing known P-gp and/or Breast Cancer Resistance Protein substrates from the original dataset. Different supervised machine learning algorithms have been tested, including Support Vector Machine, Gradient Boosting Machine, k-nearest neighbors, classificatory Partial Least Squares, Random Forest, Extreme Gradient Boosting, Deep Learning and Linear Discriminant Analysis. Good practices of predictive Quantitative Structure-Activity Relationships modeling were followed for the development of the models.Results: The best performance in the complete dataset was achieved by extreme gradient boosting, with an accuracy in the test set of 85.1%. A similar estimation of accuracy was observed in a prospective validation experiment, using a small sample of compounds and comparing predicted unbound brain bioavailability with observed experimental data.Conclusion: New in silico models were developed to predict the Kpuu,brain,ss of drug candidates. The dataset used in this study is publicly disclosed, so that the models may be reproduced, refined, or expanded, as a useful tool to assist drug discovery processes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
应用机器学习预测非结合药物在大脑中的生物利用度
目的:优化脑部生物利用度与开发以中枢神经系统为靶点的药物密切相关。有几种药代动力学参数被用于测量药物在大脑中的生物利用度。其中与生物相关性最高的可能是非结合脑-血浆分配系数 Kpuu,brain,ss,它关系到稳态条件下非结合脑和血浆药物浓度。在这项研究中,我们开发了新的硅学模型来预测 Kpuu,brain,ss:方法:我们从文献中整理出一个由 157 种化合物组成的人工数据集,并使用聚类方法将其分成训练集和测试集。通过从原始数据集中移除已知的 P-gp 和/或乳腺癌抗性蛋白底物,生成了一个完善的数据集,并使用该数据集训练了其他模型。对不同的监督机器学习算法进行了测试,包括支持向量机、梯度提升机、k-近邻、分类偏最小二乘法、随机森林、极端梯度提升、深度学习和线性判别分析。模型的开发遵循了定量结构-活性关系预测建模的良好做法:极端梯度提升法在完整数据集中的表现最佳,测试集的准确率为 85.1%。在前瞻性验证实验中也观察到了类似的准确率,该实验使用了少量化合物样本,并将预测的非结合脑生物利用率与观察到的实验数据进行了比较:结论:开发了新的硅学模型来预测候选药物的脑生物利用度。本研究中使用的数据集已公开披露,因此这些模型可以复制、改进或扩展,成为协助药物发现过程的有用工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The role of physicochemical and topological parameters in drug design Editorial: Recent trends in anti-cancer drug discovery by in silico methods A small molecule inhibitor of leucine carboxyl methyltransferase-1 inhibits cancer cell survival A review on dynamics of permeability-glycoprotein in efflux of chemotherapeutic drugs The (misleading) role of animal models in drug development
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1