利用有界数据点对分子效力改进进行分类

IF 3.597 Q2 Pharmacology, Toxicology and Pharmaceutics MedChemComm Pub Date : 2024-05-31 DOI:10.1039/D4MD00325J
Zachary Fralish, Paul Skaluba and Daniel Reker
{"title":"利用有界数据点对分子效力改进进行分类","authors":"Zachary Fralish, Paul Skaluba and Daniel Reker","doi":"10.1039/D4MD00325J","DOIUrl":null,"url":null,"abstract":"<p >Molecular machine learning algorithms are becoming increasingly powerful at predicting the potency of potential drug candidates to guide molecular discovery, lead series prioritization, and structural optimization. However, a substantial amount of inhibition data is bounded and inaccessible to traditional regression algorithms. Here, we develop a novel molecular pairing approach to process this data. This creates a new classification task of predicting which one of two paired molecules is more potent. This novel classification task can be accurately solved by various, established molecular machine learning algorithms, including XGBoost and Chemprop. Across 230 ChEMBL IC<small><sub>50</sub></small> datasets, both tree-based and neural network-based “DeltaClassifiers” show improvements over traditional regression approaches in correctly classifying molecular potency improvements. The Chemprop-based deep DeltaClassifier outperformed all here evaluated regression approaches for paired molecules with shared and with distinct scaffolds, highlighting the promise of this approach for molecular optimization and scaffold-hopping.</p>","PeriodicalId":88,"journal":{"name":"MedChemComm","volume":" 7","pages":" 2474-2482"},"PeriodicalIF":3.5970,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2024/md/d4md00325j?page=search","citationCount":"0","resultStr":"{\"title\":\"Leveraging bounded datapoints to classify molecular potency improvements†\",\"authors\":\"Zachary Fralish, Paul Skaluba and Daniel Reker\",\"doi\":\"10.1039/D4MD00325J\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Molecular machine learning algorithms are becoming increasingly powerful at predicting the potency of potential drug candidates to guide molecular discovery, lead series prioritization, and structural optimization. However, a substantial amount of inhibition data is bounded and inaccessible to traditional regression algorithms. Here, we develop a novel molecular pairing approach to process this data. This creates a new classification task of predicting which one of two paired molecules is more potent. This novel classification task can be accurately solved by various, established molecular machine learning algorithms, including XGBoost and Chemprop. Across 230 ChEMBL IC<small><sub>50</sub></small> datasets, both tree-based and neural network-based “DeltaClassifiers” show improvements over traditional regression approaches in correctly classifying molecular potency improvements. The Chemprop-based deep DeltaClassifier outperformed all here evaluated regression approaches for paired molecules with shared and with distinct scaffolds, highlighting the promise of this approach for molecular optimization and scaffold-hopping.</p>\",\"PeriodicalId\":88,\"journal\":{\"name\":\"MedChemComm\",\"volume\":\" 7\",\"pages\":\" 2474-2482\"},\"PeriodicalIF\":3.5970,\"publicationDate\":\"2024-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.rsc.org/en/content/articlepdf/2024/md/d4md00325j?page=search\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"MedChemComm\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://pubs.rsc.org/en/content/articlelanding/2024/md/d4md00325j\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Pharmacology, Toxicology and Pharmaceutics\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"MedChemComm","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2024/md/d4md00325j","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}
引用次数: 0

摘要

分子机器学习算法在预测潜在候选药物的药效以指导分子发现、先导系列优先排序和结构优化方面正变得越来越强大。然而,大量的抑制数据是有边界的,传统回归算法无法获取。在此,我们开发了一种新的分子配对方法来处理这些数据。这就产生了一个新的分类任务,即预测两个配对分子中哪一个更有效。包括 XGBoost 和 Chemprop 在内的各种成熟的分子机器学习算法都能准确地解决这项新的分类任务。在 230 个 ChEMBL IC50 数据集中,基于树的 "DeltaClassifiers "和基于神经网络的 "DeltaClassifiers "在正确分类分子效价改进方面都比传统回归方法有所提高。对于具有共享支架和不同支架的配对分子,基于 Chemprop 的深度 DeltaClassifier 的表现优于所有在此评估的回归方法,这凸显了这种方法在分子优化和支架跳转方面的前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Leveraging bounded datapoints to classify molecular potency improvements†

Molecular machine learning algorithms are becoming increasingly powerful at predicting the potency of potential drug candidates to guide molecular discovery, lead series prioritization, and structural optimization. However, a substantial amount of inhibition data is bounded and inaccessible to traditional regression algorithms. Here, we develop a novel molecular pairing approach to process this data. This creates a new classification task of predicting which one of two paired molecules is more potent. This novel classification task can be accurately solved by various, established molecular machine learning algorithms, including XGBoost and Chemprop. Across 230 ChEMBL IC50 datasets, both tree-based and neural network-based “DeltaClassifiers” show improvements over traditional regression approaches in correctly classifying molecular potency improvements. The Chemprop-based deep DeltaClassifier outperformed all here evaluated regression approaches for paired molecules with shared and with distinct scaffolds, highlighting the promise of this approach for molecular optimization and scaffold-hopping.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
MedChemComm
MedChemComm BIOCHEMISTRY & MOLECULAR BIOLOGY-CHEMISTRY, MEDICINAL
CiteScore
4.70
自引率
0.00%
发文量
0
审稿时长
2.2 months
期刊介绍: Research and review articles in medicinal chemistry and related drug discovery science; the official journal of the European Federation for Medicinal Chemistry. In 2020, MedChemComm will change its name to RSC Medicinal Chemistry. Issue 12, 2019 will be the last issue as MedChemComm.
期刊最新文献
Back cover Introduction to the themed collection in honour of Professor Christian Leumann Back cover Correction: computational design, synthesis, and assessment of 3-(4-(4-(1,3,4-oxadiazol-2-yl)-1H-imidazol-2-yl)phenyl)-1,2,4-oxadiazole derivatives as effective epidermal growth factor receptor inhibitors: a prospective strategy for anticancer therapy Introduction to the themed collection on ‘AI in Medicinal Chemistry’
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1