Accurate and Transferable Drug-Target Interaction Prediction with DrugLAMP.

Zhengchao Luo, Wei Wu, Qichen Sun, Jinzhuo Wang
{"title":"Accurate and Transferable Drug-Target Interaction Prediction with DrugLAMP.","authors":"Zhengchao Luo, Wei Wu, Qichen Sun, Jinzhuo Wang","doi":"10.1093/bioinformatics/btae693","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Accurate prediction of drug-target interactions (DTIs), especially for novel targets or drugs, is crucial for accelerating drug discovery. Recent advances in pretrained language models (PLMs) and multi-modal learning present new opportunities to enhance DTI prediction by leveraging vast unlabeled molecular data and integrating complementary information from multiple modalities.</p><p><strong>Results: </strong>We introduce DrugLAMP (PLM-Assisted Multi-modal Prediction), a PLM-based multi-modal framework for accurate and transferable DTI prediction. DrugLAMP integrates molecular graph and protein sequence features extracted by PLMs and traditional feature extractors. We introduce two novel multi-modal fusion modules: (1) Pocket-guided Co-Attention (PGCA), which uses protein pocket information to guide the attention mechanism on drug features, and (2) Paired Multi-modal Attention (PMMA), which enables effective cross-modal interactions between drug and protein features. These modules work together to enhance the model's ability to capture complex drug-protein interactions. Moreover, the Contrastive Compound-Protein Pre-training (2C2P) module enhances the model's generalization to real-world scenarios by aligning features across modalities and conditions. Comprehensive experiments demonstrate DrugLAMP's state-of-the-art performance on both standard benchmarks and challenging settings simulating real-world drug discovery, where test drugs/targets are unseen during training. Visualizations of attention maps and application to predict cryptic pockets and drug side effects further showcase DrugLAMP's strong interpretability and generalizability. Ablation studies confirm the contributions of the proposed modules.</p><p><strong>Availability: </strong>Source code and datasets are freely available at https://github.com/Lzcstan/DrugLAMP. All data originate from public sources.</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics online.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btae693","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Accurate prediction of drug-target interactions (DTIs), especially for novel targets or drugs, is crucial for accelerating drug discovery. Recent advances in pretrained language models (PLMs) and multi-modal learning present new opportunities to enhance DTI prediction by leveraging vast unlabeled molecular data and integrating complementary information from multiple modalities.

Results: We introduce DrugLAMP (PLM-Assisted Multi-modal Prediction), a PLM-based multi-modal framework for accurate and transferable DTI prediction. DrugLAMP integrates molecular graph and protein sequence features extracted by PLMs and traditional feature extractors. We introduce two novel multi-modal fusion modules: (1) Pocket-guided Co-Attention (PGCA), which uses protein pocket information to guide the attention mechanism on drug features, and (2) Paired Multi-modal Attention (PMMA), which enables effective cross-modal interactions between drug and protein features. These modules work together to enhance the model's ability to capture complex drug-protein interactions. Moreover, the Contrastive Compound-Protein Pre-training (2C2P) module enhances the model's generalization to real-world scenarios by aligning features across modalities and conditions. Comprehensive experiments demonstrate DrugLAMP's state-of-the-art performance on both standard benchmarks and challenging settings simulating real-world drug discovery, where test drugs/targets are unseen during training. Visualizations of attention maps and application to predict cryptic pockets and drug side effects further showcase DrugLAMP's strong interpretability and generalizability. Ablation studies confirm the contributions of the proposed modules.

Availability: Source code and datasets are freely available at https://github.com/Lzcstan/DrugLAMP. All data originate from public sources.

Supplementary information: Supplementary data are available at Bioinformatics online.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用 DrugLAMP 进行准确、可转移的药物-靶点相互作用预测。
动机准确预测药物-靶点相互作用(DTI),尤其是新靶点或药物的相互作用,对于加速药物发现至关重要。预训练语言模型(PLM)和多模态学习的最新进展为利用大量未标记的分子数据和整合来自多种模态的互补信息来增强 DTI 预测提供了新的机遇:我们介绍了DrugLAMP(PLM-Assisted Multi-modal Prediction,PLM辅助多模态预测),这是一个基于PLM的多模态框架,用于准确和可转移的DTI预测。DrugLAMP整合了由PLM和传统特征提取器提取的分子图和蛋白质序列特征。我们引入了两个新颖的多模态融合模块:(1) Pocket-guided Co-Attention (PGCA),该模块利用蛋白质口袋信息引导对药物特征的关注机制;(2) Paired Multi-modal Attention (PMMA),该模块实现了药物特征和蛋白质特征之间有效的跨模态交互。这些模块共同作用,增强了模型捕捉复杂的药物-蛋白质相互作用的能力。此外,对比化合物-蛋白质预训练(2C2P)模块通过调整跨模态和条件的特征,增强了模型对真实世界场景的泛化能力。综合实验证明了DrugLAMP在标准基准和模拟真实世界药物发现的挑战性设置上的一流性能,在这些设置中,测试药物/靶点在训练过程中是不可见的。注意力图的可视化以及在预测隐秘口袋和药物副作用方面的应用进一步展示了DrugLAMP强大的可解释性和通用性。消融研究证实了拟议模块的贡献:源代码和数据集可在 https://github.com/Lzcstan/DrugLAMP 免费获取。所有数据均来自公开来源:补充数据可在 Bioinformatics online 上获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
RUCova: Removal of Unwanted Covariance in mass cytometry data. ViraLM: Empowering Virus Discovery through the Genome Foundation Model. CVR-BBI: An Open-Source VR Platform for Multi-User Collaborative Brain to Brain Interfaces. Expert-guided protein Language Models enable accurate and blazingly fast fitness prediction. FungiFun3: Systemic gene set enrichment analysis for fungal species.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1