Accurate and Transferable Drug-Target Interaction Prediction with DrugLAMP.

Bioinformatics (Oxford, England) Pub Date : 2024-11-21 DOI:10.1093/bioinformatics/btae693

Zhengchao Luo, Wei Wu, Qichen Sun, Jinzhuo Wang

{"title":"Accurate and Transferable Drug-Target Interaction Prediction with DrugLAMP.","authors":"Zhengchao Luo, Wei Wu, Qichen Sun, Jinzhuo Wang","doi":"10.1093/bioinformatics/btae693","DOIUrl":null,"url":null,"abstract":"Motivation: Accurate prediction of drug-target interactions (DTIs), especially for novel targets or drugs, is crucial for accelerating drug discovery. Recent advances in pretrained language models (PLMs) and multi-modal learning present new opportunities to enhance DTI prediction by leveraging vast unlabeled molecular data and integrating complementary information from multiple modalities.Results: We introduce DrugLAMP (PLM-Assisted Multi-modal Prediction), a PLM-based multi-modal framework for accurate and transferable DTI prediction. DrugLAMP integrates molecular graph and protein sequence features extracted by PLMs and traditional feature extractors. We introduce two novel multi-modal fusion modules: (1) Pocket-guided Co-Attention (PGCA), which uses protein pocket information to guide the attention mechanism on drug features, and (2) Paired Multi-modal Attention (PMMA), which enables effective cross-modal interactions between drug and protein features. These modules work together to enhance the model's ability to capture complex drug-protein interactions. Moreover, the Contrastive Compound-Protein Pre-training (2C2P) module enhances the model's generalization to real-world scenarios by aligning features across modalities and conditions. Comprehensive experiments demonstrate DrugLAMP's state-of-the-art performance on both standard benchmarks and challenging settings simulating real-world drug discovery, where test drugs/targets are unseen during training. Visualizations of attention maps and application to predict cryptic pockets and drug side effects further showcase DrugLAMP's strong interpretability and generalizability. Ablation studies confirm the contributions of the proposed modules.Availability: Source code and datasets are freely available at https://github.com/Lzcstan/DrugLAMP. All data originate from public sources.Supplementary information: Supplementary data are available at Bioinformatics online.","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btae693","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Motivation: Accurate prediction of drug-target interactions (DTIs), especially for novel targets or drugs, is crucial for accelerating drug discovery. Recent advances in pretrained language models (PLMs) and multi-modal learning present new opportunities to enhance DTI prediction by leveraging vast unlabeled molecular data and integrating complementary information from multiple modalities.

Results: We introduce DrugLAMP (PLM-Assisted Multi-modal Prediction), a PLM-based multi-modal framework for accurate and transferable DTI prediction. DrugLAMP integrates molecular graph and protein sequence features extracted by PLMs and traditional feature extractors. We introduce two novel multi-modal fusion modules: (1) Pocket-guided Co-Attention (PGCA), which uses protein pocket information to guide the attention mechanism on drug features, and (2) Paired Multi-modal Attention (PMMA), which enables effective cross-modal interactions between drug and protein features. These modules work together to enhance the model's ability to capture complex drug-protein interactions. Moreover, the Contrastive Compound-Protein Pre-training (2C2P) module enhances the model's generalization to real-world scenarios by aligning features across modalities and conditions. Comprehensive experiments demonstrate DrugLAMP's state-of-the-art performance on both standard benchmarks and challenging settings simulating real-world drug discovery, where test drugs/targets are unseen during training. Visualizations of attention maps and application to predict cryptic pockets and drug side effects further showcase DrugLAMP's strong interpretability and generalizability. Ablation studies confirm the contributions of the proposed modules.

Availability: Source code and datasets are freely available at https://github.com/Lzcstan/DrugLAMP. All data originate from public sources.

Supplementary information: Supplementary data are available at Bioinformatics online.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用 DrugLAMP 进行准确、可转移的药物-靶点相互作用预测。

动机准确预测药物-靶点相互作用（DTI），尤其是新靶点或药物的相互作用，对于加速药物发现至关重要。预训练语言模型（PLM）和多模态学习的最新进展为利用大量未标记的分子数据和整合来自多种模态的互补信息来增强 DTI 预测提供了新的机遇：我们介绍了DrugLAMP（PLM-Assisted Multi-modal Prediction，PLM辅助多模态预测），这是一个基于PLM的多模态框架，用于准确和可转移的DTI预测。DrugLAMP整合了由PLM和传统特征提取器提取的分子图和蛋白质序列特征。我们引入了两个新颖的多模态融合模块：(1) Pocket-guided Co-Attention (PGCA)，该模块利用蛋白质口袋信息引导对药物特征的关注机制；(2) Paired Multi-modal Attention (PMMA)，该模块实现了药物特征和蛋白质特征之间有效的跨模态交互。这些模块共同作用，增强了模型捕捉复杂的药物-蛋白质相互作用的能力。此外，对比化合物-蛋白质预训练（2C2P）模块通过调整跨模态和条件的特征，增强了模型对真实世界场景的泛化能力。综合实验证明了DrugLAMP在标准基准和模拟真实世界药物发现的挑战性设置上的一流性能，在这些设置中，测试药物/靶点在训练过程中是不可见的。注意力图的可视化以及在预测隐秘口袋和药物副作用方面的应用进一步展示了DrugLAMP强大的可解释性和通用性。消融研究证实了拟议模块的贡献：源代码和数据集可在 https://github.com/Lzcstan/DrugLAMP 免费获取。所有数据均来自公开来源：补充数据可在 Bioinformatics online 上获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Bioinformatics (Oxford, England)

自引率

0.00%

发文量