Improving generalizability of drug-target binding prediction by pre-trained multi-view molecular representations.

Xike Ouyang, Yannuo Feng, Chen Cui, Yunhe Li, Li Zhang, Han Wang
{"title":"Improving generalizability of drug-target binding prediction by pre-trained multi-view molecular representations.","authors":"Xike Ouyang, Yannuo Feng, Chen Cui, Yunhe Li, Li Zhang, Han Wang","doi":"10.1093/bioinformatics/btaf002","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Most drugs start on their journey inside the body by binding the right target proteins. This is the reason that numerous efforts have been devoted to predicting the drug-target binding during drug development. However, the inherent diversity among molecular properties, coupled with limited training data availability, poses challenges to the accuracy and generalizability of these methods beyond their training domain.</p><p><strong>Results: </strong>In this work, we proposed a neural networks construction for high accurate and generalizable drug-target binding prediction, named Pre-trained Multi-view Molecular Representations (PMMR). The method uses pre-trained models to transfer representations of target proteins and drugs to the domain of drug-target binding prediction, mitigating the issue of poor generalizability stemming from limited data. Then, two typical representations of drug molecules, Graphs and SMILES strings, are learned respectively by a Graph Neural Network and a Transformer to achieve complementarity between local and global features. PMMR was evaluated on drug-target affinity and interaction benchmark datasets, and it derived preponderant performance contrast to peer methods, especially generalizability in cold-start scenarios. Furthermore, our state-of-the-art method was indicated to have the potential for drug discovery by a case study of cyclin-dependent kinase 2.</p><p><strong>Availability and implementation: </strong>https://github.com/NENUBioCompute/PMMR.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11751634/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Most drugs start on their journey inside the body by binding the right target proteins. This is the reason that numerous efforts have been devoted to predicting the drug-target binding during drug development. However, the inherent diversity among molecular properties, coupled with limited training data availability, poses challenges to the accuracy and generalizability of these methods beyond their training domain.

Results: In this work, we proposed a neural networks construction for high accurate and generalizable drug-target binding prediction, named Pre-trained Multi-view Molecular Representations (PMMR). The method uses pre-trained models to transfer representations of target proteins and drugs to the domain of drug-target binding prediction, mitigating the issue of poor generalizability stemming from limited data. Then, two typical representations of drug molecules, Graphs and SMILES strings, are learned respectively by a Graph Neural Network and a Transformer to achieve complementarity between local and global features. PMMR was evaluated on drug-target affinity and interaction benchmark datasets, and it derived preponderant performance contrast to peer methods, especially generalizability in cold-start scenarios. Furthermore, our state-of-the-art method was indicated to have the potential for drug discovery by a case study of cyclin-dependent kinase 2.

Availability and implementation: https://github.com/NENUBioCompute/PMMR.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用预训练的多视点分子表征提高药物与靶点结合预测的通用性。
动机:大多数药物通过结合正确的靶蛋白开始其在体内的旅程。这就是为什么在药物开发过程中对药物-靶点结合进行预测的原因。然而,分子性质的固有多样性,加上训练数据的有限性,对这些方法的准确性和泛化性提出了挑战。结果:在这项工作中,我们提出了一种用于高精度和可推广的药物靶点结合预测的神经网络结构,称为预训练多视图分子表征(PMMR)。该方法使用预先训练的模型将靶蛋白和药物的表示转移到药物-靶标结合预测领域,减轻了由于数据有限而导致的较差泛化性的问题。然后,利用图神经网络(Graph Neural Network, GNN)和Transformer分别学习药物分子的两种典型表征Graph和SMILES字符串,实现局部特征和全局特征的互补。PMMR在药物靶标亲和力和相互作用基准数据集上进行了评估,与同类方法相比,它的性能优于同类方法,特别是在冷启动场景下的通用性。此外,通过周期蛋白依赖性激酶2 (CDK2)的案例研究表明,我们最先进的方法具有药物发现的潜力。可用性和实施:https://github.com/NENUBioCompute/PMMR.Supplementary信息:补充数据可在生物信息学网站在线获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
HTSinfer: Inferring metadata from bulk illumina RNA-Seq libraries. MOSTPLAS: A Self-correction Multi-label Learning Model for Plasmid Host Range Prediction. GCLink: a graph contrastive link prediction framework for gene regulatory network inference. PNL: a software to build polygenic risk scores using a Super Learner approach based on PairNet, a Convolutional Neural Network. TiltRec: An ultra-fast and open-source toolkit for cryo-electron tomographic reconstruction.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1