TPepPro: a deep learning model for predicting peptide-protein interactions.

Xiaohong Jin, Zimeng Chen, Dan Yu, Qianhui Jiang, Zhuobin Chen, Bin Yan, Jing Qin, Yong Liu, Junwen Wang
{"title":"TPepPro: a deep learning model for predicting peptide-protein interactions.","authors":"Xiaohong Jin, Zimeng Chen, Dan Yu, Qianhui Jiang, Zhuobin Chen, Bin Yan, Jing Qin, Yong Liu, Junwen Wang","doi":"10.1093/bioinformatics/btae708","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Peptides and their derivatives hold potential as therapeutic agents. The rising interest in developing peptide drugs is evidenced by increasing approval rates by the FDA of USA. To identify the most potential peptides, study on peptide-protein interactions (PepPIs) presents a very important approach but poses considerable technical challenges. In experimental aspects, the transient nature of PepPIs and the high flexibility of peptides contribute to elevated costs and inefficiency. Traditional docking and molecular dynamics simulation methods require substantial computational resources, and the predictive accuracy of their results remain unsatisfactory.</p><p><strong>Results: </strong>To address this gap, we proposed TPepPro, a Transformer-based model for PepPI prediction. We trained TPepPro on a dataset of 19,187 pairs of peptide-protein complexes with both sequential and structural features. TPepPro utilizes a strategy that combines local protein sequence feature extraction with global protein structure feature extraction. Moreover, TPepPro optimizes the architecture of structural featuring neural network in BN-ReLU arrangement, which notably reduced the amount of computing resources required for PepPIs prediction. According to comparison analysis, the accuracy reached 0.855 in TPepPro, achieving an 8.1% improvement compared to the second-best model TAGPPI. TPepPro achieved an AUC of 0.922, surpassing the second-best model TAGPPI with 0.844. Moreover, the newly developed TPepPro identify certain PepPIs that can be validated according to previous experimental evidence, thus indicating the efficiency of TPepPro to detect high potential PepPIs that would be helpful for amino acid drug applications.</p><p><strong>Availability and implementation: </strong>The source code of TPepPro is available at https://github.com/wanglabhku/TPepPro.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11681936/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btae708","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Peptides and their derivatives hold potential as therapeutic agents. The rising interest in developing peptide drugs is evidenced by increasing approval rates by the FDA of USA. To identify the most potential peptides, study on peptide-protein interactions (PepPIs) presents a very important approach but poses considerable technical challenges. In experimental aspects, the transient nature of PepPIs and the high flexibility of peptides contribute to elevated costs and inefficiency. Traditional docking and molecular dynamics simulation methods require substantial computational resources, and the predictive accuracy of their results remain unsatisfactory.

Results: To address this gap, we proposed TPepPro, a Transformer-based model for PepPI prediction. We trained TPepPro on a dataset of 19,187 pairs of peptide-protein complexes with both sequential and structural features. TPepPro utilizes a strategy that combines local protein sequence feature extraction with global protein structure feature extraction. Moreover, TPepPro optimizes the architecture of structural featuring neural network in BN-ReLU arrangement, which notably reduced the amount of computing resources required for PepPIs prediction. According to comparison analysis, the accuracy reached 0.855 in TPepPro, achieving an 8.1% improvement compared to the second-best model TAGPPI. TPepPro achieved an AUC of 0.922, surpassing the second-best model TAGPPI with 0.844. Moreover, the newly developed TPepPro identify certain PepPIs that can be validated according to previous experimental evidence, thus indicating the efficiency of TPepPro to detect high potential PepPIs that would be helpful for amino acid drug applications.

Availability and implementation: The source code of TPepPro is available at https://github.com/wanglabhku/TPepPro.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
TPepPro:预测多肽-蛋白质相互作用的深度学习模型。
动机肽及其衍生物具有作为治疗药物的潜力。美国食品和药物管理局(FDA)对多肽药物的批准率不断提高,证明了人们对开发多肽药物的兴趣日益高涨。要找出最有潜力的多肽,研究多肽与蛋白质的相互作用是一个非常重要的方法,但也带来了相当大的技术挑战。在实验方面,肽与蛋白质相互作用(PepPIs)的瞬时性和肽的高度灵活性导致成本和效率的提高。传统的对接和分子动力学模拟方法需要大量的计算资源,其结果的预测准确性仍不能令人满意:为了弥补这一不足,我们提出了基于 Transformer 的 PepPI 预测模型 TPepPro。我们在一个包含 19,187 对多肽-蛋白质复合物的数据集上训练了 TPepPro,该数据集同时具有序列和结构特征。TPepPro 采用了一种将局部蛋白质序列特征提取与全局蛋白质结构特征提取相结合的策略。此外,TPepPro 还优化了 BN-ReLU 排列的结构特征神经网络架构,从而显著降低了肽-蛋白质相互作用预测所需的计算资源。根据对比分析,TPepPro 的准确率达到了 0.855,比排名第二的 TAGPPI 提高了 8.1%。TPepPro 的 AUC 为 0.922,超过了排名第二的 TAGPPI 的 0.844。此外,新开发的 TPepPro 还能识别出某些可根据以前的实验证据进行验证的 PepPIs,从而表明 TPepPro 能有效地检测出有助于氨基酸药物应用的高潜力 PepPIs:TPepPro 的源代码可从 https://github.com/wanglabhku/TPepPro.Supplementary 信息中获取:Supplementary data are available at Bioinformatics online.\.
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
HTSinfer: Inferring metadata from bulk illumina RNA-Seq libraries. MOSTPLAS: A Self-correction Multi-label Learning Model for Plasmid Host Range Prediction. GCLink: a graph contrastive link prediction framework for gene regulatory network inference. PNL: a software to build polygenic risk scores using a Super Learner approach based on PairNet, a Convolutional Neural Network. TiltRec: An ultra-fast and open-source toolkit for cryo-electron tomographic reconstruction.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1