TPepPro: a deep learning model for predicting peptide-protein interactions.

Bioinformatics (Oxford, England) Pub Date : 2024-12-26 DOI:10.1093/bioinformatics/btae708

Xiaohong Jin, Zimeng Chen, Dan Yu, Qianhui Jiang, Zhuobin Chen, Bin Yan, Jing Qin, Yong Liu, Junwen Wang

{"title":"TPepPro: a deep learning model for predicting peptide-protein interactions.","authors":"Xiaohong Jin, Zimeng Chen, Dan Yu, Qianhui Jiang, Zhuobin Chen, Bin Yan, Jing Qin, Yong Liu, Junwen Wang","doi":"10.1093/bioinformatics/btae708","DOIUrl":null,"url":null,"abstract":"Motivation: Peptides and their derivatives hold potential as therapeutic agents. The rising interest in developing peptide drugs is evidenced by increasing approval rates by the FDA of USA. To identify the most potential peptides, study on peptide-protein interactions (PepPIs) presents a very important approach but poses considerable technical challenges. In experimental aspects, the transient nature of PepPIs and the high flexibility of peptides contribute to elevated costs and inefficiency. Traditional docking and molecular dynamics simulation methods require substantial computational resources, and the predictive accuracy of their results remain unsatisfactory.Results: To address this gap, we proposed TPepPro, a Transformer-based model for PepPI prediction. We trained TPepPro on a dataset of 19,187 pairs of peptide-protein complexes with both sequential and structural features. TPepPro utilizes a strategy that combines local protein sequence feature extraction with global protein structure feature extraction. Moreover, TPepPro optimizes the architecture of structural featuring neural network in BN-ReLU arrangement, which notably reduced the amount of computing resources required for PepPIs prediction. According to comparison analysis, the accuracy reached 0.855 in TPepPro, achieving an 8.1% improvement compared to the second-best model TAGPPI. TPepPro achieved an AUC of 0.922, surpassing the second-best model TAGPPI with 0.844. Moreover, the newly developed TPepPro identify certain PepPIs that can be validated according to previous experimental evidence, thus indicating the efficiency of TPepPro to detect high potential PepPIs that would be helpful for amino acid drug applications.Availability and implementation: The source code of TPepPro is available at https://github.com/wanglabhku/TPepPro.","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11681936/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btae708","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Motivation: Peptides and their derivatives hold potential as therapeutic agents. The rising interest in developing peptide drugs is evidenced by increasing approval rates by the FDA of USA. To identify the most potential peptides, study on peptide-protein interactions (PepPIs) presents a very important approach but poses considerable technical challenges. In experimental aspects, the transient nature of PepPIs and the high flexibility of peptides contribute to elevated costs and inefficiency. Traditional docking and molecular dynamics simulation methods require substantial computational resources, and the predictive accuracy of their results remain unsatisfactory.

Results: To address this gap, we proposed TPepPro, a Transformer-based model for PepPI prediction. We trained TPepPro on a dataset of 19,187 pairs of peptide-protein complexes with both sequential and structural features. TPepPro utilizes a strategy that combines local protein sequence feature extraction with global protein structure feature extraction. Moreover, TPepPro optimizes the architecture of structural featuring neural network in BN-ReLU arrangement, which notably reduced the amount of computing resources required for PepPIs prediction. According to comparison analysis, the accuracy reached 0.855 in TPepPro, achieving an 8.1% improvement compared to the second-best model TAGPPI. TPepPro achieved an AUC of 0.922, surpassing the second-best model TAGPPI with 0.844. Moreover, the newly developed TPepPro identify certain PepPIs that can be validated according to previous experimental evidence, thus indicating the efficiency of TPepPro to detect high potential PepPIs that would be helpful for amino acid drug applications.

Availability and implementation: The source code of TPepPro is available at https://github.com/wanglabhku/TPepPro.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

TPepPro：预测多肽-蛋白质相互作用的深度学习模型。

动机肽及其衍生物具有作为治疗药物的潜力。美国食品和药物管理局（FDA）对多肽药物的批准率不断提高，证明了人们对开发多肽药物的兴趣日益高涨。要找出最有潜力的多肽，研究多肽与蛋白质的相互作用是一个非常重要的方法，但也带来了相当大的技术挑战。在实验方面，肽与蛋白质相互作用（PepPIs）的瞬时性和肽的高度灵活性导致成本和效率的提高。传统的对接和分子动力学模拟方法需要大量的计算资源，其结果的预测准确性仍不能令人满意：为了弥补这一不足，我们提出了基于 Transformer 的 PepPI 预测模型 TPepPro。我们在一个包含 19,187 对多肽-蛋白质复合物的数据集上训练了 TPepPro，该数据集同时具有序列和结构特征。TPepPro 采用了一种将局部蛋白质序列特征提取与全局蛋白质结构特征提取相结合的策略。此外，TPepPro 还优化了 BN-ReLU 排列的结构特征神经网络架构，从而显著降低了肽-蛋白质相互作用预测所需的计算资源。根据对比分析，TPepPro 的准确率达到了 0.855，比排名第二的 TAGPPI 提高了 8.1%。TPepPro 的 AUC 为 0.922，超过了排名第二的 TAGPPI 的 0.844。此外，新开发的 TPepPro 还能识别出某些可根据以前的实验证据进行验证的 PepPIs，从而表明 TPepPro 能有效地检测出有助于氨基酸药物应用的高潜力 PepPIs：TPepPro 的源代码可从 https://github.com/wanglabhku/TPepPro.Supplementary 信息中获取：Supplementary data are available at Bioinformatics online.\.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Bioinformatics (Oxford, England)

自引率

0.00%

发文量