基于Transformer的蛋白质-蛋白质相互作用位点预测集成框架。

IF 11 1区 综合性期刊 Q1 Multidisciplinary Research Pub Date : 2023-09-27 eCollection Date: 2023-01-01 DOI:10.34133/research.0240
Minjie Mou, Ziqi Pan, Zhimeng Zhou, Lingyan Zheng, Hanyu Zhang, Shuiyang Shi, Fengcheng Li, Xiuna Sun, Feng Zhu
{"title":"基于Transformer的蛋白质-蛋白质相互作用位点预测集成框架。","authors":"Minjie Mou,&nbsp;Ziqi Pan,&nbsp;Zhimeng Zhou,&nbsp;Lingyan Zheng,&nbsp;Hanyu Zhang,&nbsp;Shuiyang Shi,&nbsp;Fengcheng Li,&nbsp;Xiuna Sun,&nbsp;Feng Zhu","doi":"10.34133/research.0240","DOIUrl":null,"url":null,"abstract":"<p><p>The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.</p>","PeriodicalId":21120,"journal":{"name":"Research","volume":"6 ","pages":"0240"},"PeriodicalIF":11.0000,"publicationDate":"2023-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10528219/pdf/","citationCount":"0","resultStr":"{\"title\":\"A Transformer-Based Ensemble Framework for the Prediction of Protein-Protein Interaction Sites.\",\"authors\":\"Minjie Mou,&nbsp;Ziqi Pan,&nbsp;Zhimeng Zhou,&nbsp;Lingyan Zheng,&nbsp;Hanyu Zhang,&nbsp;Shuiyang Shi,&nbsp;Fengcheng Li,&nbsp;Xiuna Sun,&nbsp;Feng Zhu\",\"doi\":\"10.34133/research.0240\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.</p>\",\"PeriodicalId\":21120,\"journal\":{\"name\":\"Research\",\"volume\":\"6 \",\"pages\":\"0240\"},\"PeriodicalIF\":11.0000,\"publicationDate\":\"2023-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10528219/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.34133/research.0240\",\"RegionNum\":1,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"Multidisciplinary\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.34133/research.0240","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}
引用次数: 0

摘要

蛋白质-蛋白质相互作用(PPI)位点的鉴定在蛋白质功能研究和新药发现中至关重要。到目前为止,已经开发了各种基于机器学习的计算工具来加速PPI位点的识别。然而,现有的方法存在预测精度低或应用范围有限的问题。具体而言,一些方法只学习全局或局部序列特征,导致预测精度较低,而另一些方法通过从结构中提取残基相互作用来提高性能,但由于严重依赖于精确的结构信息,其应用范围有限。迫切需要开发一种整合综合信息的方法,以实现PPI位点的全蛋白质组精确图谱。因此,本文提出了一种新的基于变换器和门控卷积网络的PPI位点预测集成框架EnsemPPIS。EnsemPPIS不仅可以有效地捕捉全局和局部模式,还可以捕捉残留物的相互作用。具体而言,EnsemPPIS在(a)使用transformer从蛋白质序列中提取残基相互作用,以及(b)将全局和局部序列特征与集成学习策略进一步集成方面是独特的。与现有的各种方法相比,EnsemPPIS在多个PPI位点的预测任务中表现出优越的性能或更广泛的适用性。此外,基于EnsemPPIS可解释性的模式分析表明,EnsemPPIS完全能够仅使用序列信息学习PPI位点局部结构内的残基相互作用。EnsemPPIS的网络服务器可在http://idrblab.org/ensemppis.
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Transformer-Based Ensemble Framework for the Prediction of Protein-Protein Interaction Sites.

The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Research
Research Multidisciplinary-Multidisciplinary
CiteScore
13.40
自引率
3.60%
发文量
0
审稿时长
14 weeks
期刊介绍: Research serves as a global platform for academic exchange, collaboration, and technological advancements. This journal welcomes high-quality research contributions from any domain, with open arms to authors from around the globe. Comprising fundamental research in the life and physical sciences, Research also highlights significant findings and issues in engineering and applied science. The journal proudly features original research articles, reviews, perspectives, and editorials, fostering a diverse and dynamic scholarly environment.
期刊最新文献
A Cellulose Ionogel with Rubber-Like Stretchability for Low-Grade Heat Harvesting. Single Phototrophic Bacterium-Mediated Iron Cycling in Aquatic Environments. The Inhibition of Interfacial Ice Formation and Stress Accumulation with Zwitterionic Betaine and Trehalose for High-Efficiency Skin Cryopreservation. Unveiling the Power of Gut Microbiome in Predicting Neoadjuvant Immunochemotherapy Responses in Esophageal Squamous Cell Carcinoma. A Flexible, Large-Scale Sensing Array with Low-Power In-Sensor Intelligence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1