Denoising Protein-Protein interaction network via variational graph auto-encoder for protein complex detection.

IF 0.7 4区 生物学 Q4 MATHEMATICAL & COMPUTATIONAL BIOLOGY Journal of Bioinformatics and Computational Biology Pub Date : 2020-06-01 DOI:10.1142/S0219720020400107
Heng Yao, Jihong Guan, Tianying Liu
{"title":"Denoising Protein-Protein interaction network via variational graph auto-encoder for protein complex detection.","authors":"Heng Yao,&nbsp;Jihong Guan,&nbsp;Tianying Liu","doi":"10.1142/S0219720020400107","DOIUrl":null,"url":null,"abstract":"<p><p>Identifying protein complexes is an important issue in computational biology, as it benefits the understanding of cellular functions and the design of drugs. In the past decades, many computational methods have been proposed by mining dense subgraphs in Protein-Protein Interaction Networks (PINs). However, the high rate of false positive/negative interactions in PINs prevents accurately detecting complexes directly from the raw PINs. In this paper, we propose a denoising approach for protein complex detection by using variational graph auto-encoder. First, we embed a PIN to vector space by a stacked graph convolutional network (GCN), then decide which interactions in the PIN are credible. If the probability of an interaction being credible is less than a threshold, we delete the interaction. In such a way, we reconstruct a reliable PIN. Following that, we detect protein complexes in the reconstructed PIN by using several typical detection methods, including CPM, Coach, DPClus, GraphEntropy, IPCA and MCODE, and compare the results with those obtained directly from the original PIN. We conduct the empirical evaluation on four yeast PPI datasets (Gavin, Krogan, DIP and Wiphi) and two human PPI datasets (Reactome and Reactomekb), against two yeast complex benchmarks (CYC2008 and MIPS) and three human complex benchmarks (REACT, REACT_uniprotkb and CORE_COMPLEX_human), respectively. Experimental results show that with the reconstructed PINs obtained by our denoising approach, complex detection performance can get obviously boosted, in most cases by over 5%, sometimes even by 200%. Furthermore, we compare our approach with two existing denoising methods (RWS and RedNemo) while varying different matching rates on separate complex distributions. Our results show that in most cases (over 2/3), the proposed approach outperforms the existing methods.</p>","PeriodicalId":48910,"journal":{"name":"Journal of Bioinformatics and Computational Biology","volume":"18 3","pages":"2040010"},"PeriodicalIF":0.7000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1142/S0219720020400107","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Bioinformatics and Computational Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1142/S0219720020400107","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 12

Abstract

Identifying protein complexes is an important issue in computational biology, as it benefits the understanding of cellular functions and the design of drugs. In the past decades, many computational methods have been proposed by mining dense subgraphs in Protein-Protein Interaction Networks (PINs). However, the high rate of false positive/negative interactions in PINs prevents accurately detecting complexes directly from the raw PINs. In this paper, we propose a denoising approach for protein complex detection by using variational graph auto-encoder. First, we embed a PIN to vector space by a stacked graph convolutional network (GCN), then decide which interactions in the PIN are credible. If the probability of an interaction being credible is less than a threshold, we delete the interaction. In such a way, we reconstruct a reliable PIN. Following that, we detect protein complexes in the reconstructed PIN by using several typical detection methods, including CPM, Coach, DPClus, GraphEntropy, IPCA and MCODE, and compare the results with those obtained directly from the original PIN. We conduct the empirical evaluation on four yeast PPI datasets (Gavin, Krogan, DIP and Wiphi) and two human PPI datasets (Reactome and Reactomekb), against two yeast complex benchmarks (CYC2008 and MIPS) and three human complex benchmarks (REACT, REACT_uniprotkb and CORE_COMPLEX_human), respectively. Experimental results show that with the reconstructed PINs obtained by our denoising approach, complex detection performance can get obviously boosted, in most cases by over 5%, sometimes even by 200%. Furthermore, we compare our approach with two existing denoising methods (RWS and RedNemo) while varying different matching rates on separate complex distributions. Our results show that in most cases (over 2/3), the proposed approach outperforms the existing methods.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于变分图自编码器的蛋白质-蛋白质相互作用网络去噪。
识别蛋白质复合物是计算生物学中的一个重要问题,因为它有利于理解细胞功能和药物设计。在过去的几十年里,通过挖掘蛋白质-蛋白质相互作用网络(PINs)中的密集子图,提出了许多计算方法。然而,pin中假阳性/负相互作用的高比率阻碍了直接从原始pin中准确检测复合物。本文提出了一种基于变分图自编码器的蛋白质复合体检测去噪方法。首先,我们通过堆叠图卷积网络(GCN)将PIN嵌入到向量空间,然后确定PIN中哪些交互是可信的。如果交互可信的概率小于阈值,我们删除交互。通过这种方式,我们重建了一个可靠的PIN。随后,我们采用CPM、Coach、DPClus、GraphEntropy、IPCA和MCODE等几种典型的检测方法对重构PIN中的蛋白复合物进行检测,并与直接从原始PIN中获得的结果进行比较。我们对4个酵母PPI数据集(Gavin, Krogan, DIP和Wiphi)和2个人类PPI数据集(reacome和Reactomekb)分别针对2个酵母复合物基准(CYC2008和MIPS)和3个人类复合物基准(REACT, REACT_uniprotkb和CORE_COMPLEX_human)进行了实证评估。实验结果表明,用我们的去噪方法得到的重构pin可以明显提高复杂检测的性能,大多数情况下可以提高5%以上,有时甚至可以提高200%。此外,我们将我们的方法与两种现有的去噪方法(RWS和RedNemo)进行比较,同时在不同的复杂分布上改变不同的匹配率。我们的结果表明,在大多数情况下(超过2/3),所提出的方法优于现有的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Bioinformatics and Computational Biology
Journal of Bioinformatics and Computational Biology MATHEMATICAL & COMPUTATIONAL BIOLOGY-
CiteScore
2.10
自引率
0.00%
发文量
57
期刊介绍: The Journal of Bioinformatics and Computational Biology aims to publish high quality, original research articles, expository tutorial papers and review papers as well as short, critical comments on technical issues associated with the analysis of cellular information. The research papers will be technical presentations of new assertions, discoveries and tools, intended for a narrower specialist community. The tutorials, reviews and critical commentary will be targeted at a broader readership of biologists who are interested in using computers but are not knowledgeable about scientific computing, and equally, computer scientists who have an interest in biology but are not familiar with current thrusts nor the language of biology. Such carefully chosen tutorials and articles should greatly accelerate the rate of entry of these new creative scientists into the field.
期刊最新文献
Predicting ncRNA-Protein interactions with a graph attention model exploiting personalized subgraphs. Early lifespan prediction in Caenorhabditis elegans via contrastive learning and channel attention. Study of the mechanism of step-by-step interaction of viral proteins during replication and transcription. Mendelian randomization and AlphaFold3 analysis suggest putative causal plasma proteins in graves' disease. PLMABFW: A deep learning framework for predicting Antibody-Antigen interactions using protein language model.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1