A New Measurement for Evaluating Clusters in Protein Interaction Networks

Min Li, Xuehong Wu, Jianxin Wang, Yi Pan
{"title":"A New Measurement for Evaluating Clusters in Protein Interaction Networks","authors":"Min Li, Xuehong Wu, Jianxin Wang, Yi Pan","doi":"10.1109/BIBM.2011.47","DOIUrl":null,"url":null,"abstract":"Clustering of protein-protein interaction networks is one of the most prevalent methods for identifying protein complexes and functional modules, which is crucial to understanding the principles of cellular organization and prediction of protein functions. In the past few years, many computational methods have been proposed. However, it is always a challenging task to evaluate how well the clusters are identified. Even for the most popular measurements, F-measure and Pvalue, bias exists for evaluating the identified clusters. In this paper, we propose a new measurement, named hF-measure, to evaluate clusters more finely and distinctly. First, we defined the hierarchical consistency and the hierarchical similarity. Then, we propose a new hierarchical measurement of hF-measure by taking into account the hierarchical organization of functional annotations and the functional similarities among proteins. The new measurement hF-measure can discriminate between different types of errors which cannot be distinguished by F-measure. The experimental results based on Gene Ontology (GO) and yeast functional modules show that hF-measure evaluates clusters more accurately when compared to F-measure.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"8 1","pages":"63-68"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2011.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Clustering of protein-protein interaction networks is one of the most prevalent methods for identifying protein complexes and functional modules, which is crucial to understanding the principles of cellular organization and prediction of protein functions. In the past few years, many computational methods have been proposed. However, it is always a challenging task to evaluate how well the clusters are identified. Even for the most popular measurements, F-measure and Pvalue, bias exists for evaluating the identified clusters. In this paper, we propose a new measurement, named hF-measure, to evaluate clusters more finely and distinctly. First, we defined the hierarchical consistency and the hierarchical similarity. Then, we propose a new hierarchical measurement of hF-measure by taking into account the hierarchical organization of functional annotations and the functional similarities among proteins. The new measurement hF-measure can discriminate between different types of errors which cannot be distinguished by F-measure. The experimental results based on Gene Ontology (GO) and yeast functional modules show that hF-measure evaluates clusters more accurately when compared to F-measure.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种评价蛋白质相互作用网络簇的新方法
蛋白质-蛋白质相互作用网络聚类是鉴定蛋白质复合物和功能模块的最常用方法之一,对于理解细胞组织原理和预测蛋白质功能至关重要。在过去的几年中,已经提出了许多计算方法。然而,评估如何很好地识别集群总是一项具有挑战性的任务。即使对于最流行的测量方法,f值和p值,在评估已识别的集群时也存在偏差。在本文中,我们提出了一种新的度量,称为高频度量,以更精细、更清晰地评价聚类。首先定义了层次一致性和层次相似性。然后,我们提出了一种考虑功能注释的层次组织和蛋白质之间的功能相似性的层次化度量方法。新的测量方法hF-measure可以区分F-measure不能区分的不同类型的误差。基于基因本体(GO)和酵母功能模块的实验结果表明,与F-measure相比,hF-measure对聚类的评估更准确。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Evolution of protein architectures inferred from phylogenomic analysis of CATH Hierarchical modeling of alternative exon usage associations with survival 3D point cloud sensors for low-cost medical in-situ visualization Bayesian Classifiers for Chemical Toxicity Prediction Normal mode analysis of protein structure dynamics based on residue contact energy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1