基于多网络的人蛋白亚细胞定位鉴定

IF 0.5 4区 生物学 Q4 BIOCHEMICAL RESEARCH METHODS Current Proteomics Pub Date : 2022-05-31 DOI:10.2174/1570164619666220531113704
Rui Wang, Lei Chen
{"title":"基于多网络的人蛋白亚细胞定位鉴定","authors":"Rui Wang, Lei Chen","doi":"10.2174/1570164619666220531113704","DOIUrl":null,"url":null,"abstract":"\n\nProtein function is closely related to its location within the cell. Determination of protein subcellular location is helpful to uncover its functions. However, traditional biological experiments to determine the subcellular location are of high cost and low efficiency, which cannot meet today’s needs. In recent years, lots of computational models have been set up to identify protein subcellular locations. Most models used features derived from protein sequences. Recently, features extracted from protein-protein interaction (PPI) network become popular to study various protein-related problems.\n\n\n\nA novel model with features derived from multiple PPI networks was proposed to predict protein subcellular location.\n\n\n\nProtein features were obtained by a new designed network embedding algorithm, Mnode2vec, which was a generalized version of the classic Node2vec algorithm. Two classic classification algorithms: support vector machine and random forest, were employed to build the model.\n\n\n\nSuch model provided good performance and was superior to the model with features extracted by Node2vec. Also, this model outperformed some classic models. Furthermore, Mnode2vec can produce powerful features when the path length was small.\n\n\n\nThe proposed model can be a powerful tool to determine protein subcellular location and Mnode2vec can efficiently extract informative features from multiple networks.\n","PeriodicalId":50601,"journal":{"name":"Current Proteomics","volume":"38 1","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Identification of human protein subcellular location with multiple networks\",\"authors\":\"Rui Wang, Lei Chen\",\"doi\":\"10.2174/1570164619666220531113704\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n\\nProtein function is closely related to its location within the cell. Determination of protein subcellular location is helpful to uncover its functions. However, traditional biological experiments to determine the subcellular location are of high cost and low efficiency, which cannot meet today’s needs. In recent years, lots of computational models have been set up to identify protein subcellular locations. Most models used features derived from protein sequences. Recently, features extracted from protein-protein interaction (PPI) network become popular to study various protein-related problems.\\n\\n\\n\\nA novel model with features derived from multiple PPI networks was proposed to predict protein subcellular location.\\n\\n\\n\\nProtein features were obtained by a new designed network embedding algorithm, Mnode2vec, which was a generalized version of the classic Node2vec algorithm. Two classic classification algorithms: support vector machine and random forest, were employed to build the model.\\n\\n\\n\\nSuch model provided good performance and was superior to the model with features extracted by Node2vec. Also, this model outperformed some classic models. Furthermore, Mnode2vec can produce powerful features when the path length was small.\\n\\n\\n\\nThe proposed model can be a powerful tool to determine protein subcellular location and Mnode2vec can efficiently extract informative features from multiple networks.\\n\",\"PeriodicalId\":50601,\"journal\":{\"name\":\"Current Proteomics\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2022-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current Proteomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.2174/1570164619666220531113704\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Proteomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/1570164619666220531113704","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 10

摘要

蛋白质的功能与其在细胞中的位置密切相关。确定蛋白质的亚细胞位置有助于揭示其功能。然而,传统的生物实验确定亚细胞位置成本高、效率低,已不能满足当今的需求。近年来,人们建立了许多计算模型来识别蛋白质亚细胞的位置。大多数模型使用来自蛋白质序列的特征。近年来,从蛋白质-蛋白质相互作用(PPI)网络中提取特征成为研究各种蛋白质相关问题的热点。提出了一种具有多个PPI网络特征的新模型来预测蛋白质亚细胞定位。蛋白质特征的获取采用了一种新的网络嵌入算法Mnode2vec,该算法是经典Node2vec算法的推广版本。采用支持向量机和随机森林两种经典分类算法建立模型。该模型具有良好的性能,优于Node2vec提取特征的模型。此外,这款机型的表现也优于一些经典机型。此外,当路径长度较小时,Mnode2vec可以产生强大的特征。该模型可以作为确定蛋白质亚细胞位置的有力工具,并且Mnode2vec可以有效地从多个网络中提取信息特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Identification of human protein subcellular location with multiple networks
Protein function is closely related to its location within the cell. Determination of protein subcellular location is helpful to uncover its functions. However, traditional biological experiments to determine the subcellular location are of high cost and low efficiency, which cannot meet today’s needs. In recent years, lots of computational models have been set up to identify protein subcellular locations. Most models used features derived from protein sequences. Recently, features extracted from protein-protein interaction (PPI) network become popular to study various protein-related problems. A novel model with features derived from multiple PPI networks was proposed to predict protein subcellular location. Protein features were obtained by a new designed network embedding algorithm, Mnode2vec, which was a generalized version of the classic Node2vec algorithm. Two classic classification algorithms: support vector machine and random forest, were employed to build the model. Such model provided good performance and was superior to the model with features extracted by Node2vec. Also, this model outperformed some classic models. Furthermore, Mnode2vec can produce powerful features when the path length was small. The proposed model can be a powerful tool to determine protein subcellular location and Mnode2vec can efficiently extract informative features from multiple networks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Current Proteomics
Current Proteomics BIOCHEMICAL RESEARCH METHODS-BIOCHEMISTRY & MOLECULAR BIOLOGY
CiteScore
1.60
自引率
0.00%
发文量
25
审稿时长
>0 weeks
期刊介绍: Research in the emerging field of proteomics is growing at an extremely rapid rate. The principal aim of Current Proteomics is to publish well-timed in-depth/mini review articles in this fast-expanding area on topics relevant and significant to the development of proteomics. Current Proteomics is an essential journal for everyone involved in proteomics and related fields in both academia and industry. Current Proteomics publishes in-depth/mini review articles in all aspects of the fast-expanding field of proteomics. All areas of proteomics are covered together with the methodology, software, databases, technological advances and applications of proteomics, including functional proteomics. Diverse technologies covered include but are not limited to: Protein separation and characterization techniques 2-D gel electrophoresis and image analysis Techniques for protein expression profiling including mass spectrometry-based methods and algorithms for correlative database searching Determination of co-translational and post- translational modification of proteins Protein/peptide microarrays Biomolecular interaction analysis Analysis of protein complexes Yeast two-hybrid projects Protein-protein interaction (protein interactome) pathways and cell signaling networks Systems biology Proteome informatics (bioinformatics) Knowledge integration and management tools High-throughput protein structural studies (using mass spectrometry, nuclear magnetic resonance and X-ray crystallography) High-throughput computational methods for protein 3-D structure as well as function determination Robotics, nanotechnology, and microfluidics.
期刊最新文献
Exploring Phytochemical Compounds: A Computational Study for HIV-1 Reverse Transcriptase Inhibition Molecular Docking, Pharmacophore Mapping, and Virtual Screening of Novel Glucokinase Activators as Antidiabetic Agents Comprehensive Analysis of Tertiary Lymphoid Structures in Pancreatic Cancer: Molecular Characteristics and Prognostic Implications miR-124 in Neuroblastoma: Mechanistic Insights, Biomarker Potential, and Therapeutic Prospects The Relationship of Transposable Elements with Non-Coding RNAs in the Emergence of Human Proteins and Peptides
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1