{"title":"基于多网络的人蛋白亚细胞定位鉴定","authors":"Rui Wang, Lei Chen","doi":"10.2174/1570164619666220531113704","DOIUrl":null,"url":null,"abstract":"\n\nProtein function is closely related to its location within the cell. Determination of protein subcellular location is helpful to uncover its functions. However, traditional biological experiments to determine the subcellular location are of high cost and low efficiency, which cannot meet today’s needs. In recent years, lots of computational models have been set up to identify protein subcellular locations. Most models used features derived from protein sequences. Recently, features extracted from protein-protein interaction (PPI) network become popular to study various protein-related problems.\n\n\n\nA novel model with features derived from multiple PPI networks was proposed to predict protein subcellular location.\n\n\n\nProtein features were obtained by a new designed network embedding algorithm, Mnode2vec, which was a generalized version of the classic Node2vec algorithm. Two classic classification algorithms: support vector machine and random forest, were employed to build the model.\n\n\n\nSuch model provided good performance and was superior to the model with features extracted by Node2vec. Also, this model outperformed some classic models. Furthermore, Mnode2vec can produce powerful features when the path length was small.\n\n\n\nThe proposed model can be a powerful tool to determine protein subcellular location and Mnode2vec can efficiently extract informative features from multiple networks.\n","PeriodicalId":50601,"journal":{"name":"Current Proteomics","volume":"38 1","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":"{\"title\":\"Identification of human protein subcellular location with multiple networks\",\"authors\":\"Rui Wang, Lei Chen\",\"doi\":\"10.2174/1570164619666220531113704\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n\\nProtein function is closely related to its location within the cell. Determination of protein subcellular location is helpful to uncover its functions. However, traditional biological experiments to determine the subcellular location are of high cost and low efficiency, which cannot meet today’s needs. In recent years, lots of computational models have been set up to identify protein subcellular locations. Most models used features derived from protein sequences. Recently, features extracted from protein-protein interaction (PPI) network become popular to study various protein-related problems.\\n\\n\\n\\nA novel model with features derived from multiple PPI networks was proposed to predict protein subcellular location.\\n\\n\\n\\nProtein features were obtained by a new designed network embedding algorithm, Mnode2vec, which was a generalized version of the classic Node2vec algorithm. Two classic classification algorithms: support vector machine and random forest, were employed to build the model.\\n\\n\\n\\nSuch model provided good performance and was superior to the model with features extracted by Node2vec. Also, this model outperformed some classic models. Furthermore, Mnode2vec can produce powerful features when the path length was small.\\n\\n\\n\\nThe proposed model can be a powerful tool to determine protein subcellular location and Mnode2vec can efficiently extract informative features from multiple networks.\\n\",\"PeriodicalId\":50601,\"journal\":{\"name\":\"Current Proteomics\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2022-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"10\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current Proteomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.2174/1570164619666220531113704\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Proteomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/1570164619666220531113704","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
Identification of human protein subcellular location with multiple networks
Protein function is closely related to its location within the cell. Determination of protein subcellular location is helpful to uncover its functions. However, traditional biological experiments to determine the subcellular location are of high cost and low efficiency, which cannot meet today’s needs. In recent years, lots of computational models have been set up to identify protein subcellular locations. Most models used features derived from protein sequences. Recently, features extracted from protein-protein interaction (PPI) network become popular to study various protein-related problems.
A novel model with features derived from multiple PPI networks was proposed to predict protein subcellular location.
Protein features were obtained by a new designed network embedding algorithm, Mnode2vec, which was a generalized version of the classic Node2vec algorithm. Two classic classification algorithms: support vector machine and random forest, were employed to build the model.
Such model provided good performance and was superior to the model with features extracted by Node2vec. Also, this model outperformed some classic models. Furthermore, Mnode2vec can produce powerful features when the path length was small.
The proposed model can be a powerful tool to determine protein subcellular location and Mnode2vec can efficiently extract informative features from multiple networks.
Current ProteomicsBIOCHEMICAL RESEARCH METHODS-BIOCHEMISTRY & MOLECULAR BIOLOGY
CiteScore
1.60
自引率
0.00%
发文量
25
审稿时长
>0 weeks
期刊介绍:
Research in the emerging field of proteomics is growing at an extremely rapid rate. The principal aim of Current Proteomics is to publish well-timed in-depth/mini review articles in this fast-expanding area on topics relevant and significant to the development of proteomics. Current Proteomics is an essential journal for everyone involved in proteomics and related fields in both academia and industry.
Current Proteomics publishes in-depth/mini review articles in all aspects of the fast-expanding field of proteomics. All areas of proteomics are covered together with the methodology, software, databases, technological advances and applications of proteomics, including functional proteomics. Diverse technologies covered include but are not limited to:
Protein separation and characterization techniques
2-D gel electrophoresis and image analysis
Techniques for protein expression profiling including mass spectrometry-based methods and algorithms for correlative database searching
Determination of co-translational and post- translational modification of proteins
Protein/peptide microarrays
Biomolecular interaction analysis
Analysis of protein complexes
Yeast two-hybrid projects
Protein-protein interaction (protein interactome) pathways and cell signaling networks
Systems biology
Proteome informatics (bioinformatics)
Knowledge integration and management tools
High-throughput protein structural studies (using mass spectrometry, nuclear magnetic resonance and X-ray crystallography)
High-throughput computational methods for protein 3-D structure as well as function determination
Robotics, nanotechnology, and microfluidics.