Nansheng Chen, Xiaolong Zhang, Haomin Gan, Jing Hu
{"title":"Identification of protein hot regions by combing structure-based classification, energy-based clustering and sequence-based conservation in evolution","authors":"Nansheng Chen, Xiaolong Zhang, Haomin Gan, Jing Hu","doi":"10.1504/ijdmb.2020.10031424","DOIUrl":null,"url":null,"abstract":"Revealing the protein hot regions is the key point for understanding the protein-protein interaction, while due to the long period and labour-consuming of experimental methods, it is very helpful to use computational method to improve the efficiency to predict hot regions. In previous methods, some methods are based on a single side, such as structure, energy, and sequence, every side has its limitations. In this paper, we proposed a new method that combines structure-based classification, energy-based clustering and sequence-based conservation. This method makes full use of three sides of protein features and minimise the limitations of using one single side. Experimental results show that the proposed method increases the prediction accuracy of protein hot regions.","PeriodicalId":54964,"journal":{"name":"International Journal of Data Mining and Bioinformatics","volume":"24 1","pages":"74-95"},"PeriodicalIF":0.2000,"publicationDate":"2020-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining and Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1504/ijdmb.2020.10031424","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Revealing the protein hot regions is the key point for understanding the protein-protein interaction, while due to the long period and labour-consuming of experimental methods, it is very helpful to use computational method to improve the efficiency to predict hot regions. In previous methods, some methods are based on a single side, such as structure, energy, and sequence, every side has its limitations. In this paper, we proposed a new method that combines structure-based classification, energy-based clustering and sequence-based conservation. This method makes full use of three sides of protein features and minimise the limitations of using one single side. Experimental results show that the proposed method increases the prediction accuracy of protein hot regions.
期刊介绍:
Mining bioinformatics data is an emerging area at the intersection between bioinformatics and data mining. The objective of IJDMB is to facilitate collaboration between data mining researchers and bioinformaticians by presenting cutting edge research topics and methodologies in the area of data mining for bioinformatics. This perspective acknowledges the inter-disciplinary nature of research in data mining and bioinformatics and provides a unified forum for researchers/practitioners/students/policy makers to share the latest research and developments in this fast growing multi-disciplinary research area.