{"title":"利用序列间一一依赖关系的轮廓隐马尔可夫模型(PHMM)的推广","authors":"Vahid Rezaei Tabar, H. Pezeshk","doi":"10.22059/PBS.2016.590014","DOIUrl":null,"url":null,"abstract":"The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on their similarity; the one-by-one dependency between corresponding amino acids of two current sequences can be append to PHMM. This perspective makes it possible to consider a generalization of PHMM. For estimating the parameters of generalized PHMM (emission and transition probabilities), we introduce new forward and backward algorithms. The performance of generalized PHMM is discussed by applying it to the twenty protein families in Pfam database. Results show that the generalized PHMM significantly increases the accuracy of ordinary PHMM.","PeriodicalId":20726,"journal":{"name":"Progress in Biological Sciences","volume":"30 1","pages":"117-127"},"PeriodicalIF":0.0000,"publicationDate":"2017-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A generalization of Profile Hidden Markov Model (PHMM) using one-by-one dependency between sequences\",\"authors\":\"Vahid Rezaei Tabar, H. Pezeshk\",\"doi\":\"10.22059/PBS.2016.590014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on their similarity; the one-by-one dependency between corresponding amino acids of two current sequences can be append to PHMM. This perspective makes it possible to consider a generalization of PHMM. For estimating the parameters of generalized PHMM (emission and transition probabilities), we introduce new forward and backward algorithms. The performance of generalized PHMM is discussed by applying it to the twenty protein families in Pfam database. Results show that the generalized PHMM significantly increases the accuracy of ordinary PHMM.\",\"PeriodicalId\":20726,\"journal\":{\"name\":\"Progress in Biological Sciences\",\"volume\":\"30 1\",\"pages\":\"117-127\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Progress in Biological Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22059/PBS.2016.590014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Progress in Biological Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22059/PBS.2016.590014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A generalization of Profile Hidden Markov Model (PHMM) using one-by-one dependency between sequences
The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on their similarity; the one-by-one dependency between corresponding amino acids of two current sequences can be append to PHMM. This perspective makes it possible to consider a generalization of PHMM. For estimating the parameters of generalized PHMM (emission and transition probabilities), we introduce new forward and backward algorithms. The performance of generalized PHMM is discussed by applying it to the twenty protein families in Pfam database. Results show that the generalized PHMM significantly increases the accuracy of ordinary PHMM.