Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang
{"title":"PepTiger:从从头序列中识别容错蛋白的搜索引擎","authors":"Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang","doi":"10.2174/1874383800701010001","DOIUrl":null,"url":null,"abstract":"In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.","PeriodicalId":88758,"journal":{"name":"The open spectroscopy journal","volume":"1 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences\",\"authors\":\"Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang\",\"doi\":\"10.2174/1874383800701010001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.\",\"PeriodicalId\":88758,\"journal\":{\"name\":\"The open spectroscopy journal\",\"volume\":\"1 1\",\"pages\":\"1-8\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The open spectroscopy journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2174/1874383800701010001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The open spectroscopy journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1874383800701010001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences
In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.