PepTiger:从从头序列中识别容错蛋白的搜索引擎

Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang
{"title":"PepTiger:从从头序列中识别容错蛋白的搜索引擎","authors":"Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang","doi":"10.2174/1874383800701010001","DOIUrl":null,"url":null,"abstract":"In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.","PeriodicalId":88758,"journal":{"name":"The open spectroscopy journal","volume":"1 1","pages":"1-8"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences\",\"authors\":\"Irina Fedulova, Zheng Ouyang, Charles R. Buck, Xiang Zhang\",\"doi\":\"10.2174/1874383800701010001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.\",\"PeriodicalId\":88758,\"journal\":{\"name\":\"The open spectroscopy journal\",\"volume\":\"1 1\",\"pages\":\"1-8\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The open spectroscopy journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2174/1874383800701010001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The open spectroscopy journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1874383800701010001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

近年来,许多从头开始的测序软件产品成为可用的,为肽的MS/MS光谱提供可能的部分或完整氨基酸序列标签。然而,由于各种原因,包括光谱化学噪声和不完善的碎片化,这些序列标签几乎总是包含错误。额外的困难来自于实际的蛋白质序列变异和翻译后修饰。我们提出了一个名为PepTiger的搜索引擎,它能够正确地将有错误的从头序列标签与蛋白质数据库中的蛋白质序列进行匹配。该算法基于近似字符串匹配,然后是一种新的评分程序,该程序考虑了质量差异和新生序列与匹配肽之间的字符串距离以及理论和实验MS/MS谱之间的相似性。PepTiger与其他蛋白质鉴定软件的比较表明,PepTiger能够更好地将有错误的从头序列标签分配到正确的肽序列。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
PepTiger: Search Engine for Error-Tolerant Protein Identification from de Novo Sequences
In recent years a number of de novo sequencing software products became available providing possible partial or complete amino acid sequence tags for MS/MS spectra of peptides. However, for a variety of reasons including spectral chemical noise and imperfect fragmentation these sequence tags almost always contain errors. Additional difficulties arise from actual protein sequence variation and post-translational modifications. We present a search engine named PepTiger which is capable of correctly matching de novo sequence tags with errors to protein sequences in a protein database. The algorithm is based on approximate string matching followed by a novel scoring procedure which takes into account mass differences and the string distance between de novo sequence and matched peptides and similarities between theoretical and experimental MS/MS spectra. Comparison of PepTiger with other protein identification software shows that PepTiger is better able to assign de novo sequence tags with errors to the correct peptide sequences.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Optimal Conditions for a Multimode Laser Diode with Delayed Optical Feedback in Terahertz Time-Domain Spectroscopy A Spectroscopy-Based Multi-Analytical Approach for Studies in Conservation: Decorations in the Alexander Palace (Tsarskoye Selo) Rotational Isomerism of the Side Chains of Hydroxypropyl Cellulose in Aqueous Solution Observed Using Attenuated Total Reflectance Infrared Spectroscopy Effect of Alkaline Salts on Pyrolyzed Solid Wastes in Used Edible Oils: An Attenuated Total Reflectance Analysis of Surface Compounds as a Function of the Temperature Narrow-Linewidth Pr:YLF Laser for High-Resolution Raman Trace Gas Spectroscopy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1