Classification of Collagens via Peptide Ambiguation, in a Paleoproteomic LC-MS/MS-Based Taxonomic Pipeline.

IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Journal of Proteome Research Pub Date : 2025-04-04 Epub Date: 2025-03-13 DOI:10.1021/acs.jproteome.4c00962
Ian Engels, Alexandra Burnett, Prudence Robert, Camille Pironneau, Grégory Abrams, Robbin Bouwmeester, Peter Van der Plaetsen, Kévin Di Modica, Marcel Otte, Lawrence Guy Straus, Valentin Fischer, Fabrice Bray, Bart Mesuere, Isabelle De Groote, Dieter Deforce, Simon Daled, Maarten Dhaenens
{"title":"Classification of Collagens via Peptide Ambiguation, in a Paleoproteomic LC-MS/MS-Based Taxonomic Pipeline.","authors":"Ian Engels, Alexandra Burnett, Prudence Robert, Camille Pironneau, Grégory Abrams, Robbin Bouwmeester, Peter Van der Plaetsen, Kévin Di Modica, Marcel Otte, Lawrence Guy Straus, Valentin Fischer, Fabrice Bray, Bart Mesuere, Isabelle De Groote, Dieter Deforce, Simon Daled, Maarten Dhaenens","doi":"10.1021/acs.jproteome.4c00962","DOIUrl":null,"url":null,"abstract":"<p><p>Liquid chromatography-mass spectrometry (LC-MS/MS) extends the matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) Zooarcheology by Mass Spectrometry (ZooMS) \"mass fingerprinting\" approach to species identification by providing fragmentation spectra for each peptide. However, ancient bone samples generate sparse data containing only a few collagen proteins, rendering target-decoy strategies unusable and increasing uncertainty in peptide annotation. To ameliorate this issue, we present a ZooMS/MS data pipeline that builds on a manually curated Collagen database and comprises two novel algorithms: isoBLAST and ClassiCOL. isoBLAST first extends peptide ambiguity by generating all \"potential peptide candidates\" isobaric to the annotated precursor. The exhaustive set of candidates created is then used to retain or reject different potential paths at each taxonomic branching point from superkingdom to species, until the greatest possible specificity is reached. Uniquely, ClassiCOL allows for the identification of taxonomic mixtures, including contaminated samples, as well as suggesting taxonomies not represented in sequence databases, including extinct taxa. All considered ambiguity is then graphically represented with clear prioritization of the potential taxa in the sample. Using public as well as in-house data acquired on different instruments, we demonstrate the performance of this universal postprocessing and explore the identification of both genetic and sample mixtures. Diet reconstruction from 40,000-year-old cave hyena coprolites illustrates the exciting potential of this approach.</p>","PeriodicalId":48,"journal":{"name":"Journal of Proteome Research","volume":" ","pages":"1907-1925"},"PeriodicalIF":3.6000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11976868/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Proteome Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1021/acs.jproteome.4c00962","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Liquid chromatography-mass spectrometry (LC-MS/MS) extends the matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) Zooarcheology by Mass Spectrometry (ZooMS) "mass fingerprinting" approach to species identification by providing fragmentation spectra for each peptide. However, ancient bone samples generate sparse data containing only a few collagen proteins, rendering target-decoy strategies unusable and increasing uncertainty in peptide annotation. To ameliorate this issue, we present a ZooMS/MS data pipeline that builds on a manually curated Collagen database and comprises two novel algorithms: isoBLAST and ClassiCOL. isoBLAST first extends peptide ambiguity by generating all "potential peptide candidates" isobaric to the annotated precursor. The exhaustive set of candidates created is then used to retain or reject different potential paths at each taxonomic branching point from superkingdom to species, until the greatest possible specificity is reached. Uniquely, ClassiCOL allows for the identification of taxonomic mixtures, including contaminated samples, as well as suggesting taxonomies not represented in sequence databases, including extinct taxa. All considered ambiguity is then graphically represented with clear prioritization of the potential taxa in the sample. Using public as well as in-house data acquired on different instruments, we demonstrate the performance of this universal postprocessing and explore the identification of both genetic and sample mixtures. Diet reconstruction from 40,000-year-old cave hyena coprolites illustrates the exciting potential of this approach.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于肽歧义的胶原蛋白分类,基于LC-MS/ ms的古蛋白质组学分类管道。
液相色谱-质谱(LC-MS/MS)扩展基质辅助激光解吸电离飞行时间(MALDI-TOF)质谱(ZooMS)动物考古“质量指纹图谱”方法通过提供每个肽段的碎片化光谱进行物种鉴定。然而,古代骨样本产生的稀疏数据只包含少数胶原蛋白,使得目标诱饵策略无法使用,并增加了肽注释的不确定性。为了改善这个问题,我们提出了一个ZooMS/MS数据管道,该管道建立在人工管理的胶原蛋白数据库上,包括两种新颖的算法:isoBLAST和ClassiCOL。isoBLAST首先通过生成标注前体的所有“潜在候选肽”等压来扩展肽的模糊性。然后使用创建的详尽的候选集来保留或拒绝从超级王国到物种的每个分类分支点的不同潜在路径,直到达到最大可能的特异性。独特的是,ClassiCOL允许识别分类混合,包括污染样本,以及建议在序列数据库中未表示的分类,包括灭绝的分类群。所有考虑的模糊性,然后图形表示与明确优先级的潜在分类群在样本中。利用在不同仪器上获得的公共和内部数据,我们展示了这种通用后处理的性能,并探索了遗传和样品混合物的鉴定。从4万年前的洞穴鬣狗粪化石中重建饮食说明了这种方法令人兴奋的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Proteome Research
Journal of Proteome Research 生物-生化研究方法
CiteScore
9.00
自引率
4.50%
发文量
251
审稿时长
3 months
期刊介绍: Journal of Proteome Research publishes content encompassing all aspects of global protein analysis and function, including the dynamic aspects of genomics, spatio-temporal proteomics, metabonomics and metabolomics, clinical and agricultural proteomics, as well as advances in methodology including bioinformatics. The theme and emphasis is on a multidisciplinary approach to the life sciences through the synergy between the different types of "omics".
期刊最新文献
Proteomic Profiling Reveals the Synergistic Effects of Astaxanthin and Melatonin on the Inhibition of Cryoinjuries in Ram Sperm. Offline Two-Stage SEC and LC-MS/MS for Comprehensive Characterization of Yeast Ribosomal Populations. 2-Mercaptoethanol/DMSO Workflow Enables Highly Reproducible Quantitative Proteomics. Multiomics Integrated Analysis of the Inhibitory Mechanism of Tannic Acid on MRSA Biofilms. Comparison of Common Plasma Proteomics Workflows Reveals Distinct Preanalytical Biases by Sample Preparation and Nanoparticle Enrichment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1