MARLOWE: An Untargeted Proteomics, Statistical Approach to Taxonomic Classification for Forensics.

IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Journal of Proteome Research Pub Date : 2025-03-07 Epub Date: 2025-02-03 DOI:10.1021/acs.jproteome.3c00477
Fanny Chu, Sarah C Jenson, Anthony S Barente, Natalie C Heller, Eric D Merkley, Kristin H Jarman
{"title":"MARLOWE: An Untargeted Proteomics, Statistical Approach to Taxonomic Classification for Forensics.","authors":"Fanny Chu, Sarah C Jenson, Anthony S Barente, Natalie C Heller, Eric D Merkley, Kristin H Jarman","doi":"10.1021/acs.jproteome.3c00477","DOIUrl":null,"url":null,"abstract":"<p><p>General proteomics research for fundamental science typically addresses laboratory- or patient-derived samples of known origin and composition. However, in a few research areas, such as environmental proteomics, clinical identification of infectious organisms, archeology, art/cultural history, and forensics, attributing the origin of a protein-containing sample to the organisms that produced it is a central focus. A small number of groups have approached this problem and developed software tools for taxonomic characterization and/or identification using bottom-up proteomics. Most such tools identify peptides via database search, and many rely on organism-specific peptides as markers. Our group recently introduced MARLOWE, a software tool for taxonomic characterization of unknown samples based on <i>de novo</i> peptide identification and signal-erosion-resistant strong peptides, which are shared peptides distributed in a taxonomy-dependent manner. In the current work, we further characterize the utility of MARLOWE using publicly available proteomics data from forensically-relevant samples. MARLOWE characterizes samples based on their protein profile, and returns ranked organism lists of potential contributors and taxonomic scores based on shared strong peptides between organisms. Overall, the correct characterization rate ranges between 44 and 100%, depending on the sample type and data acquisition parameters (with lower numbers associated with lower-quality data sets). MARLOWE demonstrates successful characterization of true contributors and close relatives, and provides sufficient specificity to distinguish certain microbial species. MARLOWE demonstrates its ability to provide insight into potential taxonomic sources for a wide range of sample types without prior assumptions about sample contents. This approach can find utility in forensic science and also broadly in bioanalytical applications that utilize proteomics approaches for taxonomic characterization.</p>","PeriodicalId":48,"journal":{"name":"Journal of Proteome Research","volume":" ","pages":"995-1007"},"PeriodicalIF":3.6000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Proteome Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1021/acs.jproteome.3c00477","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/3 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

General proteomics research for fundamental science typically addresses laboratory- or patient-derived samples of known origin and composition. However, in a few research areas, such as environmental proteomics, clinical identification of infectious organisms, archeology, art/cultural history, and forensics, attributing the origin of a protein-containing sample to the organisms that produced it is a central focus. A small number of groups have approached this problem and developed software tools for taxonomic characterization and/or identification using bottom-up proteomics. Most such tools identify peptides via database search, and many rely on organism-specific peptides as markers. Our group recently introduced MARLOWE, a software tool for taxonomic characterization of unknown samples based on de novo peptide identification and signal-erosion-resistant strong peptides, which are shared peptides distributed in a taxonomy-dependent manner. In the current work, we further characterize the utility of MARLOWE using publicly available proteomics data from forensically-relevant samples. MARLOWE characterizes samples based on their protein profile, and returns ranked organism lists of potential contributors and taxonomic scores based on shared strong peptides between organisms. Overall, the correct characterization rate ranges between 44 and 100%, depending on the sample type and data acquisition parameters (with lower numbers associated with lower-quality data sets). MARLOWE demonstrates successful characterization of true contributors and close relatives, and provides sufficient specificity to distinguish certain microbial species. MARLOWE demonstrates its ability to provide insight into potential taxonomic sources for a wide range of sample types without prior assumptions about sample contents. This approach can find utility in forensic science and also broadly in bioanalytical applications that utilize proteomics approaches for taxonomic characterization.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种非靶向蛋白质组学,用于法医学分类分类的统计方法。
基础科学的一般蛋白质组学研究通常涉及已知来源和组成的实验室或患者来源的样品。然而,在一些研究领域,如环境蛋白质组学、感染性生物的临床鉴定、考古学、艺术/文化史和法医学,将含有蛋白质的样品的起源归因于产生它的生物是一个中心焦点。少数小组已经解决了这个问题,并开发了使用自下而上的蛋白质组学进行分类表征和/或鉴定的软件工具。大多数此类工具通过数据库搜索来识别肽,并且许多工具依赖于生物体特异性肽作为标记。我们的团队最近推出了MARLOWE,这是一个基于从头肽鉴定和抗信号侵蚀强肽的未知样品分类特征的软件工具,这些强肽是以分类依赖的方式分布的共享肽。在目前的工作中,我们进一步描述了MARLOWE的效用,使用公开可用的来自法医相关样本的蛋白质组学数据。MARLOWE根据样品的蛋白质特征特征,并根据生物之间共享的强肽返回潜在贡献者的生物体排名列表和分类分数。总体而言,正确的表征率范围在44%到100%之间,具体取决于样本类型和数据采集参数(较低的数字与较低质量的数据集相关)。MARLOWE展示了对真正贡献者和近亲的成功表征,并提供了足够的特异性来区分某些微生物物种。MARLOWE展示了它的能力,提供洞察到潜在的分类来源的广泛的样本类型,而没有事先假设样本内容。这种方法可以在法医科学中找到效用,也广泛应用于利用蛋白质组学方法进行分类表征的生物分析应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Proteome Research
Journal of Proteome Research 生物-生化研究方法
CiteScore
9.00
自引率
4.50%
发文量
251
审稿时长
3 months
期刊介绍: Journal of Proteome Research publishes content encompassing all aspects of global protein analysis and function, including the dynamic aspects of genomics, spatio-temporal proteomics, metabonomics and metabolomics, clinical and agricultural proteomics, as well as advances in methodology including bioinformatics. The theme and emphasis is on a multidisciplinary approach to the life sciences through the synergy between the different types of "omics".
期刊最新文献
Proteomics Analysis Revealed the Damage Mechanism of Hexavalent Uranium on BEAS-2B Cells through Oxidative Stress. Identifying Diabetic Cardiomyopathy Biomarkers via Proteomic and Glycation Modification Analysis Using DIA and PRM. MaSpeQC: An easy-to-use Interactive Pipeline to Assess the Performance of LC-MS/MS Instrumentation. Dissection of Vitronectin-Regulated Secretome in Hepatocellular Carcinoma. Improved Open Modification Searching via Unified Spectral Search with Predicted Libraries and Enhanced Vector Representations in ANN-SoLo.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1