metaExpertPro: A Computational Workflow for Metaproteomics Spectral Library Construction and Data-Independent Acquisition Mass Spectrometry Data Analysis.

IF 6.1 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Molecular & Cellular Proteomics Pub Date : 2024-10-01 Epub Date: 2024-09-13 DOI:10.1016/j.mcpro.2024.100840
Yingying Sun, Ziyuan Xing, Shuang Liang, Zelei Miao, Lai-Bao Zhuo, Wenhao Jiang, Hui Zhao, Huanhuan Gao, Yuting Xie, Yan Zhou, Liang Yue, Xue Cai, Yu-Ming Chen, Ju-Sheng Zheng, Tiannan Guo
{"title":"metaExpertPro: A Computational Workflow for Metaproteomics Spectral Library Construction and Data-Independent Acquisition Mass Spectrometry Data Analysis.","authors":"Yingying Sun, Ziyuan Xing, Shuang Liang, Zelei Miao, Lai-Bao Zhuo, Wenhao Jiang, Hui Zhao, Huanhuan Gao, Yuting Xie, Yan Zhou, Liang Yue, Xue Cai, Yu-Ming Chen, Ju-Sheng Zheng, Tiannan Guo","doi":"10.1016/j.mcpro.2024.100840","DOIUrl":null,"url":null,"abstract":"<p><p>Analysis of large-scale data-independent acquisition mass spectrometry metaproteomics data remains a computational challenge. Here, we present a computational pipeline called metaExpertPro for metaproteomics data analysis. This pipeline encompasses spectral library generation using data-dependent acquisition MS, protein identification and quantification using data-independent acquisition mass spectrometry, functional and taxonomic annotation, as well as quantitative matrix generation for both microbiota and hosts. By integrating FragPipe and DIA-NN, metaExpertPro offers compatibility with both Orbitrap and timsTOF MS instruments. To evaluate the depth and accuracy of identification and quantification, we conducted extensive assessments using human fecal samples and benchmark tests. Performance tests conducted on human fecal samples indicated that metaExpertPro quantified an average of 45,000 peptides in a 60-min diaPASEF injection. Notably, metaExpertPro outperformed three existing software tools by characterizing a higher number of peptides and proteins. Importantly, metaExpertPro maintained a low factual false discovery rate of approximately 5% for protein groups across four benchmark tests. Applying a filter of five peptides per genus, metaExpertPro achieved relatively high accuracy (F-score = 0.67-0.90) in genus diversity and showed a high correlation (r<sub>Spearman</sub> = 0.73-0.82) between the measured and true genus relative abundance in benchmark tests. Additionally, the quantitative results at the protein, taxonomy, and function levels exhibited high reproducibility and consistency across the commonly adopted public human gut microbial protein databases IGC and UHGP. In a metaproteomic analysis of dyslipidemia patients, metaExpertPro revealed characteristic alterations in microbial functions and potential interactions between the microbiota and the host.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"100840"},"PeriodicalIF":6.1000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular & Cellular Proteomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.mcpro.2024.100840","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/9/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Analysis of large-scale data-independent acquisition mass spectrometry metaproteomics data remains a computational challenge. Here, we present a computational pipeline called metaExpertPro for metaproteomics data analysis. This pipeline encompasses spectral library generation using data-dependent acquisition MS, protein identification and quantification using data-independent acquisition mass spectrometry, functional and taxonomic annotation, as well as quantitative matrix generation for both microbiota and hosts. By integrating FragPipe and DIA-NN, metaExpertPro offers compatibility with both Orbitrap and timsTOF MS instruments. To evaluate the depth and accuracy of identification and quantification, we conducted extensive assessments using human fecal samples and benchmark tests. Performance tests conducted on human fecal samples indicated that metaExpertPro quantified an average of 45,000 peptides in a 60-min diaPASEF injection. Notably, metaExpertPro outperformed three existing software tools by characterizing a higher number of peptides and proteins. Importantly, metaExpertPro maintained a low factual false discovery rate of approximately 5% for protein groups across four benchmark tests. Applying a filter of five peptides per genus, metaExpertPro achieved relatively high accuracy (F-score = 0.67-0.90) in genus diversity and showed a high correlation (rSpearman = 0.73-0.82) between the measured and true genus relative abundance in benchmark tests. Additionally, the quantitative results at the protein, taxonomy, and function levels exhibited high reproducibility and consistency across the commonly adopted public human gut microbial protein databases IGC and UHGP. In a metaproteomic analysis of dyslipidemia patients, metaExpertPro revealed characteristic alterations in microbial functions and potential interactions between the microbiota and the host.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
metaExpertPro:用于元蛋白质组学谱库构建和独立于数据采集的质谱数据分析的计算工作流程。
大规模数据独立采集质谱(DIA-MS)元蛋白质组学数据分析仍然是一项计算挑战。在这里,我们介绍了一种用于元蛋白质组学数据分析的计算管道,称为 metaExpertPro。该管道包括使用数据依赖性采集质谱(DDA-MS)生成谱库、使用 DIA-MS 鉴定和定量蛋白质、功能和分类注释,以及生成微生物群和宿主的定量矩阵。通过集成 FragPipe 和 DIA-NN,metaExpertPro 可与 Orbitrap 和 timsTOF MS 仪器兼容。为了评估鉴定和量化的深度和准确性,我们使用人类粪便样本和基准测试进行了广泛的评估。对人体粪便样本进行的性能测试表明,在 60 分钟的 diaPASEF 注射中,metaExpertPro 平均量化了 45,000 个肽段。值得注意的是,metaExpertPro 对更多肽段和蛋白质进行了表征,其性能优于现有的三种软件工具。重要的是,在四项基准测试中,metaExpertPro 对蛋白质组的实际错误发现率(FDR)保持在较低水平,约为 5%。通过对每个属的五条肽进行过滤,metaExpertPro 在属的多样性方面达到了相对较高的准确度(F-score = 0.67-0.90),并在基准测试中显示出测量结果与真实属相对丰度之间的高度相关性(rSpearman = 0.73-0.82)。此外,蛋白质、分类和功能层面的定量结果在普遍采用的公共人类肠道微生物蛋白质数据库 IGC 和 UHGP 中表现出高度的可重复性和一致性。在对血脂异常(DLP)患者的元蛋白组分析中,metaExpertPro 揭示了微生物功能的特征性改变以及微生物群与宿主之间潜在的相互作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Molecular & Cellular Proteomics
Molecular & Cellular Proteomics 生物-生化研究方法
CiteScore
11.50
自引率
4.30%
发文量
131
审稿时长
84 days
期刊介绍: The mission of MCP is to foster the development and applications of proteomics in both basic and translational research. MCP will publish manuscripts that report significant new biological or clinical discoveries underpinned by proteomic observations across all kingdoms of life. Manuscripts must define the biological roles played by the proteins investigated or their mechanisms of action. The journal also emphasizes articles that describe innovative new computational methods and technological advancements that will enable future discoveries. Manuscripts describing such approaches do not have to include a solution to a biological problem, but must demonstrate that the technology works as described, is reproducible and is appropriate to uncover yet unknown protein/proteome function or properties using relevant model systems or publicly available data. Scope: -Fundamental studies in biology, including integrative "omics" studies, that provide mechanistic insights -Novel experimental and computational technologies -Proteogenomic data integration and analysis that enable greater understanding of physiology and disease processes -Pathway and network analyses of signaling that focus on the roles of post-translational modifications -Studies of proteome dynamics and quality controls, and their roles in disease -Studies of evolutionary processes effecting proteome dynamics, quality and regulation -Chemical proteomics, including mechanisms of drug action -Proteomics of the immune system and antigen presentation/recognition -Microbiome proteomics, host-microbe and host-pathogen interactions, and their roles in health and disease -Clinical and translational studies of human diseases -Metabolomics to understand functional connections between genes, proteins and phenotypes
期刊最新文献
Integrative Multi-PTM Proteomics Reveals Dynamic Global, Redox, Phosphorylation, and Acetylation Regulation in Cytokine-treated Pancreatic Beta Cells. Gradient-Elution Nanoflow Liquid Chromatography without a Binary Pump: Smoothed Step Gradients Enable Reproducible, Sensitive, and Low-Cost Separations for Single-Cell Proteomics. In-depth analysis of miRNA binding sites reveals the complex response of uterine epithelium to miR-26a-5p and miR-125b-5p during early pregnancy. Bridging the Gap from Proteomics Technology to Clinical Application: Highlights from the 68th Benzon Foundation Symposium. Knockdown proteomics reveals USP7 as a regulator of cell-cell adhesion in colorectal cancer via AJUBA.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1