非靶向代谢组学处理步骤的模块化比较

IF 5.7 2区 化学 Q1 CHEMISTRY, ANALYTICAL Analytica Chimica Acta Pub Date : 2024-11-27 DOI:10.1016/j.aca.2024.343491
Markus Aigensberger, Christoph Bueschl, Ezequias Castillo-Lopez, Sara Ricci, Raul Rivera-Chacon, Qendrim Zebeli, Franz Berthiller, Heidi E. Schwartz-Zimmermann
{"title":"非靶向代谢组学处理步骤的模块化比较","authors":"Markus Aigensberger, Christoph Bueschl, Ezequias Castillo-Lopez, Sara Ricci, Raul Rivera-Chacon, Qendrim Zebeli, Franz Berthiller, Heidi E. Schwartz-Zimmermann","doi":"10.1016/j.aca.2024.343491","DOIUrl":null,"url":null,"abstract":"<h3>Background</h3>Untargeted metabolomics requires robust and reliable strategies for data processing to extract relevant information form the underlying raw data. Multiple platforms for data processing are available, but the choice of software tool can have an impact on the analysis. This study provides a comprehensive evaluation of four workflows based on commonly used metabolomics software tools: XCMS, Compound Discoverer, MS-DIAL, and MZmine. These tools were applied to a dataset derived from bovine saliva samples spiked with small polar molecules analysed by anion exchange chromatography coupled to high resolution mass spectrometry.<h3>Results</h3>The analysis revealed significant differences in the number and overlap of detected features, with only approximately 8% of the features included in all four peak tables. Among the overlapping features, MS-DIAL demonstrated the greatest similarity to manual integration, while XCMS and MZmine also performed well. In contrast, Compound Discoverer had issues to reliably integrate high baseline peaks. This study also explores various post-processing strategies, including missing value imputation, transformation, scaling, and filtering. The assessment of missing values indicated that they primarily originated from low abundance, making imputation with small values the most effective approach. No clear evidence suggested that transformation is necessary for downstream statistical analyses. Auto-scaling emerged as the most suitable strategy for data scaling. Low thresholds for blank filtering were found to be the most effective in enhancing data quality. The optimization of filtering thresholds required a careful balance to remove unnecessary information while retaining vital data.<h3>Significance and Novelty</h3>This work provides an overview of commonly applied strategies in untargeted metabolomics analysis, emphasizing the importance of careful workflow selection and optimization. It serves as a resource for refining data processing strategies to achieve accurate and reliable results, while also offering fresh insights into the challenges encountered throughout the untargeted metabolomics processing pipeline.","PeriodicalId":240,"journal":{"name":"Analytica Chimica Acta","volume":"79 1","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Modular comparison of untargeted metabolomics processing steps\",\"authors\":\"Markus Aigensberger, Christoph Bueschl, Ezequias Castillo-Lopez, Sara Ricci, Raul Rivera-Chacon, Qendrim Zebeli, Franz Berthiller, Heidi E. Schwartz-Zimmermann\",\"doi\":\"10.1016/j.aca.2024.343491\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3>Background</h3>Untargeted metabolomics requires robust and reliable strategies for data processing to extract relevant information form the underlying raw data. Multiple platforms for data processing are available, but the choice of software tool can have an impact on the analysis. This study provides a comprehensive evaluation of four workflows based on commonly used metabolomics software tools: XCMS, Compound Discoverer, MS-DIAL, and MZmine. These tools were applied to a dataset derived from bovine saliva samples spiked with small polar molecules analysed by anion exchange chromatography coupled to high resolution mass spectrometry.<h3>Results</h3>The analysis revealed significant differences in the number and overlap of detected features, with only approximately 8% of the features included in all four peak tables. Among the overlapping features, MS-DIAL demonstrated the greatest similarity to manual integration, while XCMS and MZmine also performed well. In contrast, Compound Discoverer had issues to reliably integrate high baseline peaks. This study also explores various post-processing strategies, including missing value imputation, transformation, scaling, and filtering. The assessment of missing values indicated that they primarily originated from low abundance, making imputation with small values the most effective approach. No clear evidence suggested that transformation is necessary for downstream statistical analyses. Auto-scaling emerged as the most suitable strategy for data scaling. Low thresholds for blank filtering were found to be the most effective in enhancing data quality. The optimization of filtering thresholds required a careful balance to remove unnecessary information while retaining vital data.<h3>Significance and Novelty</h3>This work provides an overview of commonly applied strategies in untargeted metabolomics analysis, emphasizing the importance of careful workflow selection and optimization. It serves as a resource for refining data processing strategies to achieve accurate and reliable results, while also offering fresh insights into the challenges encountered throughout the untargeted metabolomics processing pipeline.\",\"PeriodicalId\":240,\"journal\":{\"name\":\"Analytica Chimica Acta\",\"volume\":\"79 1\",\"pages\":\"\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2024-11-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytica Chimica Acta\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1016/j.aca.2024.343491\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytica Chimica Acta","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1016/j.aca.2024.343491","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

背景非靶向代谢组学需要稳健可靠的数据处理策略,以便从基础原始数据中提取相关信息。目前有多种数据处理平台,但软件工具的选择会对分析产生影响。本研究全面评估了基于常用代谢组学软件工具的四种工作流程:XCMS、Compound Discoverer、MS-DIAL 和 MZmine。结果分析表明,检测到的特征在数量和重叠方面存在显著差异,只有约 8%的特征包含在所有四个峰表中。在重叠特征中,MS-DIAL 与人工整合的相似度最高,而 XCMS 和 MZmine 的表现也不错。相比之下,Compound Discoverer 在可靠地整合高基线峰方面存在问题。本研究还探讨了各种后处理策略,包括缺失值估算、转换、缩放和过滤。对缺失值的评估表明,缺失值主要来自低丰度,因此使用小值估算是最有效的方法。没有明确的证据表明下游统计分析需要转换。自动缩放是最合适的数据缩放策略。空白过滤的低阈值对提高数据质量最为有效。过滤阈值的优化需要仔细权衡,既要去除不必要的信息,又要保留重要的数据。 这项工作概述了非靶向代谢组学分析中常用的策略,强调了仔细选择和优化工作流程的重要性。它为完善数据处理策略以获得准确可靠的结果提供了资源,同时也为整个非靶向代谢组学处理流程中遇到的挑战提供了新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Modular comparison of untargeted metabolomics processing steps

Background

Untargeted metabolomics requires robust and reliable strategies for data processing to extract relevant information form the underlying raw data. Multiple platforms for data processing are available, but the choice of software tool can have an impact on the analysis. This study provides a comprehensive evaluation of four workflows based on commonly used metabolomics software tools: XCMS, Compound Discoverer, MS-DIAL, and MZmine. These tools were applied to a dataset derived from bovine saliva samples spiked with small polar molecules analysed by anion exchange chromatography coupled to high resolution mass spectrometry.

Results

The analysis revealed significant differences in the number and overlap of detected features, with only approximately 8% of the features included in all four peak tables. Among the overlapping features, MS-DIAL demonstrated the greatest similarity to manual integration, while XCMS and MZmine also performed well. In contrast, Compound Discoverer had issues to reliably integrate high baseline peaks. This study also explores various post-processing strategies, including missing value imputation, transformation, scaling, and filtering. The assessment of missing values indicated that they primarily originated from low abundance, making imputation with small values the most effective approach. No clear evidence suggested that transformation is necessary for downstream statistical analyses. Auto-scaling emerged as the most suitable strategy for data scaling. Low thresholds for blank filtering were found to be the most effective in enhancing data quality. The optimization of filtering thresholds required a careful balance to remove unnecessary information while retaining vital data.

Significance and Novelty

This work provides an overview of commonly applied strategies in untargeted metabolomics analysis, emphasizing the importance of careful workflow selection and optimization. It serves as a resource for refining data processing strategies to achieve accurate and reliable results, while also offering fresh insights into the challenges encountered throughout the untargeted metabolomics processing pipeline.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Analytica Chimica Acta
Analytica Chimica Acta 化学-分析化学
CiteScore
10.40
自引率
6.50%
发文量
1081
审稿时长
38 days
期刊介绍: Analytica Chimica Acta has an open access mirror journal Analytica Chimica Acta: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review. Analytica Chimica Acta provides a forum for the rapid publication of original research, and critical, comprehensive reviews dealing with all aspects of fundamental and applied modern analytical chemistry. The journal welcomes the submission of research papers which report studies concerning the development of new and significant analytical methodologies. In determining the suitability of submitted articles for publication, particular scrutiny will be placed on the degree of novelty and impact of the research and the extent to which it adds to the existing body of knowledge in analytical chemistry.
期刊最新文献
Flexible disk ultramicroelectrode: facile preparation and high-resolution scanning electrochemical microscopy imaging Quantification of five antineoplastic agents in swab samples using UPLC-ESI-MS/MS: method development and validation Modular comparison of untargeted metabolomics processing steps Deep eutectic solution elution assisted ligand affinity assay: a useful tool for the active coumarins screening from Fructus cnidii Serially Coupled Columns Enhance Rapid Separation and Predictive Interaction Understanding of 93 Fentanyl Analogs
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1