Toward Automated Preprocessing of Untargeted LC-MS-Based Metabolomics Feature Lists from Human Biofluids

IF 6.7 1区 化学 Q1 CHEMISTRY, ANALYTICAL Analytical Chemistry Pub Date : 2025-01-06 DOI:10.1021/acs.analchem.4c03124
Amy Hughes, Pablo Vangeenderhuysen, Marilyn De Graeve, Beata Pomian, Tim S. Nawrot, Jeroen Raes, Simon J. S. Cameron, Lynn Vanhaecke
{"title":"Toward Automated Preprocessing of Untargeted LC-MS-Based Metabolomics Feature Lists from Human Biofluids","authors":"Amy Hughes, Pablo Vangeenderhuysen, Marilyn De Graeve, Beata Pomian, Tim S. Nawrot, Jeroen Raes, Simon J. S. Cameron, Lynn Vanhaecke","doi":"10.1021/acs.analchem.4c03124","DOIUrl":null,"url":null,"abstract":"Maximizing the extraction of true, high-quality, nonredundant features from biofluids analyzed via LC-MS systems is challenging. Here, the R packages IPO and AutoTuner were used to optimize XCMS parameter settings for the retrieval of metabolite or lipid features in both ionization modes from either faecal or urine samples from two cohorts (<i>n</i> = 621). The feature lists obtained were compared with those where the parameter values were selected manually. Three categories were used to compare feature lists: 1) feature quality through removing false positives, 2) tentative metabolite identification using the Human Metabolome Database (HMDB) and 3) feature utility such as analyzing the proportion of features within intensity threshold bins. Furthermore, a PCA-based approach to feature filtering using QC samples and variable loadings was also explored under this category. Overall, more features were observed after automated selection of parameter values for all data sets (1.3- to 3.7-fold), which propagated through comparative exercises. For example, a greater number of features (on average 51 vs 45%) had a coefficient of variation (CV) &lt; 30%. Additionally, there was a significant increase (7.6–10.4%) in the number of faecal metabolites that could be tentatively annotated, and more features were present in higher intensity threshold bins. Considering the overlap across all three categories, a greater number of features were also retained. Automated approaches that guide selection of optimal parameter values for preprocessing are important to decrease the time invested for this step, while taking advantage of the wealth of data that LC-MS systems provide.","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":"37 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.analchem.4c03124","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Maximizing the extraction of true, high-quality, nonredundant features from biofluids analyzed via LC-MS systems is challenging. Here, the R packages IPO and AutoTuner were used to optimize XCMS parameter settings for the retrieval of metabolite or lipid features in both ionization modes from either faecal or urine samples from two cohorts (n = 621). The feature lists obtained were compared with those where the parameter values were selected manually. Three categories were used to compare feature lists: 1) feature quality through removing false positives, 2) tentative metabolite identification using the Human Metabolome Database (HMDB) and 3) feature utility such as analyzing the proportion of features within intensity threshold bins. Furthermore, a PCA-based approach to feature filtering using QC samples and variable loadings was also explored under this category. Overall, more features were observed after automated selection of parameter values for all data sets (1.3- to 3.7-fold), which propagated through comparative exercises. For example, a greater number of features (on average 51 vs 45%) had a coefficient of variation (CV) < 30%. Additionally, there was a significant increase (7.6–10.4%) in the number of faecal metabolites that could be tentatively annotated, and more features were present in higher intensity threshold bins. Considering the overlap across all three categories, a greater number of features were also retained. Automated approaches that guide selection of optimal parameter values for preprocessing are important to decrease the time invested for this step, while taking advantage of the wealth of data that LC-MS systems provide.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于lc - ms的人体生物体液代谢组学特征自动预处理研究
最大限度地从LC-MS系统分析的生物流体中提取真实、高质量、无冗余的特征是一项挑战。在这里,使用R包IPO和AutoTuner优化XCMS参数设置,以便在两种电离模式下从两个队列(n = 621)的粪便或尿液样本中检索代谢物或脂质特征。将得到的特征列表与手动选择参数值的特征列表进行比较。通过三个类别来比较特征列表:1)通过去除假阳性的特征质量,2)使用人类代谢组数据库(HMDB)进行初步代谢物鉴定,3)特征效用,如分析强度阈值箱内特征的比例。此外,在此类别下还探讨了基于pca的使用QC样本和可变负载的特征滤波方法。总体而言,在所有数据集的参数值自动选择后(1.3- 3.7倍),通过比较练习传播,观察到更多的特征。例如,更多的特征(平均51 vs 45%)具有变异系数(CV) <;30%。此外,可以初步注释的粪便代谢物数量显著增加(7.6-10.4%),并且在高强度阈值箱中存在更多特征。考虑到这三个类别之间的重叠,还保留了更多的特征。指导选择最佳预处理参数值的自动化方法对于减少这一步投入的时间很重要,同时利用LC-MS系统提供的丰富数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Analytical Chemistry
Analytical Chemistry 化学-分析化学
CiteScore
12.10
自引率
12.20%
发文量
1949
审稿时长
1.4 months
期刊介绍: Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.
期刊最新文献
Graphene Field-Effect Transistors Loaded with DNA Walkers for Detection of miRNAs Uncovering Proton Transmembrane Dynamics in Single Bacteria with Array Enhanced Autocorrelation Spectroscopy Lysine-Engineered Seawater-Tolerant Nanozyme for Sustainable Inactivation of Drug-Resistant Bacteria in Marine Environments. Breath Aerosol Admittance Orthonormal to Sweat Conductivity as a Reliable 2-D Diagnostic Criterion for Cystic Fibrosis. Automated Platform for Mass-Spectrometry-Based Multiomics Analysis from Small Volumes of Biofluids
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1