{"title":"Data Processing of Product Ion Spectra: Redundancy of Product Ion Spectra of Small Molecules in Data-Dependent Acquisition Dataset","authors":"Fumio Matsuda","doi":"10.5702/massspectrometry.A0138","DOIUrl":null,"url":null,"abstract":"Non-targeted metabolome analysis studies comprehensively acquire product ion spectra from the observed ions by the data-dependent acquisition (DDA) mode of tandem mass spectrometry (MS). A DDA dataset redundantly contains closely similar product ion spectra of metabolites commonly existing among the biological samples analyzed in a metabolome study. Moreover, a single DDA data file often includes two or more closely similar raw spectra obtained from an identical precursor ion. The redundancy of product ion spectra has been used to generate an averaged product ion spectrum from a set of similar product ion spectra recorded in a DDA dataset. The spectral averaging improved the accuracy of m/z values and signal-to-noise levels of product ion spectra. However, the origins of redundancy, variations among datasets, and these effects on the spectral averaging procedure needed to be better characterized. This study investigated the nature of the redundancy by comparing the averaging results of eight DDA datasets of non-targeted metabolomics studies. The comparison revealed a significant variation in redundancy among datasets. The DDA datasets obtained by the quadrupole (Q)-Orbitrap-MS datasets had more significant intrafile redundancy than that of the Q-time-of-flight-MS. For evaluating the similarity score between two production spectra, the optimal threshold level of the cosine-product method was approximately 0.8–0.9. Moreover, contamination of biological samples such as plasticizers was another origin of spectral redundancy. The results will be the basis for further development of methods for processing of product ion spectra data. Copyright © 2023 Fumio Matsuda. This is an open-access article distributed under the terms of Creative Commons Attribution Non-Commercial 4.0 International License, which permits use, distribution, and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. Please cite this article as: Mass Spectrom (Tokyo) 2023; 12(1): A0138","PeriodicalId":18243,"journal":{"name":"Mass spectrometry","volume":"84 9","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mass spectrometry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5702/massspectrometry.A0138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Physics and Astronomy","Score":null,"Total":0}
引用次数: 0
产物离子谱数据处理:数据依赖性采集数据集中小分子产物离子谱的冗余性
非靶向代谢组学分析研究通过串联质谱(MS)的数据依赖获取(DDA)模式,从观察到的离子中全面获取产物离子谱。DDA数据集冗余包含代谢组学研究中分析的生物样品中普遍存在的代谢物的密切相似的产物离子谱。此外,单个DDA数据文件通常包括从相同前体离子获得的两个或多个密切相似的原始光谱。利用生成物离子谱的冗余性,从DDA数据集中记录的一组相似的生成物离子谱中生成平均生成物离子谱。谱平均提高了m/z值的精度和产物离子谱的信噪比。然而,冗余的起源,数据集之间的变化,以及这些对光谱平均过程的影响需要更好地表征。本研究通过比较非靶向代谢组学研究的8个DDA数据集的平均结果来研究冗余的性质。比较揭示了数据集之间冗余度的显著差异。四极(Q)-Orbitrap-MS数据集获得的DDA数据集比Q-time-of-flight- ms数据集具有更显著的文件内冗余。对于评价两种生产光谱之间的相似性得分,余弦乘积法的最佳阈值水平约为0.8 ~ 0.9。此外,生物样品的污染,如增塑剂是光谱冗余的另一个来源。研究结果将为进一步发展生成物离子谱数据处理方法奠定基础。版权所有©2023松田文雄这是一篇在知识共享署名非商业4.0国际许可协议下发布的开放获取文章,该协议允许在任何媒体上使用、分发和复制,前提是原始作品被正确引用,不得用于商业目的。请将本文引用为:Mass spectrum (Tokyo) 2023;12 (1): A0138
本文章由计算机程序翻译,如有差异,请以英文原文为准。