Chapter 11. Data Formats of the Proteomics Standards Initiative

J. Vizcaíno, S. Perkins, A. Jones, E. Deutsch
{"title":"Chapter 11. Data Formats of the Proteomics Standards Initiative","authors":"J. Vizcaíno, S. Perkins, A. Jones, E. Deutsch","doi":"10.1039/9781782626732-00229","DOIUrl":null,"url":null,"abstract":"The existence and adoption of data standards in computational proteomics, as in any other field, is generally perceived to be crucial for the further development of the discipline. We here give an up-to-date overview of the open standard data formats that have been developed under the umbrella of the Proteomics Standards Initiative (PSI). We will focus in those formats related to mass spectrometry (MS). Most of them are based in XML (Extensible Markup Language) schemas: mzML (for primary MS data, the output of mass spectrometers), mzIdentML (for peptide and protein identification data), mzQuantML (for peptide and protein quantification data) and TraML (for reporting transition lists for selected reaction monitoring approaches). In addition, mzTab was developed as a simpler tab-delimited file to support peptide, protein and small molecule identification and quantification data in the same file. In all cases, we will explain the main characteristics of each format, describe the main existing software implementations and give an update of the ongoing work to extend the formats to support new use cases. Additionally, we will discuss other data formats that have been inspired by the PSI formats. Finally, other PSI data standard formats (not MS related) will be also outlined in brief.","PeriodicalId":192946,"journal":{"name":"Proteome Informatics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proteome Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1039/9781782626732-00229","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The existence and adoption of data standards in computational proteomics, as in any other field, is generally perceived to be crucial for the further development of the discipline. We here give an up-to-date overview of the open standard data formats that have been developed under the umbrella of the Proteomics Standards Initiative (PSI). We will focus in those formats related to mass spectrometry (MS). Most of them are based in XML (Extensible Markup Language) schemas: mzML (for primary MS data, the output of mass spectrometers), mzIdentML (for peptide and protein identification data), mzQuantML (for peptide and protein quantification data) and TraML (for reporting transition lists for selected reaction monitoring approaches). In addition, mzTab was developed as a simpler tab-delimited file to support peptide, protein and small molecule identification and quantification data in the same file. In all cases, we will explain the main characteristics of each format, describe the main existing software implementations and give an update of the ongoing work to extend the formats to support new use cases. Additionally, we will discuss other data formats that have been inspired by the PSI formats. Finally, other PSI data standard formats (not MS related) will be also outlined in brief.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
第十一章。蛋白质组学标准倡议的数据格式
与其他领域一样,计算蛋白质组学中数据标准的存在和采用通常被认为对该学科的进一步发展至关重要。我们在这里给出了在蛋白质组学标准倡议(PSI)的保护下开发的开放标准数据格式的最新概述。我们将重点关注与质谱(MS)相关的格式。它们大多基于XML(可扩展标记语言)模式:mzML(用于质谱仪的原始质谱数据),mzIdentML(用于肽和蛋白质鉴定数据),mzQuantML(用于肽和蛋白质定量数据)和TraML(用于报告选定反应监测方法的转换列表)。此外,mzTab被开发为一个更简单的以制表符分隔的文件,以支持在同一文件中的肽,蛋白质和小分子鉴定和定量数据。在所有情况下,我们将解释每种格式的主要特征,描述主要的现有软件实现,并给出正在进行的工作的更新,以扩展格式以支持新的用例。此外,我们将讨论受PSI格式启发的其他数据格式。最后,还将简要介绍其他PSI数据标准格式(与MS无关)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Chapter 8. MS2-Based Quantitation Chapter 10. Data Analysis for Data Independent Acquisition Chapter 16. Proteomics Informed by Transcriptomics Chapter 3. Peptide Spectrum Matching via Database Search and Spectral Library Search Chapter 1. Introduction to Proteome Informatics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1