Rafael Stroggilos, Aggeliki Tserga, Jerome Zoidakis, Antonia Vlahou, Manousos Makridakis
{"title":"Tissue proteomics repositories for data reanalysis","authors":"Rafael Stroggilos, Aggeliki Tserga, Jerome Zoidakis, Antonia Vlahou, Manousos Makridakis","doi":"10.1002/mas.21860","DOIUrl":null,"url":null,"abstract":"<p>We are approaching the third decade since the establishment of the very first proteomics repositories back in the mid-'00s. New experimental approaches and technologies continuously enrich the field while producing vast amounts of mass spectrometry data. Together with initiatives to establish standard terminology and file formats, proteomics is rapidly transforming into a mature component of systems biology. Here we describe the ProteomeXchange consortium repositories. We specifically search, collect and evaluate public human tissue datasets (categorized as “complete” by the repository) submitted in 2015–2022, to both map the existing information and assess the data set reusability. Human tissue data are variably represented in the repositories reviewed, ranging between 10% and 25% of the total data submitted, with cancers being the most represented, followed by neuronal and cardiovascular diseases. About half of the retrieved data sets were found to lack annotations or metadata necessary to directly replicate the analysis. This poses a rough challenge to data reusability and highlights the need to increase awareness of the mage-tab file format for metadata in the community. Overall, proteomics repositories have evolved greatly over the past 7 years, as they have grown in size and become equipped with various powerful applications and tools that enable data searching and analytical tasks. However, to make the most of this potential, priority must be given to finding ways to secure detailed metadata for each submission, which is likely the next major milestone for proteomics repositories.</p>","PeriodicalId":206,"journal":{"name":"Mass Spectrometry Reviews","volume":"43 6","pages":"1270-1284"},"PeriodicalIF":6.9000,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/mas.21860","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mass Spectrometry Reviews","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mas.21860","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0
Abstract
We are approaching the third decade since the establishment of the very first proteomics repositories back in the mid-'00s. New experimental approaches and technologies continuously enrich the field while producing vast amounts of mass spectrometry data. Together with initiatives to establish standard terminology and file formats, proteomics is rapidly transforming into a mature component of systems biology. Here we describe the ProteomeXchange consortium repositories. We specifically search, collect and evaluate public human tissue datasets (categorized as “complete” by the repository) submitted in 2015–2022, to both map the existing information and assess the data set reusability. Human tissue data are variably represented in the repositories reviewed, ranging between 10% and 25% of the total data submitted, with cancers being the most represented, followed by neuronal and cardiovascular diseases. About half of the retrieved data sets were found to lack annotations or metadata necessary to directly replicate the analysis. This poses a rough challenge to data reusability and highlights the need to increase awareness of the mage-tab file format for metadata in the community. Overall, proteomics repositories have evolved greatly over the past 7 years, as they have grown in size and become equipped with various powerful applications and tools that enable data searching and analytical tasks. However, to make the most of this potential, priority must be given to finding ways to secure detailed metadata for each submission, which is likely the next major milestone for proteomics repositories.
期刊介绍:
The aim of the journal Mass Spectrometry Reviews is to publish well-written reviews in selected topics in the various sub-fields of mass spectrometry as a means to summarize the research that has been performed in that area, to focus attention of other researchers, to critically review the published material, and to stimulate further research in that area.
The scope of the published reviews include, but are not limited to topics, such as theoretical treatments, instrumental design, ionization methods, analyzers, detectors, application to the qualitative and quantitative analysis of various compounds or elements, basic ion chemistry and structure studies, ion energetic studies, and studies on biomolecules, polymers, etc.