{"title":"arcMS: transformation of multi-dimensional high-resolution mass spectrometry data to columnar format for compact storage and fast access.","authors":"Julien Le Roux, Julien Sade","doi":"10.1093/bioadv/vbae160","DOIUrl":null,"url":null,"abstract":"<p><strong>Summary: </strong>The arcMS R package addresses the challenges posed by proprietary and open-source high-resolution mass spectrometry data formats by providing functions to collect MS<sup>E</sup> data from the Waters UNIFI software and store it in the efficient Apache Parquet format, facilitating fast, easy access, and compatibility with various programming environments. This solution facilitates the manipulation of complex mass spectrometry data, including ion mobility or other additional dimensions, enabling potential integration into efficient and open-source workflows.</p><p><strong>Availability and implementation: </strong>arcMS is an open-source R package and is available on GitHub at https://github.com/leesulab/arcMS. The complete documentation, including details on UNIFI configuration and tutorials for data conversion, access to Parquet files, and filtration of data, is available at https://leesulab.github.io/arcMS. An R/Shiny companion application is also provided for visualization of Parquet data and demonstration of data filtering with the Arrow library https://github.com/leesulab/arcms-dataviz.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"4 1","pages":"vbae160"},"PeriodicalIF":2.4000,"publicationDate":"2024-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11873790/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbae160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Summary: The arcMS R package addresses the challenges posed by proprietary and open-source high-resolution mass spectrometry data formats by providing functions to collect MSE data from the Waters UNIFI software and store it in the efficient Apache Parquet format, facilitating fast, easy access, and compatibility with various programming environments. This solution facilitates the manipulation of complex mass spectrometry data, including ion mobility or other additional dimensions, enabling potential integration into efficient and open-source workflows.
Availability and implementation: arcMS is an open-source R package and is available on GitHub at https://github.com/leesulab/arcMS. The complete documentation, including details on UNIFI configuration and tutorials for data conversion, access to Parquet files, and filtration of data, is available at https://leesulab.github.io/arcMS. An R/Shiny companion application is also provided for visualization of Parquet data and demonstration of data filtering with the Arrow library https://github.com/leesulab/arcms-dataviz.