Micha J. Birklbauer, Fränze Müller, Sowmya Sivakumar Geetha, Manuel Matzinger, Karl Mechtler, Viktoria Dorfer
{"title":"Proteome-wide non-cleavable crosslink identification with MS Annika 3.0 reveals the structure of the C. elegans Box C/D complex","authors":"Micha J. Birklbauer, Fränze Müller, Sowmya Sivakumar Geetha, Manuel Matzinger, Karl Mechtler, Viktoria Dorfer","doi":"10.1038/s42004-024-01386-x","DOIUrl":null,"url":null,"abstract":"The field of crosslinking mass spectrometry has seen substantial advancements over the past decades, enabling the structural analysis of proteins and protein complexes and serving as a powerful tool in protein–protein interaction studies. However, data analysis of large non-cleavable crosslink studies is still a mostly unsolved problem due to its n-squared complexity. We here introduce an algorithm for the identification of non-cleavable crosslinks implemented in our crosslinking search engine MS Annika that is based on sparse matrix multiplication and allows for proteome-wide searches on commodity hardware. We compare our algorithm to other state-of-the-art crosslinking search engines commonly used in the field and conclude that MS Annika unifies high sensitivity, accurate FDR estimation and computational performance, outperforming competing tools. Application of this algorithm enabled us to employ a proteome-wide search of C. elegans nuclei samples, where we were able to uncover previously unknown protein interactions and conclude a comprehensive structural analysis that provides a detailed view of the Box C/D complex. Moreover, our algorithm will enable researchers to conduct similar studies that were previously unfeasible. Crosslinking mass spectrometry enables the structural analysis of proteins and protein complexes and serves as a powerful tool in protein-protein interaction studies, however, the data analysis of large non-cleavable crosslink studies remains challenging. Here, the authors report an algorithm MS Annika 3.0 for proteome-wide identification of non-cleavable crosslinks showing high sensitivity, accurate FDR estimation and computational performance, uncovering the structure of the C. elegans Box C/D complex.","PeriodicalId":10529,"journal":{"name":"Communications Chemistry","volume":" ","pages":"1-17"},"PeriodicalIF":5.9000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s42004-024-01386-x.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Chemistry","FirstCategoryId":"92","ListUrlMain":"https://www.nature.com/articles/s42004-024-01386-x","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The field of crosslinking mass spectrometry has seen substantial advancements over the past decades, enabling the structural analysis of proteins and protein complexes and serving as a powerful tool in protein–protein interaction studies. However, data analysis of large non-cleavable crosslink studies is still a mostly unsolved problem due to its n-squared complexity. We here introduce an algorithm for the identification of non-cleavable crosslinks implemented in our crosslinking search engine MS Annika that is based on sparse matrix multiplication and allows for proteome-wide searches on commodity hardware. We compare our algorithm to other state-of-the-art crosslinking search engines commonly used in the field and conclude that MS Annika unifies high sensitivity, accurate FDR estimation and computational performance, outperforming competing tools. Application of this algorithm enabled us to employ a proteome-wide search of C. elegans nuclei samples, where we were able to uncover previously unknown protein interactions and conclude a comprehensive structural analysis that provides a detailed view of the Box C/D complex. Moreover, our algorithm will enable researchers to conduct similar studies that were previously unfeasible. Crosslinking mass spectrometry enables the structural analysis of proteins and protein complexes and serves as a powerful tool in protein-protein interaction studies, however, the data analysis of large non-cleavable crosslink studies remains challenging. Here, the authors report an algorithm MS Annika 3.0 for proteome-wide identification of non-cleavable crosslinks showing high sensitivity, accurate FDR estimation and computational performance, uncovering the structure of the C. elegans Box C/D complex.
期刊介绍:
Communications Chemistry is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the chemical sciences. Research papers published by the journal represent significant advances bringing new chemical insight to a specialized area of research. We also aim to provide a community forum for issues of importance to all chemists, regardless of sub-discipline.