Jing Gao, Hongxu Chen, Hongrui Yin, Xin Chen, Zhicheng Yang, Yuqiu Wang, Jianhong Wu, Yinping Tian, Hong Shao, Liuqing Wen* and Hu Zhou*,
{"title":"Decoding Protein Glycosylation by an Integrative Mass Spectrometry-Based De Novo Sequencing Strategy","authors":"Jing Gao, Hongxu Chen, Hongrui Yin, Xin Chen, Zhicheng Yang, Yuqiu Wang, Jianhong Wu, Yinping Tian, Hong Shao, Liuqing Wen* and Hu Zhou*, ","doi":"10.1021/jacsau.4c0096010.1021/jacsau.4c00960","DOIUrl":null,"url":null,"abstract":"<p >Glycoproteins, representing more than 50% of human proteins and most biopharmaceuticals, are crucial for regulating various biological processes. The complexity of multiple glycosylation sites often leads to incomplete sequence coverage and ambiguous glycan modification profiles. Here, we developed an integrative mass spectrometry-based approach for decoding unknown glycoproteins, which is featured with the combination of deglycosylation-mediated <i>de novo</i> sequencing with glycosylation site characterization. We utilized the enzymatic deglycosylation of <i>N-</i>/ <i>O-</i>glycans to achieve comprehensive sequence coverage. Additionally, EThcD fragmentation enables the identification of high-quality long peptides, facilitating precise protein assembly. We subsequently applied this method to <i>de novo</i> sequencing of the highly glycosylated therapeutic fusion protein Etanercept (Enbrel). We also sequenced three new tumor necrosis factor receptor:Fc-fusion biologics with largely unknown sequences, unveiling subtle distinctions in the primary sequences. Furthermore, we characterized <i>N-</i> and <i>O-</i>glycosylation modifications of these proteins at subunit, glycopeptide, and glycan levels. This strategy bridges the gap between the <i>de novo</i> sequencing and glycosylation modification, providing comprehensive information on the primary structure and glycosylation modifications for glycoproteins. Notably, our method could be a robust solution for accurate sequencing of the glycoproteins and has practical value not only in basic research but also in the biopharmaceutical industry.</p>","PeriodicalId":94060,"journal":{"name":"JACS Au","volume":"5 2","pages":"702–713 702–713"},"PeriodicalIF":8.5000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/jacsau.4c00960","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JACS Au","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/jacsau.4c00960","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Glycoproteins, representing more than 50% of human proteins and most biopharmaceuticals, are crucial for regulating various biological processes. The complexity of multiple glycosylation sites often leads to incomplete sequence coverage and ambiguous glycan modification profiles. Here, we developed an integrative mass spectrometry-based approach for decoding unknown glycoproteins, which is featured with the combination of deglycosylation-mediated de novo sequencing with glycosylation site characterization. We utilized the enzymatic deglycosylation of N-/ O-glycans to achieve comprehensive sequence coverage. Additionally, EThcD fragmentation enables the identification of high-quality long peptides, facilitating precise protein assembly. We subsequently applied this method to de novo sequencing of the highly glycosylated therapeutic fusion protein Etanercept (Enbrel). We also sequenced three new tumor necrosis factor receptor:Fc-fusion biologics with largely unknown sequences, unveiling subtle distinctions in the primary sequences. Furthermore, we characterized N- and O-glycosylation modifications of these proteins at subunit, glycopeptide, and glycan levels. This strategy bridges the gap between the de novo sequencing and glycosylation modification, providing comprehensive information on the primary structure and glycosylation modifications for glycoproteins. Notably, our method could be a robust solution for accurate sequencing of the glycoproteins and has practical value not only in basic research but also in the biopharmaceutical industry.