Pub Date : 2026-01-07DOI: 10.1016/j.mcpro.2026.101506
Deanna L Plubell, Philip M Remes, Christine C Wu, Cristina C Jacob, Gennifer E Merrihew, Chris Hsu, Nick Shulman, Brendan X MacLean, Lilian Heil, Kathleen L Poston, Tom Montine, Michael J MacCoss
The development of targeted assays that monitor biomedically relevant proteins is an important step in bridging discovery experiments to large scale clinical studies. Targeted assays are currently unable to scale to hundreds or thousands of targets. We demonstrate the generation of large-scale assays using a novel hybrid nominal mass instrument. The scale of these assays is achievable with the StellarTM mass spectrometer through the accommodation of shifting retention times by real-time alignment, while being sensitive and fast enough to handle many concurrent targets. Assays were constructed using precursor information from gas-phase fractionated (GPF) data-independent acquisition (DIA). We demonstrate the ability to schedule methods from orbitrap and linear ion trap acquired GPF DIA library, and compare the quantification of a matrix-matched calibration curve from orbitrap DIA and linear ion trap parallel reaction monitoring (PRM). Two applications of these proposed workflows are shown with a cerebrospinal fluid (CSF) neurodegenerative disease protein PRM assay and with a Mag-Net enriched plasma extracellular vesicle (EV) protein survey PRM assay. In CSF, our assay targets proteins discovered previously to be associated with Alzheimer's disease in a small independent sample set. For the Mag-Net enriched plasma survey assay, we observe that proteins selected based on their measurement robustness are still able to capture differences in abundance across disease groups in a small sample set. These highlight the application of highly multiplex, targeted protein assays in clinical research.
{"title":"Development of highly multiplex targeted proteomics assays in biofluids using a nominal mass ion trap mass spectrometer.","authors":"Deanna L Plubell, Philip M Remes, Christine C Wu, Cristina C Jacob, Gennifer E Merrihew, Chris Hsu, Nick Shulman, Brendan X MacLean, Lilian Heil, Kathleen L Poston, Tom Montine, Michael J MacCoss","doi":"10.1016/j.mcpro.2026.101506","DOIUrl":"10.1016/j.mcpro.2026.101506","url":null,"abstract":"<p><p>The development of targeted assays that monitor biomedically relevant proteins is an important step in bridging discovery experiments to large scale clinical studies. Targeted assays are currently unable to scale to hundreds or thousands of targets. We demonstrate the generation of large-scale assays using a novel hybrid nominal mass instrument. The scale of these assays is achievable with the Stellar<sup>TM</sup> mass spectrometer through the accommodation of shifting retention times by real-time alignment, while being sensitive and fast enough to handle many concurrent targets. Assays were constructed using precursor information from gas-phase fractionated (GPF) data-independent acquisition (DIA). We demonstrate the ability to schedule methods from orbitrap and linear ion trap acquired GPF DIA library, and compare the quantification of a matrix-matched calibration curve from orbitrap DIA and linear ion trap parallel reaction monitoring (PRM). Two applications of these proposed workflows are shown with a cerebrospinal fluid (CSF) neurodegenerative disease protein PRM assay and with a Mag-Net enriched plasma extracellular vesicle (EV) protein survey PRM assay. In CSF, our assay targets proteins discovered previously to be associated with Alzheimer's disease in a small independent sample set. For the Mag-Net enriched plasma survey assay, we observe that proteins selected based on their measurement robustness are still able to capture differences in abundance across disease groups in a small sample set. These highlight the application of highly multiplex, targeted protein assays in clinical research.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101506"},"PeriodicalIF":5.5,"publicationDate":"2026-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145945061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-02DOI: 10.1016/j.mcpro.2025.101483
Lan Huang, Anne-Claude Gavin, Jyoti S Choudhary
{"title":"Special Issue on Women in Proteomics.","authors":"Lan Huang, Anne-Claude Gavin, Jyoti S Choudhary","doi":"10.1016/j.mcpro.2025.101483","DOIUrl":"10.1016/j.mcpro.2025.101483","url":null,"abstract":"","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":"25 1","pages":"101483"},"PeriodicalIF":5.5,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12809484/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145896695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31DOI: 10.1016/j.mcpro.2025.101505
Kannan Venugopal, Fiona Achcar, Witold E Wolski, Paolo Nanni, Leandro Lemgruber Soares, Gavin J Wright, Matthias Marti
Malaria transmission from humans to mosquitoes is essential to the parasite life cycle. In the human malaria parasite, Plasmodium falciparum, the rate of commitment to produce the sexual transmission stages, or gametocytes varies and is governed by genetic, epigenetic and environmental factors. The sexually committed parasite has so far remained elusive due to the lack of markers to efficiently isolate these parasites for subsequent functional studies including proteomic analysis of the isolated population. Here, we demonstrate that MSRP1 is a highly specific sexual commitment marker. Using this marker, we generated and validated reporter parasite lines for subsequent FACS-based isolation of sexually and asexually committed parasites. Proteomics of isolated parasites defined distinct protein signatures, including several merozoite surface proteins, indicating functional differences between the two parasite populations. This study provides a blueprint for systematic characterisation of the parasite stage at this crucial juncture in the life cycle.
{"title":"Defining the proteome of sexually committed parasites in Plasmodium falciparum.","authors":"Kannan Venugopal, Fiona Achcar, Witold E Wolski, Paolo Nanni, Leandro Lemgruber Soares, Gavin J Wright, Matthias Marti","doi":"10.1016/j.mcpro.2025.101505","DOIUrl":"https://doi.org/10.1016/j.mcpro.2025.101505","url":null,"abstract":"<p><p>Malaria transmission from humans to mosquitoes is essential to the parasite life cycle. In the human malaria parasite, Plasmodium falciparum, the rate of commitment to produce the sexual transmission stages, or gametocytes varies and is governed by genetic, epigenetic and environmental factors. The sexually committed parasite has so far remained elusive due to the lack of markers to efficiently isolate these parasites for subsequent functional studies including proteomic analysis of the isolated population. Here, we demonstrate that MSRP1 is a highly specific sexual commitment marker. Using this marker, we generated and validated reporter parasite lines for subsequent FACS-based isolation of sexually and asexually committed parasites. Proteomics of isolated parasites defined distinct protein signatures, including several merozoite surface proteins, indicating functional differences between the two parasite populations. This study provides a blueprint for systematic characterisation of the parasite stage at this crucial juncture in the life cycle.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101505"},"PeriodicalIF":5.5,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145892718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-30DOI: 10.1016/j.mcpro.2025.101468
Jeffrey Shabanowitz, Jennifer G Abelin
{"title":"Special Issue: Celebrating the Career of Donald F. Hunt.","authors":"Jeffrey Shabanowitz, Jennifer G Abelin","doi":"10.1016/j.mcpro.2025.101468","DOIUrl":"10.1016/j.mcpro.2025.101468","url":null,"abstract":"","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":"25 1","pages":"101468"},"PeriodicalIF":5.5,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12804007/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145878696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-24DOI: 10.1016/j.mcpro.2025.101503
Christofer Daniel Sánchez, Aswath Balakrishnan, Blake Krisko, Bulbul Ahmmed, Luna Witchey, Oceani Valenzuela, Minas Minasyan, Anthony Pak, Haik Mkhikian
Although the plasma membrane (PM) is among the most biologically important and therapeutically targeted cellular compartments, it is among the most challenging to faithfully capture using proteomic approaches. The quality of quantitative surfaceomics data depends heavily on the effectiveness of the cell surface enrichment used during sample preparation. Enrichment improves sensitivity for low abundance PM proteins and ensures that the changes detected reflect PM expression changes rather than whole cell changes. Cell surface biotinylation with PM-impermeable, amine-reactive reagents is a facile, accessible, and unbiased approach to enrich PM proteins. However, it results in unexpectedly high contamination with intracellular proteins, reducing its utility. We report that biotinylating human cells with amine-reactive reagents intracellularly labels a small but reproducible population of nonviable cells. Although these dead cells represent only 5 ± 2% of the total, we find that in T cell preparations the dead cells account for 90% of labelled proteins. Depleting Annexin V positive dead T cells postlabelling removes ∼99% of the intracellularly labelled cells, resulting in markedly improved PM identifications, peptide counts, and intensity-based absolute quantification intensities. Correspondingly, we found substantial depletion of intracellular proteins, particularly of nuclear origin. Overall, the cumulative intensity of PM proteins increased from 4% to 55.8% with dead cell depletion. Finally, we demonstrate that immature ER/Golgi glycoforms of CD11a and CD18 are selectively removed by dead-cell depletion. We conclude that high intracellular labelling of nonviable cells is the major source of intracellular protein contaminants in amine-reactive surface enrichment methods and can be reduced by dead-cell depletion postlabelling, improving both the sensitivity and accuracy of PM proteomics.
{"title":"Improved T Cell Surfaceomics by Depleting Intracellularly Labelled Dead Cells.","authors":"Christofer Daniel Sánchez, Aswath Balakrishnan, Blake Krisko, Bulbul Ahmmed, Luna Witchey, Oceani Valenzuela, Minas Minasyan, Anthony Pak, Haik Mkhikian","doi":"10.1016/j.mcpro.2025.101503","DOIUrl":"10.1016/j.mcpro.2025.101503","url":null,"abstract":"<p><p>Although the plasma membrane (PM) is among the most biologically important and therapeutically targeted cellular compartments, it is among the most challenging to faithfully capture using proteomic approaches. The quality of quantitative surfaceomics data depends heavily on the effectiveness of the cell surface enrichment used during sample preparation. Enrichment improves sensitivity for low abundance PM proteins and ensures that the changes detected reflect PM expression changes rather than whole cell changes. Cell surface biotinylation with PM-impermeable, amine-reactive reagents is a facile, accessible, and unbiased approach to enrich PM proteins. However, it results in unexpectedly high contamination with intracellular proteins, reducing its utility. We report that biotinylating human cells with amine-reactive reagents intracellularly labels a small but reproducible population of nonviable cells. Although these dead cells represent only 5 ± 2% of the total, we find that in T cell preparations the dead cells account for 90% of labelled proteins. Depleting Annexin V positive dead T cells postlabelling removes ∼99% of the intracellularly labelled cells, resulting in markedly improved PM identifications, peptide counts, and intensity-based absolute quantification intensities. Correspondingly, we found substantial depletion of intracellular proteins, particularly of nuclear origin. Overall, the cumulative intensity of PM proteins increased from 4% to 55.8% with dead cell depletion. Finally, we demonstrate that immature ER/Golgi glycoforms of CD11a and CD18 are selectively removed by dead-cell depletion. We conclude that high intracellular labelling of nonviable cells is the major source of intracellular protein contaminants in amine-reactive surface enrichment methods and can be reduced by dead-cell depletion postlabelling, improving both the sensitivity and accuracy of PM proteomics.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101503"},"PeriodicalIF":5.5,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145843875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-24DOI: 10.1016/j.mcpro.2025.101501
Daniela Klaproth-Andrade, Yanik Bruns, Wassim Gabriel, Christian Nix, Valter Bergant, Andreas Pichlmair, Mathias Wilhelm, Julien Gagneur
Post-translational modifications (PTMs) play a central role in cellular regulation and are implicated in numerous diseases. Database searching remains the standard for identifying modified peptides from tandem mass spectra but is hindered by the combinatorial expansion of modification types and sites. De novo peptide sequencing offers an attractive alternative, yet existing methods remain limited to unmodified peptides or a narrow set of PTMs. Here, we curated a large dataset of spectra from endogenous and synthetic peptides from ProteomeTools spanning 19 biologically relevant amino acid-PTM combinations, covering phosphorylation, acetylation, and ubiquitination. We used this dataset to develop Modanovo, an extension of the Casanovo transformer architecture for de novo peptide sequencing. Modanovo achieved robust performance across these amino acid-PTM combinations (median area under the precision-coverage curve 0.92), while maintaining performance on unmodified peptides (0.93), nearly identical to Casanovo (0.94). The model outperformed π-PrimeNovo-PTM and InstaNovo-P and showed increased precision and complementarity to the database search tool MSFragger. Robustness was confirmed across independent datasets, particularly at peptide lengths frequently represented in the curated dataset. Applied to a phosphoproteomics dataset from monkeypox virus-infected cells, Modanovo recovered numerous confident peptides not reported by database search, including new viral phosphosites supported by spectral evidence, thereby demonstrating its complementarity to database-driven identification approaches. These results establish Modanovo as a broadly applicable model for comprehensive de novo sequencing of both modified and unmodified peptides.
{"title":"Modanovo: A Unified Model for Post-translational Modification-Aware De Novo Sequencing Using Experimental Spectra From In Vivo and Synthetic Peptides.","authors":"Daniela Klaproth-Andrade, Yanik Bruns, Wassim Gabriel, Christian Nix, Valter Bergant, Andreas Pichlmair, Mathias Wilhelm, Julien Gagneur","doi":"10.1016/j.mcpro.2025.101501","DOIUrl":"10.1016/j.mcpro.2025.101501","url":null,"abstract":"<p><p>Post-translational modifications (PTMs) play a central role in cellular regulation and are implicated in numerous diseases. Database searching remains the standard for identifying modified peptides from tandem mass spectra but is hindered by the combinatorial expansion of modification types and sites. De novo peptide sequencing offers an attractive alternative, yet existing methods remain limited to unmodified peptides or a narrow set of PTMs. Here, we curated a large dataset of spectra from endogenous and synthetic peptides from ProteomeTools spanning 19 biologically relevant amino acid-PTM combinations, covering phosphorylation, acetylation, and ubiquitination. We used this dataset to develop Modanovo, an extension of the Casanovo transformer architecture for de novo peptide sequencing. Modanovo achieved robust performance across these amino acid-PTM combinations (median area under the precision-coverage curve 0.92), while maintaining performance on unmodified peptides (0.93), nearly identical to Casanovo (0.94). The model outperformed π-PrimeNovo-PTM and InstaNovo-P and showed increased precision and complementarity to the database search tool MSFragger. Robustness was confirmed across independent datasets, particularly at peptide lengths frequently represented in the curated dataset. Applied to a phosphoproteomics dataset from monkeypox virus-infected cells, Modanovo recovered numerous confident peptides not reported by database search, including new viral phosphosites supported by spectral evidence, thereby demonstrating its complementarity to database-driven identification approaches. These results establish Modanovo as a broadly applicable model for comprehensive de novo sequencing of both modified and unmodified peptides.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101501"},"PeriodicalIF":5.5,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145843870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Investigating multiple protein post-translational modifications (PTMs) is critical for unraveling the complexities of protein regulation and the dynamic interplay among PTMs, a growing focus in proteomics. However, simultaneous analysis of diverse PTMs remains a significant technical challenge, as existing workflows struggle to balance throughput, sensitivity, and reproducibility, particularly when sample amounts are limited. To address these limitations, we present MoSAIC, a multi-PTM workflow integrating co-enrichment strategies, multiplexing, fractionation, hybrid data acquisition, and unified data analysis, optimized for clinically relevant biological samples. This approach targets phosphorylation, glycosylation, acetylation, and ubiquitination, enabling comprehensive interrogation of these modifications simultaneously. Compared to the traditional CPTAC workflow, MoSAIC doubles PTM coverage (4 vs. 2 PTMs) while maintaining the same instrument time (24 MS runs), achieving increased identifications of PTM-modified peptides. By leveraging fractionation and tandem mass tag (TMT) labeling, we achieved concurrent identification and quantification of PTM-specific peptides from the same sample, enhancing throughput and data consistency. This robust workflow addresses key limitations in multi-PTM proteomics, providing a cost-effective and efficient platform to advance biological and clinical research.
研究多种蛋白质翻译后修饰(PTMs)对于揭示蛋白质调控的复杂性和PTMs之间的动态相互作用至关重要,这是蛋白质组学日益关注的焦点。然而,同时分析多种ptm仍然是一个重大的技术挑战,因为现有的工作流程难以平衡吞吐量、灵敏度和可重复性,特别是当样品数量有限时。为了解决这些限制,我们提出了MoSAIC,这是一个多ptm工作流程,集成了共同富集策略、多路复用、分离、混合数据采集和统一数据分析,针对临床相关的生物样本进行了优化。这种方法针对磷酸化、糖基化、乙酰化和泛素化,能够同时对这些修饰进行全面的研究。与传统的CPTAC工作流程相比,MoSAIC在保持相同的仪器时间(24 MS运行)的同时,增加了PTM覆盖范围(4 vs 2 PTM),从而增加了PTM修饰肽的鉴定。通过利用分离和串联质量标签(TMT)标记,我们实现了来自同一样品的ptm特异性肽的同时鉴定和定量,提高了吞吐量和数据一致性。这个强大的工作流程解决了多ptm蛋白质组学的关键限制,为推进生物学和临床研究提供了一个经济高效的平台。
{"title":"MoSAIC: An Integrated and Modular Workflow for Confident Analysis of Protein Post-Translational Modification Landscapes.","authors":"Yuanwei Xu, Lijun Chen, T Mamie Lih, Yingwei Hu, Hui Zhang","doi":"10.1016/j.mcpro.2025.101502","DOIUrl":"https://doi.org/10.1016/j.mcpro.2025.101502","url":null,"abstract":"<p><p>Investigating multiple protein post-translational modifications (PTMs) is critical for unraveling the complexities of protein regulation and the dynamic interplay among PTMs, a growing focus in proteomics. However, simultaneous analysis of diverse PTMs remains a significant technical challenge, as existing workflows struggle to balance throughput, sensitivity, and reproducibility, particularly when sample amounts are limited. To address these limitations, we present MoSAIC, a multi-PTM workflow integrating co-enrichment strategies, multiplexing, fractionation, hybrid data acquisition, and unified data analysis, optimized for clinically relevant biological samples. This approach targets phosphorylation, glycosylation, acetylation, and ubiquitination, enabling comprehensive interrogation of these modifications simultaneously. Compared to the traditional CPTAC workflow, MoSAIC doubles PTM coverage (4 vs. 2 PTMs) while maintaining the same instrument time (24 MS runs), achieving increased identifications of PTM-modified peptides. By leveraging fractionation and tandem mass tag (TMT) labeling, we achieved concurrent identification and quantification of PTM-specific peptides from the same sample, enhancing throughput and data consistency. This robust workflow addresses key limitations in multi-PTM proteomics, providing a cost-effective and efficient platform to advance biological and clinical research.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101502"},"PeriodicalIF":5.5,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145843855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Post-translational modifications (PTMs) are pivotal in cellular regulations, and their crosstalk is related to various diseases such as cancer. Given the prevalence of PTM crosstalk within close amino acid ranges, identifying peptides with multiple PTMs is essential. However, this task is an NP-hard combinatorial problem with exponential complexity, posing significant challenges for existing analysis methods. Here, we introduce PIPI-C (PTM-Invariant Peptide Identification with a Combinatorial model), a novel search engine that addresses this challenge through a mixed integer linear programming (MILP) model, thereby overcoming the limitations of existing approaches that struggle with high-order PTM combinations. Rigorous validation across diverse datasets confirms PIPI-C's superior performance in detecting PTM combinations. When applied to over 72 million mass spectra of three human cancers-lung squamous cell carcinoma (LSCC), colorectal adenocarcinoma (COAD), and glioblastoma (GBM)-PIPI-C reveals significantly upregulated PTM combinations. In LSCC, 50% of 860 upregulated unique PTM site patterns (UPSPs) (when comparing cancer vs. normal samples) carried at least two PTMs, including literature-supported crosstalks such as di-methylation with trifluoroleucine substitution and amidation with proline-to-valine substitution. Similar findings in COAD and GBM highlight PIPI-C's utility in uncovering cancer-relevant PTM combination landscapes. Overall, PIPI-C provides a robust mathematical framework for decoding complex PTM patterns, advancing our understanding of PTM-driven cellular processes in diseases.
翻译后修饰(ptm)在细胞调控中起着至关重要的作用,它们之间的相互作用与癌症等多种疾病有关。鉴于PTM串扰在近氨基酸范围内的普遍性,鉴定具有多个PTM的肽是必要的。然而,该任务是一个具有指数复杂度的NP-hard组合问题,对现有的分析方法提出了重大挑战。在这里,我们介绍了PIPI-C (PTM- invariant Peptide Identification with a Combinatorial model),这是一种新的搜索引擎,通过混合整数线性规划(MILP)模型解决了这一挑战,从而克服了现有方法在高阶PTM组合方面的局限性。跨不同数据集的严格验证证实了PIPI-C在检测PTM组合方面的卓越性能。当将pipi - c应用于三种人类癌症(肺鳞状细胞癌(LSCC)、结直肠癌(COAD)和胶质母细胞瘤(GBM))的超过7200万个质谱时,pipi - c显示PTM组合显著上调。在LSCC中,860个上调的独特PTM位点模式(upsp)中有50%(当比较癌症和正常样本时)携带至少两个PTM,包括文献支持的串串,如三氟亮氨酸取代的二甲基化和脯氨酸-缬氨酸取代的酰胺化。在COAD和GBM中类似的发现突出了PIPI-C在发现癌症相关的PTM组合景观中的效用。总的来说,PIPI-C为解码复杂的PTM模式提供了一个强大的数学框架,促进了我们对PTM驱动的疾病细胞过程的理解。
{"title":"PIPI-C: A Combinatorial Optimization Framework for Identifying Post-translational Modification Hot-spots in Mass Spectrometry Data.","authors":"Shengzhi Lai, Shuaijian Dai, Peize Zhao, Chen Zhou, Ning Li, Weichuan Yu","doi":"10.1016/j.mcpro.2025.101494","DOIUrl":"10.1016/j.mcpro.2025.101494","url":null,"abstract":"<p><p>Post-translational modifications (PTMs) are pivotal in cellular regulations, and their crosstalk is related to various diseases such as cancer. Given the prevalence of PTM crosstalk within close amino acid ranges, identifying peptides with multiple PTMs is essential. However, this task is an NP-hard combinatorial problem with exponential complexity, posing significant challenges for existing analysis methods. Here, we introduce PIPI-C (PTM-Invariant Peptide Identification with a Combinatorial model), a novel search engine that addresses this challenge through a mixed integer linear programming (MILP) model, thereby overcoming the limitations of existing approaches that struggle with high-order PTM combinations. Rigorous validation across diverse datasets confirms PIPI-C's superior performance in detecting PTM combinations. When applied to over 72 million mass spectra of three human cancers-lung squamous cell carcinoma (LSCC), colorectal adenocarcinoma (COAD), and glioblastoma (GBM)-PIPI-C reveals significantly upregulated PTM combinations. In LSCC, 50% of 860 upregulated unique PTM site patterns (UPSPs) (when comparing cancer vs. normal samples) carried at least two PTMs, including literature-supported crosstalks such as di-methylation with trifluoroleucine substitution and amidation with proline-to-valine substitution. Similar findings in COAD and GBM highlight PIPI-C's utility in uncovering cancer-relevant PTM combination landscapes. Overall, PIPI-C provides a robust mathematical framework for decoding complex PTM patterns, advancing our understanding of PTM-driven cellular processes in diseases.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101494"},"PeriodicalIF":5.5,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145828023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-22DOI: 10.1016/j.mcpro.2025.101500
Araf Mahmud, Yingnan Song, Qi Zhou, Chen Huang
Translational errors (TEs) result in a mismatch between mRNA codons and the amino acids (AAs) of the corresponding protein. Unlike DNA mutations or RNA editing, where nucleotide sequences can be used to infer AA substitutions, TEs can only be detected at the protein level. Although high-throughput mass spectrometry (MS) proteomics offers the potential to resolve peptide sequences and could theoretically be used to identify TEs, the feasibility of current MS data analysis approaches for this application remains uncertain. Here, we utilize patient-derived xenograft proteomics data, which include both human and mouse peptides with identifiable cross-species AA variations, as a ground truth for benchmarking TE identification methods. By using high-confidence mouse peptides as surrogates for "TE-containing" peptides, we show that current open search approaches can achieve >65% overall sensitivity and >70% overall precision for high-quality samples. The intersection of different search strategies significantly enhances precision, albeit at the expense of reduced sensitivity. Notably, the evaluation metrics vary significantly across individual AA substitutions, suggesting that caution is warranted when detecting or interpreting specific AA substitutions. Moreover, closed searches targeting predefined AA changes exhibit poor precision, with post-translational modification mislocalization identified as a key bottleneck for this application. Overall, our study provides a first-of-its-kind benchmark for MS-based TE discovery and offers guidance for optimizing MS search strategies.
{"title":"Assessing the Performance of Mass Spectrometry Search Strategies in Identifying Translational Errors Using PDX Proteomics Data.","authors":"Araf Mahmud, Yingnan Song, Qi Zhou, Chen Huang","doi":"10.1016/j.mcpro.2025.101500","DOIUrl":"10.1016/j.mcpro.2025.101500","url":null,"abstract":"<p><p>Translational errors (TEs) result in a mismatch between mRNA codons and the amino acids (AAs) of the corresponding protein. Unlike DNA mutations or RNA editing, where nucleotide sequences can be used to infer AA substitutions, TEs can only be detected at the protein level. Although high-throughput mass spectrometry (MS) proteomics offers the potential to resolve peptide sequences and could theoretically be used to identify TEs, the feasibility of current MS data analysis approaches for this application remains uncertain. Here, we utilize patient-derived xenograft proteomics data, which include both human and mouse peptides with identifiable cross-species AA variations, as a ground truth for benchmarking TE identification methods. By using high-confidence mouse peptides as surrogates for \"TE-containing\" peptides, we show that current open search approaches can achieve >65% overall sensitivity and >70% overall precision for high-quality samples. The intersection of different search strategies significantly enhances precision, albeit at the expense of reduced sensitivity. Notably, the evaluation metrics vary significantly across individual AA substitutions, suggesting that caution is warranted when detecting or interpreting specific AA substitutions. Moreover, closed searches targeting predefined AA changes exhibit poor precision, with post-translational modification mislocalization identified as a key bottleneck for this application. Overall, our study provides a first-of-its-kind benchmark for MS-based TE discovery and offers guidance for optimizing MS search strategies.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101500"},"PeriodicalIF":5.5,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145828011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-22DOI: 10.1016/j.mcpro.2025.101499
Myoung Sup Shim, Aleks Grimsrud, Vaibhav Desikan, Mi Sun Sung, Paloma B Liton
We present the first integrated transcriptomic and proteomic profiling of the iridocorneal region in the spontaneous murine glaucoma model DBA/2J and DBA/2J-Gpnmb+/Sj controls to define molecular changes associated with ocular hypertension and glaucoma. Using RNA sequencing and label-free quantitative proteomics, we identified over 20,000 transcripts and 8500 proteins, creating a comprehensive molecular atlas of glaucoma-related alterations in DBA/2J mice. Principal component and differential expression analyses revealed distinct genotype-specific molecular signatures. In DBA/2J mice, upregulated genes were enriched in pathways related to extracellular matrix remodeling, collagen organization, TGF-β signaling, and inflammation. Proteomic data confirmed increased levels of complement components, antigen presentation proteins, and autophagy markers. Integrated analyses identified 29 genes upregulated at both transcript and protein levels, primarily involved in extracellular matrix structure and immune regulation. Downregulated genes were associated with melanocyte differentiation and pigment-organelle function, including Pmel, a gene implicated in pigmentary glaucoma. Cross-referencing with human genome-wide association studies data revealed overlap with glaucoma-associated genes (LTBP2, LOXL1, COL11A1, VCAM1), alongside reduced expression of Angpt and Lmx1b, linked to ocular hypertension. Together, these findings support the existence of an immune-fibrotic feed-forward loop and implicate collagen-elastic fiber dysfunction as a central mechanism in glaucoma pathogenesis.
{"title":"Comparative Multi-Omics Analysis of the Iridocorneal Angle Identifies an Immune-Fibrotic Profile in the DBA/2J Glaucoma Mouse Model.","authors":"Myoung Sup Shim, Aleks Grimsrud, Vaibhav Desikan, Mi Sun Sung, Paloma B Liton","doi":"10.1016/j.mcpro.2025.101499","DOIUrl":"10.1016/j.mcpro.2025.101499","url":null,"abstract":"<p><p>We present the first integrated transcriptomic and proteomic profiling of the iridocorneal region in the spontaneous murine glaucoma model DBA/2J and DBA/2J-Gpnmb<sup>+</sup>/Sj controls to define molecular changes associated with ocular hypertension and glaucoma. Using RNA sequencing and label-free quantitative proteomics, we identified over 20,000 transcripts and 8500 proteins, creating a comprehensive molecular atlas of glaucoma-related alterations in DBA/2J mice. Principal component and differential expression analyses revealed distinct genotype-specific molecular signatures. In DBA/2J mice, upregulated genes were enriched in pathways related to extracellular matrix remodeling, collagen organization, TGF-β signaling, and inflammation. Proteomic data confirmed increased levels of complement components, antigen presentation proteins, and autophagy markers. Integrated analyses identified 29 genes upregulated at both transcript and protein levels, primarily involved in extracellular matrix structure and immune regulation. Downregulated genes were associated with melanocyte differentiation and pigment-organelle function, including Pmel, a gene implicated in pigmentary glaucoma. Cross-referencing with human genome-wide association studies data revealed overlap with glaucoma-associated genes (LTBP2, LOXL1, COL11A1, VCAM1), alongside reduced expression of Angpt and Lmx1b, linked to ocular hypertension. Together, these findings support the existence of an immune-fibrotic feed-forward loop and implicate collagen-elastic fiber dysfunction as a central mechanism in glaucoma pathogenesis.</p>","PeriodicalId":18712,"journal":{"name":"Molecular & Cellular Proteomics","volume":" ","pages":"101499"},"PeriodicalIF":5.5,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145828069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}