Pub Date : 2026-03-20DOI: 10.1038/s44320-026-00203-y
Polina V Polunina, Wolfgang Maier, Alan F Rubin
Deep Mutational Scanning (DMS) assays can systematically assess the effects of amino acid substitutions on protein function, but many datasets have incomplete variant coverage due to technical constraints. We developed VEFill (Variant Effect Fill), a gradient boosting model for imputing missing DMS scores across protein domains. Trained on the Human Domainome 1, VEFill integrates ESM-1v sequence embeddings, evolutionary conservation (EVE scores), amino acid substitution matrices, and physicochemical descriptors. The model achieved robust predictive performance (Pearson r = 0.80) and generalized reliably to unseen proteins in stability-based datasets, while showing weaker performance on activity-based assays. Per-protein models confirmed VEFill's effectiveness under limited-data conditions and a reduced two-feature version performed comparably to the full model, suggesting an efficient alternative. Across multiple benchmarking settings, VEFill consistently outperformed baselines once ≥20% of experimental measurements were available. However, true zero-shot prediction without positional context remains challenging, particularly for functionally complex proteins. Overall, VEFill offers an interpretable, scalable framework for DMS score imputation, and enables systematic mutation prioritization including the design of sparse experimental libraries for variant effect studies.
深度突变扫描(DMS)分析可以系统地评估氨基酸取代对蛋白质功能的影响,但由于技术限制,许多数据集的变异覆盖不完整。我们开发了VEFill (Variant Effect Fill),这是一个梯度增强模型,用于在蛋白质结构域中输入缺失的DMS分数。VEFill以Human Domainome 1为基础,集成了ESM-1v序列嵌入、进化守恒(EVE分数)、氨基酸替代矩阵和物理化学描述符。该模型实现了稳健的预测性能(Pearson r = 0.80),并可靠地推广到基于稳定性的数据集中未见的蛋白质,而在基于活性的分析中表现较弱。单蛋白模型证实了VEFill在有限数据条件下的有效性,并且减少的双特征版本与完整模型的表现相当,表明了一种有效的替代方案。在多个基准测试设置中,VEFill在≥20%的实验测量值可用时始终优于基线。然而,没有位置背景的真正的零射击预测仍然具有挑战性,特别是对于功能复杂的蛋白质。总的来说,VEFill提供了一个可解释的、可扩展的DMS评分输入框架,并支持系统的突变优先级,包括设计用于变异效应研究的稀疏实验库。
{"title":"VEFill: accurate and generalizable deep mutational scanning score imputation across protein domains.","authors":"Polina V Polunina, Wolfgang Maier, Alan F Rubin","doi":"10.1038/s44320-026-00203-y","DOIUrl":"10.1038/s44320-026-00203-y","url":null,"abstract":"<p><p>Deep Mutational Scanning (DMS) assays can systematically assess the effects of amino acid substitutions on protein function, but many datasets have incomplete variant coverage due to technical constraints. We developed VEFill (Variant Effect Fill), a gradient boosting model for imputing missing DMS scores across protein domains. Trained on the Human Domainome 1, VEFill integrates ESM-1v sequence embeddings, evolutionary conservation (EVE scores), amino acid substitution matrices, and physicochemical descriptors. The model achieved robust predictive performance (Pearson r = 0.80) and generalized reliably to unseen proteins in stability-based datasets, while showing weaker performance on activity-based assays. Per-protein models confirmed VEFill's effectiveness under limited-data conditions and a reduced two-feature version performed comparably to the full model, suggesting an efficient alternative. Across multiple benchmarking settings, VEFill consistently outperformed baselines once ≥20% of experimental measurements were available. However, true zero-shot prediction without positional context remains challenging, particularly for functionally complex proteins. Overall, VEFill offers an interpretable, scalable framework for DMS score imputation, and enables systematic mutation prioritization including the design of sparse experimental libraries for variant effect studies.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147491478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-20DOI: 10.1038/s44320-026-00200-1
Rotem Fuchs, Ofir Schor, Bar Naim, Dafna Tussia-Cohen, Alessandra Mozzi, Diego Forni, Sivan Friedman, Zohar Haggai, Manuela Sironi, Tzachi Hagai
Viral proteins interact with host proteins to hijack cellular pathways important for viral replication. Viral mimics are proteins whose structural similarity to host-mimicked proteins allows them to interact with mutual host targets. This mimicry poses a challenge for the host-how to avoid mimics without compromising essential interactions with host-mimicked proteins. Despite the prevalence of mimicry, the evolutionary dynamics between host and viral mimics remain largely unknown. We address this by integrating structural modeling, host-virus interaction networks, and comprehensive evolutionary analyses of host and viral proteins. We show that host proteins targeted by mimics and host-mimicked proteins are highly conserved, and that this is related to functional constraints imposed on host proteins. Host interface residues that interact with both mimics and host-mimicked proteins evolve slowly, while residues that exclusively interact with mimics evolve significantly faster. Surprisingly, viral mimics do not evolve rapidly, instead displaying complex evolutionary patterns. Our analysis reveals host's limited capacity to escape mimicry and viral evolution to exploit this, and highlights how constraints lead to unexpectedly slow evolution of host-virus interaction networks.
{"title":"The evolutionary dynamics between viral mimics and host proteins.","authors":"Rotem Fuchs, Ofir Schor, Bar Naim, Dafna Tussia-Cohen, Alessandra Mozzi, Diego Forni, Sivan Friedman, Zohar Haggai, Manuela Sironi, Tzachi Hagai","doi":"10.1038/s44320-026-00200-1","DOIUrl":"https://doi.org/10.1038/s44320-026-00200-1","url":null,"abstract":"<p><p>Viral proteins interact with host proteins to hijack cellular pathways important for viral replication. Viral mimics are proteins whose structural similarity to host-mimicked proteins allows them to interact with mutual host targets. This mimicry poses a challenge for the host-how to avoid mimics without compromising essential interactions with host-mimicked proteins. Despite the prevalence of mimicry, the evolutionary dynamics between host and viral mimics remain largely unknown. We address this by integrating structural modeling, host-virus interaction networks, and comprehensive evolutionary analyses of host and viral proteins. We show that host proteins targeted by mimics and host-mimicked proteins are highly conserved, and that this is related to functional constraints imposed on host proteins. Host interface residues that interact with both mimics and host-mimicked proteins evolve slowly, while residues that exclusively interact with mimics evolve significantly faster. Surprisingly, viral mimics do not evolve rapidly, instead displaying complex evolutionary patterns. Our analysis reveals host's limited capacity to escape mimicry and viral evolution to exploit this, and highlights how constraints lead to unexpectedly slow evolution of host-virus interaction networks.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147491472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-16DOI: 10.1038/s44320-026-00202-z
Gabriel T Vercelli, Xingcheng Zhou, Stefany Moreno-Gámez, Rashi R Jeeda, Rachel Gregor, Jonasz Słomka, Akorfa Dagadu, Ariel L Furst, Otto X Cordero
Microbial surface functionalization is a powerful strategy for endowing microbes with novel, non-genetic functions. However, existing methods are often species-specific, limited in scope, and compromise cell viability. Here, we present a universal and modular platform for high-density, reproducible surface functionalization across diverse microbial species-including Gram-positive, Gram-negative, aerobic, and anaerobic bacteria-using multiple molecular classes such as fluorophores, enzymes, and nucleic acids. Our method preserves cell viability and achieves 50× higher functionalization efficiency than previous methods with a standardized protocol applicable to any azide-containing molecule. Applications of the method show reproducible and tunable phenotypic outcomes at the single-cell level: fluorophore labeling yielded adjustable fluorescence, β-lactamase conferred scalable antibiotic resistance, and DNA coatings modulated adhesion and aggregation. This platform provides quantitative, non-genetic control over microbial phenotypes and complements genetic engineering approaches. It enables new possibilities for microbial design in biotechnology, medicine, and environmental applications where genetic modification is impractical or undesirable.
{"title":"A universal surface functionalization technique to chemically enhance live microbial cells.","authors":"Gabriel T Vercelli, Xingcheng Zhou, Stefany Moreno-Gámez, Rashi R Jeeda, Rachel Gregor, Jonasz Słomka, Akorfa Dagadu, Ariel L Furst, Otto X Cordero","doi":"10.1038/s44320-026-00202-z","DOIUrl":"https://doi.org/10.1038/s44320-026-00202-z","url":null,"abstract":"<p><p>Microbial surface functionalization is a powerful strategy for endowing microbes with novel, non-genetic functions. However, existing methods are often species-specific, limited in scope, and compromise cell viability. Here, we present a universal and modular platform for high-density, reproducible surface functionalization across diverse microbial species-including Gram-positive, Gram-negative, aerobic, and anaerobic bacteria-using multiple molecular classes such as fluorophores, enzymes, and nucleic acids. Our method preserves cell viability and achieves 50× higher functionalization efficiency than previous methods with a standardized protocol applicable to any azide-containing molecule. Applications of the method show reproducible and tunable phenotypic outcomes at the single-cell level: fluorophore labeling yielded adjustable fluorescence, β-lactamase conferred scalable antibiotic resistance, and DNA coatings modulated adhesion and aggregation. This platform provides quantitative, non-genetic control over microbial phenotypes and complements genetic engineering approaches. It enables new possibilities for microbial design in biotechnology, medicine, and environmental applications where genetic modification is impractical or undesirable.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147468848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-11DOI: 10.1038/s44320-026-00199-5
Tetiana Serdiuk, Virginie Redeker, Jimmy Savistchenko, Sandesh Neupane, Walther Haenseler, Yanick Fleischmann, Viviane Reber, Sabrina Keller, Cinzia Tiberi, Ruxandra Bachmann-Gagescu, Matthias Gstaiger, Thomas Braun, Roland Riek, Steve Gentleman, Adriano Aguzzi, Natalie de Souza, Ronald Melki, Paola Picotti
The aggregation of the protein alpha-synuclein (αSyn) is a common feature of multiple neurodegenerative diseases collectively called synucleinopathies, for which the pathobiology is not well understood. The different phenotypic characteristics of the synucleinopathies Parkinson's disease (PD), Dementia with Lewy Bodies (DLB) and Multiple System Atrophy (MSA) have been proposed to originate from the distinct structures adopted by αSyn in its amyloid forms. Here, using covalent labeling and limited proteolysis coupled to mass spectrometry (LiP-MS) in vitro and in situ within neuronal cells and directly in native patient brain homogenates, we show that pathogenic αSyn from distinct synucleinopathies (PD, DLB and MSA) are structurally different. Further, we found that fibril structural differences are associated with different putative fibril interactomes and neuronal responses. We discovered disease-specific ubiquitination patterns and turnover profiles for pathogenic αSyn species, detected molecular pathways responding specifically to the uptake of different αSyn fibrillar polymorphs, and identified a subset of the involved proteins as putative interactors of αSyn. In particular, components of the ubiquitin-proteasomal System (UPS), including E3 ubiquitin ligases, chaperones, and deubiquitinating proteins, showed disease/polymorph-specific putative interaction patterns, possibly accounting for different resistance of patient-derived αSyn fibrils to degradation. Genetic modulation with CRISPR-based tools showed that members of the UPS degradation pathway (three E3 ligases: UBE3A, TRIM25, HUWE1 and the AAA+ ATPase VCP) reduced αSyn inclusions, in a strain-specific manner. LiP-MS also identified sets of proteins with altered protease susceptibility in postmortem brain homogenates of PD, DLB, and MSA patients. These sets were largely disease-specific and included proteins altered in cells treated with fibrils derived from patients with the matching disease. Our findings provide insight into cellular processes involved in the accumulation and turnover of αSyn pathogenic aggregates in PD, DLB and MSA in a disease/specific manner and constitutes a resource of potential novel drug targets in these synucleinopathies.
{"title":"Structure-function relationship of alpha-synuclein fibrillar polymorphs derived from distinct synucleinopathies.","authors":"Tetiana Serdiuk, Virginie Redeker, Jimmy Savistchenko, Sandesh Neupane, Walther Haenseler, Yanick Fleischmann, Viviane Reber, Sabrina Keller, Cinzia Tiberi, Ruxandra Bachmann-Gagescu, Matthias Gstaiger, Thomas Braun, Roland Riek, Steve Gentleman, Adriano Aguzzi, Natalie de Souza, Ronald Melki, Paola Picotti","doi":"10.1038/s44320-026-00199-5","DOIUrl":"https://doi.org/10.1038/s44320-026-00199-5","url":null,"abstract":"<p><p>The aggregation of the protein alpha-synuclein (αSyn) is a common feature of multiple neurodegenerative diseases collectively called synucleinopathies, for which the pathobiology is not well understood. The different phenotypic characteristics of the synucleinopathies Parkinson's disease (PD), Dementia with Lewy Bodies (DLB) and Multiple System Atrophy (MSA) have been proposed to originate from the distinct structures adopted by αSyn in its amyloid forms. Here, using covalent labeling and limited proteolysis coupled to mass spectrometry (LiP-MS) in vitro and in situ within neuronal cells and directly in native patient brain homogenates, we show that pathogenic αSyn from distinct synucleinopathies (PD, DLB and MSA) are structurally different. Further, we found that fibril structural differences are associated with different putative fibril interactomes and neuronal responses. We discovered disease-specific ubiquitination patterns and turnover profiles for pathogenic αSyn species, detected molecular pathways responding specifically to the uptake of different αSyn fibrillar polymorphs, and identified a subset of the involved proteins as putative interactors of αSyn. In particular, components of the ubiquitin-proteasomal System (UPS), including E3 ubiquitin ligases, chaperones, and deubiquitinating proteins, showed disease/polymorph-specific putative interaction patterns, possibly accounting for different resistance of patient-derived αSyn fibrils to degradation. Genetic modulation with CRISPR-based tools showed that members of the UPS degradation pathway (three E3 ligases: UBE3A, TRIM25, HUWE1 and the AAA+ ATPase VCP) reduced αSyn inclusions, in a strain-specific manner. LiP-MS also identified sets of proteins with altered protease susceptibility in postmortem brain homogenates of PD, DLB, and MSA patients. These sets were largely disease-specific and included proteins altered in cells treated with fibrils derived from patients with the matching disease. Our findings provide insight into cellular processes involved in the accumulation and turnover of αSyn pathogenic aggregates in PD, DLB and MSA in a disease/specific manner and constitutes a resource of potential novel drug targets in these synucleinopathies.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tumor microenvironment (TME) is characterized by a milieu of distinct cell types that exist in heterogenous transcriptional states across tumors. Functional interactions among these cell states drive tumor progression and therapy response. Systematic characterization of functional cell-state interactions (CSIs) remains challenging due to the paucity of scRNA-seq cohorts with clinical information on one hand, and the lack of cellular context in bulk RNA-seq cohorts on the other. We present CSI-TME, a computational pipeline that extends the concept of gene interactions, such as synthetic lethality, to cell states, to infer prognostic CSIs by directly leveraging large cohorts of bulk transcriptomic datasets. Applied CSI-TME to IDH-mutant gliomas, we identified a highly reproducible cell-state interaction network (CSIN) that is predominantly pro-tumor and differentially activated in IDH-mut astrocytoma versus oligodendroglioma. Malignant cell states within the CSIN resemble multiple neuronal lineages, including astrocyte-like and oligodendrocyte-progenitor-like programs, and reveal key interactions between glioma stem cells and T cells. CSIN stratifies patient response to immune-checkpoint blockade therapy. Roughly 20% of CSIs involve direct ligand-receptor communication, and co-localize in spatial-transcriptomic datasets, most notably for a pro-tumorigenic interaction between tip-like endothelial cells and hypoxic malignant cells supported by multiple ligand-receptor interactions. Interestingly, anti-tumor CSIs correlated with oncogenic mutations are preferentially active in early stages of cancer, hinting at tissue homeostatic response. Overall, CSI-TME is a novel approach that, leveraging clinical bulk transcriptomic data, identifies prognostic CSIs and therapeutic ligand-receptor targets, while providing novel insight into how interactions among the cell states shape the TME in IDH-mutant glioma.
{"title":"Identifying clinically relevant cell state interactions in the tumor microenvironment of IDH-mutant gliomas using CSI-TME.","authors":"Arashdeep Singh, Bharati Mehani, Vishaka Gopalan, Sushant Puri, Kenneth Aldape, Sridhar Hannenhalli","doi":"10.1038/s44320-026-00201-0","DOIUrl":"https://doi.org/10.1038/s44320-026-00201-0","url":null,"abstract":"<p><p>Tumor microenvironment (TME) is characterized by a milieu of distinct cell types that exist in heterogenous transcriptional states across tumors. Functional interactions among these cell states drive tumor progression and therapy response. Systematic characterization of functional cell-state interactions (CSIs) remains challenging due to the paucity of scRNA-seq cohorts with clinical information on one hand, and the lack of cellular context in bulk RNA-seq cohorts on the other. We present CSI-TME, a computational pipeline that extends the concept of gene interactions, such as synthetic lethality, to cell states, to infer prognostic CSIs by directly leveraging large cohorts of bulk transcriptomic datasets. Applied CSI-TME to IDH-mutant gliomas, we identified a highly reproducible cell-state interaction network (CSIN) that is predominantly pro-tumor and differentially activated in IDH-mut astrocytoma versus oligodendroglioma. Malignant cell states within the CSIN resemble multiple neuronal lineages, including astrocyte-like and oligodendrocyte-progenitor-like programs, and reveal key interactions between glioma stem cells and T cells. CSIN stratifies patient response to immune-checkpoint blockade therapy. Roughly 20% of CSIs involve direct ligand-receptor communication, and co-localize in spatial-transcriptomic datasets, most notably for a pro-tumorigenic interaction between tip-like endothelial cells and hypoxic malignant cells supported by multiple ligand-receptor interactions. Interestingly, anti-tumor CSIs correlated with oncogenic mutations are preferentially active in early stages of cancer, hinting at tissue homeostatic response. Overall, CSI-TME is a novel approach that, leveraging clinical bulk transcriptomic data, identifies prognostic CSIs and therapeutic ligand-receptor targets, while providing novel insight into how interactions among the cell states shape the TME in IDH-mutant glioma.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":""},"PeriodicalIF":7.7,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147434422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-01-19DOI: 10.1038/s44320-025-00186-2
Viola Hollek, Francisca Böhning, Catalina Florez Vargas, Anja Sieber, Markus Morkel, Nils Blüthgen
Oncogenic mutations shape colorectal cancer (CRC) biology, yet their impact on transcriptional phenotypes remains incompletely understood, and their individual prognostic value is limited. Here, we perform a pooled single-cell transcriptomic screen of over 100,000 CRC cells with a comprehensive barcoded library of oncogenic variants across genetically diverse CRC lines. Using a variational autoencoder-based interpretable factor model, we identify ten conserved oncogene-driven transcriptional modules (TMOs) representing core cancer phenotypes such as cellular plasticity, inflammatory response, replicative stress, and epithelial-to-mesenchymal transition. Engagement of these modules can be context-dependent, reflecting interactions between oncogene-induced driver pathways and background genetics. TMO activity in patient tumors stratifies CRC cohorts into high- and low-risk groups, improving relapse-free survival prediction beyond existing classification systems. Our study systematically links oncogenic signaling to transcriptional states and clinical outcomes, establishing a functional framework for module-based patient stratification in precision oncology.
{"title":"Pooled single-cell screen in colorectal cancer defines transcriptional modules linked to oncogenes.","authors":"Viola Hollek, Francisca Böhning, Catalina Florez Vargas, Anja Sieber, Markus Morkel, Nils Blüthgen","doi":"10.1038/s44320-025-00186-2","DOIUrl":"10.1038/s44320-025-00186-2","url":null,"abstract":"<p><p>Oncogenic mutations shape colorectal cancer (CRC) biology, yet their impact on transcriptional phenotypes remains incompletely understood, and their individual prognostic value is limited. Here, we perform a pooled single-cell transcriptomic screen of over 100,000 CRC cells with a comprehensive barcoded library of oncogenic variants across genetically diverse CRC lines. Using a variational autoencoder-based interpretable factor model, we identify ten conserved oncogene-driven transcriptional modules (TMOs) representing core cancer phenotypes such as cellular plasticity, inflammatory response, replicative stress, and epithelial-to-mesenchymal transition. Engagement of these modules can be context-dependent, reflecting interactions between oncogene-induced driver pathways and background genetics. TMO activity in patient tumors stratifies CRC cohorts into high- and low-risk groups, improving relapse-free survival prediction beyond existing classification systems. Our study systematically links oncogenic signaling to transcriptional states and clinical outcomes, establishing a functional framework for module-based patient stratification in precision oncology.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":"435-460"},"PeriodicalIF":7.7,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12954089/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146003796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2025-12-08DOI: 10.1038/s44320-025-00176-4
Christopher Thai, Amartya Singh, Daniel Herranz, Hossein Khiabanian
Single-cell RNA sequencing allows defining cellular identities based on transcriptional similarity using unsupervised clustering. However, a single clustering resolution may not yield groups of cells that represent both broad, well-defined populations and smaller subpopulations simultaneously. Therefore, when cell identities are not known prior to sequencing, robust comparison and annotation of inferred de novo clusters remains a challenge. Here, we introduce CANTAO, in which we propose the average overlap metric to define the distance between single-cell clusters by comparing ranked lists of differentially expressed genes in a top-weighted manner. We benchmark CANTAO in truth-known datasets comprised of similar yet distinct cell populations and show that evaluating clusters with average overlap results in a consistent, precise, and biologically meaningful recapitulation of true cell identities. We then analyze unsorted mouse thymocytes and characterize stages of T-cell development in the thymus, including minor populations of double-negative (CD4-CD8-) T cells that are difficult to confidently detect among unsorted single cells. We demonstrate that CANTAO enables robust, reproducible characterization of single-cell data and clarifies biological interpretation of underlying identities in homogeneous populations.
{"title":"CANTAO: guiding clustering and annotation in single-cell RNA sequencing using average overlap.","authors":"Christopher Thai, Amartya Singh, Daniel Herranz, Hossein Khiabanian","doi":"10.1038/s44320-025-00176-4","DOIUrl":"10.1038/s44320-025-00176-4","url":null,"abstract":"<p><p>Single-cell RNA sequencing allows defining cellular identities based on transcriptional similarity using unsupervised clustering. However, a single clustering resolution may not yield groups of cells that represent both broad, well-defined populations and smaller subpopulations simultaneously. Therefore, when cell identities are not known prior to sequencing, robust comparison and annotation of inferred de novo clusters remains a challenge. Here, we introduce CANTAO, in which we propose the average overlap metric to define the distance between single-cell clusters by comparing ranked lists of differentially expressed genes in a top-weighted manner. We benchmark CANTAO in truth-known datasets comprised of similar yet distinct cell populations and show that evaluating clusters with average overlap results in a consistent, precise, and biologically meaningful recapitulation of true cell identities. We then analyze unsorted mouse thymocytes and characterize stages of T-cell development in the thymus, including minor populations of double-negative (CD4-CD8-) T cells that are difficult to confidently detect among unsorted single cells. We demonstrate that CANTAO enables robust, reproducible characterization of single-cell data and clarifies biological interpretation of underlying identities in homogeneous populations.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":"461-475"},"PeriodicalIF":7.7,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12954110/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145708582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2026-01-03DOI: 10.1038/s44320-025-00174-6
Lina Wu, Qingqing Wang, Xinyi Hong, Xueer Cai, Litinghui Zhang, Min Li, Mingkai Wu, Thomas K Wood, Xiaomei Yan
Persisters represent a transient, antibiotic-tolerant subpopulation within isogenic bacterial populations, contributing to infection relapses. However, the mechanisms driving persister formation and resuscitation remain elusive. Here, we developed nano-flow cytometry (nFCM)-based methods for single-cell quantification of toxin (T) RelE and antitoxin (A) RelB levels, as well as for monitoring persister states through cell wall growth. We demonstrate that bacteria elevate the T/A ratio through two distinct TA expression modalities to withstand bacteriostatic antibiotic challenge, with T/A = 1.0 as a critical threshold. Intriguingly, single-cell resuscitation dynamics revealed that subinhibitory antibiotic exposure promotes entry into a deeper dormant state characterized by elevated T/A ratios, underscoring the importance of maximizing therapeutic antibiotic concentrations. Crucially, we uncovered a triphasic detoxification process during resuscitation where progressive toxin depletion drives T/A ratio reduction to a critical proliferation-permissive threshold. Proteomic profiling unveiled that persisters with high RelE production have increased transmembrane transporter levels linked to stress response and drug efflux. Our findings offer pivotal molecular insights underlying persister transitions and underscore the need for high-throughput, single-cell analysis of these heterogeneity phenotypes.
{"title":"Single-cell analysis reveals critical toxin/antitoxin ratio triggering persister resuscitation.","authors":"Lina Wu, Qingqing Wang, Xinyi Hong, Xueer Cai, Litinghui Zhang, Min Li, Mingkai Wu, Thomas K Wood, Xiaomei Yan","doi":"10.1038/s44320-025-00174-6","DOIUrl":"10.1038/s44320-025-00174-6","url":null,"abstract":"<p><p>Persisters represent a transient, antibiotic-tolerant subpopulation within isogenic bacterial populations, contributing to infection relapses. However, the mechanisms driving persister formation and resuscitation remain elusive. Here, we developed nano-flow cytometry (nFCM)-based methods for single-cell quantification of toxin (T) RelE and antitoxin (A) RelB levels, as well as for monitoring persister states through cell wall growth. We demonstrate that bacteria elevate the T/A ratio through two distinct TA expression modalities to withstand bacteriostatic antibiotic challenge, with T/A = 1.0 as a critical threshold. Intriguingly, single-cell resuscitation dynamics revealed that subinhibitory antibiotic exposure promotes entry into a deeper dormant state characterized by elevated T/A ratios, underscoring the importance of maximizing therapeutic antibiotic concentrations. Crucially, we uncovered a triphasic detoxification process during resuscitation where progressive toxin depletion drives T/A ratio reduction to a critical proliferation-permissive threshold. Proteomic profiling unveiled that persisters with high RelE production have increased transmembrane transporter levels linked to stress response and drug efflux. Our findings offer pivotal molecular insights underlying persister transitions and underscore the need for high-throughput, single-cell analysis of these heterogeneity phenotypes.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":"395-411"},"PeriodicalIF":7.7,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12953861/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145896166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2025-12-15DOI: 10.1038/s44320-025-00179-1
Patricia Skowronek, Anant Nawalgaria, Matthias Mann
{"title":"Multimodal AI agents for capturing and sharing proteomics laboratory practice.","authors":"Patricia Skowronek, Anant Nawalgaria, Matthias Mann","doi":"10.1038/s44320-025-00179-1","DOIUrl":"10.1038/s44320-025-00179-1","url":null,"abstract":"","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":"333-336"},"PeriodicalIF":7.7,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12954122/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145763248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-01Epub Date: 2025-12-15DOI: 10.1038/s44320-025-00175-5
Serkan Sayin, Motasem ElGamel, Brittany Rosener, Michael Brehm, Andrew Mugler, Amir Mitchell
Bacterial colonization of tumors is widespread, yet the dynamics during colonization remain underexplored. Here we discover strong variability in the sizes of intratumor bacterial clones and use this variability to infer the mechanisms of colonization. We monitored bacterial population dynamics in murine tumors after introducing millions of genetically barcoded Escherichia coli cells. Results from intravenous injection revealed that roughly a hundred bacteria seeded a tumor and that colonizers underwent rapid, yet highly nonuniform growth. Within a day, bacteria reached a steady-state and then sustained load and clone diversity. Intratumor injections, circumventing colonization bottlenecks, revealed that the nonuniformity persists and that the sizes of bacterial progenies followed a scale-free distribution. Theory suggested that our observations are compatible with a growth model constrained by a local niche load, global resource competition, and noise. Our work provides the first dynamical model of tumor colonization and may allow distinguishing genuine tumor microbiomes from contamination.
{"title":"Bacterial population dynamics during colonization of solid tumors.","authors":"Serkan Sayin, Motasem ElGamel, Brittany Rosener, Michael Brehm, Andrew Mugler, Amir Mitchell","doi":"10.1038/s44320-025-00175-5","DOIUrl":"10.1038/s44320-025-00175-5","url":null,"abstract":"<p><p>Bacterial colonization of tumors is widespread, yet the dynamics during colonization remain underexplored. Here we discover strong variability in the sizes of intratumor bacterial clones and use this variability to infer the mechanisms of colonization. We monitored bacterial population dynamics in murine tumors after introducing millions of genetically barcoded Escherichia coli cells. Results from intravenous injection revealed that roughly a hundred bacteria seeded a tumor and that colonizers underwent rapid, yet highly nonuniform growth. Within a day, bacteria reached a steady-state and then sustained load and clone diversity. Intratumor injections, circumventing colonization bottlenecks, revealed that the nonuniformity persists and that the sizes of bacterial progenies followed a scale-free distribution. Theory suggested that our observations are compatible with a growth model constrained by a local niche load, global resource competition, and noise. Our work provides the first dynamical model of tumor colonization and may allow distinguishing genuine tumor microbiomes from contamination.</p>","PeriodicalId":18906,"journal":{"name":"Molecular Systems Biology","volume":" ","pages":"412-434"},"PeriodicalIF":7.7,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12953616/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145763277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}