Pub Date : 2026-01-20eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013831
Adam A Malik, Cecilia Krona, Soumi Kundu, Philip Gerlee, Sven Nelander
Patient-derived cells (PDC) mouse xenografts are increasingly important tools in glioblastoma (GBM) research, essential to investigate case-specific growth patterns and treatment responses. Despite the central role of xenograft models in the field, few good simulation models are available to probe the dynamics of tumor growth and to support therapy design. We therefore propose a new framework for the patient-specific simulation of GBM in the mouse brain. Unlike existing methods, our simulations leverage a high-resolution map of the mouse brain anatomy to yield patient-specific results that are in good agreement with experimental observations. To facilitate the fitting of our model to histological data, we use Approximate Bayesian Computation. Because our model uses few parameters, reflecting growth, invasion and niche dependencies, it is well suited for case comparisons and for probing treatment effects. We demonstrate how our model can be used to simulate different treatment by perturbing the different model parameters. We expect in silico replicates of mouse xenograft tumors can improve the assessment of therapeutic outcomes and boost the statistical power of preclinical GBM studies.
{"title":"Anatomically aware simulation of patient-specific glioblastoma xenografts.","authors":"Adam A Malik, Cecilia Krona, Soumi Kundu, Philip Gerlee, Sven Nelander","doi":"10.1371/journal.pcbi.1013831","DOIUrl":"10.1371/journal.pcbi.1013831","url":null,"abstract":"<p><p>Patient-derived cells (PDC) mouse xenografts are increasingly important tools in glioblastoma (GBM) research, essential to investigate case-specific growth patterns and treatment responses. Despite the central role of xenograft models in the field, few good simulation models are available to probe the dynamics of tumor growth and to support therapy design. We therefore propose a new framework for the patient-specific simulation of GBM in the mouse brain. Unlike existing methods, our simulations leverage a high-resolution map of the mouse brain anatomy to yield patient-specific results that are in good agreement with experimental observations. To facilitate the fitting of our model to histological data, we use Approximate Bayesian Computation. Because our model uses few parameters, reflecting growth, invasion and niche dependencies, it is well suited for case comparisons and for probing treatment effects. We demonstrate how our model can be used to simulate different treatment by perturbing the different model parameters. We expect in silico replicates of mouse xenograft tumors can improve the assessment of therapeutic outcomes and boost the statistical power of preclinical GBM studies.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013831"},"PeriodicalIF":3.6,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12851450/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-20eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013896
Zhijian Hu, Yuzhen Wu, Tomas Freire, Erida Gjini, Kevin Wood
Drugs play a central role in limiting bacterial population spread, yet laboratory studies typically assume well-mixed environments when assessing microbial drug responses. In contrast, bacteria in the human body often occupy spatially structured habitats where drug concentrations vary. Understanding how this heterogeneity shapes growth and decline is therefore essential for controlling infections and mitigating resistance evolution. Here, we developed a minimal robot-automated system to study how spatial drug heterogeneity affects short-term population dynamics in E. faecalis, a Gram-positive opportunistic pathogen. This system was combined with a theoretical framework to interpret and explain the observed outcomes. We first recapitulated the classic critical-patch-size model result: in a spatially homogeneous environment, a population persists in a finite domain only when growth outpaces diffusive losses at the boundaries. In heterogeneous environments, we found certain conditions that population persistence can depend critically on the spatial arrangement of the drug, even when its total amount is fixed. Using theoretical and experimental approaches, we identified the arrangements that produce the strongest growth and the fastest decline, revealing the range of possible outcomes under drug heterogeneity. We further tested this framework in more complex environments, including ring-shaped communities, and observed consistent arrangement-dependent behavior. Overall, our results extend the classical growth-condition framework to general heterogeneous environments and demonstrate that spatial drug arrangement - not only total dose - can strongly influence bacterial population dynamics. These findings highlight the importance of spatially structured dosing strategies and motivate further theoretical and experimental investigation.
{"title":"Linking spatial drug heterogeneity to microbial growth dynamics in theory and experiment.","authors":"Zhijian Hu, Yuzhen Wu, Tomas Freire, Erida Gjini, Kevin Wood","doi":"10.1371/journal.pcbi.1013896","DOIUrl":"10.1371/journal.pcbi.1013896","url":null,"abstract":"<p><p>Drugs play a central role in limiting bacterial population spread, yet laboratory studies typically assume well-mixed environments when assessing microbial drug responses. In contrast, bacteria in the human body often occupy spatially structured habitats where drug concentrations vary. Understanding how this heterogeneity shapes growth and decline is therefore essential for controlling infections and mitigating resistance evolution. Here, we developed a minimal robot-automated system to study how spatial drug heterogeneity affects short-term population dynamics in E. faecalis, a Gram-positive opportunistic pathogen. This system was combined with a theoretical framework to interpret and explain the observed outcomes. We first recapitulated the classic critical-patch-size model result: in a spatially homogeneous environment, a population persists in a finite domain only when growth outpaces diffusive losses at the boundaries. In heterogeneous environments, we found certain conditions that population persistence can depend critically on the spatial arrangement of the drug, even when its total amount is fixed. Using theoretical and experimental approaches, we identified the arrangements that produce the strongest growth and the fastest decline, revealing the range of possible outcomes under drug heterogeneity. We further tested this framework in more complex environments, including ring-shaped communities, and observed consistent arrangement-dependent behavior. Overall, our results extend the classical growth-condition framework to general heterogeneous environments and demonstrate that spatial drug arrangement - not only total dose - can strongly influence bacterial population dynamics. These findings highlight the importance of spatially structured dosing strategies and motivate further theoretical and experimental investigation.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013896"},"PeriodicalIF":3.6,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12863682/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-20eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013902
[This corrects the article DOI: 10.1371/journal.pcbi.1013452.].
[这更正了文章DOI: 10.1371/journal.pcbi.1013452.]。
{"title":"Correction: Simulation insights on the compound action potential in multifascicular nerves.","authors":"","doi":"10.1371/journal.pcbi.1013902","DOIUrl":"10.1371/journal.pcbi.1013902","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.1371/journal.pcbi.1013452.].</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013902"},"PeriodicalIF":3.6,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12818681/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146011955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-16eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1012987
Janis Shin, James M Carothers, Herbert M Sauro
Bayesian Metabolic Control Analysis (BMCA) is a promising framework for inferring metabolic control coefficients in data-limited scenarios, combining Bayesian inference with linear-logarithmic (lin-log) rate laws. These metabolic control coefficients quantify how changes in enzyme activities affect steady-state fluxes and metabolite concentrations across a metabolic network. However, its predictive accuracy and limitations remain underexplored. This study systematically evaluates BMCA's ability to infer elasticity values, flux control coefficients (FCC), and concentration control coefficients (CCC) under varying data availability conditions using three synthetic metabolic network models. We demonstrate that BMCA predictions are highly dependent on the inclusion of flux and enzyme concentration data, with the omission of these datasets leading to severe inaccuracies. In our synthetic, enzyme-perturbation datasets, external metabolite concentrations had minimal impact and, in some cases, their exclusion improved predictions; when external-nutrient perturbations were introduced and those concentrations were observed, gains were at most modest. Additionally, we find that posterior estimation with both ADVI and HMC can underestimate large-magnitude elasticities in our synthetic settings, with ADVI showing somewhat higher variance under strong up-regulation; thus, recovering |elasticity| [Formula: see text]1.5 remains challenging regardless of the inference engine. ADVI also fails to accurately infer allosteric interactions, even when regulatory effects are strong. While BMCA maintains reasonable accuracy in partially recovering the rankings of the highest FCC values, its estimates of absolute values remain constrained by prior assumptions and data limitations. Our findings reveal the BMCA algorithm's strengths and weaknesses, providing guidance on its application in metabolic engineering, and highlighting the need for methodological refinements to enhance its predictive capabilities.
{"title":"Evaluating the limitations of Bayesian metabolic control analysis.","authors":"Janis Shin, James M Carothers, Herbert M Sauro","doi":"10.1371/journal.pcbi.1012987","DOIUrl":"10.1371/journal.pcbi.1012987","url":null,"abstract":"<p><p>Bayesian Metabolic Control Analysis (BMCA) is a promising framework for inferring metabolic control coefficients in data-limited scenarios, combining Bayesian inference with linear-logarithmic (lin-log) rate laws. These metabolic control coefficients quantify how changes in enzyme activities affect steady-state fluxes and metabolite concentrations across a metabolic network. However, its predictive accuracy and limitations remain underexplored. This study systematically evaluates BMCA's ability to infer elasticity values, flux control coefficients (FCC), and concentration control coefficients (CCC) under varying data availability conditions using three synthetic metabolic network models. We demonstrate that BMCA predictions are highly dependent on the inclusion of flux and enzyme concentration data, with the omission of these datasets leading to severe inaccuracies. In our synthetic, enzyme-perturbation datasets, external metabolite concentrations had minimal impact and, in some cases, their exclusion improved predictions; when external-nutrient perturbations were introduced and those concentrations were observed, gains were at most modest. Additionally, we find that posterior estimation with both ADVI and HMC can underestimate large-magnitude elasticities in our synthetic settings, with ADVI showing somewhat higher variance under strong up-regulation; thus, recovering |elasticity| [Formula: see text]1.5 remains challenging regardless of the inference engine. ADVI also fails to accurately infer allosteric interactions, even when regulatory effects are strong. While BMCA maintains reasonable accuracy in partially recovering the rankings of the highest FCC values, its estimates of absolute values remain constrained by prior assumptions and data limitations. Our findings reveal the BMCA algorithm's strengths and weaknesses, providing guidance on its application in metabolic engineering, and highlighting the need for methodological refinements to enhance its predictive capabilities.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1012987"},"PeriodicalIF":3.6,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12844504/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145990164","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-16eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013903
Muyiwa S Adegbaju, Oluwabuyikunmi Owo-Odusi, Eden T Wirtz, Olanrewaju B Morenikeji, Olusola Ojurongbe, Bolaji N Thomas
<p><p>The concern regarding H5N1 outbreak, particularly the accelerated mutagenesis of its core genomic elements, underscores the persistent threat of influenza to global health. Neuraminidase (NA), a pivotal sialidase integral to virion egress and propagation, comprises nine distinct isoforms, exhibiting unique evolutionary trajectories and structural adaptations. Despite extensive characterization of hemagglutinin subtypes, the functional divergence of the nine NA subtypes remains inadequately understood. To address this gap, we conducted a structural analysis of NA subtypes, employing structural superimposition and motif-guided sequence alignment to delineate subtype-specific residues. Hierarchical clustering stratified the nine NA subtypes into four distinct subgroups: NA2 (subgroup I), NA1 and NA4 (subgroup II), NA9/NA7/NA6/NA3 (subgroup III), and NA8/NA5 (subgroup 4). We identified 40 highly conserved and functionally significant amino acid loci, likely modulating enzymatic activity and substrate specificity across subtypes. To investigate the structural basis of adaptation in H5N1, we generated NA1 mutants by swapping family specific position (FSP) residues and analyzed their dynamics using Molecular Dynamics (MD) simulations, complemented by a deep phylogenetic analysis across six host reservoirs. MD simulation parameters reveal a dynamic paradox: the Wild-Type (WT) NA1 maintains a conserved global compactness Rg, which masks a complex, bi-modal switching mechanism essential for its catalytic function, validated by multi-basin free energy landscape (FEL) topography. We identify Lysine-207 (K207) as the master determinant of this switching mechanism and the enzyme's dynamic fate. Substitutions at this conserved nexus produced diametrically opposite outcomes: K207W imposed structural rigidification (abolishing the switch), K207H achieved dynamic preservation, and K207I drove expanded disorder and collapse. Furthermore, dynamic correlation analysis shows that these single-point substitutions function as molecular switches that significantly re-wire the enzyme's allosteric communication networks, extending far beyond the active site. To assess the role of NA1 in host tropism and adaptive evolution, we conducted a phylogenetic analysis of NA1 genes from H5N1 isolates across multiple host reservoirs; H. sapiens, G. gallus, Anser anser domesticus, M. gallopavo, B. taurus, and C. olor. Notably, we observed opposing selection pressures and diversification patterns: G. gallus isolates showed signatures of positive selection consistent with hyper-reassortment, while human isolates displayed highly diverse, sporadic spillover events. We conclude that the evolutionary contribution of NA1 to H5N1 host adaptation is not encoded in static structure, but certain residues such as K207 defines a pivotal mechanism for regulating the enzyme's function through dynamic states. Our MD data thus proposes a novel strategy for next-generation antivirals by targetin
{"title":"Structural analysis of antigenic variation and adaptive evolution of the H5N1 neuraminidase gene.","authors":"Muyiwa S Adegbaju, Oluwabuyikunmi Owo-Odusi, Eden T Wirtz, Olanrewaju B Morenikeji, Olusola Ojurongbe, Bolaji N Thomas","doi":"10.1371/journal.pcbi.1013903","DOIUrl":"10.1371/journal.pcbi.1013903","url":null,"abstract":"<p><p>The concern regarding H5N1 outbreak, particularly the accelerated mutagenesis of its core genomic elements, underscores the persistent threat of influenza to global health. Neuraminidase (NA), a pivotal sialidase integral to virion egress and propagation, comprises nine distinct isoforms, exhibiting unique evolutionary trajectories and structural adaptations. Despite extensive characterization of hemagglutinin subtypes, the functional divergence of the nine NA subtypes remains inadequately understood. To address this gap, we conducted a structural analysis of NA subtypes, employing structural superimposition and motif-guided sequence alignment to delineate subtype-specific residues. Hierarchical clustering stratified the nine NA subtypes into four distinct subgroups: NA2 (subgroup I), NA1 and NA4 (subgroup II), NA9/NA7/NA6/NA3 (subgroup III), and NA8/NA5 (subgroup 4). We identified 40 highly conserved and functionally significant amino acid loci, likely modulating enzymatic activity and substrate specificity across subtypes. To investigate the structural basis of adaptation in H5N1, we generated NA1 mutants by swapping family specific position (FSP) residues and analyzed their dynamics using Molecular Dynamics (MD) simulations, complemented by a deep phylogenetic analysis across six host reservoirs. MD simulation parameters reveal a dynamic paradox: the Wild-Type (WT) NA1 maintains a conserved global compactness Rg, which masks a complex, bi-modal switching mechanism essential for its catalytic function, validated by multi-basin free energy landscape (FEL) topography. We identify Lysine-207 (K207) as the master determinant of this switching mechanism and the enzyme's dynamic fate. Substitutions at this conserved nexus produced diametrically opposite outcomes: K207W imposed structural rigidification (abolishing the switch), K207H achieved dynamic preservation, and K207I drove expanded disorder and collapse. Furthermore, dynamic correlation analysis shows that these single-point substitutions function as molecular switches that significantly re-wire the enzyme's allosteric communication networks, extending far beyond the active site. To assess the role of NA1 in host tropism and adaptive evolution, we conducted a phylogenetic analysis of NA1 genes from H5N1 isolates across multiple host reservoirs; H. sapiens, G. gallus, Anser anser domesticus, M. gallopavo, B. taurus, and C. olor. Notably, we observed opposing selection pressures and diversification patterns: G. gallus isolates showed signatures of positive selection consistent with hyper-reassortment, while human isolates displayed highly diverse, sporadic spillover events. We conclude that the evolutionary contribution of NA1 to H5N1 host adaptation is not encoded in static structure, but certain residues such as K207 defines a pivotal mechanism for regulating the enzyme's function through dynamic states. Our MD data thus proposes a novel strategy for next-generation antivirals by targetin","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013903"},"PeriodicalIF":3.6,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12826504/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145990204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-16eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013844
Gaspar Cano, Richard Kempter
From cortical synfire chains to hippocampal replay, the idea that neural populations can be activated sequentially with precise spike timing is thought to be essential for several brain functions. It has been shown that neuronal sequences with weak feedforward connectivity can be replayed due to amplification via intra-assembly recurrent connections. However, the mechanisms behind this phenomenon are still unclear. Here, we simulate spiking networks with different excitatory and inhibitory connectivity and find that an exclusively excitatory network is sufficient for this amplification to occur. To explain the spiking network behavior, we introduce a population model of membrane-potential distributions, and we analytically describe how different connectivity structures determine replay speed, with weaker feedforward connectivity generating slower and wider pulses that can be sustained by recurrent connections. Pulse propagation is facilitated if the neuronal membrane time constant is large compared to the pulse width. Together, our simulations and analytical results predict the conditions for replay of neuronal assemblies.
{"title":"Conditions for replay of neuronal assemblies.","authors":"Gaspar Cano, Richard Kempter","doi":"10.1371/journal.pcbi.1013844","DOIUrl":"10.1371/journal.pcbi.1013844","url":null,"abstract":"<p><p>From cortical synfire chains to hippocampal replay, the idea that neural populations can be activated sequentially with precise spike timing is thought to be essential for several brain functions. It has been shown that neuronal sequences with weak feedforward connectivity can be replayed due to amplification via intra-assembly recurrent connections. However, the mechanisms behind this phenomenon are still unclear. Here, we simulate spiking networks with different excitatory and inhibitory connectivity and find that an exclusively excitatory network is sufficient for this amplification to occur. To explain the spiking network behavior, we introduce a population model of membrane-potential distributions, and we analytically describe how different connectivity structures determine replay speed, with weaker feedforward connectivity generating slower and wider pulses that can be sustained by recurrent connections. Pulse propagation is facilitated if the neuronal membrane time constant is large compared to the pulse width. Together, our simulations and analytical results predict the conditions for replay of neuronal assemblies.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013844"},"PeriodicalIF":3.6,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12829973/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145990194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-16eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013767
Koshlan Mayer-Blackwell, Anastasia Minervina, Mikhail Pogorelyy, Puneet Rawat, Melanie R Shapiro, Leeana D Peters, Emily S Ford, Amanda L Posgai, Kasi Vegesana, Samuel Minot, David M Koelle, Victor Greiff, Philip Bradley, Todd M Brusko, Paul G Thomas, Andrew Fiore-Gartland
T cell receptors (TCRs) recognize peptides presented by polymorphic human leukocyte antigen (HLA) molecules, but HLA genotype data are often missing from TCR repertoire sequencing studies. To address this, we developed TCR2HLA, an open-source tool that infers HLA genotypes from TCRβ repertoires. Expanding on work linking public TRBV-CDR3 sequences to HLA genotypes, we incorporated "quasi-public" metaclonotypes - composed of rarer TCRβ sequences with shared amino acid features - enriched by HLA genotypes. Using four TCRβseq datasets from 3,150 individuals, we applied TRBV gene partitioning and locality-sensitive hashing to identify ~96,000 TCRβ features strongly associated with specific HLA alleles from 71M input TCRs. Binary HLA classifiers built with these features achieved high balanced accuracy (>0.9) across common HLA-A (9/12), B (9/12), C (6/13), DRB1 (11/11) alleles and prevalent DPA1/DPB1 (6/10), DQA1/DQB1 (8/17) heterodimers. We also introduced a high-sensitivity calibration to support predictions in samples with as few as 5,000 unique clonotypes. Calibrated predictions with confidence filtering improved reliability. Beyond genotype imputation, TCR2HLA enables the discovery of novel HLA- and exposure-associated TCRs, as shown by the identification of SARS-CoV-2 related TCRs in a large COVID-19 dataset lacking HLA data. TCR2HLA provides a scalable framework for bridging the gap between TCRseq data and HLA genotype for biomarker discovery.
{"title":"TCR2HLA: Calibrated inference of HLA genotypes from TCR repertoires enables identification of immunologically relevant metaclonotypes.","authors":"Koshlan Mayer-Blackwell, Anastasia Minervina, Mikhail Pogorelyy, Puneet Rawat, Melanie R Shapiro, Leeana D Peters, Emily S Ford, Amanda L Posgai, Kasi Vegesana, Samuel Minot, David M Koelle, Victor Greiff, Philip Bradley, Todd M Brusko, Paul G Thomas, Andrew Fiore-Gartland","doi":"10.1371/journal.pcbi.1013767","DOIUrl":"10.1371/journal.pcbi.1013767","url":null,"abstract":"<p><p>T cell receptors (TCRs) recognize peptides presented by polymorphic human leukocyte antigen (HLA) molecules, but HLA genotype data are often missing from TCR repertoire sequencing studies. To address this, we developed TCR2HLA, an open-source tool that infers HLA genotypes from TCRβ repertoires. Expanding on work linking public TRBV-CDR3 sequences to HLA genotypes, we incorporated \"quasi-public\" metaclonotypes - composed of rarer TCRβ sequences with shared amino acid features - enriched by HLA genotypes. Using four TCRβseq datasets from 3,150 individuals, we applied TRBV gene partitioning and locality-sensitive hashing to identify ~96,000 TCRβ features strongly associated with specific HLA alleles from 71M input TCRs. Binary HLA classifiers built with these features achieved high balanced accuracy (>0.9) across common HLA-A (9/12), B (9/12), C (6/13), DRB1 (11/11) alleles and prevalent DPA1/DPB1 (6/10), DQA1/DQB1 (8/17) heterodimers. We also introduced a high-sensitivity calibration to support predictions in samples with as few as 5,000 unique clonotypes. Calibrated predictions with confidence filtering improved reliability. Beyond genotype imputation, TCR2HLA enables the discovery of novel HLA- and exposure-associated TCRs, as shown by the identification of SARS-CoV-2 related TCRs in a large COVID-19 dataset lacking HLA data. TCR2HLA provides a scalable framework for bridging the gap between TCRseq data and HLA genotype for biomarker discovery.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013767"},"PeriodicalIF":3.6,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12810895/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145990156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-15eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013809
Josephine Solowiej-Wedderburn, Jennifer T Pentz, Ludvig Lizana, Bjoern O Schroeder, Peter A Lind, Eric Libby
[This corrects the article DOI: 10.1371/journal.pcbi.1013213.].
[这更正了文章DOI: 10.1371/journal.pcbi.1013213.]。
{"title":"Correction: Competition and cooperation: The plasticity of bacterial interactions across environments.","authors":"Josephine Solowiej-Wedderburn, Jennifer T Pentz, Ludvig Lizana, Bjoern O Schroeder, Peter A Lind, Eric Libby","doi":"10.1371/journal.pcbi.1013809","DOIUrl":"10.1371/journal.pcbi.1013809","url":null,"abstract":"<p><p>[This corrects the article DOI: 10.1371/journal.pcbi.1013213.].</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013809"},"PeriodicalIF":3.6,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12806839/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013880
Asmita Roy, Xianyang Zhang
In genome-wide epigenetic studies, determining how exposures (e.g., Single Nucleotide Polymorphisms) affect outcomes (e.g., gene expression) through intermediate variables, such as DNA methylation, is a key challenge. Mediation analysis provides a framework to identify these causal pathways; however, testing for mediation effects involves a complex composite null hypothesis. Existing methods, such as Sobel's test or the Max-P test, are often underpowered in this context because they rely on null distributions determined under only a subset of the null space and are not optimized for the multiple testing burden inherent in high-dimensional data. To address these limitations, we introduce MLFDR (Mediation Analysis using Local False Discovery Rates), a novel method for high-dimensional mediation analysis. MLFDR leverages local false discovery rates, calculated from the coefficients of structural equation models, to construct an optimal rejection region. We demonstrate theoretically and through simulation that MLFDR asymptotically controls the false discovery rate and achieves superior statistical power compared to recent high-dimensional mediation methods. In real data applications, MLFDR identified 20%-50% more significant mediators than existing methods, demonstrating its ability to uncover biological signals missed by conventional approaches.
{"title":"Powerful large scale inference in high dimensional mediation analysis.","authors":"Asmita Roy, Xianyang Zhang","doi":"10.1371/journal.pcbi.1013880","DOIUrl":"10.1371/journal.pcbi.1013880","url":null,"abstract":"<p><p>In genome-wide epigenetic studies, determining how exposures (e.g., Single Nucleotide Polymorphisms) affect outcomes (e.g., gene expression) through intermediate variables, such as DNA methylation, is a key challenge. Mediation analysis provides a framework to identify these causal pathways; however, testing for mediation effects involves a complex composite null hypothesis. Existing methods, such as Sobel's test or the Max-P test, are often underpowered in this context because they rely on null distributions determined under only a subset of the null space and are not optimized for the multiple testing burden inherent in high-dimensional data. To address these limitations, we introduce MLFDR (Mediation Analysis using Local False Discovery Rates), a novel method for high-dimensional mediation analysis. MLFDR leverages local false discovery rates, calculated from the coefficients of structural equation models, to construct an optimal rejection region. We demonstrate theoretically and through simulation that MLFDR asymptotically controls the false discovery rate and achieves superior statistical power compared to recent high-dimensional mediation methods. In real data applications, MLFDR identified 20%-50% more significant mediators than existing methods, demonstrating its ability to uncover biological signals missed by conventional approaches.</p>","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013880"},"PeriodicalIF":3.6,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12829953/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985187","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14eCollection Date: 2026-01-01DOI: 10.1371/journal.pcbi.1013891
Mingming Lin, Kai Li, Xiaofan Wang, Juanjuan Sun, Kun Gong, Zhibin Wang, Pin Sun
<p><strong>Background: </strong>Heart failure with preserved ejection fraction (HFpEF) represents a heterogeneous syndrome with diverse pathophysiological mechanisms and limited therapeutic options. Peak strain dispersion (PSD) has emerged as a potential mediator in HFpEF pathophysiology. This study aimed to identify distinct HFpEF subtypes and investigate PSD's subtype-specific mediating pathways.</p><p><strong>Methods: </strong>This prospective single-center study included 150 HFpEF patients recruited from December 2023 to December 2024. Unsupervised K-means clustering was performed on the entire cohort to identify patient subtypes. For detailed analysis, rigorous data quality control was performed by removing cases with missing values in any of the 25 baseline features or outcome variables. Consequently, 84 patients with complete data were retained for analysis. Comprehensive clinical and echocardiographic data were collected, including PSD measured by speckle-tracking echocardiography and myocardial work parameters (global work waste and global work efficiency). Unsupervised K-means clustering was performed to identify distinct patient subtypes using eight key variables. Machine learning models with feature engineering (incorporating five clinically meaningful interaction terms: PSD_LVEF, age_HTN, eGFR_BNP, RWT_E/e', and GLS_LVMI) were developed to predict myocardial work parameters and assess feature importance using SHAP (SHapley Additive exPlanations) analysis. Nonlinear mediation analysis was conducted within each subtype to evaluate the mediating pathways through which clinical factors influence myocardial work outcomes.</p><p><strong>Results: </strong>Two distinct HFpEF subtypes were identified: Cluster 0 characterized by younger age (58.6 ± 13.2 years), severe renal dysfunction (eGFR 12.8[8.9-19.9] mL/min/1.73m²), higher PSD (56.0[48.0-64.5] ms), and lower global work efficiency; and Cluster 1 characterized by older age (71.2 ± 9.7 years), preserved renal function (eGFR 104.0[78.5-126.0] mL/min/1.73m²), lower PSD (41.0[35.0-49.0] ms), and higher GWE. Machine learning models achieved moderate to good predictive performance (R² = 0.58-0.61 for GWE and GWW). SHAP analysis revealed that PSD was the most important predictor, with the PSD×LVEF interaction term showing prominent importance in GWE prediction. Nonlinear mediation analysis demonstrated striking subtype-specific differences in mediation patterns.In Cluster 0, eGFR showed a trend toward mediating its effects on GWW through PSD (indirect effect = 0.313), reflecting complex cardiorenal interactions in younger patients with severe renal disease. In contrast, Cluster 1 demonstrated significant mediation effects: BNP's effect on GWW was significantly mediated through PSD (indirect effect = -0.4877, P < 0.05), and BNP's effect on GWE was entirely mediated through PSD (indirect effect = 0.5389, P < 0.05).</p><p><strong>Conclusion: </strong>This study identified two distinct HFpEF subtype
{"title":"Peak strain dispersion as a nonlinear mediator in HFpEF: Unraveling subtype-specific pathways via SHAP-augmented ensemble modeling.","authors":"Mingming Lin, Kai Li, Xiaofan Wang, Juanjuan Sun, Kun Gong, Zhibin Wang, Pin Sun","doi":"10.1371/journal.pcbi.1013891","DOIUrl":"10.1371/journal.pcbi.1013891","url":null,"abstract":"<p><strong>Background: </strong>Heart failure with preserved ejection fraction (HFpEF) represents a heterogeneous syndrome with diverse pathophysiological mechanisms and limited therapeutic options. Peak strain dispersion (PSD) has emerged as a potential mediator in HFpEF pathophysiology. This study aimed to identify distinct HFpEF subtypes and investigate PSD's subtype-specific mediating pathways.</p><p><strong>Methods: </strong>This prospective single-center study included 150 HFpEF patients recruited from December 2023 to December 2024. Unsupervised K-means clustering was performed on the entire cohort to identify patient subtypes. For detailed analysis, rigorous data quality control was performed by removing cases with missing values in any of the 25 baseline features or outcome variables. Consequently, 84 patients with complete data were retained for analysis. Comprehensive clinical and echocardiographic data were collected, including PSD measured by speckle-tracking echocardiography and myocardial work parameters (global work waste and global work efficiency). Unsupervised K-means clustering was performed to identify distinct patient subtypes using eight key variables. Machine learning models with feature engineering (incorporating five clinically meaningful interaction terms: PSD_LVEF, age_HTN, eGFR_BNP, RWT_E/e', and GLS_LVMI) were developed to predict myocardial work parameters and assess feature importance using SHAP (SHapley Additive exPlanations) analysis. Nonlinear mediation analysis was conducted within each subtype to evaluate the mediating pathways through which clinical factors influence myocardial work outcomes.</p><p><strong>Results: </strong>Two distinct HFpEF subtypes were identified: Cluster 0 characterized by younger age (58.6 ± 13.2 years), severe renal dysfunction (eGFR 12.8[8.9-19.9] mL/min/1.73m²), higher PSD (56.0[48.0-64.5] ms), and lower global work efficiency; and Cluster 1 characterized by older age (71.2 ± 9.7 years), preserved renal function (eGFR 104.0[78.5-126.0] mL/min/1.73m²), lower PSD (41.0[35.0-49.0] ms), and higher GWE. Machine learning models achieved moderate to good predictive performance (R² = 0.58-0.61 for GWE and GWW). SHAP analysis revealed that PSD was the most important predictor, with the PSD×LVEF interaction term showing prominent importance in GWE prediction. Nonlinear mediation analysis demonstrated striking subtype-specific differences in mediation patterns.In Cluster 0, eGFR showed a trend toward mediating its effects on GWW through PSD (indirect effect = 0.313), reflecting complex cardiorenal interactions in younger patients with severe renal disease. In contrast, Cluster 1 demonstrated significant mediation effects: BNP's effect on GWW was significantly mediated through PSD (indirect effect = -0.4877, P < 0.05), and BNP's effect on GWE was entirely mediated through PSD (indirect effect = 0.5389, P < 0.05).</p><p><strong>Conclusion: </strong>This study identified two distinct HFpEF subtype","PeriodicalId":20241,"journal":{"name":"PLoS Computational Biology","volume":"22 1","pages":"e1013891"},"PeriodicalIF":3.6,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12829950/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145985166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}