Kartik Singhal, Evelyn Schmidt, Susanna Kiwala, S Peter Goedegebuure, Christopher A Miller, Huiming Xia, Kelsy C Cotto, Jinglun Li, Jennie Yao, Luke Hendrickson, Miller M Richters, My H Hoang, Mariam Khanfar, Isabel Risch, Shelly O'Laughlin, Nancy Myers, Tammi Vickery, Sherri R Davies, Feiyu Du, Thomas B Mooney, Adam Coffman, Gue Su Chang, Jasreet Hundal, John E Garza, Michael D McLellan, Joshua F McMichael, John Maruska, William Blake Inabinett, William A Hoos, Rachel Karchin, Tanner M Johanns, Gavin P Dunn, Russel K Pachynski, Todd A Fehniger, Jeffrey P Ward, Jennifer A Foltz, William E Gillanders, Obi L Griffith, Malachi Griffith
Personalized neoantigen vaccines represent a promising immunotherapy approach that harnesses tumor-specific antigens to stimulate anti-tumor immune responses. However, the design of these vaccines requires sophisticated computational workflows to predict and prioritize neoantigen candidates from patient sequencing data, coupled with rigorous review to ensure candidate quality. While numerous computational tools exist for neoantigen prediction, to our knowledge, there are no established protocols detailing the complete process from raw sequencing data through systematic candidate selection. Here, we present ImmunoNX (Immunogenomics Neoantigen eXplorer), an end-to-end protocol for neoantigen prediction and vaccine design that has supported over 185 patients across 11 clinical trials. The workflow integrates tumor DNA/RNA and matched normal DNA sequencing data through a computational pipeline built with Workflow Definition Language (WDL) and executed via Cromwell on Google Cloud Platform. ImmunoNX employs consensus-based variant calling, in-silico HLA typing, and pVACtools for neoantigen prediction. Additionally, we describe a two-stage immunogenomics review process with prioritization of neoantigen candidates, enabled by pVACview, followed by manual assessment of variants using the Integrative Genomics Viewer (IGV). This workflow enables vaccine design in under three months. We demonstrate the protocol using the HCC1395 breast cancer cell line dataset, identifying 78 high-confidence neoantigen candidates from 322 initial predictions. Although demonstrated here for vaccine development, this workflow can be adapted for diverse neoantigen therapies and experiments. Therefore, this protocol provides the research community with a reproducible, version-controlled framework for designing personalized neoantigen vaccines, supported by detailed documentation, example datasets, and open-source code.
{"title":"ImmunoNX: a robust bioinformatics workflow to support personalized neoantigen vaccine trials.","authors":"Kartik Singhal, Evelyn Schmidt, Susanna Kiwala, S Peter Goedegebuure, Christopher A Miller, Huiming Xia, Kelsy C Cotto, Jinglun Li, Jennie Yao, Luke Hendrickson, Miller M Richters, My H Hoang, Mariam Khanfar, Isabel Risch, Shelly O'Laughlin, Nancy Myers, Tammi Vickery, Sherri R Davies, Feiyu Du, Thomas B Mooney, Adam Coffman, Gue Su Chang, Jasreet Hundal, John E Garza, Michael D McLellan, Joshua F McMichael, John Maruska, William Blake Inabinett, William A Hoos, Rachel Karchin, Tanner M Johanns, Gavin P Dunn, Russel K Pachynski, Todd A Fehniger, Jeffrey P Ward, Jennifer A Foltz, William E Gillanders, Obi L Griffith, Malachi Griffith","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Personalized neoantigen vaccines represent a promising immunotherapy approach that harnesses tumor-specific antigens to stimulate anti-tumor immune responses. However, the design of these vaccines requires sophisticated computational workflows to predict and prioritize neoantigen candidates from patient sequencing data, coupled with rigorous review to ensure candidate quality. While numerous computational tools exist for neoantigen prediction, to our knowledge, there are no established protocols detailing the complete process from raw sequencing data through systematic candidate selection. Here, we present ImmunoNX (Immunogenomics Neoantigen eXplorer), an end-to-end protocol for neoantigen prediction and vaccine design that has supported over 185 patients across 11 clinical trials. The workflow integrates tumor DNA/RNA and matched normal DNA sequencing data through a computational pipeline built with Workflow Definition Language (WDL) and executed via Cromwell on Google Cloud Platform. ImmunoNX employs consensus-based variant calling, in-silico HLA typing, and pVACtools for neoantigen prediction. Additionally, we describe a two-stage immunogenomics review process with prioritization of neoantigen candidates, enabled by pVACview, followed by manual assessment of variants using the Integrative Genomics Viewer (IGV). This workflow enables vaccine design in under three months. We demonstrate the protocol using the HCC1395 breast cancer cell line dataset, identifying 78 high-confidence neoantigen candidates from 322 initial predictions. Although demonstrated here for vaccine development, this workflow can be adapted for diverse neoantigen therapies and experiments. Therefore, this protocol provides the research community with a reproducible, version-controlled framework for designing personalized neoantigen vaccines, supported by detailed documentation, example datasets, and open-source code.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709488/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Timothy Keyes, Alison Callahan, Abby S Pandya, Nerissa Ambers, Juan M Banda, Miguel Fuentes, Carlene Lugtu, Pranav Masariya, Srikar Nallan, Connor O'Brien, Thomas Wang, Emily Alsentzer, Jonathan H Chen, Dev Dash, Matthew A Eisenberg, Patricia Garcia, Nikesh Kotecha, Anurang Revri, Michael A Pfeffer, Nigam H Shah, Sneha S Jain
Post-deployment monitoring of artificial intelligence (AI) systems in health care is essential to ensure their safety, quality, and sustained benefit-and to support governance decisions about which systems to update, modify, or decommission. Motivated by these needs, we developed a framework for monitoring deployed AI systems grounded in the mandate to take specific actions when they fail to behave as intended. This framework, which is now actively used at Stanford Health Care, is organized around three complementary principles: system integrity, performance, and impact. System integrity monitoring focuses on maximizing system uptime, detecting runtime errors, and identifying when changes to the surrounding IT ecosystem have unintended effects. Performance monitoring focuses on maintaining accurate system behavior in the face of changing health care practices (and thus input data) over time. Impact monitoring assesses whether a deployed system continues to have value in the form of benefit to clinicians and patients. Drawing on examples of deployed AI systems at our academic medical center, we provide practical guidance for creating monitoring plans based on these principles that specify which metrics to measure, when those metrics should be reviewed, who is responsible for acting when metrics change, and what concrete follow-up actions should be taken-for both traditional and generative AI. We also discuss challenges to implementing this framework, including the effort and cost of monitoring for health systems with limited resources and the difficulty of incorporating data-driven monitoring practices into complex organizations where conflicting priorities and definitions of success often coexist. This framework offers a practical template and starting point for health systems seeking to ensure that AI deployments remain safe and effective over time.
{"title":"Monitoring Deployed AI Systems in Health Care.","authors":"Timothy Keyes, Alison Callahan, Abby S Pandya, Nerissa Ambers, Juan M Banda, Miguel Fuentes, Carlene Lugtu, Pranav Masariya, Srikar Nallan, Connor O'Brien, Thomas Wang, Emily Alsentzer, Jonathan H Chen, Dev Dash, Matthew A Eisenberg, Patricia Garcia, Nikesh Kotecha, Anurang Revri, Michael A Pfeffer, Nigam H Shah, Sneha S Jain","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Post-deployment monitoring of artificial intelligence (AI) systems in health care is essential to ensure their safety, quality, and sustained benefit-and to support governance decisions about which systems to update, modify, or decommission. Motivated by these needs, we developed a framework for monitoring deployed AI systems grounded in the mandate to take specific actions when they fail to behave as intended. This framework, which is now actively used at Stanford Health Care, is organized around three complementary principles: system integrity, performance, and impact. System integrity monitoring focuses on maximizing system uptime, detecting runtime errors, and identifying when changes to the surrounding IT ecosystem have unintended effects. Performance monitoring focuses on maintaining accurate system behavior in the face of changing health care practices (and thus input data) over time. Impact monitoring assesses whether a deployed system continues to have value in the form of benefit to clinicians and patients. Drawing on examples of deployed AI systems at our academic medical center, we provide practical guidance for creating monitoring plans based on these principles that specify which metrics to measure, when those metrics should be reviewed, who is responsible for acting when metrics change, and what concrete follow-up actions should be taken-for both traditional and generative AI. We also discuss challenges to implementing this framework, including the effort and cost of monitoring for health systems with limited resources and the difficulty of incorporating data-driven monitoring practices into complex organizations where conflicting priorities and definitions of success often coexist. This framework offers a practical template and starting point for health systems seeking to ensure that AI deployments remain safe and effective over time.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709494/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jonas A Gustafson, Jiadong Lin, Evan E Eichler, Danny E Miller
We present needLR, a structural variant (SV) annotation tool that can be used for filtering and prioritization of candidate pathogenic SVs from long-read sequencing data using population allele frequencies, annotations for genomic context, and gene-phenotype associations. When using population data from 500 presumably healthy individuals to evaluate nine test cases with known pathogenic SVs, needLR assigned allele frequencies to over 97.5% of all detected SVs and reduced the average number of novel genic SVs to 121 per case while retaining all known pathogenic variants.
{"title":"needLR: Long-read structural variant annotation with population-scale frequency estimation.","authors":"Jonas A Gustafson, Jiadong Lin, Evan E Eichler, Danny E Miller","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>We present needLR, a structural variant (SV) annotation tool that can be used for filtering and prioritization of candidate pathogenic SVs from long-read sequencing data using population allele frequencies, annotations for genomic context, and gene-phenotype associations. When using population data from 500 presumably healthy individuals to evaluate nine test cases with known pathogenic SVs, needLR assigned allele frequencies to over 97.5% of all detected SVs and reduced the average number of novel genic SVs to 121 per case while retaining all known pathogenic variants.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709490/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ziyue Zheng, Loay J Jabre, Matthew McIlvin, Mak A Saito, Sangwon Hyun
Intracellular compartmentalization of proteins underpins their function and the metabolic processes they sustain. Various mass spectrometry-based proteomics methods (subcellular spatial proteomics) now allow high throughput subcellular protein localization. Yet, the curation, analysis and interpretation of these data remain challenging, particularly in non-model organisms where establishing reliable marker proteins is difficult, and in contexts where experimental replication and subcellular fractionation are constrained. Here, we develop FSPmix, a semi-supervised functional clustering method implemented as an open-source R package, which leverages partial annotations from a subset of marker proteins to predict protein subcellular localization from subcellular spatial proteomics data. This method explicitly assumes that protein signatures vary smoothly across subcellular fractions, enabling more robust inference under low signal-to-noise data regimes. We applied FSPmix to a subcellular proteomics dataset from a marine diatom, allowing us to assign probabilistic localizations to proteins and uncover potentially new protein functions. Altogether, this work lays the foundation for more robust statistical analysis and interpretation of subcellular proteomics datasets, particularly in understudied organisms.
{"title":"Subcellular proteome niche discovery using semi-supervised functional clustering.","authors":"Ziyue Zheng, Loay J Jabre, Matthew McIlvin, Mak A Saito, Sangwon Hyun","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Intracellular compartmentalization of proteins underpins their function and the metabolic processes they sustain. Various mass spectrometry-based proteomics methods (subcellular spatial proteomics) now allow high throughput subcellular protein localization. Yet, the curation, analysis and interpretation of these data remain challenging, particularly in non-model organisms where establishing reliable marker proteins is difficult, and in contexts where experimental replication and subcellular fractionation are constrained. Here, we develop FSPmix, a semi-supervised functional clustering method implemented as an open-source R package, which leverages partial annotations from a subset of marker proteins to predict protein subcellular localization from subcellular spatial proteomics data. This method explicitly assumes that protein signatures vary smoothly across subcellular fractions, enabling more robust inference under low signal-to-noise data regimes. We applied FSPmix to a subcellular proteomics dataset from a marine diatom, allowing us to assign probabilistic localizations to proteins and uncover potentially new protein functions. Altogether, this work lays the foundation for more robust statistical analysis and interpretation of subcellular proteomics datasets, particularly in understudied organisms.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709497/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We studied the structural alterations between healthy and diseased brain tissues using a multiparametric framework combining fractal analysis, fractal functional transformation, multifractal analysis, and the Inverse Participation Ratio (IPR) analysis. Accurate characterization of brain tissue microstructure is crucial for early detection and diagnosis of cancer. By applying box-counting methods on brightfield microscopy images, we estimated the fractal dimension and its logarithmic and functional forms to highlight spatial irregularities in the tissue architecture. While and exhibited long-tailed distributions distinguishing healthy from cancer tissues, provided significantly improved differentiation by emphasizing local structural variations. Additionally, multifractal analysis revealed broader vs curves in cancerous samples, reflecting higher heterogeneity. IPR analysis based on light localization further demonstrated increased nanoscale variations in mass density, reflecting higher structural disorder in cancer tissues. Combining these complementary approaches creates a robust framework for measuring tissue complexity and holds great potential to improve microscopic diagnostic methods for brain cancer detection.
本研究采用分形分析、分形函数变换、多重分形分析和逆参与比(IPR)分析相结合的多参数框架研究了健康和患病脑组织的结构变化。脑组织微观结构的准确表征对于癌症的早期发现和诊断至关重要。通过对明场显微镜图像应用盒计数方法,我们估计了分形维数(Df)及其对数(ln(Df))和函数(ln(Dtf))形式,以突出组织结构中的空间不规则性。虽然Df和ln(Df)表现出区分健康组织和癌症组织的长尾分布,但ln(Dtf)通过强调局部结构变异,显著改善了分化。此外,多重分形分析显示,癌症样本的f(α) vs α曲线更宽,反映了更高的异质性。基于光定位的IPR分析进一步表明,质量密度的纳米级变化增加,反映了癌组织中更高的结构紊乱。将这些互补的方法结合起来,为测量组织复杂性创造了一个强大的框架,并具有改善脑癌检测的显微诊断方法的巨大潜力。
{"title":"Quantitative Characterization of Brain Tissue Alterations in Brain Cancer Using Fractal, Multifractal, and IPR Metrics.","authors":"Mousa Alrubayan, Santanu Maity, Prabhakar Pradhan","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>We studied the structural alterations between healthy and diseased brain tissues using a multiparametric framework combining fractal analysis, fractal functional transformation, multifractal analysis, and the Inverse Participation Ratio (IPR) analysis. Accurate characterization of brain tissue microstructure is crucial for early detection and diagnosis of cancer. By applying box-counting methods on brightfield microscopy images, we estimated the fractal dimension <math> <mrow> <mfenced> <mrow><msub><mi>D</mi> <mi>f</mi></msub> </mrow> </mfenced> </mrow> </math> and its logarithmic <math> <mrow> <mfenced><mrow><mi>l</mi> <mi>n</mi> <mfenced> <mrow><msub><mi>D</mi> <mi>f</mi></msub> </mrow> </mfenced> </mrow> </mfenced> </mrow> </math> and functional <math> <mrow> <mfenced><mrow><mi>l</mi> <mi>n</mi> <mfenced> <mrow><msub><mi>D</mi> <mrow><mi>t</mi> <mi>f</mi></mrow> </msub> </mrow> </mfenced> </mrow> </mfenced> </mrow> </math> forms to highlight spatial irregularities in the tissue architecture. While <math> <mrow><msub><mi>D</mi> <mi>f</mi></msub> </mrow> </math> and <math><mrow><mi>l</mi> <mi>n</mi> <mfenced> <mrow><msub><mi>D</mi> <mi>f</mi></msub> </mrow> </mfenced> </mrow> </math> exhibited long-tailed distributions distinguishing healthy from cancer tissues, <math><mrow><mi>l</mi> <mi>n</mi> <mfenced> <mrow><msub><mi>D</mi> <mrow><mi>t</mi> <mi>f</mi></mrow> </msub> </mrow> </mfenced> </mrow> </math> provided significantly improved differentiation by emphasizing local structural variations. Additionally, multifractal analysis revealed broader <math><mrow><mi>f</mi> <mo>(</mo> <mi>α</mi> <mo>)</mo></mrow> </math> vs <math><mi>α</mi></math> curves in cancerous samples, reflecting higher heterogeneity. IPR analysis based on light localization further demonstrated increased nanoscale variations in mass density, reflecting higher structural disorder in cancer tissues. Combining these complementary approaches creates a robust framework for measuring tissue complexity and holds great potential to improve microscopic diagnostic methods for brain cancer detection.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709487/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Santanu Maity, Mousa Alrubayan, Mohammad Moshahid Khan, Prabhakar Pradhan
Alzheimer's disease (AD) is characterized by progressive microstructural deterioration in brain tissue, yet conventional imaging and histopathology often lack the sensitivity needed to detect subtle early-stage changes. Here, we present a multiparametric framework combining fractal and multifractal analysis and their distributions to quantify structural alterations in human brain tissue affected by AD. Moreover, from the fractal and multifractal formalism, we introduced an innovative fractal functional distribution method, a novel technique that transforms fractal distribution into a Gaussian form. Statistically, these distribution parameters are easy to interpret and can distinguish between control and diseased tissues. Across samples, we identify pronounced threshold-dependent behavior of fractal and multifractal parameters, reflecting the intrinsic sparsity and heterogeneous intensity landscape of brain tissue. These threshold-sensitive signatures provide a framework for quantitative stage detection and may serve as biomarkers for early pathological transitions. In addition, we studied structural disorder and complexity using our established light localization technique, inverse participation ratio (IPR) analysis. IPR-based analysis demonstrates that increasing IPR pixel size highlights the elevation of structural alterations with disease progression. Together, these integrative analyses establish a robust, multi-scale quantitative framework for detecting microstructural alterations in AD, providing a promising foundation for early diagnosis and improved pathological assessment.
{"title":"Alterations of brain tissue structural complexity and disorder in Alzheimer's disease (AD): Fractal, multifractal, fractal transformation, and disorder strength analyses.","authors":"Santanu Maity, Mousa Alrubayan, Mohammad Moshahid Khan, Prabhakar Pradhan","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Alzheimer's disease (AD) is characterized by progressive microstructural deterioration in brain tissue, yet conventional imaging and histopathology often lack the sensitivity needed to detect subtle early-stage changes. Here, we present a multiparametric framework combining fractal and multifractal analysis and their distributions to quantify structural alterations in human brain tissue affected by AD. Moreover, from the fractal and multifractal formalism, we introduced an innovative fractal functional distribution method, a novel technique that transforms fractal distribution into a Gaussian form. Statistically, these distribution parameters are easy to interpret and can distinguish between control and diseased tissues. Across samples, we identify pronounced threshold-dependent behavior of fractal and multifractal parameters, reflecting the intrinsic sparsity and heterogeneous intensity landscape of brain tissue. These threshold-sensitive signatures provide a framework for quantitative stage detection and may serve as biomarkers for early pathological transitions. In addition, we studied structural disorder and complexity using our established light localization technique, inverse participation ratio (IPR) analysis. IPR-based analysis demonstrates that increasing IPR pixel size highlights the elevation of structural alterations with disease progression. Together, these integrative analyses establish a robust, multi-scale quantitative framework for detecting microstructural alterations in AD, providing a promising foundation for early diagnosis and improved pathological assessment.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12709484/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145783937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anqi Feng, Zhangxing Bian, Samuel W Remedios, Savannah P Hays, Blake E Dewey, Alexa Colinco, Jiachen Zhuo, Dan Benjamini, Jerry L Prince
Accurate segmentation of thalamic nuclei from magnetic resonance images is important due to the distinct roles of these nuclei in overall brain function and to their differential involvement in neurological and psychiatric disorders. However, segmentation remains challenging given the small size of many nuclei, limited intrathalamic contrast and image resolution, and inter-subject anatomical variability. In this work, we present CATNUS (Coordinate-Aware Thalamic Nuclei Segmentation), segmenting 13 thalamic nuclei (or nuclear groups) using a 3D U-Net architecture enhanced with coordinate convolution layers, which provide more precise localization of both large and small nuclei. To support broad clinical applicability, we provide pre-trained model variants that can operate on quantitative T1 maps as well as on widely used magnetization-prepared rapid gradient echo (MPRAGE) and fast gray matter acquisition T1 inversion recovery (FGATIR) sequences. We benchmarked CATNUS against established methods, including FreeSurfer, THOMAS and HIPS-THOMAS, demonstrating improved segmentation accuracy and robust test-retest reliability across multiple nuclei. Furthermore, CATNUS demonstrated strong out-of-distribution generalization on traveling-subject datasets spanning multiple scanners, field strengths, and vendors, producing reliable and anatomically coherent segmentations across diverse acquisition conditions. Overall, CATNUS provides an accurate and generalizable solution for thalamic nuclei segmentation, with strong potential to facilitate large-scale neuroimaging studies and support real-world clinical assessment.
{"title":"CATNUS: Coordinate-Aware Thalamic Nuclei Segmentation Using T1-Weighted MRI.","authors":"Anqi Feng, Zhangxing Bian, Samuel W Remedios, Savannah P Hays, Blake E Dewey, Alexa Colinco, Jiachen Zhuo, Dan Benjamini, Jerry L Prince","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Accurate segmentation of thalamic nuclei from magnetic resonance images is important due to the distinct roles of these nuclei in overall brain function and to their differential involvement in neurological and psychiatric disorders. However, segmentation remains challenging given the small size of many nuclei, limited intrathalamic contrast and image resolution, and inter-subject anatomical variability. In this work, we present CATNUS (Coordinate-Aware Thalamic Nuclei Segmentation), segmenting 13 thalamic nuclei (or nuclear groups) using a 3D U-Net architecture enhanced with coordinate convolution layers, which provide more precise localization of both large and small nuclei. To support broad clinical applicability, we provide pre-trained model variants that can operate on quantitative T1 maps as well as on widely used magnetization-prepared rapid gradient echo (MPRAGE) and fast gray matter acquisition T1 inversion recovery (FGATIR) sequences. We benchmarked CATNUS against established methods, including FreeSurfer, THOMAS and HIPS-THOMAS, demonstrating improved segmentation accuracy and robust test-retest reliability across multiple nuclei. Furthermore, CATNUS demonstrated strong out-of-distribution generalization on traveling-subject datasets spanning multiple scanners, field strengths, and vendors, producing reliable and anatomically coherent segmentations across diverse acquisition conditions. Overall, CATNUS provides an accurate and generalizable solution for thalamic nuclei segmentation, with strong potential to facilitate large-scale neuroimaging studies and support real-world clinical assessment.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12687853/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145727766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The ongoing explosion of genome sequence data is transforming how we reconstruct and understand the histories of biological systems. Across biological scales, from individual cells to populations and species, trees-based models provide a common framework for representing ancestry. Once limited to species phylogenetics, "tree thinking" now extends deeply to population genomics and cell biology, revealing the genealogical structure of genetic and phenotypic variation within and across organisms. Recently, there have been great methodological and computational advances on tree-based methods, including methods for inferring ancestral recombination graphs in populations, phylogenetic frameworks for comparative genomics, and lineage-tracing techniques in developmental and cancer biology. Despite differences in data types and biological contexts, these approaches share core statistical and algorithmic challenges: efficiently inferring branching histories from genomic information, integrating temporal and spatial signals, and connecting genealogical structures to evolutionary and functional processes. Recognizing these shared foundations opens opportunities for cross-fertilization between fields that are traditionally studied in isolation. By examining how tree-based methods are applied across cellular, population, and species scales, we identify the conceptual parallels that unite them and the distinct challenges that each domain presents. These comparisons offer new perspectives that can inform algorithmic innovations and lead to more powerful inference strategies across the full spectrum of biological systems.
{"title":"Tree Thinking in the Genomic Era: Unifying Models Across Cells, Populations, and Species.","authors":"Yun Deng, Shing H Zhan, Yulin Zhang, Chao Zhang, Bingjie Chen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The ongoing explosion of genome sequence data is transforming how we reconstruct and understand the histories of biological systems. Across biological scales, from individual cells to populations and species, trees-based models provide a common framework for representing ancestry. Once limited to species phylogenetics, \"tree thinking\" now extends deeply to population genomics and cell biology, revealing the genealogical structure of genetic and phenotypic variation within and across organisms. Recently, there have been great methodological and computational advances on tree-based methods, including methods for inferring ancestral recombination graphs in populations, phylogenetic frameworks for comparative genomics, and lineage-tracing techniques in developmental and cancer biology. Despite differences in data types and biological contexts, these approaches share core statistical and algorithmic challenges: efficiently inferring branching histories from genomic information, integrating temporal and spatial signals, and connecting genealogical structures to evolutionary and functional processes. Recognizing these shared foundations opens opportunities for cross-fertilization between fields that are traditionally studied in isolation. By examining how tree-based methods are applied across cellular, population, and species scales, we identify the conceptual parallels that unite them and the distinct challenges that each domain presents. These comparisons offer new perspectives that can inform algorithmic innovations and lead to more powerful inference strategies across the full spectrum of biological systems.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12687851/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145727717","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The basic reproduction number ( ) is an epidemiological metric that represents the average number of new infections caused by a single infectious individual in a completely susceptible population. The methodology for calculating this metric is well-defined for numerous model types, including, most prominently, Ordinary Differential Equations (ODEs). The basic reproduction number is used in disease modeling to predict the potential of an outbreak and the transmissibility of a disease, as well as by governments to inform public health interventions and resource allocation for controlling the spread of diseases. A Petri Net (PN) is a directed bipartite graph where places, transitions, arcs, and the firing of the arcs determine the dynamic behavior of the system. Petri Net models have been an increasingly used tool within the epidemiology community. However, no generalized method for calculating directly from PN models has been established. Thus, in this paper, we establish a generalized computational framework for calculating directly from Petri Net models. We adapt the next-generation matrix method to be compatible with multiple Petri Net formalisms, including both deterministic Variable Arc Weight Petri Nets (VAPNs) and stochastic continuous-time Petri Nets (SPNs). We demonstrate the method's versatility on a range of complex epidemiological models, including those with multiple strains, asymptomatic states, and nonlinear dynamics. Crucially, we numerically validate our framework by demonstrating that the analytically derived values are in strong agreement with those estimated from simulation data, thereby confirming the method's accuracy and practical utility.
{"title":"The Basic Reproduction Number for Petri Net Models: A Next-Generation Matrix Approach.","authors":"Trevor Reckell, Beckett Sterner, Petar Jevtić","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The basic reproduction number ( <math> <mrow><msub><mi>R</mi> <mn>0</mn></msub> </mrow> </math> ) is an epidemiological metric that represents the average number of new infections caused by a single infectious individual in a completely susceptible population. The methodology for calculating this metric is well-defined for numerous model types, including, most prominently, Ordinary Differential Equations (ODEs). The basic reproduction number is used in disease modeling to predict the potential of an outbreak and the transmissibility of a disease, as well as by governments to inform public health interventions and resource allocation for controlling the spread of diseases. A Petri Net (PN) is a directed bipartite graph where places, transitions, arcs, and the firing of the arcs determine the dynamic behavior of the system. Petri Net models have been an increasingly used tool within the epidemiology community. However, no generalized method for calculating <math> <mrow><msub><mi>R</mi> <mn>0</mn></msub> </mrow> </math> directly from PN models has been established. Thus, in this paper, we establish a generalized computational framework for calculating <math> <mrow><msub><mi>R</mi> <mn>0</mn></msub> </mrow> </math> directly from Petri Net models. We adapt the next-generation matrix method to be compatible with multiple Petri Net formalisms, including both deterministic Variable Arc Weight Petri Nets (VAPNs) and stochastic continuous-time Petri Nets (SPNs). We demonstrate the method's versatility on a range of complex epidemiological models, including those with multiple strains, asymptomatic states, and nonlinear dynamics. Crucially, we numerically validate our framework by demonstrating that the analytically derived <math> <mrow><msub><mi>R</mi> <mn>0</mn></msub> </mrow> </math> values are in strong agreement with those estimated from simulation data, thereby confirming the method's accuracy and practical utility.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12687848/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145727755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kevin Keomanee-Dizon, Yaakov Clenman, Alejandra Duran, Sergey Ryabichko, Pauline Hansen, Tohn Borjigin, Richard Thornton, Jared E Toettcher, Harold M McNamara
High-numerical-aperture (NA) oblique plane microscopy enables noninvasive fluorescence imaging of subcellular dynamics without requiring radical sample modification. However, performance degrades at depth in multicellular specimens as scattering and refractive-index heterogeneity raise out-of-focus background. We report a two-photon oblique plane microscope that improves resolution at depth by combining high-NA single-objective detection with multiphoton plane illumination. The microscope achieves $sim!300$ nm lateral and $sim!650$ nm axial resolution, with single-molecule sensitivity in vivo. Compared with two-photon point scanning, the lower illumination NA delivers an order of magnitude lower peak intensity, enabling $>!5times$ faster volumetric acquisition (up to $3.25 times 10^6$ voxels s$^{-1}$) with reduced photodamage. In multicellular contexts, near-infrared nonlinear excitation enhances contrast throughout the illumination depth by $sim!2times$ and restores volumetric resolving power by $>!2times$ relative to linear excitation. We demonstrate these capabilities through molecular imaging of epithelial tissue, stem-cell-derived gastruloids, and living fruit fly embryos, including multicolor transcription-factor dynamics, optogenetic subcellular control, and single-mRNA tracking, all using standard glass-based mounting.
{"title":"Depth-enhanced molecular imaging with two-photon oblique plane microscopy.","authors":"Kevin Keomanee-Dizon, Yaakov Clenman, Alejandra Duran, Sergey Ryabichko, Pauline Hansen, Tohn Borjigin, Richard Thornton, Jared E Toettcher, Harold M McNamara","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>High-numerical-aperture (NA) oblique plane microscopy enables noninvasive fluorescence imaging of subcellular dynamics without requiring radical sample modification. However, performance degrades at depth in multicellular specimens as scattering and refractive-index heterogeneity raise out-of-focus background. We report a two-photon oblique plane microscope that improves resolution at depth by combining high-NA single-objective detection with multiphoton plane illumination. The microscope achieves $sim!300$ nm lateral and $sim!650$ nm axial resolution, with single-molecule sensitivity in vivo. Compared with two-photon point scanning, the lower illumination NA delivers an order of magnitude lower peak intensity, enabling $>!5times$ faster volumetric acquisition (up to $3.25 times 10^6$ voxels s$^{-1}$) with reduced photodamage. In multicellular contexts, near-infrared nonlinear excitation enhances contrast throughout the illumination depth by $sim!2times$ and restores volumetric resolving power by $>!2times$ relative to linear excitation. We demonstrate these capabilities through molecular imaging of epithelial tissue, stem-cell-derived gastruloids, and living fruit fly embryos, including multicolor transcription-factor dynamics, optogenetic subcellular control, and single-mRNA tracking, all using standard glass-based mounting.</p>","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12642766/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145607864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}