Pub Date : 2026-01-14Epub Date: 2025-11-13DOI: 10.1016/j.xgen.2025.101062
Haitao Xiang, Xiangyu Guan, Yaohua Wei, Shuzhen Luo, Haibo Zhang, Fanyu Bu, Yixin Yan, Yunyun Fu, Yijian Li, Qumiao Xu, Penghui Lin, Dongbing Liu, Xinlan Zhou, Feng Gao, Tai Chen, Guangjun Nie, Kui Wu, Ying Gu, Longqi Liu, Ziqing Ye, Xiaojian Wu, Ruifang Zhao, Siqi Liu, Xuan Dong
Tumor-specific antigens (TSAs) are crucial for activating T cells against cancer, but traditional discovery methods focusing on exonic mutations overlook non-canonical TSAs from non-coding regions. We employed an integrative proteogenomic strategy combining whole-genome and RNA sequencing with immunoprecipitation mass spectrometry to comprehensively explore TSA generation in colorectal cancer patients. Analysis of 10 paired tumor samples identified 96 mutated major histocompatibility complex class I-presented neo-epitopes, with 80.21% originating from non-coding regions. In hypermutated tumors with high mutational burden, neo-epitopes predominantly arose from intergenic and intronic areas, while in non-hypermutated tumors with low mutational burden, they mainly stemmed from coding variations and alternative splicing events. Functional validation in mouse models demonstrated that mutated non-canonical neo-epitopes effectively activated CD8+ T cells and significantly suppressed tumor growth. These findings underscore the importance of considering the entire genomic landscape in TSA discovery, suggesting new avenues for personalized immunotherapy.
{"title":"Predominant mutated non-canonical tumor-specific antigens identified by proteogenomics demonstrate immunogenicity and tumor suppression in CRC.","authors":"Haitao Xiang, Xiangyu Guan, Yaohua Wei, Shuzhen Luo, Haibo Zhang, Fanyu Bu, Yixin Yan, Yunyun Fu, Yijian Li, Qumiao Xu, Penghui Lin, Dongbing Liu, Xinlan Zhou, Feng Gao, Tai Chen, Guangjun Nie, Kui Wu, Ying Gu, Longqi Liu, Ziqing Ye, Xiaojian Wu, Ruifang Zhao, Siqi Liu, Xuan Dong","doi":"10.1016/j.xgen.2025.101062","DOIUrl":"10.1016/j.xgen.2025.101062","url":null,"abstract":"<p><p>Tumor-specific antigens (TSAs) are crucial for activating T cells against cancer, but traditional discovery methods focusing on exonic mutations overlook non-canonical TSAs from non-coding regions. We employed an integrative proteogenomic strategy combining whole-genome and RNA sequencing with immunoprecipitation mass spectrometry to comprehensively explore TSA generation in colorectal cancer patients. Analysis of 10 paired tumor samples identified 96 mutated major histocompatibility complex class I-presented neo-epitopes, with 80.21% originating from non-coding regions. In hypermutated tumors with high mutational burden, neo-epitopes predominantly arose from intergenic and intronic areas, while in non-hypermutated tumors with low mutational burden, they mainly stemmed from coding variations and alternative splicing events. Functional validation in mouse models demonstrated that mutated non-canonical neo-epitopes effectively activated CD8<sup>+</sup> T cells and significantly suppressed tumor growth. These findings underscore the importance of considering the entire genomic landscape in TSA discovery, suggesting new avenues for personalized immunotherapy.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101062"},"PeriodicalIF":11.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145524524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14Epub Date: 2025-11-10DOI: 10.1016/j.xgen.2025.101061
Zepeng Mu, Haley E Randolph, Raúl Aguirre-Gamboa, Ellen Ketter, Anne Dumaine, Veronica Locher, Cary Brandolino, Xuanyao Liu, Daniel E Kaufmann, Luis B Barreiro, Yang I Li
Only one-third of immune-associated genome-wide association study (GWAS) loci colocalize with expression quantitative trait loci (eQTLs), leaving most mechanisms unresolved. To address this, we created a unified single-cell chromatin accessibility (scATAC) map of ∼280,000 peripheral immune cells from 48 individuals, including 20 COVID-19 patients. Topic modeling of scATAC data identified continuous cell states and revealed disease-relevant cellular contexts. We identified 37,390 chromatin accessibility QTLs (caQTLs) at 10% false discovery rate and observed extensive sharing of caQTLs, with <20% confined to a single context. Notably, caQTLs explained ∼50% more GWAS loci compared to eQTLs, nominating putative causal genes for some unexplained loci. Yet most GWAS-colocalizing caQTLs lacked eQTL support, limiting causal inference from chromatin data alone. Thus, while caQTLs can improve GWAS interpretation, robust mechanistic insights require integration with gene expression and other functional evidence. Our work underscores that cellular context is critical for regulatory variant interpretation and emphasizes the need to map genetic effects in disease-relevant cell states.
{"title":"Impact of disease-associated chromatin accessibility QTLs across immune cell types and contexts.","authors":"Zepeng Mu, Haley E Randolph, Raúl Aguirre-Gamboa, Ellen Ketter, Anne Dumaine, Veronica Locher, Cary Brandolino, Xuanyao Liu, Daniel E Kaufmann, Luis B Barreiro, Yang I Li","doi":"10.1016/j.xgen.2025.101061","DOIUrl":"10.1016/j.xgen.2025.101061","url":null,"abstract":"<p><p>Only one-third of immune-associated genome-wide association study (GWAS) loci colocalize with expression quantitative trait loci (eQTLs), leaving most mechanisms unresolved. To address this, we created a unified single-cell chromatin accessibility (scATAC) map of ∼280,000 peripheral immune cells from 48 individuals, including 20 COVID-19 patients. Topic modeling of scATAC data identified continuous cell states and revealed disease-relevant cellular contexts. We identified 37,390 chromatin accessibility QTLs (caQTLs) at 10% false discovery rate and observed extensive sharing of caQTLs, with <20% confined to a single context. Notably, caQTLs explained ∼50% more GWAS loci compared to eQTLs, nominating putative causal genes for some unexplained loci. Yet most GWAS-colocalizing caQTLs lacked eQTL support, limiting causal inference from chromatin data alone. Thus, while caQTLs can improve GWAS interpretation, robust mechanistic insights require integration with gene expression and other functional evidence. Our work underscores that cellular context is critical for regulatory variant interpretation and emphasizes the need to map genetic effects in disease-relevant cell states.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101061"},"PeriodicalIF":11.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145496864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-14DOI: 10.1016/j.xgen.2025.101135
Jeffrey Rogers
The human complement of chromosomes differs from our closest primate relatives by virtue of a unique chromosome fusion event. In this issue of Cell Genomics, Yang et al. provide the first detailed analysis of the site of chromosome fusion and reconstruct the complex evolutionary relationships among the genomic elements within the human fusion site and their related sequences in our great ape relatives.
{"title":"Resolution of a human chromosomal mystery: Evolutionary complexity revealed.","authors":"Jeffrey Rogers","doi":"10.1016/j.xgen.2025.101135","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101135","url":null,"abstract":"<p><p>The human complement of chromosomes differs from our closest primate relatives by virtue of a unique chromosome fusion event. In this issue of Cell Genomics, Yang et al. provide the first detailed analysis of the site of chromosome fusion and reconstruct the complex evolutionary relationships among the genomic elements within the human fusion site and their related sequences in our great ape relatives.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"6 1","pages":"101135"},"PeriodicalIF":11.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-12DOI: 10.1016/j.xgen.2025.101125
Aaron M Allen, Megan C Neville, Tetsuya Nojima, Faredin Alejevski, Stephen F Goodwin
Sex differences in behaviors arise from variations in female and male nervous systems, yet the cellular and molecular bases of these differences remain poorly defined. Here, we employ an unbiased, single-cell transcriptomic approach to investigate how sex influences the adult Drosophila melanogaster brain. We demonstrate that sex differences do not result from large-scale transcriptional reprogramming, but rather from selective modifications within shared developmental lineages mediated by the sex-differentiating transcription factors Doublesex and Fruitless. We reveal, with unprecedented resolution, the extraordinary genetic diversity within these sexually dimorphic cell types and find that birth order represents a novel axis of sexual differentiation. Neuronal identity in the adult reflects spatiotemporal patterning and sex-specific survival, with female-biased neurons emerging early and male-biased neurons arising later. This pattern reframes dimorphic neurons as "paralogous" rather than "orthologous," suggesting sex leverages distinct developmental windows to build behavioral circuits, and highlights a role for exaptation in diversifying the brain.
{"title":"Differential neuronal survival defines a novel axis of sexual dimorphism in the Drosophila brain.","authors":"Aaron M Allen, Megan C Neville, Tetsuya Nojima, Faredin Alejevski, Stephen F Goodwin","doi":"10.1016/j.xgen.2025.101125","DOIUrl":"10.1016/j.xgen.2025.101125","url":null,"abstract":"<p><p>Sex differences in behaviors arise from variations in female and male nervous systems, yet the cellular and molecular bases of these differences remain poorly defined. Here, we employ an unbiased, single-cell transcriptomic approach to investigate how sex influences the adult Drosophila melanogaster brain. We demonstrate that sex differences do not result from large-scale transcriptional reprogramming, but rather from selective modifications within shared developmental lineages mediated by the sex-differentiating transcription factors Doublesex and Fruitless. We reveal, with unprecedented resolution, the extraordinary genetic diversity within these sexually dimorphic cell types and find that birth order represents a novel axis of sexual differentiation. Neuronal identity in the adult reflects spatiotemporal patterning and sex-specific survival, with female-biased neurons emerging early and male-biased neurons arising later. This pattern reframes dimorphic neurons as \"paralogous\" rather than \"orthologous,\" suggesting sex leverages distinct developmental windows to build behavioral circuits, and highlights a role for exaptation in diversifying the brain.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101125"},"PeriodicalIF":11.1,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145968107","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-31DOI: 10.1016/j.xgen.2025.101108
Elisabeth Rebboah, Ryan Weber, Elnaz Abdollahzadeh, Nikhila Swarna, Delaney K Sullivan, Diane Trout, Fairlie Reese, Heidi Yahan Liang, Ghassan Filimban, Parvin Mahdipoor, Margaret Duffield, Romina Mojaverzargar, Erisa Taghizadeh, Negar Fattahi, Negar Mojgani, Haoran Zhang, Rebekah K Loving, Maria Carilli, A Sina Booeshaghi, Shimako Kawauchi, Ingileif B Hallgrímsdóttir, Brian A Williams, Grant R MacGregor, Lior Pachter, Barbara J Wold, Ali Mortazavi
Mapping the impact of genomic variation on gene expression provides insight into the molecular basis of complex phenotypic traits and disease predisposition. Mouse models offer a controlled framework to capture genomic diversity across tissues. As part of the IGVF consortium, we profiled the transcriptomes of eight tissues from each founder strain of the Collaborative Cross using single-nucleus RNA sequencing. The resulting "8-cube" dataset contains 5.2 million nuclei across 106 cell types and cell states. Transcriptome variation correlated with genetic divergence, highest in the wild-derived strains. Heart and brain were relatively similar, whereas gonads, adrenal gland, skeletal muscle, kidney, and liver showed greater divergence. Variation often concentrated in specific cell types and states, especially those linked to immune and metabolic traits. The founder 8-cube dataset provides rich transcriptome signatures to help explain strain-specific traits and disease states and serves as a foundation for deeper analysis of these tissues across diverse mouse genotypes.
{"title":"Systematic cell-type resolved transcriptomes of 8 tissues in 8 lab and wild-derived mouse strains capture global and local expression variation.","authors":"Elisabeth Rebboah, Ryan Weber, Elnaz Abdollahzadeh, Nikhila Swarna, Delaney K Sullivan, Diane Trout, Fairlie Reese, Heidi Yahan Liang, Ghassan Filimban, Parvin Mahdipoor, Margaret Duffield, Romina Mojaverzargar, Erisa Taghizadeh, Negar Fattahi, Negar Mojgani, Haoran Zhang, Rebekah K Loving, Maria Carilli, A Sina Booeshaghi, Shimako Kawauchi, Ingileif B Hallgrímsdóttir, Brian A Williams, Grant R MacGregor, Lior Pachter, Barbara J Wold, Ali Mortazavi","doi":"10.1016/j.xgen.2025.101108","DOIUrl":"10.1016/j.xgen.2025.101108","url":null,"abstract":"<p><p>Mapping the impact of genomic variation on gene expression provides insight into the molecular basis of complex phenotypic traits and disease predisposition. Mouse models offer a controlled framework to capture genomic diversity across tissues. As part of the IGVF consortium, we profiled the transcriptomes of eight tissues from each founder strain of the Collaborative Cross using single-nucleus RNA sequencing. The resulting \"8-cube\" dataset contains 5.2 million nuclei across 106 cell types and cell states. Transcriptome variation correlated with genetic divergence, highest in the wild-derived strains. Heart and brain were relatively similar, whereas gonads, adrenal gland, skeletal muscle, kidney, and liver showed greater divergence. Variation often concentrated in specific cell types and states, especially those linked to immune and metabolic traits. The founder 8-cube dataset provides rich transcriptome signatures to help explain strain-specific traits and disease states and serves as a foundation for deeper analysis of these tissues across diverse mouse genotypes.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101108"},"PeriodicalIF":11.1,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145890528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-29DOI: 10.1016/j.xgen.2025.101107
Jia-Yu Xue, Cheng-Ao Yang, Shuaiya Hu, Hai-Yun Fan, Yan-Mei Zhang, Zhu-Qing Shao, Linzhou Li, Sibo Wang, Tong Wei, Shanshan Dong, Yang Liu, Zhen Li, Yves Van de Peer
Protein domains are fundamental units determining protein functions. This study identified all protein domains and domain combinations from 446 genomes across all major plant lineages. We discovered more domains and domain combinations in land plants than in algae. Many novel "core" protein domains were acquired in the early evolution of streptophytes, substantially enriching the genomic toolkit that enabled plants to shift from unicellular to multicellular organization and to adapt to terrestrial life. After conquering the land, the number of ancestral core domains kept decreasing in land plants; in contrast, an increasing number of non-core domains were acquired, which, together with enhanced activity of domain shuffling, generated various novel domain combinations and expanded protein diversity. We speculate that losing existing genetic elements (core domains) is not always detrimental, as it may have reduced evolutionary constraint upon species, paving the way for biological innovation (speciation) and adaptation to changing environments.
{"title":"Evolution of protein domains and protein domain combinations provides insights into the origin and diversification of land plants.","authors":"Jia-Yu Xue, Cheng-Ao Yang, Shuaiya Hu, Hai-Yun Fan, Yan-Mei Zhang, Zhu-Qing Shao, Linzhou Li, Sibo Wang, Tong Wei, Shanshan Dong, Yang Liu, Zhen Li, Yves Van de Peer","doi":"10.1016/j.xgen.2025.101107","DOIUrl":"10.1016/j.xgen.2025.101107","url":null,"abstract":"<p><p>Protein domains are fundamental units determining protein functions. This study identified all protein domains and domain combinations from 446 genomes across all major plant lineages. We discovered more domains and domain combinations in land plants than in algae. Many novel \"core\" protein domains were acquired in the early evolution of streptophytes, substantially enriching the genomic toolkit that enabled plants to shift from unicellular to multicellular organization and to adapt to terrestrial life. After conquering the land, the number of ancestral core domains kept decreasing in land plants; in contrast, an increasing number of non-core domains were acquired, which, together with enhanced activity of domain shuffling, generated various novel domain combinations and expanded protein diversity. We speculate that losing existing genetic elements (core domains) is not always detrimental, as it may have reduced evolutionary constraint upon species, paving the way for biological innovation (speciation) and adaptation to changing environments.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101107"},"PeriodicalIF":11.1,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145866766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The ecological persistence of Bifidobacterium breve across life stages reflects adaptive strategies beyond the classical infant- versus adult-type dichotomy, historically attributed to differential nutrient utilization. Here, comparative genomics revealed no major differences in shared carbohydrate-related genes or accessory genome content between infant- and adult-derived strains. Instead, a distinct type III lanthipeptide bacteriocin cluster, lanKC, was specifically detected in adult-derived isolates. Functional assays combining gene knockout, in vitro co-cultivation, and human intervention demonstrated that lanKC enhances strain-level competitive fitness and promotes community stability. Phylogenetic and metagenomic analyses of 5,475 lanKC homologs and 6,122 infant gut metagenomes further suggested a possible early-life acquisition via intra-genus horizontal gene transfer. These findings uncover a previously unrecognized genetic basis underlying B. breve adaptation to the gut environment and support a multi-factorial model in which metabolic flexibility and interference competition jointly sustain bifidobacterial persistence and host-microbe symbiosis throughout life.
{"title":"Bacteriocin gene-mediated ecological adaptation of Bifidobacterium breve in the adult human gut.","authors":"Jingyu Wang, Xin Qian, Qing Li, Zhiying Jin, Na Liu, Jianxin Zhao, Wei Chen, Shaopu Wang, Peijun Tian","doi":"10.1016/j.xgen.2025.101106","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101106","url":null,"abstract":"<p><p>The ecological persistence of Bifidobacterium breve across life stages reflects adaptive strategies beyond the classical infant- versus adult-type dichotomy, historically attributed to differential nutrient utilization. Here, comparative genomics revealed no major differences in shared carbohydrate-related genes or accessory genome content between infant- and adult-derived strains. Instead, a distinct type III lanthipeptide bacteriocin cluster, lanKC, was specifically detected in adult-derived isolates. Functional assays combining gene knockout, in vitro co-cultivation, and human intervention demonstrated that lanKC enhances strain-level competitive fitness and promotes community stability. Phylogenetic and metagenomic analyses of 5,475 lanKC homologs and 6,122 infant gut metagenomes further suggested a possible early-life acquisition via intra-genus horizontal gene transfer. These findings uncover a previously unrecognized genetic basis underlying B. breve adaptation to the gut environment and support a multi-factorial model in which metabolic flexibility and interference competition jointly sustain bifidobacterial persistence and host-microbe symbiosis throughout life.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101106"},"PeriodicalIF":11.1,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-19DOI: 10.1016/j.xgen.2025.101103
Aaron M Allen, Megan C Neville, Tetsuya Nojima, Faredin Alejevski, Devika Agarwal, David Sims, Stephen F Goodwin
Gene expression shapes the nervous system at every biological level, from molecular and cellular processes defining neuronal identity and function to systems-level wiring and circuit dynamics underlying behavior. Here, we generate the first high-resolution, single-cell transcriptomic atlas of the adult Drosophila melanogaster central brain by integrating multiple datasets, achieving an unprecedented 10-fold coverage of every neuron in this complex tissue. We show that a neuron's genetic identity overwhelmingly reflects its developmental origin, preserving a genetic address based on both lineage and birth order. We reveal foundational rules linking neurogenesis to transcriptional identity and provide a framework for systematically defining neuronal types. This atlas provides a powerful resource for mapping the cellular substrates of behavior by integrating annotations of hemilineage, cell types/subtypes, and molecular signatures of underlying physiological properties. It lays the groundwork for a long-sought bridge between developmental processes and the functional circuits that give rise to behavior.
{"title":"A high-resolution atlas of the brain predicts lineage and birth order underlying neuronal identity.","authors":"Aaron M Allen, Megan C Neville, Tetsuya Nojima, Faredin Alejevski, Devika Agarwal, David Sims, Stephen F Goodwin","doi":"10.1016/j.xgen.2025.101103","DOIUrl":"10.1016/j.xgen.2025.101103","url":null,"abstract":"<p><p>Gene expression shapes the nervous system at every biological level, from molecular and cellular processes defining neuronal identity and function to systems-level wiring and circuit dynamics underlying behavior. Here, we generate the first high-resolution, single-cell transcriptomic atlas of the adult Drosophila melanogaster central brain by integrating multiple datasets, achieving an unprecedented 10-fold coverage of every neuron in this complex tissue. We show that a neuron's genetic identity overwhelmingly reflects its developmental origin, preserving a genetic address based on both lineage and birth order. We reveal foundational rules linking neurogenesis to transcriptional identity and provide a framework for systematically defining neuronal types. This atlas provides a powerful resource for mapping the cellular substrates of behavior by integrating annotations of hemilineage, cell types/subtypes, and molecular signatures of underlying physiological properties. It lays the groundwork for a long-sought bridge between developmental processes and the functional circuits that give rise to behavior.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101103"},"PeriodicalIF":11.1,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7618732/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-19DOI: 10.1016/j.xgen.2025.101104
Camille A Daniels, Adetola A Abdulkadir, Megan H Cleveland, Jennifer H McDaniel, David Jáspez, Luis Alberto Rubio-Rodríguez, Adrián Muñoz-Barrera, José Miguel Lorenzo-Salazar, Carlos Flores, Byunggil Yoo, Sayed Mohammad Ebrahim Sahraeian, Yina Wang, Massimiliano Rossi, Arun Visvanath, Lisa Murray, Wei-Ting Chen, Severine Catreux, James Han, Rami Mehio, Gavin Parnaby, Andrew Carroll, Pi-Chuan Chang, Kishwar Shafin, Daniel Cook, Alexey Kolesnikov, Lucas Brambrink, Mohammed Faizal Eeman Mootor, Yash Patel, Takafumi N Yamaguchi, Paul C Boutros, Karolina Sienkiewicz, Jonathan Foox, Christopher E Mason, Bryan R Lajoie, Carlos A Ruiz-Perez, Semyon Kruglyak, Justin M Zook, Nathan D Olson
We developed a benchmark set of subclonal variants in the Genome in a Bottle (GIAB) Consortium HG002 reference material (RM) DNA for evaluating lower-frequency variant callsets. We used a somatic variant caller with high-coverage (300×) whole-genome sequencing data from the GIAB Ashkenazi Jewish trio to identify potential subclonal variants in the HG002 RM DNA. Using orthogonal sequencing data and manual curation, we defined a benchmark set with 85 high-confidence subclonal single-nucleotide variants (SNVs) (allele frequency [AF] > 5%) and a benchmark region covering 2.45 Gbp of the autosomes. External validation supported that it can be used to reliably identify both false negatives and false positives for a variety of sequencing technologies and variant callers. By adding our characterization of mosaic SNVs in this widely used cell line, we have expanded the scope of bioinformatic and sequencing applications for which the HG002 GIAB RM can be used to include benchmarking subclonal SNVs.
{"title":"Characterization of subclonal variants in HG002 Genome in a Bottle reference material as a resource for benchmarking variant callers.","authors":"Camille A Daniels, Adetola A Abdulkadir, Megan H Cleveland, Jennifer H McDaniel, David Jáspez, Luis Alberto Rubio-Rodríguez, Adrián Muñoz-Barrera, José Miguel Lorenzo-Salazar, Carlos Flores, Byunggil Yoo, Sayed Mohammad Ebrahim Sahraeian, Yina Wang, Massimiliano Rossi, Arun Visvanath, Lisa Murray, Wei-Ting Chen, Severine Catreux, James Han, Rami Mehio, Gavin Parnaby, Andrew Carroll, Pi-Chuan Chang, Kishwar Shafin, Daniel Cook, Alexey Kolesnikov, Lucas Brambrink, Mohammed Faizal Eeman Mootor, Yash Patel, Takafumi N Yamaguchi, Paul C Boutros, Karolina Sienkiewicz, Jonathan Foox, Christopher E Mason, Bryan R Lajoie, Carlos A Ruiz-Perez, Semyon Kruglyak, Justin M Zook, Nathan D Olson","doi":"10.1016/j.xgen.2025.101104","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101104","url":null,"abstract":"<p><p>We developed a benchmark set of subclonal variants in the Genome in a Bottle (GIAB) Consortium HG002 reference material (RM) DNA for evaluating lower-frequency variant callsets. We used a somatic variant caller with high-coverage (300×) whole-genome sequencing data from the GIAB Ashkenazi Jewish trio to identify potential subclonal variants in the HG002 RM DNA. Using orthogonal sequencing data and manual curation, we defined a benchmark set with 85 high-confidence subclonal single-nucleotide variants (SNVs) (allele frequency [AF] > 5%) and a benchmark region covering 2.45 Gbp of the autosomes. External validation supported that it can be used to reliably identify both false negatives and false positives for a variety of sequencing technologies and variant callers. By adding our characterization of mosaic SNVs in this widely used cell line, we have expanded the scope of bioinformatic and sequencing applications for which the HG002 GIAB RM can be used to include benchmarking subclonal SNVs.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101104"},"PeriodicalIF":11.1,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cellular identity emerges from the dynamic coordination of context-aware gene programs that encode biological functions across molecular layers. To decode this complexity, we present SSpMosaic, a computational framework that establishes metaprograms (higher-order, cross-dataset aligned gene program representations) as universal anchors for biological state representation. Leveraging these metaprograms, SSpMosaic enables consistent, accurate integration across batches, modalities, and species. Critically, SSpMosaic accurately annotates cell types within query datasets, enabling discovery and annotation of novel cell states through metaprogram-based transfer learning. The framework achieves resolution-agnostic spatial transcriptomics deconvolution, precisely mapping cell-type distributions from spot-level (Visium) to subcellular scales (CosMx/Visium HD). As a paradigm-shifting application, we integrate single-nucleus transcriptomics, chromatin accessibility, and spatial transcriptomics to resolve multi-stage spatial domain dynamics across tissue slices. Finally, SSpMosaic enables reference-free spatial characterization, identifying conserved spatial ecotypes across tissue slices and annotating cellular niches without requiring matched single-cell data.
{"title":"Robust integration and annotation of single-cell and spatial omics data using interpretable gene programs.","authors":"Yuelei Zhang, Wenxuan Ming, Bianjiong Yu, Lele Wang, Kaiyan Lu, Lei Xu, Yanhong Ni, Runzhi Deng, Dijun Chen","doi":"10.1016/j.xgen.2025.101105","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101105","url":null,"abstract":"<p><p>Cellular identity emerges from the dynamic coordination of context-aware gene programs that encode biological functions across molecular layers. To decode this complexity, we present SSpMosaic, a computational framework that establishes metaprograms (higher-order, cross-dataset aligned gene program representations) as universal anchors for biological state representation. Leveraging these metaprograms, SSpMosaic enables consistent, accurate integration across batches, modalities, and species. Critically, SSpMosaic accurately annotates cell types within query datasets, enabling discovery and annotation of novel cell states through metaprogram-based transfer learning. The framework achieves resolution-agnostic spatial transcriptomics deconvolution, precisely mapping cell-type distributions from spot-level (Visium) to subcellular scales (CosMx/Visium HD). As a paradigm-shifting application, we integrate single-nucleus transcriptomics, chromatin accessibility, and spatial transcriptomics to resolve multi-stage spatial domain dynamics across tissue slices. Finally, SSpMosaic enables reference-free spatial characterization, identifying conserved spatial ecotypes across tissue slices and annotating cellular niches without requiring matched single-cell data.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101105"},"PeriodicalIF":11.1,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}