Pub Date : 2025-12-29DOI: 10.1016/j.xgen.2025.101107
Jia-Yu Xue, Cheng-Ao Yang, Shuaiya Hu, Hai-Yun Fan, Yan-Mei Zhang, Zhu-Qing Shao, Linzhou Li, Sibo Wang, Tong Wei, Shanshan Dong, Yang Liu, Zhen Li, Yves Van de Peer
Protein domains are fundamental units determining protein functions. This study identified all protein domains and domain combinations from 446 genomes across all major plant lineages. We discovered more domains and domain combinations in land plants than in algae. Many novel "core" protein domains were acquired in the early evolution of streptophytes, substantially enriching the genomic toolkit that enabled plants to shift from unicellular to multicellular organization and to adapt to terrestrial life. After conquering the land, the number of ancestral core domains kept decreasing in land plants; in contrast, an increasing number of non-core domains were acquired, which, together with enhanced activity of domain shuffling, generated various novel domain combinations and expanded protein diversity. We speculate that losing existing genetic elements (core domains) is not always detrimental, as it may have reduced evolutionary constraint upon species, paving the way for biological innovation (speciation) and adaptation to changing environments.
{"title":"Evolution of protein domains and protein domain combinations provides insights into the origin and diversification of land plants.","authors":"Jia-Yu Xue, Cheng-Ao Yang, Shuaiya Hu, Hai-Yun Fan, Yan-Mei Zhang, Zhu-Qing Shao, Linzhou Li, Sibo Wang, Tong Wei, Shanshan Dong, Yang Liu, Zhen Li, Yves Van de Peer","doi":"10.1016/j.xgen.2025.101107","DOIUrl":"10.1016/j.xgen.2025.101107","url":null,"abstract":"<p><p>Protein domains are fundamental units determining protein functions. This study identified all protein domains and domain combinations from 446 genomes across all major plant lineages. We discovered more domains and domain combinations in land plants than in algae. Many novel \"core\" protein domains were acquired in the early evolution of streptophytes, substantially enriching the genomic toolkit that enabled plants to shift from unicellular to multicellular organization and to adapt to terrestrial life. After conquering the land, the number of ancestral core domains kept decreasing in land plants; in contrast, an increasing number of non-core domains were acquired, which, together with enhanced activity of domain shuffling, generated various novel domain combinations and expanded protein diversity. We speculate that losing existing genetic elements (core domains) is not always detrimental, as it may have reduced evolutionary constraint upon species, paving the way for biological innovation (speciation) and adaptation to changing environments.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101107"},"PeriodicalIF":11.1,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145866766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The ecological persistence of Bifidobacterium breve across life stages reflects adaptive strategies beyond the classical infant- versus adult-type dichotomy, historically attributed to differential nutrient utilization. Here, comparative genomics revealed no major differences in shared carbohydrate-related genes or accessory genome content between infant- and adult-derived strains. Instead, a distinct type III lanthipeptide bacteriocin cluster, lanKC, was specifically detected in adult-derived isolates. Functional assays combining gene knockout, in vitro co-cultivation, and human intervention demonstrated that lanKC enhances strain-level competitive fitness and promotes community stability. Phylogenetic and metagenomic analyses of 5,475 lanKC homologs and 6,122 infant gut metagenomes further suggested a possible early-life acquisition via intra-genus horizontal gene transfer. These findings uncover a previously unrecognized genetic basis underlying B. breve adaptation to the gut environment and support a multi-factorial model in which metabolic flexibility and interference competition jointly sustain bifidobacterial persistence and host-microbe symbiosis throughout life.
{"title":"Bacteriocin gene-mediated ecological adaptation of Bifidobacterium breve in the adult human gut.","authors":"Jingyu Wang, Xin Qian, Qing Li, Zhiying Jin, Na Liu, Jianxin Zhao, Wei Chen, Shaopu Wang, Peijun Tian","doi":"10.1016/j.xgen.2025.101106","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101106","url":null,"abstract":"<p><p>The ecological persistence of Bifidobacterium breve across life stages reflects adaptive strategies beyond the classical infant- versus adult-type dichotomy, historically attributed to differential nutrient utilization. Here, comparative genomics revealed no major differences in shared carbohydrate-related genes or accessory genome content between infant- and adult-derived strains. Instead, a distinct type III lanthipeptide bacteriocin cluster, lanKC, was specifically detected in adult-derived isolates. Functional assays combining gene knockout, in vitro co-cultivation, and human intervention demonstrated that lanKC enhances strain-level competitive fitness and promotes community stability. Phylogenetic and metagenomic analyses of 5,475 lanKC homologs and 6,122 infant gut metagenomes further suggested a possible early-life acquisition via intra-genus horizontal gene transfer. These findings uncover a previously unrecognized genetic basis underlying B. breve adaptation to the gut environment and support a multi-factorial model in which metabolic flexibility and interference competition jointly sustain bifidobacterial persistence and host-microbe symbiosis throughout life.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101106"},"PeriodicalIF":11.1,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-19DOI: 10.1016/j.xgen.2025.101103
Aaron M Allen, Megan C Neville, Tetsuya Nojima, Faredin Alejevski, Devika Agarwal, David Sims, Stephen F Goodwin
Gene expression shapes the nervous system at every biological level, from molecular and cellular processes defining neuronal identity and function to systems-level wiring and circuit dynamics underlying behavior. Here, we generate the first high-resolution, single-cell transcriptomic atlas of the adult Drosophila melanogaster central brain by integrating multiple datasets, achieving an unprecedented 10-fold coverage of every neuron in this complex tissue. We show that a neuron's genetic identity overwhelmingly reflects its developmental origin, preserving a genetic address based on both lineage and birth order. We reveal foundational rules linking neurogenesis to transcriptional identity and provide a framework for systematically defining neuronal types. This atlas provides a powerful resource for mapping the cellular substrates of behavior by integrating annotations of hemilineage, cell types/subtypes, and molecular signatures of underlying physiological properties. It lays the groundwork for a long-sought bridge between developmental processes and the functional circuits that give rise to behavior.
{"title":"A high-resolution atlas of the brain predicts lineage and birth order underlying neuronal identity.","authors":"Aaron M Allen, Megan C Neville, Tetsuya Nojima, Faredin Alejevski, Devika Agarwal, David Sims, Stephen F Goodwin","doi":"10.1016/j.xgen.2025.101103","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101103","url":null,"abstract":"<p><p>Gene expression shapes the nervous system at every biological level, from molecular and cellular processes defining neuronal identity and function to systems-level wiring and circuit dynamics underlying behavior. Here, we generate the first high-resolution, single-cell transcriptomic atlas of the adult Drosophila melanogaster central brain by integrating multiple datasets, achieving an unprecedented 10-fold coverage of every neuron in this complex tissue. We show that a neuron's genetic identity overwhelmingly reflects its developmental origin, preserving a genetic address based on both lineage and birth order. We reveal foundational rules linking neurogenesis to transcriptional identity and provide a framework for systematically defining neuronal types. This atlas provides a powerful resource for mapping the cellular substrates of behavior by integrating annotations of hemilineage, cell types/subtypes, and molecular signatures of underlying physiological properties. It lays the groundwork for a long-sought bridge between developmental processes and the functional circuits that give rise to behavior.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101103"},"PeriodicalIF":11.1,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-19DOI: 10.1016/j.xgen.2025.101104
Camille A Daniels, Adetola A Abdulkadir, Megan H Cleveland, Jennifer H McDaniel, David Jáspez, Luis Alberto Rubio-Rodríguez, Adrián Muñoz-Barrera, José Miguel Lorenzo-Salazar, Carlos Flores, Byunggil Yoo, Sayed Mohammad Ebrahim Sahraeian, Yina Wang, Massimiliano Rossi, Arun Visvanath, Lisa Murray, Wei-Ting Chen, Severine Catreux, James Han, Rami Mehio, Gavin Parnaby, Andrew Carroll, Pi-Chuan Chang, Kishwar Shafin, Daniel Cook, Alexey Kolesnikov, Lucas Brambrink, Mohammed Faizal Eeman Mootor, Yash Patel, Takafumi N Yamaguchi, Paul C Boutros, Karolina Sienkiewicz, Jonathan Foox, Christopher E Mason, Bryan R Lajoie, Carlos A Ruiz-Perez, Semyon Kruglyak, Justin M Zook, Nathan D Olson
We developed a benchmark set of subclonal variants in the Genome in a Bottle (GIAB) Consortium HG002 reference material (RM) DNA for evaluating lower-frequency variant callsets. We used a somatic variant caller with high-coverage (300×) whole-genome sequencing data from the GIAB Ashkenazi Jewish trio to identify potential subclonal variants in the HG002 RM DNA. Using orthogonal sequencing data and manual curation, we defined a benchmark set with 85 high-confidence subclonal single-nucleotide variants (SNVs) (allele frequency [AF] > 5%) and a benchmark region covering 2.45 Gbp of the autosomes. External validation supported that it can be used to reliably identify both false negatives and false positives for a variety of sequencing technologies and variant callers. By adding our characterization of mosaic SNVs in this widely used cell line, we have expanded the scope of bioinformatic and sequencing applications for which the HG002 GIAB RM can be used to include benchmarking subclonal SNVs.
{"title":"Characterization of subclonal variants in HG002 Genome in a Bottle reference material as a resource for benchmarking variant callers.","authors":"Camille A Daniels, Adetola A Abdulkadir, Megan H Cleveland, Jennifer H McDaniel, David Jáspez, Luis Alberto Rubio-Rodríguez, Adrián Muñoz-Barrera, José Miguel Lorenzo-Salazar, Carlos Flores, Byunggil Yoo, Sayed Mohammad Ebrahim Sahraeian, Yina Wang, Massimiliano Rossi, Arun Visvanath, Lisa Murray, Wei-Ting Chen, Severine Catreux, James Han, Rami Mehio, Gavin Parnaby, Andrew Carroll, Pi-Chuan Chang, Kishwar Shafin, Daniel Cook, Alexey Kolesnikov, Lucas Brambrink, Mohammed Faizal Eeman Mootor, Yash Patel, Takafumi N Yamaguchi, Paul C Boutros, Karolina Sienkiewicz, Jonathan Foox, Christopher E Mason, Bryan R Lajoie, Carlos A Ruiz-Perez, Semyon Kruglyak, Justin M Zook, Nathan D Olson","doi":"10.1016/j.xgen.2025.101104","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101104","url":null,"abstract":"<p><p>We developed a benchmark set of subclonal variants in the Genome in a Bottle (GIAB) Consortium HG002 reference material (RM) DNA for evaluating lower-frequency variant callsets. We used a somatic variant caller with high-coverage (300×) whole-genome sequencing data from the GIAB Ashkenazi Jewish trio to identify potential subclonal variants in the HG002 RM DNA. Using orthogonal sequencing data and manual curation, we defined a benchmark set with 85 high-confidence subclonal single-nucleotide variants (SNVs) (allele frequency [AF] > 5%) and a benchmark region covering 2.45 Gbp of the autosomes. External validation supported that it can be used to reliably identify both false negatives and false positives for a variety of sequencing technologies and variant callers. By adding our characterization of mosaic SNVs in this widely used cell line, we have expanded the scope of bioinformatic and sequencing applications for which the HG002 GIAB RM can be used to include benchmarking subclonal SNVs.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101104"},"PeriodicalIF":11.1,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cellular identity emerges from the dynamic coordination of context-aware gene programs that encode biological functions across molecular layers. To decode this complexity, we present SSpMosaic, a computational framework that establishes metaprograms (higher-order, cross-dataset aligned gene program representations) as universal anchors for biological state representation. Leveraging these metaprograms, SSpMosaic enables consistent, accurate integration across batches, modalities, and species. Critically, SSpMosaic accurately annotates cell types within query datasets, enabling discovery and annotation of novel cell states through metaprogram-based transfer learning. The framework achieves resolution-agnostic spatial transcriptomics deconvolution, precisely mapping cell-type distributions from spot-level (Visium) to subcellular scales (CosMx/Visium HD). As a paradigm-shifting application, we integrate single-nucleus transcriptomics, chromatin accessibility, and spatial transcriptomics to resolve multi-stage spatial domain dynamics across tissue slices. Finally, SSpMosaic enables reference-free spatial characterization, identifying conserved spatial ecotypes across tissue slices and annotating cellular niches without requiring matched single-cell data.
{"title":"Robust integration and annotation of single-cell and spatial omics data using interpretable gene programs.","authors":"Yuelei Zhang, Wenxuan Ming, Bianjiong Yu, Lele Wang, Kaiyan Lu, Lei Xu, Yanhong Ni, Runzhi Deng, Dijun Chen","doi":"10.1016/j.xgen.2025.101105","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101105","url":null,"abstract":"<p><p>Cellular identity emerges from the dynamic coordination of context-aware gene programs that encode biological functions across molecular layers. To decode this complexity, we present SSpMosaic, a computational framework that establishes metaprograms (higher-order, cross-dataset aligned gene program representations) as universal anchors for biological state representation. Leveraging these metaprograms, SSpMosaic enables consistent, accurate integration across batches, modalities, and species. Critically, SSpMosaic accurately annotates cell types within query datasets, enabling discovery and annotation of novel cell states through metaprogram-based transfer learning. The framework achieves resolution-agnostic spatial transcriptomics deconvolution, precisely mapping cell-type distributions from spot-level (Visium) to subcellular scales (CosMx/Visium HD). As a paradigm-shifting application, we integrate single-nucleus transcriptomics, chromatin accessibility, and spatial transcriptomics to resolve multi-stage spatial domain dynamics across tissue slices. Finally, SSpMosaic enables reference-free spatial characterization, identifying conserved spatial ecotypes across tissue slices and annotating cellular niches without requiring matched single-cell data.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101105"},"PeriodicalIF":11.1,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-11DOI: 10.1016/j.xgen.2025.101101
Andrew Liao, Zehao Zhang, Andras Sziraki, Abdulraouf Abdulraouf, Abid Rehman, Zihan Xu, Ziyu Lu, Weirong Jiang, Alia Arya, Jasper Lee, Manolis Maragkakis, Wei Zhou, Junyue Cao
Large-scale single-cell atlases have revealed many aging- and disease-associated cell types, yet these populations are often underrepresented in heterogeneous tissues, limiting detailed molecular analyses. To address this, we developed EnrichSci-a scalable, microfluidics-free platform that combines hybridization chain reaction RNA fluorescence in situ hybridization (FISH) with combinatorial indexing to profile single-nucleus transcriptomes of target cell types with full gene-body coverage. Applied to oligodendrocytes in the aging mouse brain, EnrichSci uncovered aging-associated molecular dynamics across distinct oligodendrocyte subtypes, revealing both shared and subtype-specific gene expression changes. Additionally, we identified aging-associated exon-level signatures missed by conventional gene-level analyses, highlighting post-transcriptional regulation as a critical dimension of cell-state dynamics in aging. By coupling transcript-guided enrichment with a scalable sequencing workflow, EnrichSci provides a versatile approach to decode dynamic regulatory landscapes in diverse cell types from complex tissues.
{"title":"Transcript-guided targeted cell enrichment for scalable single-nucleus RNA sequencing.","authors":"Andrew Liao, Zehao Zhang, Andras Sziraki, Abdulraouf Abdulraouf, Abid Rehman, Zihan Xu, Ziyu Lu, Weirong Jiang, Alia Arya, Jasper Lee, Manolis Maragkakis, Wei Zhou, Junyue Cao","doi":"10.1016/j.xgen.2025.101101","DOIUrl":"10.1016/j.xgen.2025.101101","url":null,"abstract":"<p><p>Large-scale single-cell atlases have revealed many aging- and disease-associated cell types, yet these populations are often underrepresented in heterogeneous tissues, limiting detailed molecular analyses. To address this, we developed EnrichSci-a scalable, microfluidics-free platform that combines hybridization chain reaction RNA fluorescence in situ hybridization (FISH) with combinatorial indexing to profile single-nucleus transcriptomes of target cell types with full gene-body coverage. Applied to oligodendrocytes in the aging mouse brain, EnrichSci uncovered aging-associated molecular dynamics across distinct oligodendrocyte subtypes, revealing both shared and subtype-specific gene expression changes. Additionally, we identified aging-associated exon-level signatures missed by conventional gene-level analyses, highlighting post-transcriptional regulation as a critical dimension of cell-state dynamics in aging. By coupling transcript-guided enrichment with a scalable sequencing workflow, EnrichSci provides a versatile approach to decode dynamic regulatory landscapes in diverse cell types from complex tissues.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101101"},"PeriodicalIF":11.1,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genome-wide association studies (GWASs) have identified over 50 lung cancer risk loci; however, the precise cellular context of these genetic mechanisms remains unclear due to limitations in bulk tissue expression quantitative trait locus (eQTL) analyses. Here, we present the largest single-cell eQTL (sc-eQTL) atlas of human lung tissue to date, profiling 222 donors using multiplexed single-cell RNA sequencing (scRNA-seq). We identified 4,341 independent eQTLs across 17 cell types, with over 60% of sc-eQTLs and 51% of eGenes being cell-type specific, and fewer than 52% were detectable in paired bulk datasets. Integration with GWASs for non-small cell lung cancer highlighted epithelial and immune cells as key contributors to genetic susceptibility, identifying 28 candidate genes within known risk loci and 24 in novel regions. Notably, 47% of established non-small cell lung cancer (NSCLC) susceptibility loci exhibited cell-type-specific pleiotropic genetic regulation. This study provides a valuable resource of lung sc-eQTLs and illuminates how genetic variation modulates gene expression in a cell-type-specific fashion, contributing to lung cancer susceptibility.
{"title":"Single-cell eQTL mapping reveals cell-type-specific genetic regulation in lung cancer.","authors":"Yating Fu, Yi Wang, Chen Jin, Chang Zhang, Jiaying Cai, Linnan Gong, Chenying Jin, Chen Ji, Yuanlin Mou, Caochen Zhang, Shihao Wu, Xinyuan Ge, Yahui Dai, Sunan Miao, Huimin Ma, Xiaoyang Ma, Mengping Wang, Lijun Bian, Erbao Zhang, Juncheng Dai, Zhibin Hu, Guangfu Jin, Meng Zhu, Hongbing Shen, Hongxia Ma","doi":"10.1016/j.xgen.2025.101100","DOIUrl":"10.1016/j.xgen.2025.101100","url":null,"abstract":"<p><p>Genome-wide association studies (GWASs) have identified over 50 lung cancer risk loci; however, the precise cellular context of these genetic mechanisms remains unclear due to limitations in bulk tissue expression quantitative trait locus (eQTL) analyses. Here, we present the largest single-cell eQTL (sc-eQTL) atlas of human lung tissue to date, profiling 222 donors using multiplexed single-cell RNA sequencing (scRNA-seq). We identified 4,341 independent eQTLs across 17 cell types, with over 60% of sc-eQTLs and 51% of eGenes being cell-type specific, and fewer than 52% were detectable in paired bulk datasets. Integration with GWASs for non-small cell lung cancer highlighted epithelial and immune cells as key contributors to genetic susceptibility, identifying 28 candidate genes within known risk loci and 24 in novel regions. Notably, 47% of established non-small cell lung cancer (NSCLC) susceptibility loci exhibited cell-type-specific pleiotropic genetic regulation. This study provides a valuable resource of lung sc-eQTLs and illuminates how genetic variation modulates gene expression in a cell-type-specific fashion, contributing to lung cancer susceptibility.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101100"},"PeriodicalIF":11.1,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10DOI: 10.1016/j.xgen.2025.101081
Zilin Li, Xihao Li
In this issue of Cell Genomics, Xihao Li (X.L.), Zilin Li (Z.L.), and colleagues present the annotated genomic data structure (aGDS) format to streamline genomic analyses that use biobank-scale whole-genome sequencing data. Both authors have a research focus in statistical genetics/genomics, and here they highlight their latest work and the benefits of their aGDS approach.
{"title":"Meet the authors: Zilin Li and Xihao Li.","authors":"Zilin Li, Xihao Li","doi":"10.1016/j.xgen.2025.101081","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101081","url":null,"abstract":"<p><p>In this issue of Cell Genomics, Xihao Li (X.L.), Zilin Li (Z.L.), and colleagues present the annotated genomic data structure (aGDS) format to streamline genomic analyses that use biobank-scale whole-genome sequencing data. Both authors have a research focus in statistical genetics/genomics, and here they highlight their latest work and the benefits of their aGDS approach.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"5 12","pages":"101081"},"PeriodicalIF":11.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10DOI: 10.1016/j.xgen.2025.101098
Maxime M C Tortora, Geoffrey Fudenberg
Cohesin drives genome organization via loop extrusion, orchestrated by the dynamic exchange of multiple essential accessory proteins. Although these regulators bind the core cohesin complex only transiently, their disruption can dramatically alter loop-extrusion dynamics and chromosome morphology. Still, a quantitative theory of cohesin regulation and its interplay with genome folding is still elusive. Here, we derive a chemical-reaction network model of loop-extrusion regulation from first principles that is fully specified by available in vivo measurements. This "bursty extrusion model" untangles the distinct roles of regulators, whose exchange coincides with intermittent periods of motor activity. By incorporating bursty extrusion in polymer simulations, we reveal how variations in regulatory protein abundance can alter chromatin architecture across length and timescales. Our results are corroborated by in vivo and in vitro observations, bridging the gap between cohesin-regulator dynamics at the molecular scale and their genome-wide consequences on chromosome organization.
{"title":"The physical chemistry of interphase loop extrusion.","authors":"Maxime M C Tortora, Geoffrey Fudenberg","doi":"10.1016/j.xgen.2025.101098","DOIUrl":"10.1016/j.xgen.2025.101098","url":null,"abstract":"<p><p>Cohesin drives genome organization via loop extrusion, orchestrated by the dynamic exchange of multiple essential accessory proteins. Although these regulators bind the core cohesin complex only transiently, their disruption can dramatically alter loop-extrusion dynamics and chromosome morphology. Still, a quantitative theory of cohesin regulation and its interplay with genome folding is still elusive. Here, we derive a chemical-reaction network model of loop-extrusion regulation from first principles that is fully specified by available in vivo measurements. This \"bursty extrusion model\" untangles the distinct roles of regulators, whose exchange coincides with intermittent periods of motor activity. By incorporating bursty extrusion in polymer simulations, we reveal how variations in regulatory protein abundance can alter chromatin architecture across length and timescales. Our results are corroborated by in vivo and in vitro observations, bridging the gap between cohesin-regulator dynamics at the molecular scale and their genome-wide consequences on chromosome organization.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101098"},"PeriodicalIF":11.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10Epub Date: 2025-10-02DOI: 10.1016/j.xgen.2025.101031
Tamara A Potapova, Paxton Kostos, Sean McKinney, Matthew Borchers, Jeff Haug, Andrea Guarracino, Steven J Solar, Mark Mattingly, Graciela Monfort Anez, Leonardo Gomes de Lima, Yan Wang, Chongbei Zhao, Kate Hall, Madelaine Gogol, Sophie Hoffman, Dmitry Antipov, Arang Rhie, Monika Cechova, Karen H Miga, Erik Garrison, Adam M Phillippy, Jennifer L Gerton
Ribosomal RNA (rRNA) genes are organized in tandem arrays known as ribosomal DNA (rDNA) on multiple chromosomes in Hominidae genomes. We measured copy number and transcriptional activity status of rRNA gene arrays across multiple individual genomes, revealing an identifiable fingerprint of rDNA copy number and activity. In some cases, entire arrays were transcriptionally silent, characterized by high DNA methylation across the rRNA gene, inaccessible chromatin, and the absence of transcription factors and transcripts. Silent arrays showed reduced association with the nucleolus and decreased interchromosomal interactions, consistent with the model that nucleolar organizer function depends on transcriptional activity. Removing rDNA methylation activated silent arrays. Array activity status remained stable through induced pluripotent stem cell reprogramming and differentiation into cerebral and intestinal organoids. Haplotype tracing in two unrelated family trios showed paternal transmission of silent arrays. We propose that the epigenetic state buffers rRNA gene dosage, specifies nucleolar organizer function, and can propagate transgenerationally.
{"title":"Chromosome-specific epigenetic control and transmission of ribosomal DNA arrays in Hominidae genomes.","authors":"Tamara A Potapova, Paxton Kostos, Sean McKinney, Matthew Borchers, Jeff Haug, Andrea Guarracino, Steven J Solar, Mark Mattingly, Graciela Monfort Anez, Leonardo Gomes de Lima, Yan Wang, Chongbei Zhao, Kate Hall, Madelaine Gogol, Sophie Hoffman, Dmitry Antipov, Arang Rhie, Monika Cechova, Karen H Miga, Erik Garrison, Adam M Phillippy, Jennifer L Gerton","doi":"10.1016/j.xgen.2025.101031","DOIUrl":"10.1016/j.xgen.2025.101031","url":null,"abstract":"<p><p>Ribosomal RNA (rRNA) genes are organized in tandem arrays known as ribosomal DNA (rDNA) on multiple chromosomes in Hominidae genomes. We measured copy number and transcriptional activity status of rRNA gene arrays across multiple individual genomes, revealing an identifiable fingerprint of rDNA copy number and activity. In some cases, entire arrays were transcriptionally silent, characterized by high DNA methylation across the rRNA gene, inaccessible chromatin, and the absence of transcription factors and transcripts. Silent arrays showed reduced association with the nucleolus and decreased interchromosomal interactions, consistent with the model that nucleolar organizer function depends on transcriptional activity. Removing rDNA methylation activated silent arrays. Array activity status remained stable through induced pluripotent stem cell reprogramming and differentiation into cerebral and intestinal organoids. Haplotype tracing in two unrelated family trios showed paternal transmission of silent arrays. We propose that the epigenetic state buffers rRNA gene dosage, specifies nucleolar organizer function, and can propagate transgenerationally.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101031"},"PeriodicalIF":11.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12802695/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145226383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}