Pub Date : 2025-12-10Epub Date: 2025-09-17DOI: 10.1016/j.xgen.2025.101007
Paul M McKeever, Aiden M Sababi, Raghav Sharma, Zhiyu Xu, Shangxi Xiao, Philip McGoldrick, Troy Ketela, Christine Sato, Danielle Moreno, Naomi Visanji, Gabor G Kovacs, Julia Keith, Lorne Zinman, Ekaterina Rogaeva, Hani Goodarzi, Gary D Bader, Janice Robertson
Amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD) are fatal neurodegenerative diseases sharing clinical and pathological features. Both involve complex neuron-glia interactions, but cell-type-specific alterations remain poorly defined. We performed single-nucleus RNA sequencing of the frontal cortex from C9orf72-related ALS (with and without FTLD) and sporadic ALS (sALS). Neurons showed prominent changes in mitochondrial function, protein homeostasis, and chromatin remodeling. Comparison with independent datasets from other cortical regions revealed consistent pathway alterations, including upregulation of STMN2 and NEFL across brain regions and subtypes. We further examined dysregulation of alternative polyadenylation (APA), an understudied post-transcriptional mechanism, uncovering cell-type-specific APA patterns. To investigate its regulation, we developed the alternative polyadenylation network (APA-Net), a multi-modal deep learning model integrating transcript sequences and RNA-binding protein (RBP) expression profiles to predict APA. This atlas advances our understanding of ALS/FTLD molecular pathology and provides a valuable resource for future mechanistic studies.
{"title":"Single-nucleus transcriptome atlas of orbitofrontal cortex in ALS with a deep learning-based decoding of alternative polyadenylation mechanisms.","authors":"Paul M McKeever, Aiden M Sababi, Raghav Sharma, Zhiyu Xu, Shangxi Xiao, Philip McGoldrick, Troy Ketela, Christine Sato, Danielle Moreno, Naomi Visanji, Gabor G Kovacs, Julia Keith, Lorne Zinman, Ekaterina Rogaeva, Hani Goodarzi, Gary D Bader, Janice Robertson","doi":"10.1016/j.xgen.2025.101007","DOIUrl":"10.1016/j.xgen.2025.101007","url":null,"abstract":"<p><p>Amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTLD) are fatal neurodegenerative diseases sharing clinical and pathological features. Both involve complex neuron-glia interactions, but cell-type-specific alterations remain poorly defined. We performed single-nucleus RNA sequencing of the frontal cortex from C9orf72-related ALS (with and without FTLD) and sporadic ALS (sALS). Neurons showed prominent changes in mitochondrial function, protein homeostasis, and chromatin remodeling. Comparison with independent datasets from other cortical regions revealed consistent pathway alterations, including upregulation of STMN2 and NEFL across brain regions and subtypes. We further examined dysregulation of alternative polyadenylation (APA), an understudied post-transcriptional mechanism, uncovering cell-type-specific APA patterns. To investigate its regulation, we developed the alternative polyadenylation network (APA-Net), a multi-modal deep learning model integrating transcript sequences and RNA-binding protein (RBP) expression profiles to predict APA. This atlas advances our understanding of ALS/FTLD molecular pathology and provides a valuable resource for future mechanistic studies.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101007"},"PeriodicalIF":11.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12802741/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145088500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10Epub Date: 2025-10-13DOI: 10.1016/j.xgen.2025.101034
Quanyi Zhao, Albert Pedroza, Disha Sharma, Wenduo Gu, Alex Dalal, Chad Weldy, William Jackson, Daniel Yuhang Li, Yana Ryan, Trieu Nguyen, Rohan Shad, Brian T Palmisano, João P Monteiro, Matthew Worssam, Alexa Berezowitz, Meghana Iyer, Huitong Shi, Ramendra Kundu, Lasemahang Limbu, Juyong Brian Kim, Anshul Kundaje, Michael Fischbein, Robert Wirka, Thomas Quertermous, Paul Cheng
Arterial segments show differing disease propensities, yet mechanisms remain unknown. We compiled a transcriptomic and spatial atlas of healthy human arterial cells across multiple segments to understand these differences. Arteries demonstrated a stereotyped pattern of cell-specific, segmental heterogeneity not captured by common marker genes. Arterial identities are encoded in fibroblast and smooth muscle cell (SMC) transcriptomes. Differentially expressed genes enrich for disease loci. Fibroblast gene expression enriches for a disproportionate number of disease loci, highlighting an underrecognized role for fibroblasts in disease risk. Cells of different segments cluster more by embryonic origin than anatomy. Global analysis of disease regulons in fibroblasts and SMCs identified developmental transcription factors that persist into adulthood, suggesting a functional role of these factors in disease. Lastly, the heterogeneity of non-coding transcriptomes rivals that of protein-coding transcriptomes. Differentially expressed lncRNAs enrich for genetic signals for vascular diseases, suggesting a role for lncRNAs in vascular disease.
{"title":"A cell and transcriptome atlas of human arterial vasculature.","authors":"Quanyi Zhao, Albert Pedroza, Disha Sharma, Wenduo Gu, Alex Dalal, Chad Weldy, William Jackson, Daniel Yuhang Li, Yana Ryan, Trieu Nguyen, Rohan Shad, Brian T Palmisano, João P Monteiro, Matthew Worssam, Alexa Berezowitz, Meghana Iyer, Huitong Shi, Ramendra Kundu, Lasemahang Limbu, Juyong Brian Kim, Anshul Kundaje, Michael Fischbein, Robert Wirka, Thomas Quertermous, Paul Cheng","doi":"10.1016/j.xgen.2025.101034","DOIUrl":"10.1016/j.xgen.2025.101034","url":null,"abstract":"<p><p>Arterial segments show differing disease propensities, yet mechanisms remain unknown. We compiled a transcriptomic and spatial atlas of healthy human arterial cells across multiple segments to understand these differences. Arteries demonstrated a stereotyped pattern of cell-specific, segmental heterogeneity not captured by common marker genes. Arterial identities are encoded in fibroblast and smooth muscle cell (SMC) transcriptomes. Differentially expressed genes enrich for disease loci. Fibroblast gene expression enriches for a disproportionate number of disease loci, highlighting an underrecognized role for fibroblasts in disease risk. Cells of different segments cluster more by embryonic origin than anatomy. Global analysis of disease regulons in fibroblasts and SMCs identified developmental transcription factors that persist into adulthood, suggesting a functional role of these factors in disease. Lastly, the heterogeneity of non-coding transcriptomes rivals that of protein-coding transcriptomes. Differentially expressed lncRNAs enrich for genetic signals for vascular diseases, suggesting a role for lncRNAs in vascular disease.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101034"},"PeriodicalIF":11.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12802643/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145294673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10Epub Date: 2025-09-18DOI: 10.1016/j.xgen.2025.101009
Xihao Li, Andrew R Wood, Yuxin Yuan, Manrui Zhang, Yushu Huang, Gareth Hawkes, Robin N Beaumont, Michael N Weedon, Wenyuan Li, Xiaoyu Li, Xihong Lin, Zilin Li
Biobank-scale whole-genome sequencing (WGS) studies are increasingly pivotal in unraveling the genetic bases of diverse health outcomes. However, managing and analyzing these datasets' sheer volume and complexity presents significant challenges. We highlight the annotated genomic data structure (aGDS) format, substantially reducing the WGS data file size while enabling seamless integration of genomic and functional information for comprehensive WGS analyses. The aGDS format yielded 23 chromosome-specific files for the UK Biobank 500k WGS dataset, occupying only 1.10 tebibytes of storage. We develop the vcf2agds toolkit that streamlines the conversion of WGS data from VCF to aGDS format. Additionally, the STAARpipeline equipped with the aGDS files enabled scalable, comprehensive, and functionally informed WGS analysis, facilitating the detection of common and rare coding and noncoding phenotype-genotype associations. Overall, the vcf2agds toolkit and STAARpipeline provide a streamlined solution that facilitates efficient data management and analysis of biobank-scale WGS data across hundreds of thousands of samples.
{"title":"Streamlining large-scale genomic data management: Insights from the UK Biobank whole-genome sequencing data.","authors":"Xihao Li, Andrew R Wood, Yuxin Yuan, Manrui Zhang, Yushu Huang, Gareth Hawkes, Robin N Beaumont, Michael N Weedon, Wenyuan Li, Xiaoyu Li, Xihong Lin, Zilin Li","doi":"10.1016/j.xgen.2025.101009","DOIUrl":"10.1016/j.xgen.2025.101009","url":null,"abstract":"<p><p>Biobank-scale whole-genome sequencing (WGS) studies are increasingly pivotal in unraveling the genetic bases of diverse health outcomes. However, managing and analyzing these datasets' sheer volume and complexity presents significant challenges. We highlight the annotated genomic data structure (aGDS) format, substantially reducing the WGS data file size while enabling seamless integration of genomic and functional information for comprehensive WGS analyses. The aGDS format yielded 23 chromosome-specific files for the UK Biobank 500k WGS dataset, occupying only 1.10 tebibytes of storage. We develop the vcf2agds toolkit that streamlines the conversion of WGS data from VCF to aGDS format. Additionally, the STAARpipeline equipped with the aGDS files enabled scalable, comprehensive, and functionally informed WGS analysis, facilitating the detection of common and rare coding and noncoding phenotype-genotype associations. Overall, the vcf2agds toolkit and STAARpipeline provide a streamlined solution that facilitates efficient data management and analysis of biobank-scale WGS data across hundreds of thousands of samples.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101009"},"PeriodicalIF":11.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12802598/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145093020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10Epub Date: 2025-09-24DOI: 10.1016/j.xgen.2025.101011
Petr Šulc, Andrea Di Gioacchino, Alexander Solovyov, Siyu Sun, Stephen Martis, Sajid A Marhon, Håvard T Lindholm, Raymond Chen, Amir Hosseini, Hua Jiang, Bao-Han Ly, Martin S Taylor, Parinaz Mehdipour, Omar Abdel-Wahab, Nicole Rusk, Nicolas Vabret, John LaCava, Daniel D De Carvalho, Rémi Monasson, Simona Cocco, Benjamin D Greenbaum
An emerging hallmark of many human diseases is transcription of typically silenced repetitive DNA containing pathogen-associated molecular patterns (PAMPs). These PAMPs engage the innate immune system via pattern recognition receptors (PRRs)-a phenomenon known as viral mimicry. We propose a statistical physics framework to quantify viral mimicry by measuring "selective forces" that enrich PAMPs compared to a genome-wide reference distribution. We validate our predictions by identifying repeats that bind different PRRs and show potential viral mimics in different repeat families across eukaryotic genomes, suggesting shared mechanisms drive emergence and retention. We propose two non-exclusive evolutionary hypotheses. The first "repeat-centric" hypothesis posits PAMPs are integral to the repeat life cycle and are therefore enriched as they mediate repeat expansion. The second "organism-centric" hypothesis proposes viral mimicry functions as a cell-intrinsic feedback mechanism for sensing and reacting to transcriptional dysregulation, which provides a selective pressure to maintain PAMPs in genomes.
{"title":"Repeats mimic pathogen-associated patterns across a vast evolutionary landscape.","authors":"Petr Šulc, Andrea Di Gioacchino, Alexander Solovyov, Siyu Sun, Stephen Martis, Sajid A Marhon, Håvard T Lindholm, Raymond Chen, Amir Hosseini, Hua Jiang, Bao-Han Ly, Martin S Taylor, Parinaz Mehdipour, Omar Abdel-Wahab, Nicole Rusk, Nicolas Vabret, John LaCava, Daniel D De Carvalho, Rémi Monasson, Simona Cocco, Benjamin D Greenbaum","doi":"10.1016/j.xgen.2025.101011","DOIUrl":"10.1016/j.xgen.2025.101011","url":null,"abstract":"<p><p>An emerging hallmark of many human diseases is transcription of typically silenced repetitive DNA containing pathogen-associated molecular patterns (PAMPs). These PAMPs engage the innate immune system via pattern recognition receptors (PRRs)-a phenomenon known as viral mimicry. We propose a statistical physics framework to quantify viral mimicry by measuring \"selective forces\" that enrich PAMPs compared to a genome-wide reference distribution. We validate our predictions by identifying repeats that bind different PRRs and show potential viral mimics in different repeat families across eukaryotic genomes, suggesting shared mechanisms drive emergence and retention. We propose two non-exclusive evolutionary hypotheses. The first \"repeat-centric\" hypothesis posits PAMPs are integral to the repeat life cycle and are therefore enriched as they mediate repeat expansion. The second \"organism-centric\" hypothesis proposes viral mimicry functions as a cell-intrinsic feedback mechanism for sensing and reacting to transcriptional dysregulation, which provides a selective pressure to maintain PAMPs in genomes.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101011"},"PeriodicalIF":11.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12802575/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145151953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-10DOI: 10.1016/j.xgen.2025.101080
Daniel Sultanov, Andreas Hochwagen
To support the high demand for ribosomes, ribosomal RNA (rRNA) is produced by hundreds of highly expressed gene copies. In this issue of Cell Genomics, Potapova et al. reveal chromosome-specific organization and heritable transcriptional activity of rRNA clusters in humans and other hominids.
{"title":"Shining a light on ribosomal RNA genes, one chromosome at a time.","authors":"Daniel Sultanov, Andreas Hochwagen","doi":"10.1016/j.xgen.2025.101080","DOIUrl":"10.1016/j.xgen.2025.101080","url":null,"abstract":"<p><p>To support the high demand for ribosomes, ribosomal RNA (rRNA) is produced by hundreds of highly expressed gene copies. In this issue of Cell Genomics, Potapova et al. reveal chromosome-specific organization and heritable transcriptional activity of rRNA clusters in humans and other hominids.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"5 12","pages":"101080"},"PeriodicalIF":11.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12802687/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145745938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-25DOI: 10.1016/j.xgen.2025.101075
Kenneth E Westerman, Julie E Gervis, Luke J O'Connor, Miriam S Udler, Alisa K Manning
Polygenic scores (PGSs) that can predict response to interventions can facilitate precision medicine and are detectable in observational datasets as PGS-by-exposure (PGS×E) interactions. PGSs based on interactions (iPGSs) or variance effects (vPGSs) may be more powerful than standard PGSs for detecting PGS×E, but these have yet to be systematically compared. We describe a generalized pipeline for developing and comparing these PGS types and apply it to detect genetic modification of the relationship between adiposity (measured by BMI) and a broad set of cardiometabolic risk factors. Our applied analysis in the UK Biobank identified significant PGS×BMI for 16/20 risk factors, most consistently for the iPGS approach. Many interactions replicated in All of Us (AoU); for example, we observed a 72% larger BMI-alanine aminotransferase association in the top iPGS decile in AoU. Our study provides a framework for the comparison of PGS×E strategies and informs efforts toward clinically useful response-focused PGSs.
多基因评分(pgs)可以预测对干预措施的反应,可以促进精准医疗,并在观察数据集中作为pgs -暴露(PGS×E)相互作用进行检测。基于相互作用(ipgs)或方差效应(vpgs)的pgs在检测PGS×E方面可能比标准pgs更强大,但这些还没有被系统地比较。我们描述了一个开发和比较这些PGS类型的通用管道,并将其应用于检测肥胖(由BMI测量)与一系列广泛的心脏代谢危险因素之间关系的遗传修饰。我们在英国生物银行的应用分析确定了16/20个风险因素的显著PGS×BMI,最一致的是iPGS方法。在《All of Us》(AoU)中复制了许多互动;例如,我们观察到,在AoU的iPGS前十分位数中,bmi -丙氨酸转氨酶的关联要大72%。我们的研究为PGS×E策略的比较提供了一个框架,并为临床有用的以反应为重点的pgs提供了信息。
{"title":"Polygenic scores capture genetic modification of the adiposity-cardiometabolic risk factor relationship.","authors":"Kenneth E Westerman, Julie E Gervis, Luke J O'Connor, Miriam S Udler, Alisa K Manning","doi":"10.1016/j.xgen.2025.101075","DOIUrl":"10.1016/j.xgen.2025.101075","url":null,"abstract":"<p><p>Polygenic scores (PGSs) that can predict response to interventions can facilitate precision medicine and are detectable in observational datasets as PGS-by-exposure (PGS×E) interactions. PGSs based on interactions (iPGSs) or variance effects (vPGSs) may be more powerful than standard PGSs for detecting PGS×E, but these have yet to be systematically compared. We describe a generalized pipeline for developing and comparing these PGS types and apply it to detect genetic modification of the relationship between adiposity (measured by BMI) and a broad set of cardiometabolic risk factors. Our applied analysis in the UK Biobank identified significant PGS×BMI for 16/20 risk factors, most consistently for the iPGS approach. Many interactions replicated in All of Us (AoU); for example, we observed a 72% larger BMI-alanine aminotransferase association in the top iPGS decile in AoU. Our study provides a framework for the comparison of PGS×E strategies and informs efforts toward clinically useful response-focused PGSs.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101075"},"PeriodicalIF":11.1,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145642999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-19DOI: 10.1016/j.xgen.2025.101069
Gregory R Keele, Monika Dzieciatkowska, Ariel M Hay, Matthew Vincent, Callan O'Connor, Daniel Stephenson, Julie A Reisz, Travis Nemkov, Kirk C Hansen, Grier P Page, James C Zimring, Gary A Churchill, Angelo D'Alessandro
Red blood cells (RBCs) transport oxygen but accumulate oxidative damage over time, reducing function in vivo and during storage, critical for transfusions. To explore the genetics of RBC resilience, we profiled proteins, metabolites, and lipids from fresh and stored RBCs from 350 genetically diverse mice. Our analysis identified over 6,000 quantitative trait loci (QTLs). Compared to other tissues, the prevalence of trans genetic effects over cis ones reflects the absence of de novo protein synthesis in anucleated RBCs. QTL hotspots at Hbb, Hba, Mon1a, and (storage-specific) Steap3 linked ferroptosis to hemolysis. Proteasome QTLs clustered at multiple loci, underscoring the importance of degrading oxidized proteins. Post-translational modification (PTM) QTLs mapped predominantly to hemoglobins, including cysteine residues. The loss of reactive C93 in humanized mice (hemoglobulin beta [HBB] C93A) disrupted redox balance, glutathione pools, glutathionylation, and redox PTMs. These findings highlight genetic regulation of RBC oxidation, with implications for transfusion biology and oxidative-stress-dependent hemolytic disorders.
{"title":"Genetic architecture of the murine red blood cell proteome reveals central role of hemoglobin beta cysteine 93 in maintaining redox balance.","authors":"Gregory R Keele, Monika Dzieciatkowska, Ariel M Hay, Matthew Vincent, Callan O'Connor, Daniel Stephenson, Julie A Reisz, Travis Nemkov, Kirk C Hansen, Grier P Page, James C Zimring, Gary A Churchill, Angelo D'Alessandro","doi":"10.1016/j.xgen.2025.101069","DOIUrl":"10.1016/j.xgen.2025.101069","url":null,"abstract":"<p><p>Red blood cells (RBCs) transport oxygen but accumulate oxidative damage over time, reducing function in vivo and during storage, critical for transfusions. To explore the genetics of RBC resilience, we profiled proteins, metabolites, and lipids from fresh and stored RBCs from 350 genetically diverse mice. Our analysis identified over 6,000 quantitative trait loci (QTLs). Compared to other tissues, the prevalence of trans genetic effects over cis ones reflects the absence of de novo protein synthesis in anucleated RBCs. QTL hotspots at Hbb, Hba, Mon1a, and (storage-specific) Steap3 linked ferroptosis to hemolysis. Proteasome QTLs clustered at multiple loci, underscoring the importance of degrading oxidized proteins. Post-translational modification (PTM) QTLs mapped predominantly to hemoglobins, including cysteine residues. The loss of reactive C93 in humanized mice (hemoglobulin beta [HBB] C93A) disrupted redox balance, glutathione pools, glutathionylation, and redox PTMs. These findings highlight genetic regulation of RBC oxidation, with implications for transfusion biology and oxidative-stress-dependent hemolytic disorders.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101069"},"PeriodicalIF":11.1,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-12Epub Date: 2025-08-12DOI: 10.1016/j.xgen.2025.100969
Ander Díaz-Navarro, Xindi Zhang, Wei Jiao, Bo Wang, Lincoln Stein
Understanding how genomic alterations drive cancer is key to advancing precision oncology. To detect these alterations, accurate algorithms are used; however, due to privacy concerns, few deeply sequenced cancer genomes can be shared, limiting benchmarking and representing a major obstacle to the improvement of analytic tools. To address this, we developed OncoGAN, a generative AI model combining adversarial networks and variational autoencoders to create realistic synthetic cancer genomes. Trained on large-scale genomic datasets, OncoGAN accurately reproduces somatic mutations, copy number alterations, and structural variants across cancer types while preserving donors' privacy. The synthetic genomes reflect tumor-specific mutational signatures and positional mutation patterns. Using DeepTumour, we validated the synthetic data's fidelity, showing high concordance between generated and predicted tumors. Moreover, augmenting the training data with synthetic genomes improved DeepTumour's accuracy, underscoring OncoGAN's potential to generate shareable datasets with known ground truths for benchmarking and enhancement of cancer genome analysis tools.
{"title":"In silico generation of synthetic cancer genomes using generative AI.","authors":"Ander Díaz-Navarro, Xindi Zhang, Wei Jiao, Bo Wang, Lincoln Stein","doi":"10.1016/j.xgen.2025.100969","DOIUrl":"10.1016/j.xgen.2025.100969","url":null,"abstract":"<p><p>Understanding how genomic alterations drive cancer is key to advancing precision oncology. To detect these alterations, accurate algorithms are used; however, due to privacy concerns, few deeply sequenced cancer genomes can be shared, limiting benchmarking and representing a major obstacle to the improvement of analytic tools. To address this, we developed OncoGAN, a generative AI model combining adversarial networks and variational autoencoders to create realistic synthetic cancer genomes. Trained on large-scale genomic datasets, OncoGAN accurately reproduces somatic mutations, copy number alterations, and structural variants across cancer types while preserving donors' privacy. The synthetic genomes reflect tumor-specific mutational signatures and positional mutation patterns. Using DeepTumour, we validated the synthetic data's fidelity, showing high concordance between generated and predicted tumors. Moreover, augmenting the training data with synthetic genomes improved DeepTumour's accuracy, underscoring OncoGAN's potential to generate shareable datasets with known ground truths for benchmarking and enhancement of cancer genome analysis tools.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100969"},"PeriodicalIF":11.1,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12648103/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144849956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-12Epub Date: 2025-09-09DOI: 10.1016/j.xgen.2025.100985
Sebastian Palacios, Simone Bruno, Ron Weiss, Elia Salibi, Isabella Goodchild-Michelman, Andrew Kane, Katherine Ilia, Domitilla Del Vecchio
Cells store information by means of chromatin modifications that persist through cell divisions and can hold gene expression silenced over generations. However, how these modifications may maintain other gene expression states has remained unclear. This study shows that chromatin modifications can maintain a wide range of gene expression levels over time, thus uncovering analog epigenetic memory. By engineering a genomic reporter and epigenetic effectors, we tracked the gene expression dynamics following targeted perturbations to the chromatin state. We found that distinct grades of DNA methylation led to corresponding, persistent gene expression levels. Altering the DNA methylation grade, in turn, resulted in permanent loss of gene expression memory. Consistent with experiments, our chromatin modification model indicates that analog memory arises when the positive feedback between DNA methylation and repressive histone modifications is lacking. This discovery will lead to a deeper understanding of epigenetic memory and to new tools for synthetic biology.
{"title":"Analog epigenetic memory revealed by targeted chromatin editing.","authors":"Sebastian Palacios, Simone Bruno, Ron Weiss, Elia Salibi, Isabella Goodchild-Michelman, Andrew Kane, Katherine Ilia, Domitilla Del Vecchio","doi":"10.1016/j.xgen.2025.100985","DOIUrl":"10.1016/j.xgen.2025.100985","url":null,"abstract":"<p><p>Cells store information by means of chromatin modifications that persist through cell divisions and can hold gene expression silenced over generations. However, how these modifications may maintain other gene expression states has remained unclear. This study shows that chromatin modifications can maintain a wide range of gene expression levels over time, thus uncovering analog epigenetic memory. By engineering a genomic reporter and epigenetic effectors, we tracked the gene expression dynamics following targeted perturbations to the chromatin state. We found that distinct grades of DNA methylation led to corresponding, persistent gene expression levels. Altering the DNA methylation grade, in turn, resulted in permanent loss of gene expression memory. Consistent with experiments, our chromatin modification model indicates that analog memory arises when the positive feedback between DNA methylation and repressive histone modifications is lacking. This discovery will lead to a deeper understanding of epigenetic memory and to new tools for synthetic biology.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100985"},"PeriodicalIF":11.1,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12648113/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145034782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-12DOI: 10.1016/j.xgen.2025.101064
Tianang Leng, Cesar de la Fuente-Nunez
Unmodified class II bacteriocins promise precision antimicrobials that spare bystander microbes. Zhang and colleagues introduce IIBacFinder, a genomics-guided pipeline that detects precursor and context genes with a curated pHMM library, infers leader-peptide cleavage, and triages candidates by meta-omics signals. The authors apply it across bacterial genomes, including an atlas of ∼280,000 human-gut genomes, and recover a vast reservoir of narrow-spectrum peptides and prioritize gut-resident candidates for synthesis. Of the 26 synthesized, 16 display activity in vitro, largely via membrane perturbation and with additive effects alongside vancomycin, while ex vivo assays show minimal compositional disruption of fecal communities compared with antibiotic controls. These results position unmodified class II bacteriocins as tractable, microbiome-sparing agents and illustrate how genome-scale mining coupled to meta-omics can bridge sequence to function in complex ecosystems.
{"title":"The gut's hidden arsenal: A genomics-guided atlas of class II bacteriocins.","authors":"Tianang Leng, Cesar de la Fuente-Nunez","doi":"10.1016/j.xgen.2025.101064","DOIUrl":"10.1016/j.xgen.2025.101064","url":null,"abstract":"<p><p>Unmodified class II bacteriocins promise precision antimicrobials that spare bystander microbes. Zhang and colleagues introduce IIBacFinder, a genomics-guided pipeline that detects precursor and context genes with a curated pHMM library, infers leader-peptide cleavage, and triages candidates by meta-omics signals. The authors apply it across bacterial genomes, including an atlas of ∼280,000 human-gut genomes, and recover a vast reservoir of narrow-spectrum peptides and prioritize gut-resident candidates for synthesis. Of the 26 synthesized, 16 display activity in vitro, largely via membrane perturbation and with additive effects alongside vancomycin, while ex vivo assays show minimal compositional disruption of fecal communities compared with antibiotic controls. These results position unmodified class II bacteriocins as tractable, microbiome-sparing agents and illustrate how genome-scale mining coupled to meta-omics can bridge sequence to function in complex ecosystems.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"5 11","pages":"101064"},"PeriodicalIF":11.1,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12648078/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145514600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}