Pub Date : 2025-12-01DOI: 10.1016/j.xgen.2025.101074
Mohab Helmy, Jin U Li, Xinyu F Yan, Rachel K Meade, Elizabeth Anderson, Patrick B Chen, Anne M Czechanski, Tomás Di Domenico, Jonathan Flint, Erik Garrison, Marco T P Gontijo, Andrea Guarracino, Leanne Haggerty, Edith Heard, Kerstin Howe, Narendra Meena, Fergal J Martin, Eric A Miska, Isabell Rall, Navin B Ramakrishna, Alexandra Sapetschnig, Swati Sinha, Diandian Sun, Francesca F Tricomi, Runjia Qu, Jonathan M D Wood, Tianzhen Wu, Dian J Zhou, Laura Reinholdt, David J Adams, Clare M Smith, Jingtao Lilue, Thomas M Keane
We present a collection of 17 high-quality long-read inbred mouse strain genomes with complete annotation (contig N50s of 0.8-33.9 Mbp). This collection includes 12 widely used classical laboratory strains and 5 wild-derived strains. We have resolved previously incomplete genomic regions, including the major histocompatibility complex (MHC), defensin cluster, T cell receptor, and Ly49 complexes. Hundreds of non-reference genes from previous publications not found in GRCm39, such as Defa1, Raet1a, and Klra20 (Ly49T), were localized in the new reference genomes. We conducted a genome-wide scan of variable number tandem repeats (VNTRs) within the coding regions, identifying over 400 genes with VNTR polymorphisms with up to 600 repeat copies and repeat units reaching 990 nucleotides. Our strain-specific annotations enhance RNA sequencing (RNA-seq) analyses, as demonstrated in PWK/PhJ, where we observed a 5.1% improvement in read mapping and expression-level differences in 2.1% of coding genes compared to using GRCm39.
{"title":"High-quality mouse reference genomes reveal the structural complexity of the murine protein-coding landscape.","authors":"Mohab Helmy, Jin U Li, Xinyu F Yan, Rachel K Meade, Elizabeth Anderson, Patrick B Chen, Anne M Czechanski, Tomás Di Domenico, Jonathan Flint, Erik Garrison, Marco T P Gontijo, Andrea Guarracino, Leanne Haggerty, Edith Heard, Kerstin Howe, Narendra Meena, Fergal J Martin, Eric A Miska, Isabell Rall, Navin B Ramakrishna, Alexandra Sapetschnig, Swati Sinha, Diandian Sun, Francesca F Tricomi, Runjia Qu, Jonathan M D Wood, Tianzhen Wu, Dian J Zhou, Laura Reinholdt, David J Adams, Clare M Smith, Jingtao Lilue, Thomas M Keane","doi":"10.1016/j.xgen.2025.101074","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101074","url":null,"abstract":"<p><p>We present a collection of 17 high-quality long-read inbred mouse strain genomes with complete annotation (contig N50s of 0.8-33.9 Mbp). This collection includes 12 widely used classical laboratory strains and 5 wild-derived strains. We have resolved previously incomplete genomic regions, including the major histocompatibility complex (MHC), defensin cluster, T cell receptor, and Ly49 complexes. Hundreds of non-reference genes from previous publications not found in GRCm39, such as Defa1, Raet1a, and Klra20 (Ly49T), were localized in the new reference genomes. We conducted a genome-wide scan of variable number tandem repeats (VNTRs) within the coding regions, identifying over 400 genes with VNTR polymorphisms with up to 600 repeat copies and repeat units reaching 990 nucleotides. Our strain-specific annotations enhance RNA sequencing (RNA-seq) analyses, as demonstrated in PWK/PhJ, where we observed a 5.1% improvement in read mapping and expression-level differences in 2.1% of coding genes compared to using GRCm39.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101074"},"PeriodicalIF":11.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-01DOI: 10.1016/j.xgen.2025.101071
Mingrui Yu, Qian Zhang, Kai Yuan, Aleksejs Sazonovs, Christine R Stevens, Laura Fachal, Christopher A Lamb, Carl A Anderson, Mark J Daly, Hailiang Huang
Genetic mutations that yield a defective cystic fibrosis (CF) transmembrane regulator (CFTR) protein cause CF, a life-limiting autosomal-recessive Mendelian disorder. A protective role of CFTR loss-of-function mutations in inflammatory bowel disease (IBD) has been suggested, but its evidence has been inconclusive and contradictory. Here, leveraging a large IBD exome sequencing dataset comprising 38,558 cases and 66,945 controls of European ancestry in the discovery stage and a combined total of 42,475 cases and 192,050 controls across diverse ancestry groups in the replication stage, we established a protective role of CF-risk variants against IBD based on the association test of CFTR deltaF508 (p = 8.96E-11) and the gene-based burden test of CF-risk variants (p = 3.9E-07). Furthermore, we assessed variant prioritization methods, including AlphaMissense, using clinically annotated CF-risk variants as the gold standard. Our findings highlight the critical and unmet need for effective variant prioritization in gene-based burden tests.
{"title":"Cystic fibrosis risk variants confer protection against inflammatory bowel disease.","authors":"Mingrui Yu, Qian Zhang, Kai Yuan, Aleksejs Sazonovs, Christine R Stevens, Laura Fachal, Christopher A Lamb, Carl A Anderson, Mark J Daly, Hailiang Huang","doi":"10.1016/j.xgen.2025.101071","DOIUrl":"10.1016/j.xgen.2025.101071","url":null,"abstract":"<p><p>Genetic mutations that yield a defective cystic fibrosis (CF) transmembrane regulator (CFTR) protein cause CF, a life-limiting autosomal-recessive Mendelian disorder. A protective role of CFTR loss-of-function mutations in inflammatory bowel disease (IBD) has been suggested, but its evidence has been inconclusive and contradictory. Here, leveraging a large IBD exome sequencing dataset comprising 38,558 cases and 66,945 controls of European ancestry in the discovery stage and a combined total of 42,475 cases and 192,050 controls across diverse ancestry groups in the replication stage, we established a protective role of CF-risk variants against IBD based on the association test of CFTR deltaF508 (p = 8.96E-11) and the gene-based burden test of CF-risk variants (p = 3.9E-07). Furthermore, we assessed variant prioritization methods, including AlphaMissense, using clinically annotated CF-risk variants as the gold standard. Our findings highlight the critical and unmet need for effective variant prioritization in gene-based burden tests.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101071"},"PeriodicalIF":11.1,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-25DOI: 10.1016/j.xgen.2025.101075
Kenneth E Westerman, Julie E Gervis, Luke J O'Connor, Miriam S Udler, Alisa K Manning
Polygenic scores (PGSs) that can predict response to interventions can facilitate precision medicine and are detectable in observational datasets as PGS-by-exposure (PGS×E) interactions. PGSs based on interactions (iPGSs) or variance effects (vPGSs) may be more powerful than standard PGSs for detecting PGS×E, but these have yet to be systematically compared. We describe a generalized pipeline for developing and comparing these PGS types and apply it to detect genetic modification of the relationship between adiposity (measured by BMI) and a broad set of cardiometabolic risk factors. Our applied analysis in the UK Biobank identified significant PGS×BMI for 16/20 risk factors, most consistently for the iPGS approach. Many interactions replicated in All of Us (AoU); for example, we observed a 72% larger BMI-alanine aminotransferase association in the top iPGS decile in AoU. Our study provides a framework for the comparison of PGS×E strategies and informs efforts toward clinically useful response-focused PGSs.
多基因评分(pgs)可以预测对干预措施的反应,可以促进精准医疗,并在观察数据集中作为pgs -暴露(PGS×E)相互作用进行检测。基于相互作用(ipgs)或方差效应(vpgs)的pgs在检测PGS×E方面可能比标准pgs更强大,但这些还没有被系统地比较。我们描述了一个开发和比较这些PGS类型的通用管道,并将其应用于检测肥胖(由BMI测量)与一系列广泛的心脏代谢危险因素之间关系的遗传修饰。我们在英国生物银行的应用分析确定了16/20个风险因素的显著PGS×BMI,最一致的是iPGS方法。在《All of Us》(AoU)中复制了许多互动;例如,我们观察到,在AoU的iPGS前十分位数中,bmi -丙氨酸转氨酶的关联要大72%。我们的研究为PGS×E策略的比较提供了一个框架,并为临床有用的以反应为重点的pgs提供了信息。
{"title":"Polygenic scores capture genetic modification of the adiposity-cardiometabolic risk factor relationship.","authors":"Kenneth E Westerman, Julie E Gervis, Luke J O'Connor, Miriam S Udler, Alisa K Manning","doi":"10.1016/j.xgen.2025.101075","DOIUrl":"10.1016/j.xgen.2025.101075","url":null,"abstract":"<p><p>Polygenic scores (PGSs) that can predict response to interventions can facilitate precision medicine and are detectable in observational datasets as PGS-by-exposure (PGS×E) interactions. PGSs based on interactions (iPGSs) or variance effects (vPGSs) may be more powerful than standard PGSs for detecting PGS×E, but these have yet to be systematically compared. We describe a generalized pipeline for developing and comparing these PGS types and apply it to detect genetic modification of the relationship between adiposity (measured by BMI) and a broad set of cardiometabolic risk factors. Our applied analysis in the UK Biobank identified significant PGS×BMI for 16/20 risk factors, most consistently for the iPGS approach. Many interactions replicated in All of Us (AoU); for example, we observed a 72% larger BMI-alanine aminotransferase association in the top iPGS decile in AoU. Our study provides a framework for the comparison of PGS×E strategies and informs efforts toward clinically useful response-focused PGSs.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101075"},"PeriodicalIF":11.1,"publicationDate":"2025-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145642999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-19DOI: 10.1016/j.xgen.2025.101065
Tajda Klobučar, Jona Novljan, Ira A Iosub, Boštjan Kokot, Iztok Urbančič, D Marc Jones, Anob M Chakrabarti, Nicholas M Luscombe, Jernej Ule, Miha Modic
Complex RNA-protein networks play a pivotal role in the formation of many types of biomolecular condensates. How RNA features contribute to condensate formation, however, remains incompletely understood. Here, we integrate tailored transcriptomics assays to identify a distinct class of developmental condensation-prone RNAs termed "smOOPs" (semi-extractable, orthogonal-organic-phase-separation-enriched RNAs). These transcripts localize to larger intracellular foci, form denser RNA subnetworks than expected, and are heavily bound by RNA-binding proteins (RBPs). Using an explainable deep learning framework, we reveal that smOOPs harbor characteristic sequence composition, with lower sequence complexity, increased intramolecular folding, and specific RBP-binding patterns. Intriguingly, these RNAs encode proteins bearing extensive intrinsically disordered regions and are highly predicted to be involved in biomolecular condensates, indicating an interplay between RNA- and protein-based features in phase separation. This work advances our understanding of condensation-prone RNAs and provides a versatile resource to further investigate RNA-driven condensation principles.
{"title":"Integrative profiling of condensation-prone RNAs during early development.","authors":"Tajda Klobučar, Jona Novljan, Ira A Iosub, Boštjan Kokot, Iztok Urbančič, D Marc Jones, Anob M Chakrabarti, Nicholas M Luscombe, Jernej Ule, Miha Modic","doi":"10.1016/j.xgen.2025.101065","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101065","url":null,"abstract":"<p><p>Complex RNA-protein networks play a pivotal role in the formation of many types of biomolecular condensates. How RNA features contribute to condensate formation, however, remains incompletely understood. Here, we integrate tailored transcriptomics assays to identify a distinct class of developmental condensation-prone RNAs termed \"smOOPs\" (semi-extractable, orthogonal-organic-phase-separation-enriched RNAs). These transcripts localize to larger intracellular foci, form denser RNA subnetworks than expected, and are heavily bound by RNA-binding proteins (RBPs). Using an explainable deep learning framework, we reveal that smOOPs harbor characteristic sequence composition, with lower sequence complexity, increased intramolecular folding, and specific RBP-binding patterns. Intriguingly, these RNAs encode proteins bearing extensive intrinsically disordered regions and are highly predicted to be involved in biomolecular condensates, indicating an interplay between RNA- and protein-based features in phase separation. This work advances our understanding of condensation-prone RNAs and provides a versatile resource to further investigate RNA-driven condensation principles.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101065"},"PeriodicalIF":11.1,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-19DOI: 10.1016/j.xgen.2025.101069
Gregory R Keele, Monika Dzieciatkowska, Ariel M Hay, Matthew Vincent, Callan O'Connor, Daniel Stephenson, Julie A Reisz, Travis Nemkov, Kirk C Hansen, Grier P Page, James C Zimring, Gary A Churchill, Angelo D'Alessandro
Red blood cells (RBCs) transport oxygen but accumulate oxidative damage over time, reducing function in vivo and during storage, critical for transfusions. To explore the genetics of RBC resilience, we profiled proteins, metabolites, and lipids from fresh and stored RBCs from 350 genetically diverse mice. Our analysis identified over 6,000 quantitative trait loci (QTLs). Compared to other tissues, the prevalence of trans genetic effects over cis ones reflects the absence of de novo protein synthesis in anucleated RBCs. QTL hotspots at Hbb, Hba, Mon1a, and (storage-specific) Steap3 linked ferroptosis to hemolysis. Proteasome QTLs clustered at multiple loci, underscoring the importance of degrading oxidized proteins. Post-translational modification (PTM) QTLs mapped predominantly to hemoglobins, including cysteine residues. The loss of reactive C93 in humanized mice (hemoglobulin beta [HBB] C93A) disrupted redox balance, glutathione pools, glutathionylation, and redox PTMs. These findings highlight genetic regulation of RBC oxidation, with implications for transfusion biology and oxidative-stress-dependent hemolytic disorders.
{"title":"Genetic architecture of the murine red blood cell proteome reveals central role of hemoglobin beta cysteine 93 in maintaining redox balance.","authors":"Gregory R Keele, Monika Dzieciatkowska, Ariel M Hay, Matthew Vincent, Callan O'Connor, Daniel Stephenson, Julie A Reisz, Travis Nemkov, Kirk C Hansen, Grier P Page, James C Zimring, Gary A Churchill, Angelo D'Alessandro","doi":"10.1016/j.xgen.2025.101069","DOIUrl":"10.1016/j.xgen.2025.101069","url":null,"abstract":"<p><p>Red blood cells (RBCs) transport oxygen but accumulate oxidative damage over time, reducing function in vivo and during storage, critical for transfusions. To explore the genetics of RBC resilience, we profiled proteins, metabolites, and lipids from fresh and stored RBCs from 350 genetically diverse mice. Our analysis identified over 6,000 quantitative trait loci (QTLs). Compared to other tissues, the prevalence of trans genetic effects over cis ones reflects the absence of de novo protein synthesis in anucleated RBCs. QTL hotspots at Hbb, Hba, Mon1a, and (storage-specific) Steap3 linked ferroptosis to hemolysis. Proteasome QTLs clustered at multiple loci, underscoring the importance of degrading oxidized proteins. Post-translational modification (PTM) QTLs mapped predominantly to hemoglobins, including cysteine residues. The loss of reactive C93 in humanized mice (hemoglobulin beta [HBB] C93A) disrupted redox balance, glutathione pools, glutathionylation, and redox PTMs. These findings highlight genetic regulation of RBC oxidation, with implications for transfusion biology and oxidative-stress-dependent hemolytic disorders.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101069"},"PeriodicalIF":11.1,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-19DOI: 10.1016/j.xgen.2025.101068
Seon-Kyeong Jang, Zitian Wang, Richard Border, Dinh Tuan, Angela Wei, Ulzee An, Sriram Sankararaman, Vasilis Ntranos, Jonathan Flint, Noah Zaitlen
Protein language models (PLMs) improve variant effect predictions, but their role in gene discovery for complex traits remains unclear. We introduce an allelic series-based regression test that uses PLM-derived variant effect predictions as proxies for effect sizes, identifying ∼46% more associations than standard burden tests. Extending this to isoform-level analysis, we find 26 gene-trait pairs with stronger associations in non-canonical versus canonical transcripts, highlighting isoform-specific effects. Finally, we identify evolutionary plausible variants (EPVs), missense variants assigned higher likelihoods than the wild-type alleles by PLMs, representing 0.45% of missense variants. EPVs show higher allele frequencies than synonymous variants, consistent with differential selection pressures, and are linked to nine traits, including protective associations with low-density lipoprotein (LDL) and bone mineral density. Together, our results demonstrate how PLMs can enhance rare-variant interpretation and gene-trait association discovery in exome data.
{"title":"Leveraging protein language models to identify complex trait associations with previously inaccessible classes of functional rare variants.","authors":"Seon-Kyeong Jang, Zitian Wang, Richard Border, Dinh Tuan, Angela Wei, Ulzee An, Sriram Sankararaman, Vasilis Ntranos, Jonathan Flint, Noah Zaitlen","doi":"10.1016/j.xgen.2025.101068","DOIUrl":"10.1016/j.xgen.2025.101068","url":null,"abstract":"<p><p>Protein language models (PLMs) improve variant effect predictions, but their role in gene discovery for complex traits remains unclear. We introduce an allelic series-based regression test that uses PLM-derived variant effect predictions as proxies for effect sizes, identifying ∼46% more associations than standard burden tests. Extending this to isoform-level analysis, we find 26 gene-trait pairs with stronger associations in non-canonical versus canonical transcripts, highlighting isoform-specific effects. Finally, we identify evolutionary plausible variants (EPVs), missense variants assigned higher likelihoods than the wild-type alleles by PLMs, representing 0.45% of missense variants. EPVs show higher allele frequencies than synonymous variants, consistent with differential selection pressures, and are linked to nine traits, including protective associations with low-density lipoprotein (LDL) and bone mineral density. Together, our results demonstrate how PLMs can enhance rare-variant interpretation and gene-trait association discovery in exome data.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101068"},"PeriodicalIF":11.1,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-12Epub Date: 2025-08-12DOI: 10.1016/j.xgen.2025.100969
Ander Díaz-Navarro, Xindi Zhang, Wei Jiao, Bo Wang, Lincoln Stein
Understanding how genomic alterations drive cancer is key to advancing precision oncology. To detect these alterations, accurate algorithms are used; however, due to privacy concerns, few deeply sequenced cancer genomes can be shared, limiting benchmarking and representing a major obstacle to the improvement of analytic tools. To address this, we developed OncoGAN, a generative AI model combining adversarial networks and variational autoencoders to create realistic synthetic cancer genomes. Trained on large-scale genomic datasets, OncoGAN accurately reproduces somatic mutations, copy number alterations, and structural variants across cancer types while preserving donors' privacy. The synthetic genomes reflect tumor-specific mutational signatures and positional mutation patterns. Using DeepTumour, we validated the synthetic data's fidelity, showing high concordance between generated and predicted tumors. Moreover, augmenting the training data with synthetic genomes improved DeepTumour's accuracy, underscoring OncoGAN's potential to generate shareable datasets with known ground truths for benchmarking and enhancement of cancer genome analysis tools.
{"title":"In silico generation of synthetic cancer genomes using generative AI.","authors":"Ander Díaz-Navarro, Xindi Zhang, Wei Jiao, Bo Wang, Lincoln Stein","doi":"10.1016/j.xgen.2025.100969","DOIUrl":"10.1016/j.xgen.2025.100969","url":null,"abstract":"<p><p>Understanding how genomic alterations drive cancer is key to advancing precision oncology. To detect these alterations, accurate algorithms are used; however, due to privacy concerns, few deeply sequenced cancer genomes can be shared, limiting benchmarking and representing a major obstacle to the improvement of analytic tools. To address this, we developed OncoGAN, a generative AI model combining adversarial networks and variational autoencoders to create realistic synthetic cancer genomes. Trained on large-scale genomic datasets, OncoGAN accurately reproduces somatic mutations, copy number alterations, and structural variants across cancer types while preserving donors' privacy. The synthetic genomes reflect tumor-specific mutational signatures and positional mutation patterns. Using DeepTumour, we validated the synthetic data's fidelity, showing high concordance between generated and predicted tumors. Moreover, augmenting the training data with synthetic genomes improved DeepTumour's accuracy, underscoring OncoGAN's potential to generate shareable datasets with known ground truths for benchmarking and enhancement of cancer genome analysis tools.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100969"},"PeriodicalIF":11.1,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12648103/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144849956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-12Epub Date: 2025-09-09DOI: 10.1016/j.xgen.2025.100985
Sebastian Palacios, Simone Bruno, Ron Weiss, Elia Salibi, Isabella Goodchild-Michelman, Andrew Kane, Katherine Ilia, Domitilla Del Vecchio
Cells store information by means of chromatin modifications that persist through cell divisions and can hold gene expression silenced over generations. However, how these modifications may maintain other gene expression states has remained unclear. This study shows that chromatin modifications can maintain a wide range of gene expression levels over time, thus uncovering analog epigenetic memory. By engineering a genomic reporter and epigenetic effectors, we tracked the gene expression dynamics following targeted perturbations to the chromatin state. We found that distinct grades of DNA methylation led to corresponding, persistent gene expression levels. Altering the DNA methylation grade, in turn, resulted in permanent loss of gene expression memory. Consistent with experiments, our chromatin modification model indicates that analog memory arises when the positive feedback between DNA methylation and repressive histone modifications is lacking. This discovery will lead to a deeper understanding of epigenetic memory and to new tools for synthetic biology.
{"title":"Analog epigenetic memory revealed by targeted chromatin editing.","authors":"Sebastian Palacios, Simone Bruno, Ron Weiss, Elia Salibi, Isabella Goodchild-Michelman, Andrew Kane, Katherine Ilia, Domitilla Del Vecchio","doi":"10.1016/j.xgen.2025.100985","DOIUrl":"10.1016/j.xgen.2025.100985","url":null,"abstract":"<p><p>Cells store information by means of chromatin modifications that persist through cell divisions and can hold gene expression silenced over generations. However, how these modifications may maintain other gene expression states has remained unclear. This study shows that chromatin modifications can maintain a wide range of gene expression levels over time, thus uncovering analog epigenetic memory. By engineering a genomic reporter and epigenetic effectors, we tracked the gene expression dynamics following targeted perturbations to the chromatin state. We found that distinct grades of DNA methylation led to corresponding, persistent gene expression levels. Altering the DNA methylation grade, in turn, resulted in permanent loss of gene expression memory. Consistent with experiments, our chromatin modification model indicates that analog memory arises when the positive feedback between DNA methylation and repressive histone modifications is lacking. This discovery will lead to a deeper understanding of epigenetic memory and to new tools for synthetic biology.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100985"},"PeriodicalIF":11.1,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12648113/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145034782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-12DOI: 10.1016/j.xgen.2025.101064
Tianang Leng, Cesar de la Fuente-Nunez
Unmodified class II bacteriocins promise precision antimicrobials that spare bystander microbes. Zhang and colleagues introduce IIBacFinder, a genomics-guided pipeline that detects precursor and context genes with a curated pHMM library, infers leader-peptide cleavage, and triages candidates by meta-omics signals. The authors apply it across bacterial genomes, including an atlas of ∼280,000 human-gut genomes, and recover a vast reservoir of narrow-spectrum peptides and prioritize gut-resident candidates for synthesis. Of the 26 synthesized, 16 display activity in vitro, largely via membrane perturbation and with additive effects alongside vancomycin, while ex vivo assays show minimal compositional disruption of fecal communities compared with antibiotic controls. These results position unmodified class II bacteriocins as tractable, microbiome-sparing agents and illustrate how genome-scale mining coupled to meta-omics can bridge sequence to function in complex ecosystems.
{"title":"The gut's hidden arsenal: A genomics-guided atlas of class II bacteriocins.","authors":"Tianang Leng, Cesar de la Fuente-Nunez","doi":"10.1016/j.xgen.2025.101064","DOIUrl":"10.1016/j.xgen.2025.101064","url":null,"abstract":"<p><p>Unmodified class II bacteriocins promise precision antimicrobials that spare bystander microbes. Zhang and colleagues introduce IIBacFinder, a genomics-guided pipeline that detects precursor and context genes with a curated pHMM library, infers leader-peptide cleavage, and triages candidates by meta-omics signals. The authors apply it across bacterial genomes, including an atlas of ∼280,000 human-gut genomes, and recover a vast reservoir of narrow-spectrum peptides and prioritize gut-resident candidates for synthesis. Of the 26 synthesized, 16 display activity in vitro, largely via membrane perturbation and with additive effects alongside vancomycin, while ex vivo assays show minimal compositional disruption of fecal communities compared with antibiotic controls. These results position unmodified class II bacteriocins as tractable, microbiome-sparing agents and illustrate how genome-scale mining coupled to meta-omics can bridge sequence to function in complex ecosystems.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"5 11","pages":"101064"},"PeriodicalIF":11.1,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12648078/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145514600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-12Epub Date: 2025-09-09DOI: 10.1016/j.xgen.2025.100981
Taylor N LaFlam, Christian B Billesbølle, Tuan Dinh, Finn D Wolfreys, Erick Lu, Tomas Matteson, Jinping An, Ying Xu, Arushi Singhal, Nadav Brandes, Vasilis Ntranos, Aashish Manglik, Jason G Cyster, Chun Jimmie Ye
Missense variants can have pleiotropic effects on protein function, and predicting these effects can be difficult. We performed near-saturation deep mutational scanning of P2RY8, a G protein-coupled receptor that promotes germinal center B cell confinement. We assayed the effect of each variant on surface expression, migration, and proliferation. We delineated variants that affected both expression and function, affected function independently of expression, and discrepantly affected migration and proliferation. We also used cryo-electron microscopy to determine the structure of activated, ligand-bound P2RY8, providing structural insights into the effects of variants on ligand binding and signal transmission. We applied the deep mutational scanning results to both improve computational variant effect predictions and to characterize the phenotype of germline variants and lymphoma-associated variants. Together, our results demonstrate the power of integrating deep mutational scanning, structure determination, and in silico prediction to advance the understanding of a receptor important in human health.
{"title":"Phenotypic pleiotropy of missense variants in human B cell confinement receptor P2RY8.","authors":"Taylor N LaFlam, Christian B Billesbølle, Tuan Dinh, Finn D Wolfreys, Erick Lu, Tomas Matteson, Jinping An, Ying Xu, Arushi Singhal, Nadav Brandes, Vasilis Ntranos, Aashish Manglik, Jason G Cyster, Chun Jimmie Ye","doi":"10.1016/j.xgen.2025.100981","DOIUrl":"10.1016/j.xgen.2025.100981","url":null,"abstract":"<p><p>Missense variants can have pleiotropic effects on protein function, and predicting these effects can be difficult. We performed near-saturation deep mutational scanning of P2RY8, a G protein-coupled receptor that promotes germinal center B cell confinement. We assayed the effect of each variant on surface expression, migration, and proliferation. We delineated variants that affected both expression and function, affected function independently of expression, and discrepantly affected migration and proliferation. We also used cryo-electron microscopy to determine the structure of activated, ligand-bound P2RY8, providing structural insights into the effects of variants on ligand binding and signal transmission. We applied the deep mutational scanning results to both improve computational variant effect predictions and to characterize the phenotype of germline variants and lymphoma-associated variants. Together, our results demonstrate the power of integrating deep mutational scanning, structure determination, and in silico prediction to advance the understanding of a receptor important in human health.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100981"},"PeriodicalIF":11.1,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12648108/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145034846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}