Pub Date : 2026-02-03DOI: 10.1186/s13059-026-03987-2
Yuwei Hua, Shiyu Li, Yong Zhang
Nuclear morphology encodes rich phenotypic information critical for understanding cellular states, yet its full potential remains untapped in biomedical analysis. This study introduces NuSPIRe, a self-supervised deep learning model designed to analyze nuclear morphology using DAPI-stained images. Pretrained on 15.52 million cell nucleus images, NuSPIRe performs robustly in cell type identification and perturbation detection, even with limited annotations. Moreover, NuSPIRe integrates nuclear morphology with spatial omics data, uncovering significant correlations between cellular structure and gene expression. Notably, NuSPIRe further enables AI-driven experimental optimization for region-of-interest identification and field-of-view selection, enhancing data efficiency in spatial omics and molecular cell biology.
{"title":"Self-supervised pretraining with NuSPIRe unlocks nuclear morphology-driven insights in spatial omics.","authors":"Yuwei Hua, Shiyu Li, Yong Zhang","doi":"10.1186/s13059-026-03987-2","DOIUrl":"10.1186/s13059-026-03987-2","url":null,"abstract":"<p><p>Nuclear morphology encodes rich phenotypic information critical for understanding cellular states, yet its full potential remains untapped in biomedical analysis. This study introduces NuSPIRe, a self-supervised deep learning model designed to analyze nuclear morphology using DAPI-stained images. Pretrained on 15.52 million cell nucleus images, NuSPIRe performs robustly in cell type identification and perturbation detection, even with limited annotations. Moreover, NuSPIRe integrates nuclear morphology with spatial omics data, uncovering significant correlations between cellular structure and gene expression. Notably, NuSPIRe further enables AI-driven experimental optimization for region-of-interest identification and field-of-view selection, enhancing data efficiency in spatial omics and molecular cell biology.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12958554/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-03DOI: 10.1186/s13059-026-03980-9
Pei-Xuan Xiao, Lei Tan, Jianke Dong, Jing Huang, Yuhong Huang, Jia-Bao He, Handong Su, Botao Song, Wen-Biao Jiao
Background: Potato (Solanum tuberosum) breeding is severely hindered by its highly heterozygous autotetraploid genome, where complex allelic interactions impede precise trait selection. Reconstructing complete haplotype-resolved assemblies is crucial for genome-assisted breeding. However, current assembly methods for autopolyploids often generate fragmented sequences, haplotype-switch errors, and gaps in complex regions such as centromeres.
Results: To address these challenges, we develop PHap, a haplotype assembly pipeline tailored for autopolyploids, using only standard sequencing data, including long-reads and Hi-C. Applying PHap to the autotetraploid potato cultivar HuaShu4, we generate a haplotype-resolved, near telomere-to-telomere assembly of 3.12 Gb with an N50 of 32.7 Mb and 99.7% haplotype accuracy. Comparisons with alternative methods and existing assemblies highlight PHap's advantages in assembly quality and cost-effectiveness. Integration of transcriptomic and epigenomic data demonstrates that the genomic and methylation divergence across haplotypes drives substantial allelic expression differentiation. Time-course RNA-seq further reveals, for the first time, that 55% of genes exhibit divergent allelic expression, with dynamic shifts in dominant or suppressed alleles during tuber development. Additionally, our assembly resolves high-resolution haplotype-specific structures in centromeres and subtelomeres, as well as haplotype divergence of structural rearrangements. It also shows neocentromere formation via the expansion of megabase-scale satellite arrays.
Conclusions: These findings provide insights into the architecture of autopolyploid genomes and establish a foundation for genomics-assisted breeding of polyploid potatoes.
{"title":"Haplotype-resolved and near telomere-to-telomere assembly of the autotetraploid potato genome.","authors":"Pei-Xuan Xiao, Lei Tan, Jianke Dong, Jing Huang, Yuhong Huang, Jia-Bao He, Handong Su, Botao Song, Wen-Biao Jiao","doi":"10.1186/s13059-026-03980-9","DOIUrl":"10.1186/s13059-026-03980-9","url":null,"abstract":"<p><strong>Background: </strong>Potato (Solanum tuberosum) breeding is severely hindered by its highly heterozygous autotetraploid genome, where complex allelic interactions impede precise trait selection. Reconstructing complete haplotype-resolved assemblies is crucial for genome-assisted breeding. However, current assembly methods for autopolyploids often generate fragmented sequences, haplotype-switch errors, and gaps in complex regions such as centromeres.</p><p><strong>Results: </strong>To address these challenges, we develop PHap, a haplotype assembly pipeline tailored for autopolyploids, using only standard sequencing data, including long-reads and Hi-C. Applying PHap to the autotetraploid potato cultivar HuaShu4, we generate a haplotype-resolved, near telomere-to-telomere assembly of 3.12 Gb with an N50 of 32.7 Mb and 99.7% haplotype accuracy. Comparisons with alternative methods and existing assemblies highlight PHap's advantages in assembly quality and cost-effectiveness. Integration of transcriptomic and epigenomic data demonstrates that the genomic and methylation divergence across haplotypes drives substantial allelic expression differentiation. Time-course RNA-seq further reveals, for the first time, that 55% of genes exhibit divergent allelic expression, with dynamic shifts in dominant or suppressed alleles during tuber development. Additionally, our assembly resolves high-resolution haplotype-specific structures in centromeres and subtelomeres, as well as haplotype divergence of structural rearrangements. It also shows neocentromere formation via the expansion of megabase-scale satellite arrays.</p><p><strong>Conclusions: </strong>These findings provide insights into the architecture of autopolyploid genomes and establish a foundation for genomics-assisted breeding of polyploid potatoes.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12955163/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146114792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-03DOI: 10.1186/s13059-026-03977-4
Hyeong-Cheol Oh, Yeonseung Han, Yoojin Chang, Hyongbum Henry Kim
Background: ALK gene fusions are key oncogenic drivers in cancers such as non-small cell lung cancer, where they define a molecular subtype responsive to ALK tyrosine kinase inhibitors (TKIs). However, resistance commonly arises due to single nucleotide variants (SNVs) within the ALK tyrosine kinase domain, many of which remain variants of uncertain significance (VUSs).
Results: To systematically profile resistance, we use prime editing to generate and assess 3,208 ALK variants covering 99% of all possible SNVs across exons 20-28, along with intronic variants. We evaluate drug resistance across three generations of ALK TKIs: alectinib, lorlatinib, and zotizalkib. These high-resolution resistance landscapes validate known resistance mutations (e.g., G1202R, L1196M), identify previously uncharacterized resistance-associated VUSs, and reveal distinct patterns of drug-specific and shared resistance across inhibitors. Structural mapping further contextualizes resistance-associated variants relative to the ATP-binding pocket and distal regions associated with resistance.
Conclusions: This study provides a comprehensive functional atlas of ALK tyrosine kinase domain variants under TKI selection, offering a valuable experimental framework for interpreting resistance-associated variants. Although derived from in vitro models and therefore context dependent, this resource complements existing clinical and genomic knowledge and may aid in the functional interpretation of ALK variants observed in ALK-driven cancers.
{"title":"A comprehensive functional atlas of ALK kinase domain variants reveals resistance landscape to ALK inhibitors.","authors":"Hyeong-Cheol Oh, Yeonseung Han, Yoojin Chang, Hyongbum Henry Kim","doi":"10.1186/s13059-026-03977-4","DOIUrl":"10.1186/s13059-026-03977-4","url":null,"abstract":"<p><strong>Background: </strong>ALK gene fusions are key oncogenic drivers in cancers such as non-small cell lung cancer, where they define a molecular subtype responsive to ALK tyrosine kinase inhibitors (TKIs). However, resistance commonly arises due to single nucleotide variants (SNVs) within the ALK tyrosine kinase domain, many of which remain variants of uncertain significance (VUSs).</p><p><strong>Results: </strong>To systematically profile resistance, we use prime editing to generate and assess 3,208 ALK variants covering 99% of all possible SNVs across exons 20-28, along with intronic variants. We evaluate drug resistance across three generations of ALK TKIs: alectinib, lorlatinib, and zotizalkib. These high-resolution resistance landscapes validate known resistance mutations (e.g., G1202R, L1196M), identify previously uncharacterized resistance-associated VUSs, and reveal distinct patterns of drug-specific and shared resistance across inhibitors. Structural mapping further contextualizes resistance-associated variants relative to the ATP-binding pocket and distal regions associated with resistance.</p><p><strong>Conclusions: </strong>This study provides a comprehensive functional atlas of ALK tyrosine kinase domain variants under TKI selection, offering a valuable experimental framework for interpreting resistance-associated variants. Although derived from in vitro models and therefore context dependent, this resource complements existing clinical and genomic knowledge and may aid in the functional interpretation of ALK variants observed in ALK-driven cancers.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12958575/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146107851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-02DOI: 10.1186/s13059-026-03976-5
Stijn Hawinkel, Xilan Yang, Ward Poelmans, Hans Motte, Tom Beeckman, Steven Maere
Spatial omics technologies localize individual molecules at subcellular resolution, yet growing numbers of molecules, features and replicates set analysis challenges. We present smoppix, a nonparametric analysis method based on the probabilistic index, to test for several uni- and bivariate localization patterns. It exploits the high-dimensionality of the data for variance weighting and for providing a background null distribution, unique for every molecule. Moreover, smoppix sidesteps segmentation, edge correction, warping and density estimation, and is scalable thanks to an exact permutation null distribution. We unearth spatial patterns in datasets from four kingdoms, and validate some findings experimentally on spikemoss roots.
{"title":"smoppix: unified nonparametric analysis of single-molecule spatial omics data using probabilistic indices.","authors":"Stijn Hawinkel, Xilan Yang, Ward Poelmans, Hans Motte, Tom Beeckman, Steven Maere","doi":"10.1186/s13059-026-03976-5","DOIUrl":"10.1186/s13059-026-03976-5","url":null,"abstract":"<p><p>Spatial omics technologies localize individual molecules at subcellular resolution, yet growing numbers of molecules, features and replicates set analysis challenges. We present smoppix, a nonparametric analysis method based on the probabilistic index, to test for several uni- and bivariate localization patterns. It exploits the high-dimensionality of the data for variance weighting and for providing a background null distribution, unique for every molecule. Moreover, smoppix sidesteps segmentation, edge correction, warping and density estimation, and is scalable thanks to an exact permutation null distribution. We unearth spatial patterns in datasets from four kingdoms, and validate some findings experimentally on spikemoss roots.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12951914/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146108072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-02DOI: 10.1186/s13059-026-03961-y
Mateusz Baca, Barbara Bujalska, Danijela Popović, Michał Golubiński, Paulo C Alves, Edouard Bard, Claudio Berto, Gloria Cuenca-Bescós, Love Dalén, Helen Fewlass, Tatyana Fadeeva, Jeremy Herman, Ivan Horáček, Magdalena Krajcarz, Matthew Law, Anna Lemanik, Juan Manuel López-García, Elisa Luzi, Xabier Murelaga, Ahmad Mahmoudi, Marco Peresani, Simon Parfitt, Joana Pauperio, Svetlana V Pavlova, Piroska Pazonyi, Iván Rey Rodríguez, Jeremy B Searle, Joanna Stojak, Tatyana Strukova, Jan M Wójcik, Adam Nadachowski
Background: The field vole, an abundant and widespread microtine rodent, is a complex comprised of three cryptic species: the short-tailed field vole (Microtus agrestis) which is present over much of Eurasia, the Mediterranean field vole (Microtus lavernedii) in southern Europe, and the Portuguese field vole (Microtus rozianus) in western Spain and Portugal. Previous research has shown high genomic differentiation of these three lineages. However, the details of the process underlying their divergence remain unknown.
Results: We analyse 70 mitogenomes and 16 nuclear genomes of modern specimens, and 83 mitogenomes and 12 nuclear genomes of ancient specimens spanning the last 75 thousand years (ka). We estimate the divergence of Portuguese from short-tailed and Mediterranean field voles to be ca. 220 ka ago and of the latter two species to be ca. 110 ka ago, earlier than previous estimates involving only modern sequences. The divergence times we obtain match those between major mitochondrial lineages of cold-adapted and steppe rodents in Europe. We find signatures of gene flow within and between field vole lineages, with some analyses suggesting a hybrid origin of the Mediterranean lineage. Ancient specimens from the Italian Peninsula reveal a previously unrecognised lineage that show evidence of genetic exchange with other populations.
Conclusions: The pattern of genetic variation in the field vole species complex demonstrates the impact of stadial-interstadial cycles in generating recurrent episodes of allopatry and connectivity of populations, a situation which could only be revealed by our dense genomic sampling over time.
{"title":"The evolutionary history of the field vole species complex revealed by modern and ancient genomes.","authors":"Mateusz Baca, Barbara Bujalska, Danijela Popović, Michał Golubiński, Paulo C Alves, Edouard Bard, Claudio Berto, Gloria Cuenca-Bescós, Love Dalén, Helen Fewlass, Tatyana Fadeeva, Jeremy Herman, Ivan Horáček, Magdalena Krajcarz, Matthew Law, Anna Lemanik, Juan Manuel López-García, Elisa Luzi, Xabier Murelaga, Ahmad Mahmoudi, Marco Peresani, Simon Parfitt, Joana Pauperio, Svetlana V Pavlova, Piroska Pazonyi, Iván Rey Rodríguez, Jeremy B Searle, Joanna Stojak, Tatyana Strukova, Jan M Wójcik, Adam Nadachowski","doi":"10.1186/s13059-026-03961-y","DOIUrl":"10.1186/s13059-026-03961-y","url":null,"abstract":"<p><strong>Background: </strong>The field vole, an abundant and widespread microtine rodent, is a complex comprised of three cryptic species: the short-tailed field vole (Microtus agrestis) which is present over much of Eurasia, the Mediterranean field vole (Microtus lavernedii) in southern Europe, and the Portuguese field vole (Microtus rozianus) in western Spain and Portugal. Previous research has shown high genomic differentiation of these three lineages. However, the details of the process underlying their divergence remain unknown.</p><p><strong>Results: </strong>We analyse 70 mitogenomes and 16 nuclear genomes of modern specimens, and 83 mitogenomes and 12 nuclear genomes of ancient specimens spanning the last 75 thousand years (ka). We estimate the divergence of Portuguese from short-tailed and Mediterranean field voles to be ca. 220 ka ago and of the latter two species to be ca. 110 ka ago, earlier than previous estimates involving only modern sequences. The divergence times we obtain match those between major mitochondrial lineages of cold-adapted and steppe rodents in Europe. We find signatures of gene flow within and between field vole lineages, with some analyses suggesting a hybrid origin of the Mediterranean lineage. Ancient specimens from the Italian Peninsula reveal a previously unrecognised lineage that show evidence of genetic exchange with other populations.</p><p><strong>Conclusions: </strong>The pattern of genetic variation in the field vole species complex demonstrates the impact of stadial-interstadial cycles in generating recurrent episodes of allopatry and connectivity of populations, a situation which could only be revealed by our dense genomic sampling over time.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12955023/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146108074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-02DOI: 10.1186/s13059-026-03974-7
Nikoletta Y Papaioannou, Petros Patsali, Julia Klermund, Panayiota L Papasavva, Geoffroy Andrieux, Lola Koniali, Basma Naiisseh, Soteroula Christou, Maria Sitarou, Marina Kleanthous, Toni Cathomen, Carsten W Lederer
Background: Beta-thalassemia is among the most common monogenic disorders, posing a major global health challenge. Editing of genetic modifiers, such as BCL11A erythroid enhancer and HBG promoters, enhances fetal hemoglobin expression and confers major therapeutic potential. Double-strand-break (DSB)-independent genome editing tools, such as base editors (BE), are potentially safer and better suited for multiplexed application than DSB-dependent CRISPR/Cas technology. However, harmful on- and off-target events remain a concern and must be excluded before clinical application, including chromosomal rearrangements invisible to standard detection technologies.
Results: Using primary patient-derived CD34+ cells from three donors, we investigate simplex and duplex BE-based disruption of the BCL11A erythroid enhancer and the BCL11A binding site (-115 bp) on the HBG promoter for DNA-level and functional studies at the RNA, protein, and morphological level. Analyses include direct comparison to DSB-based editing, the current clinically applied standard, and CAST-seq to assess recombination events, allowing wider inferences on relative safety. RNA-seq analyses for clones of primary CD34+ cells across all treatments confirm peak HBG induction for duplex BE and comparable effects on apoptotic and immune response signatures. Overall, duplex BE produces robust γ-globin and fetal hemoglobin induction, improves functional correction over simplex editing and results in low incidence of genomic alterations in both target loci.
Conclusions: Duplex BE targeting both BCL11A erythroid enhancer and HBG promoter enables functional correction and genome integrity. Our study highlights the efficacy, safety, and therapeutic potential of the present duplex BE approach.
{"title":"Functional correction and genome integrity with duplex base editing of β-thalassemic hematopoietic stem cells.","authors":"Nikoletta Y Papaioannou, Petros Patsali, Julia Klermund, Panayiota L Papasavva, Geoffroy Andrieux, Lola Koniali, Basma Naiisseh, Soteroula Christou, Maria Sitarou, Marina Kleanthous, Toni Cathomen, Carsten W Lederer","doi":"10.1186/s13059-026-03974-7","DOIUrl":"10.1186/s13059-026-03974-7","url":null,"abstract":"<p><strong>Background: </strong>Beta-thalassemia is among the most common monogenic disorders, posing a major global health challenge. Editing of genetic modifiers, such as BCL11A erythroid enhancer and HBG promoters, enhances fetal hemoglobin expression and confers major therapeutic potential. Double-strand-break (DSB)-independent genome editing tools, such as base editors (BE), are potentially safer and better suited for multiplexed application than DSB-dependent CRISPR/Cas technology. However, harmful on- and off-target events remain a concern and must be excluded before clinical application, including chromosomal rearrangements invisible to standard detection technologies.</p><p><strong>Results: </strong>Using primary patient-derived CD34<sup>+</sup> cells from three donors, we investigate simplex and duplex BE-based disruption of the BCL11A erythroid enhancer and the BCL11A binding site (-115 bp) on the HBG promoter for DNA-level and functional studies at the RNA, protein, and morphological level. Analyses include direct comparison to DSB-based editing, the current clinically applied standard, and CAST-seq to assess recombination events, allowing wider inferences on relative safety. RNA-seq analyses for clones of primary CD34<sup>+</sup> cells across all treatments confirm peak HBG induction for duplex BE and comparable effects on apoptotic and immune response signatures. Overall, duplex BE produces robust γ-globin and fetal hemoglobin induction, improves functional correction over simplex editing and results in low incidence of genomic alterations in both target loci.</p><p><strong>Conclusions: </strong>Duplex BE targeting both BCL11A erythroid enhancer and HBG promoter enables functional correction and genome integrity. Our study highlights the efficacy, safety, and therapeutic potential of the present duplex BE approach.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12952134/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146108064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-31DOI: 10.1186/s13059-026-03963-w
Wenbin Ye, Xin Cheng, Xingyu Bi, Xiaohui Wu
Alternative polyadenylation (APA) is a pervasive RNA-processing mechanism in eukaryotes that significantly promotes transcriptome and proteome diversity. Here we proposed PolyAseqTrap, an R package for probing polyA sites from diverse 3' sequencing data. PolyAseqTrap implements a polyA read prioritization strategy to determine precise positions of polyA sites. Particularly, it incorporates a transferrable cross-species deep learning model for mitigating the long-pending internal priming problem. Moreover, PolyAseqTrap employs a weighted density peak clustering method to reducing microheterogeneity impact in different species. We evaluated PolyAseqTrap using data from 16 different 3' sequencing techniques across multiple species, demonstrating the effectiveness and robustness of PolyAseqTrap.
{"title":"PolyAseqTrap: a universal tool for genome-wide identification and quantification of polyadenylation sites from different 3' end sequencing data.","authors":"Wenbin Ye, Xin Cheng, Xingyu Bi, Xiaohui Wu","doi":"10.1186/s13059-026-03963-w","DOIUrl":"10.1186/s13059-026-03963-w","url":null,"abstract":"<p><p>Alternative polyadenylation (APA) is a pervasive RNA-processing mechanism in eukaryotes that significantly promotes transcriptome and proteome diversity. Here we proposed PolyAseqTrap, an R package for probing polyA sites from diverse 3' sequencing data. PolyAseqTrap implements a polyA read prioritization strategy to determine precise positions of polyA sites. Particularly, it incorporates a transferrable cross-species deep learning model for mitigating the long-pending internal priming problem. Moreover, PolyAseqTrap employs a weighted density peak clustering method to reducing microheterogeneity impact in different species. We evaluated PolyAseqTrap using data from 16 different 3' sequencing techniques across multiple species, demonstrating the effectiveness and robustness of PolyAseqTrap.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12947541/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-31DOI: 10.1186/s13059-026-03967-6
Majid Pahlevan Kakhki, Fatemeh Rangani, Ewoud Ewing, Chiara Starvaggi Cucuzza, Galina Zheleznyakova, Maria Kalomoiri, Lea Kenny, Anika Raghavan, Chandana Rao Prakash, Gabe van den Hoeven, Tejaswi Venkata S Badam, Ruxandra Covacu, Ioanna Andreou, Maria Needhamsen, Lara Kular, Maja Jagodic
Background: CRISPR/dCas9-based epigenome editing systems, including DNA methylation epimodifiers, have greatly advanced molecular functional studies, revolutionizing their precision and applicability. Despite their promise, challenges such as the magnitude and stability of the on-target editing and unwanted off-target effects underscore the need for improved tool characterization and design.
Results: We systematically compare specific targeting and genome-wide off-target effects of available and novel dCas9-based DNA methylation editing tools over time. We demonstrate that multimerization of the catalytic domain of DNA methyltransferase 3A enhances editing potency but also induces widespread, early methylation deposition at low-to-medium methylated promoter-related regions with specific gRNAs and also with non-targeting gRNAs. A small fraction of the methylation changes associated with transcriptional dysregulation and mapped predominantly to bivalent chromatin associating both with transcriptional repression and activation. Additionally, specific non-targeting control gRNAs cause pervasive and long-lasting methylation-independent transcriptional alterations particularly in genes linked to RNA and energy metabolism. CRISPRoff emerges as the most efficient tool for stable promoter targeting, with fewer and less stable off-target effects compared to other epimodifiers but with persistent transcriptome alterations.
Conclusions: Our findings highlight the delicate balance between potency and specificity of epigenome editing and provide critical insights into the design and application of future tools to improve their precision and minimize unintended consequences.
{"title":"Comprehensive profiling of CRISPR/dCas9 epigenome editors indicates a complex link between on and off target effects.","authors":"Majid Pahlevan Kakhki, Fatemeh Rangani, Ewoud Ewing, Chiara Starvaggi Cucuzza, Galina Zheleznyakova, Maria Kalomoiri, Lea Kenny, Anika Raghavan, Chandana Rao Prakash, Gabe van den Hoeven, Tejaswi Venkata S Badam, Ruxandra Covacu, Ioanna Andreou, Maria Needhamsen, Lara Kular, Maja Jagodic","doi":"10.1186/s13059-026-03967-6","DOIUrl":"10.1186/s13059-026-03967-6","url":null,"abstract":"<p><strong>Background: </strong>CRISPR/dCas9-based epigenome editing systems, including DNA methylation epimodifiers, have greatly advanced molecular functional studies, revolutionizing their precision and applicability. Despite their promise, challenges such as the magnitude and stability of the on-target editing and unwanted off-target effects underscore the need for improved tool characterization and design.</p><p><strong>Results: </strong>We systematically compare specific targeting and genome-wide off-target effects of available and novel dCas9-based DNA methylation editing tools over time. We demonstrate that multimerization of the catalytic domain of DNA methyltransferase 3A enhances editing potency but also induces widespread, early methylation deposition at low-to-medium methylated promoter-related regions with specific gRNAs and also with non-targeting gRNAs. A small fraction of the methylation changes associated with transcriptional dysregulation and mapped predominantly to bivalent chromatin associating both with transcriptional repression and activation. Additionally, specific non-targeting control gRNAs cause pervasive and long-lasting methylation-independent transcriptional alterations particularly in genes linked to RNA and energy metabolism. CRISPRoff emerges as the most efficient tool for stable promoter targeting, with fewer and less stable off-target effects compared to other epimodifiers but with persistent transcriptome alterations.</p><p><strong>Conclusions: </strong>Our findings highlight the delicate balance between potency and specificity of epigenome editing and provide critical insights into the design and application of future tools to improve their precision and minimize unintended consequences.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12924462/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-31DOI: 10.1186/s13059-026-03946-x
Michelle Almeida da Paz, Umut Yildiz, Minyoung Kim, Víctor Campos-Fornés, Marina Pinkasz, Thomas Dahlet, Kyung-Min Noh, Leila Taher
Background: Long dismissed as mere genomic parasites, transposable elements (TEs) are now recognized as major drivers of genome evolution. TEs serve as a source of cell-type specific cis-regulatory elements, influencing gene expression and observable phenotypes. However, the precise TE regulatory roles in different contexts remain largely unexplored and the impact of TEs on transcriptional regulatory networks and contribution to disease risk is likely deeply underestimated.
Results: Using a multimapper-aware strategy, we systematically characterize the epigenetic profile of TEs in human cell systems modeling neural development. This analysis reveals that MER57E3, a primate-specific TE subfamily, exhibits strong enrichment for active, and absence of repressive, histone modifications across six cultured human neural cell types. MER57E3 copies are predominantly located near zinc finger genes and enriched for homeodomain motifs recognized by brain-specific transcription factors, including GBX1 and BSX. Upon CRISPR interference (CRISPRi) targeting specific MER57E3 copies, RNA-seq analysis demonstrates downregulation of the key neurogenesis-related genes PAX6 and NEUROG2.
Conclusions: Our data indicate that members of the MER57E3 TE subfamily regulate the expression of critical neurogenesis genes during neural progenitor cell (NPC) development. Moreover, this study emphasizes the importance of investigating TEs, offering new insights into how their epigenetic dysregulation may contribute to pathogenesis of neurodevelopmental disorders.
{"title":"MER57E3 transposable elements regulate gene expression in a human cell model of neural development.","authors":"Michelle Almeida da Paz, Umut Yildiz, Minyoung Kim, Víctor Campos-Fornés, Marina Pinkasz, Thomas Dahlet, Kyung-Min Noh, Leila Taher","doi":"10.1186/s13059-026-03946-x","DOIUrl":"10.1186/s13059-026-03946-x","url":null,"abstract":"<p><strong>Background: </strong>Long dismissed as mere genomic parasites, transposable elements (TEs) are now recognized as major drivers of genome evolution. TEs serve as a source of cell-type specific cis-regulatory elements, influencing gene expression and observable phenotypes. However, the precise TE regulatory roles in different contexts remain largely unexplored and the impact of TEs on transcriptional regulatory networks and contribution to disease risk is likely deeply underestimated.</p><p><strong>Results: </strong>Using a multimapper-aware strategy, we systematically characterize the epigenetic profile of TEs in human cell systems modeling neural development. This analysis reveals that MER57E3, a primate-specific TE subfamily, exhibits strong enrichment for active, and absence of repressive, histone modifications across six cultured human neural cell types. MER57E3 copies are predominantly located near zinc finger genes and enriched for homeodomain motifs recognized by brain-specific transcription factors, including GBX1 and BSX. Upon CRISPR interference (CRISPRi) targeting specific MER57E3 copies, RNA-seq analysis demonstrates downregulation of the key neurogenesis-related genes PAX6 and NEUROG2.</p><p><strong>Conclusions: </strong>Our data indicate that members of the MER57E3 TE subfamily regulate the expression of critical neurogenesis genes during neural progenitor cell (NPC) development. Moreover, this study emphasizes the importance of investigating TEs, offering new insights into how their epigenetic dysregulation may contribute to pathogenesis of neurodevelopmental disorders.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12947407/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146097432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Trimethylation of histone H3 lysine 27 (H3K27me3) confers a repressive chromatin state and is dynamically deposited and removed to regulate gene expression throughout plant growth and development. Nevertheless, the three-dimensional epigenome architecture linked to H3K27me3 and its regulatory role in controlling rice development remain unclear.
Results: Here, we employ long-read ChIA-PET to map the genome topology associated with H3K27me3 in rice, identifying hundreds of repressive chromatin spatial clusters within chromosomes. The T-DNA insertional mutagenesis of EMBRYONIC FLOWER 2b (OsEMF2b) leads to disruption of H3K27me3-associated chromatin interaction networks. Notably, we discover that the three key flowering time loci, Early heading date 1 (Ehd1) and Heading date 3a (Hd3a)/Rice Flowering Locus T1 (RFT1) form an inter-chromosomal repressive chromatin spatial cluster as a cis-interaction hub. Further investigation reveals that Ghd7, Ghd8, and Hd1 undergo liquid-liquid phase separation and bind to Ehd1-Hd3a/RFT1 repressive chromatin spatial cluster as trans flowering regulators.
Conclusion: Our findings elucidate a multi-dimensional topological framework of H3K27me3-associated loci in rice and identify a spatial flowering gene cluster regulated by the phase-separated repressor complex of Ghd7, Ghd8, and Hd1. These results expand our knowledge of the structural basis of the three-dimensional genome governing the transcriptional coordination of florigens in rice.
{"title":"3D epigenome architecture orchestrates cis and trans regulation of flowering time in rice.","authors":"Yue Li, Zhanyi Zhang, Leilei Qiu, Ruiqin Zheng, Hongmeijuan Li, Zhilin Cao, Minrong Guo, Shuangqi Wang, Liang Xie, Pengpeng Guan, Xingyu Huang, Cuicui Shen, Hong Wu, Wenjun Li, Weizhi Ouyang, Ying Zhang, Guoting Chen, Mohamed Foda, Jiapei Yan, Changyin Wu, Xinhao Ouyang, Guoliang Li, Yongzhong Xing, Xingwang Li","doi":"10.1186/s13059-026-03940-3","DOIUrl":"10.1186/s13059-026-03940-3","url":null,"abstract":"<p><strong>Background: </strong>Trimethylation of histone H3 lysine 27 (H3K27me3) confers a repressive chromatin state and is dynamically deposited and removed to regulate gene expression throughout plant growth and development. Nevertheless, the three-dimensional epigenome architecture linked to H3K27me3 and its regulatory role in controlling rice development remain unclear.</p><p><strong>Results: </strong>Here, we employ long-read ChIA-PET to map the genome topology associated with H3K27me3 in rice, identifying hundreds of repressive chromatin spatial clusters within chromosomes. The T-DNA insertional mutagenesis of EMBRYONIC FLOWER 2b (OsEMF2b) leads to disruption of H3K27me3-associated chromatin interaction networks. Notably, we discover that the three key flowering time loci, Early heading date 1 (Ehd1) and Heading date 3a (Hd3a)/Rice Flowering Locus T1 (RFT1) form an inter-chromosomal repressive chromatin spatial cluster as a cis-interaction hub. Further investigation reveals that Ghd7, Ghd8, and Hd1 undergo liquid-liquid phase separation and bind to Ehd1-Hd3a/RFT1 repressive chromatin spatial cluster as trans flowering regulators.</p><p><strong>Conclusion: </strong>Our findings elucidate a multi-dimensional topological framework of H3K27me3-associated loci in rice and identify a spatial flowering gene cluster regulated by the phase-separated repressor complex of Ghd7, Ghd8, and Hd1. These results expand our knowledge of the structural basis of the three-dimensional genome governing the transcriptional coordination of florigens in rice.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"27 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12930761/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147277529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}