Pub Date : 2026-01-09DOI: 10.1186/s13059-025-03929-4
Tongxuan Lv, Quan Han, Yilin Li, Chen Liang, Zhonghao Ruan, Haoyu Chao, Ming Chen, Dijun Chen
Background: The regulation of gene expression in plants is governed by complex interactions between cis-regulatory elements and epigenetic modifications such as histone marks. While deep learning models have achieved success in predicting regulatory features from DNA sequence, their cross-species generalizability in plants remains largely unexplored.
Results: We systematically evaluate the ability of deep learning models to predict histone modifications across plant species using a multi-stage framework based on the Sei architecture. We train species-specific models for Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and maize (Zea mays), achieving high within-species predictive performance and strong agreement between predictions and experimental ChIP-seq profiles. However, cross-species predictions show reduced performance with increasing phylogenetic distance, highlighting limited model transferability between monocots and dicots. To improve generalization, we construct a Poaceae family-level model by jointly training on rice and maize, and an Arabidopsis-trained model based solely on Arabidopsis. These models demonstrate robust predictive power in completely unprofiled species that are not used in training set, highlighting the model's adaptability to novel plant genomes based solely on conserved regulatory syntax. In contrast, cross-family models produce less consistent results, with reliable performance only in species sharing conserved regulatory features. We also develop an easy-to-use pipeline that predicts genome-wide chromatin signals directly from DNA sequences.
Conclusions: Our findings demonstrate that phylogenetically informed model training significantly improves cross-species epigenomic prediction, offering a scalable computational strategy for functional annotation in non-model and agriculturally important plants.
{"title":"Cross-species prediction of histone modifications in plants via deep learning.","authors":"Tongxuan Lv, Quan Han, Yilin Li, Chen Liang, Zhonghao Ruan, Haoyu Chao, Ming Chen, Dijun Chen","doi":"10.1186/s13059-025-03929-4","DOIUrl":"10.1186/s13059-025-03929-4","url":null,"abstract":"<p><strong>Background: </strong>The regulation of gene expression in plants is governed by complex interactions between cis-regulatory elements and epigenetic modifications such as histone marks. While deep learning models have achieved success in predicting regulatory features from DNA sequence, their cross-species generalizability in plants remains largely unexplored.</p><p><strong>Results: </strong>We systematically evaluate the ability of deep learning models to predict histone modifications across plant species using a multi-stage framework based on the Sei architecture. We train species-specific models for Arabidopsis (Arabidopsis thaliana), rice (Oryza sativa), and maize (Zea mays), achieving high within-species predictive performance and strong agreement between predictions and experimental ChIP-seq profiles. However, cross-species predictions show reduced performance with increasing phylogenetic distance, highlighting limited model transferability between monocots and dicots. To improve generalization, we construct a Poaceae family-level model by jointly training on rice and maize, and an Arabidopsis-trained model based solely on Arabidopsis. These models demonstrate robust predictive power in completely unprofiled species that are not used in training set, highlighting the model's adaptability to novel plant genomes based solely on conserved regulatory syntax. In contrast, cross-family models produce less consistent results, with reliable performance only in species sharing conserved regulatory features. We also develop an easy-to-use pipeline that predicts genome-wide chromatin signals directly from DNA sequences.</p><p><strong>Conclusions: </strong>Our findings demonstrate that phylogenetically informed model training significantly improves cross-species epigenomic prediction, offering a scalable computational strategy for functional annotation in non-model and agriculturally important plants.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"20"},"PeriodicalIF":12.3,"publicationDate":"2026-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12879380/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145946704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Background: Root rot disease caused by fungal pathogens of wine grapevines poses a serious threat to their growth and results in a substantial economic impact on grape industry. The rhizosphere microbiome recruited to plants is critical for mitigating soil-borne pathogens. However, how beneficial microbes influence disease resistance remains unclear.
Results: We investigate the composition and gene functions of microorganisms in wine grapevines with root rot disease and healthy controls by amplicon and metagenomic sequencing. We use culturomics and in vivo experiments to verify the pathogen and beneficial strains to improve plant health. We find that root rot disease in grapevines significantly affects rhizosphere microbiome diversity and composition. The microbial interkingdom network indicates that the disease destabilizes the bacteria-fungi co-occurrence network. We find that plants recruit the potentially beneficial bacteria Pseudomonas, Bacillus and Streptomyces in healthy rhizosphere soil. By culturomics, we confirm that Fusarium solani is the main pathogen causing root rot disease. We further observe that these three key beneficial bacteria from the co-occurrence networks enhance the resistance of grapevines to pathogens. Furthermore, metagenomic analysis reveals that beneficial bacterial strains suppress pathogens by enriching potential functional genes in pathways involved in disease resistance.
Conclusions: Our findings highlight the critical role of disease resistance pathways of potentially beneficial microorganisms in fighting disease and supporting plant health, offering new insight for the exploration of beneficial microbial resources and providing a basis for the development of biological control of grape root rot disease.
{"title":"Core microbiota recruited by healthy grapevines enhance resistance against root rot disease.","authors":"Ruotong Wang, Wenyu Zhang, Zhishan He, Yao Zhou, Cheng Chen, Kaibo Song, Qingwu Shang, Yunfeng Wu, Peiwen Gu, Duntao Shu, Lei Zhao","doi":"10.1186/s13059-025-03905-y","DOIUrl":"10.1186/s13059-025-03905-y","url":null,"abstract":"<p><strong>Background: </strong>Root rot disease caused by fungal pathogens of wine grapevines poses a serious threat to their growth and results in a substantial economic impact on grape industry. The rhizosphere microbiome recruited to plants is critical for mitigating soil-borne pathogens. However, how beneficial microbes influence disease resistance remains unclear.</p><p><strong>Results: </strong>We investigate the composition and gene functions of microorganisms in wine grapevines with root rot disease and healthy controls by amplicon and metagenomic sequencing. We use culturomics and in vivo experiments to verify the pathogen and beneficial strains to improve plant health. We find that root rot disease in grapevines significantly affects rhizosphere microbiome diversity and composition. The microbial interkingdom network indicates that the disease destabilizes the bacteria-fungi co-occurrence network. We find that plants recruit the potentially beneficial bacteria Pseudomonas, Bacillus and Streptomyces in healthy rhizosphere soil. By culturomics, we confirm that Fusarium solani is the main pathogen causing root rot disease. We further observe that these three key beneficial bacteria from the co-occurrence networks enhance the resistance of grapevines to pathogens. Furthermore, metagenomic analysis reveals that beneficial bacterial strains suppress pathogens by enriching potential functional genes in pathways involved in disease resistance.</p><p><strong>Conclusions: </strong>Our findings highlight the critical role of disease resistance pathways of potentially beneficial microorganisms in fighting disease and supporting plant health, offering new insight for the exploration of beneficial microbial resources and providing a basis for the development of biological control of grape root rot disease.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"13"},"PeriodicalIF":12.3,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12857107/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145907182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-05DOI: 10.1186/s13059-025-03913-y
Tessa R MacNish, Thomas Bergmann, David Edwards
There is an urgent need to increase sustainable crop production. The application of molecular marker technologies such as genomic selection and machine learning based approaches are aiding accelerated crop improvement. Conventional molecular marker technologies use single nucleotide polymorphisms to predict traits, however these do not capture local epistasis and can be challenging for machine learning applications. With the growth of genome sequence data, it is possible to define haplotypes that can account for local epistatic effects and are more suitable for machine learning models. This review discusses the different methods for defining haplotype blocks and their application in plant breeding.
{"title":"Haplotype applications in genomic selection.","authors":"Tessa R MacNish, Thomas Bergmann, David Edwards","doi":"10.1186/s13059-025-03913-y","DOIUrl":"10.1186/s13059-025-03913-y","url":null,"abstract":"<p><p>There is an urgent need to increase sustainable crop production. The application of molecular marker technologies such as genomic selection and machine learning based approaches are aiding accelerated crop improvement. Conventional molecular marker technologies use single nucleotide polymorphisms to predict traits, however these do not capture local epistasis and can be challenging for machine learning applications. With the growth of genome sequence data, it is possible to define haplotypes that can account for local epistatic effects and are more suitable for machine learning models. This review discusses the different methods for defining haplotype blocks and their application in plant breeding.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"18"},"PeriodicalIF":12.3,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12870298/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145906700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-05DOI: 10.1186/s13059-025-03924-9
Yicheng Huang, Enlai Guan, Shipeng Song, Dal-Hoe Koo, Monica A Schmidt, Handong Su, Chunli Chen, Jianwei Zhang
Background: Centromere function is fundamental and conserved across eukaryotes, despite highly divergent DNA sequences, even among closely related species. These regions often contain rapidly evolving repeats and retrotransposons, yet play a crucial role in chromosome segregation. Soybean, which harbors two distinct types of centromeric satellite repeats, is an ideal model for studying centromeric repeat organization and function.
Results: Here we generate the complete map of centromeric satellite repeats revealing the organizational patterns of different types of centromeric satellite repeats within centromeres. These maps are constructed using three recently available telomere-to-telomere soybean genomes. We find that certain centromeric satellite repeats exhibit chromosome-specific evolutionary trajectories and may serve distinct functional roles in centromere activity. We further analyze the potential relationship between centromere-specific histones H3 (CENH3) and centromeric satellite repeats, identifying consensus motifs associated with CENH3-binding sites. We also analyze the higher-order tandem repeats of the centromere and propose a hypothetical model of centromeric DNA replication.
Conclusions: We conclude that CentGm-1 and CentGm-4 evolve independently. The observation that completely identical CentGm-4 sequences consistently appear on the same chromosome across different soybean varieties indicates a stronger chromosome-specific preference for CentGm-4. We propose a model in which replication templates within the centromere region originate from multiple CENH3-nucleosome complexes bound to CentGm sequences. Both CentGm-1 and CentGm-4 contain similar motifs with the potential to bind CENH3 protein. The findings provide a new insight into the mechanisms behind centromere diversity and dynamics.
{"title":"Genetic diversity and architectural dynamics of soybean centromeres.","authors":"Yicheng Huang, Enlai Guan, Shipeng Song, Dal-Hoe Koo, Monica A Schmidt, Handong Su, Chunli Chen, Jianwei Zhang","doi":"10.1186/s13059-025-03924-9","DOIUrl":"10.1186/s13059-025-03924-9","url":null,"abstract":"<p><strong>Background: </strong>Centromere function is fundamental and conserved across eukaryotes, despite highly divergent DNA sequences, even among closely related species. These regions often contain rapidly evolving repeats and retrotransposons, yet play a crucial role in chromosome segregation. Soybean, which harbors two distinct types of centromeric satellite repeats, is an ideal model for studying centromeric repeat organization and function.</p><p><strong>Results: </strong>Here we generate the complete map of centromeric satellite repeats revealing the organizational patterns of different types of centromeric satellite repeats within centromeres. These maps are constructed using three recently available telomere-to-telomere soybean genomes. We find that certain centromeric satellite repeats exhibit chromosome-specific evolutionary trajectories and may serve distinct functional roles in centromere activity. We further analyze the potential relationship between centromere-specific histones H3 (CENH3) and centromeric satellite repeats, identifying consensus motifs associated with CENH3-binding sites. We also analyze the higher-order tandem repeats of the centromere and propose a hypothetical model of centromeric DNA replication.</p><p><strong>Conclusions: </strong>We conclude that CentGm-1 and CentGm-4 evolve independently. The observation that completely identical CentGm-4 sequences consistently appear on the same chromosome across different soybean varieties indicates a stronger chromosome-specific preference for CentGm-4. We propose a model in which replication templates within the centromere region originate from multiple CENH3-nucleosome complexes bound to CentGm sequences. Both CentGm-1 and CentGm-4 contain similar motifs with the potential to bind CENH3 protein. The findings provide a new insight into the mechanisms behind centromere diversity and dynamics.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"17"},"PeriodicalIF":12.3,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12870325/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145907212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Understanding human blood metabolites is essential for deciphering systemic physiology and disease mechanisms, yet remains challenging due to diverse origins and dynamic regulation. In this study, we develop HUBMet ( https://hubmet.app.bio-it.tech/home ), an open-access web server that includes 3,950 metabolites and 129,814 metabolite-protein associations, with four analytical modules: Over-Representation Analysis (ORA) for enrichment analysis; Metabolite Set Enrichment Analysis (MSEA) for quantitative data analysis; Tissue Specificity Analysis (TSA) for assessing metabolite-tissue relevance; Metabolite-Protein Network Analysis (MPNet) for identifying key metabolite-protein associations and functional modules. HUBMet's utility is demonstrated through a COVID-19 case study revealing metabolic signatures associated with disease severity.
{"title":"HUBMet: an integrative database and analytical platform for human blood metabolites and metabolite-protein associations.","authors":"Xingyue Wang, Xiangyu Qiao, Alberto Zenere, Swapnali Barde, Jing Wang, Wen Zhong","doi":"10.1186/s13059-025-03922-x","DOIUrl":"10.1186/s13059-025-03922-x","url":null,"abstract":"<p><p>Understanding human blood metabolites is essential for deciphering systemic physiology and disease mechanisms, yet remains challenging due to diverse origins and dynamic regulation. In this study, we develop HUBMet ( https://hubmet.app.bio-it.tech/home ), an open-access web server that includes 3,950 metabolites and 129,814 metabolite-protein associations, with four analytical modules: Over-Representation Analysis (ORA) for enrichment analysis; Metabolite Set Enrichment Analysis (MSEA) for quantitative data analysis; Tissue Specificity Analysis (TSA) for assessing metabolite-tissue relevance; Metabolite-Protein Network Analysis (MPNet) for identifying key metabolite-protein associations and functional modules. HUBMet's utility is demonstrated through a COVID-19 case study revealing metabolic signatures associated with disease severity.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"7"},"PeriodicalIF":12.3,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145846997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-27DOI: 10.1186/s13059-025-03920-z
Sutton Tennant, Erik J Amezquita, Yaohua Li, Benjamin Smith, Sai Subhash Mahamkali Venkata Subramanyam, Sergio Alan Cervantes-Pérez, Sandra Thibivilliers, Samik Bhattacharya, Jasper Klaver, Marc Libault
Background: Our understanding of gene function is often driven by its expression and, a fortiori, based on its RNA abundance in a cell, a tissue, or an organ. This assumption often neglects the limited correlation between RNA and protein abundance, largely due to post-transcriptional and pre-translational regulation. Among these regulatory processes, the spatial distribution of RNA molecules within cells has been reported as a major contributor of cellular function in microbial and animal systems. However, our understanding of the differential cellular distribution of transcripts in plants is very limited.
Results: In this manuscript, using Molecular Cartography™ and Xenium, two high-resolution and sensitive spatial transcriptomic technologies, we comprehensively analyze the differential mapping of millions of plant transcripts in the nuclear and cytoplasmic compartments of various soybean nodule cell types. Our analysis reveals distinct distributions of transcripts between the nuclear and the cytoplasmic compartments of the soybean nodule cell. We also detect variability in cytoplasmic distribution among transcripts encoded by different genes and across cell types.
Conclusions: Our findings reveal the strong diversity in the spatial distribution of transcripts in and between differentiated plant cells. It suggests that transcript localization serves as an additional regulatory layer beyond transcriptional control. By modulating nuclear export and cytoplasmic positioning, plant cells may fine-tune translational efficiency and gene function. This study underscores the importance of incorporating spatial information into transcriptomic analyses and provides new insights into the regulatory architecture of plant RNA biology.
{"title":"The differential subcellular localization of soybean transcripts, an additional regulatory mechanism of gene activity.","authors":"Sutton Tennant, Erik J Amezquita, Yaohua Li, Benjamin Smith, Sai Subhash Mahamkali Venkata Subramanyam, Sergio Alan Cervantes-Pérez, Sandra Thibivilliers, Samik Bhattacharya, Jasper Klaver, Marc Libault","doi":"10.1186/s13059-025-03920-z","DOIUrl":"10.1186/s13059-025-03920-z","url":null,"abstract":"<p><strong>Background: </strong>Our understanding of gene function is often driven by its expression and, a fortiori, based on its RNA abundance in a cell, a tissue, or an organ. This assumption often neglects the limited correlation between RNA and protein abundance, largely due to post-transcriptional and pre-translational regulation. Among these regulatory processes, the spatial distribution of RNA molecules within cells has been reported as a major contributor of cellular function in microbial and animal systems. However, our understanding of the differential cellular distribution of transcripts in plants is very limited.</p><p><strong>Results: </strong>In this manuscript, using Molecular Cartography™ and Xenium, two high-resolution and sensitive spatial transcriptomic technologies, we comprehensively analyze the differential mapping of millions of plant transcripts in the nuclear and cytoplasmic compartments of various soybean nodule cell types. Our analysis reveals distinct distributions of transcripts between the nuclear and the cytoplasmic compartments of the soybean nodule cell. We also detect variability in cytoplasmic distribution among transcripts encoded by different genes and across cell types.</p><p><strong>Conclusions: </strong>Our findings reveal the strong diversity in the spatial distribution of transcripts in and between differentiated plant cells. It suggests that transcript localization serves as an additional regulatory layer beyond transcriptional control. By modulating nuclear export and cytoplasmic positioning, plant cells may fine-tune translational efficiency and gene function. This study underscores the importance of incorporating spatial information into transcriptomic analyses and provides new insights into the regulatory architecture of plant RNA biology.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"9"},"PeriodicalIF":12.3,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12853825/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145844393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mutational signatures provide key insights into cancer mutational processes, but the availability of signature catalogues generated by different groups using distinct methodologies underscores a need for standardization. We introduce a Bayesian framework that offers a systematic approach to expanding existing signature catalogues for any type of mutational signature while grouping patients based on shared signature patterns. We demonstrate that this approach can identify both known and novel molecular subtypes across nearly 8000 samples spanning six cancer types and show that stratifications derived from signature yield prognostic groups, further enhancing the translational potential of mutational signatures.
{"title":"BASCULE: bayesian inference and clustering of mutational signatures leveraging biological priors.","authors":"Elena Buscaroli, Azad Sadr, Riccardo Bergamin, Salvatore Milite, Edith Natalia Villegas Garcia, Arianna Tasciotti, Alessio Ansuini, Daniele Ramazzotti, Nicola Calonaci, Giulio Caravagna","doi":"10.1186/s13059-025-03835-9","DOIUrl":"10.1186/s13059-025-03835-9","url":null,"abstract":"<p><p>Mutational signatures provide key insights into cancer mutational processes, but the availability of signature catalogues generated by different groups using distinct methodologies underscores a need for standardization. We introduce a Bayesian framework that offers a systematic approach to expanding existing signature catalogues for any type of mutational signature while grouping patients based on shared signature patterns. We demonstrate that this approach can identify both known and novel molecular subtypes across nearly 8000 samples spanning six cancer types and show that stratifications derived from signature yield prognostic groups, further enhancing the translational potential of mutational signatures.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"15"},"PeriodicalIF":12.3,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12857155/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145846985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-24DOI: 10.1186/s13059-025-03915-w
Tereza Bojdová, Lucie Hloušková, Kateřina Holušová, Radim Svačina, Eva Hřibová, Iva Ilíková, Johannes Thiel, Gihwan Kim, Roman Pleskot, Andreas Houben, Jan Bartoš, Miroslava Karafiátová
Background: Selective DNA elimination occurs across diverse species and plays a crucial role in evolution and development. This process encompasses small deletions, complete removal of chromosomes, or even the elimination of entire parental genomes. Despite its importance, the molecular mechanisms governing selective DNA elimination remain poorly understood. Our study focuses on the tissue-specific elimination of Sorghum purpureosericeum B chromosomes during embryo development.
Results: In situ B chromosome visualisation, complemented by transcriptomic profiling and gene-enrichment analysis, allows us to identify 28 candidate genes potentially linked to chromosome elimination. We show that elimination is a developmentally programmed process, peaking during mid-embryogenesis and nearly completed at later stages, leaving B chromosomes only in restricted meristematic regions. Genome sequencing reveals that the sorghum B chromosome is of multi-A chromosomal origin, has reduced gene density, is enriched in repetitive sequences, and carries a novel centromeric repeat, SpuCL166. Transcriptome analyses identify B-specific variants of kinetochore, cohesion, and checkpoint genes that are expressed during active elimination, while structural modeling of CENH3 and CENP-C indicates functional divergence at the kinetochore interface.
Conclusions: Here, we provide the first comprehensive genomic and transcriptomic characterization of B chromosome and its elimination in Sorghum purpureosericeum. Our findings suggest that B chromosomes express modified mitotic machinery to control their own fate. By establishing a framework of candidate genes, this study opens new avenues for dissecting the molecular mechanisms of chromosome elimination and provides a critical foundation for understanding how genomes evolve to regulate and tolerate supernumerary chromosomal elements.
{"title":"Sorghum embryos undergoing B chromosome elimination express B-variants of mitotic-related genes.","authors":"Tereza Bojdová, Lucie Hloušková, Kateřina Holušová, Radim Svačina, Eva Hřibová, Iva Ilíková, Johannes Thiel, Gihwan Kim, Roman Pleskot, Andreas Houben, Jan Bartoš, Miroslava Karafiátová","doi":"10.1186/s13059-025-03915-w","DOIUrl":"10.1186/s13059-025-03915-w","url":null,"abstract":"<p><strong>Background: </strong>Selective DNA elimination occurs across diverse species and plays a crucial role in evolution and development. This process encompasses small deletions, complete removal of chromosomes, or even the elimination of entire parental genomes. Despite its importance, the molecular mechanisms governing selective DNA elimination remain poorly understood. Our study focuses on the tissue-specific elimination of Sorghum purpureosericeum B chromosomes during embryo development.</p><p><strong>Results: </strong>In situ B chromosome visualisation, complemented by transcriptomic profiling and gene-enrichment analysis, allows us to identify 28 candidate genes potentially linked to chromosome elimination. We show that elimination is a developmentally programmed process, peaking during mid-embryogenesis and nearly completed at later stages, leaving B chromosomes only in restricted meristematic regions. Genome sequencing reveals that the sorghum B chromosome is of multi-A chromosomal origin, has reduced gene density, is enriched in repetitive sequences, and carries a novel centromeric repeat, SpuCL166. Transcriptome analyses identify B-specific variants of kinetochore, cohesion, and checkpoint genes that are expressed during active elimination, while structural modeling of CENH3 and CENP-C indicates functional divergence at the kinetochore interface.</p><p><strong>Conclusions: </strong>Here, we provide the first comprehensive genomic and transcriptomic characterization of B chromosome and its elimination in Sorghum purpureosericeum. Our findings suggest that B chromosomes express modified mitotic machinery to control their own fate. By establishing a framework of candidate genes, this study opens new avenues for dissecting the molecular mechanisms of chromosome elimination and provides a critical foundation for understanding how genomes evolve to regulate and tolerate supernumerary chromosomal elements.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"8"},"PeriodicalIF":12.3,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12849586/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145821760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-23DOI: 10.1186/s13059-025-03912-z
Adam Staadig, Maja Krzewińska, Maja Sidstedt, Daniel Kling, Siri Aili Fagerholm, Ricky Ansell, Anders Götherström, Andreas Tillmar
Background: The fields of ancient DNA research and forensic genetics share both methodological similarities and common challenges, particularly in the analysis of degraded DNA. Leveraging these overlaps, this study evaluates three single nucleotide polymorphisms (SNP)-based genotyping assays for analyzing challenging forensic samples: the FORCE-QIAseq SNP panel, the Twist ancient DNA hybridization capture panel, and whole-genome sequencing.
Results: We analyze twenty skeletal bone and tooth samples from authentic missing person cases, where almost all samples are severely degraded and contain exceptionally low amounts of endogenous DNA, reflected by both reduced quantifiable DNA concentrations and lower proportions of human DNA reads than typically obtained from high-quality forensic samples. Despite these challenging sample characteristics, both the FORCE and Twist assays successfully generate a substantial number of genotypes across many samples, while whole-genome sequencing yields fewer SNP calls. However, techniques like probabilistic genotyping, increase sequencing depth or genotype imputation can further enhance the utility of WGS for forensic use.
Conclusions: This study highlights the effectiveness of incorporating ancient DNA methods into forensic genetics for the analysis of degraded samples. The findings are broadly applicable to both forensic and ancient DNA research disciplines, offering valuable insights into assay selection based on sample condition and investigative goals.
{"title":"Comparative assessment of SNP genotyping assays for challenging forensic samples utilizing ancient DNA methods.","authors":"Adam Staadig, Maja Krzewińska, Maja Sidstedt, Daniel Kling, Siri Aili Fagerholm, Ricky Ansell, Anders Götherström, Andreas Tillmar","doi":"10.1186/s13059-025-03912-z","DOIUrl":"10.1186/s13059-025-03912-z","url":null,"abstract":"<p><strong>Background: </strong>The fields of ancient DNA research and forensic genetics share both methodological similarities and common challenges, particularly in the analysis of degraded DNA. Leveraging these overlaps, this study evaluates three single nucleotide polymorphisms (SNP)-based genotyping assays for analyzing challenging forensic samples: the FORCE-QIAseq SNP panel, the Twist ancient DNA hybridization capture panel, and whole-genome sequencing.</p><p><strong>Results: </strong>We analyze twenty skeletal bone and tooth samples from authentic missing person cases, where almost all samples are severely degraded and contain exceptionally low amounts of endogenous DNA, reflected by both reduced quantifiable DNA concentrations and lower proportions of human DNA reads than typically obtained from high-quality forensic samples. Despite these challenging sample characteristics, both the FORCE and Twist assays successfully generate a substantial number of genotypes across many samples, while whole-genome sequencing yields fewer SNP calls. However, techniques like probabilistic genotyping, increase sequencing depth or genotype imputation can further enhance the utility of WGS for forensic use.</p><p><strong>Conclusions: </strong>This study highlights the effectiveness of incorporating ancient DNA methods into forensic genetics for the analysis of degraded samples. The findings are broadly applicable to both forensic and ancient DNA research disciplines, offering valuable insights into assay selection based on sample condition and investigative goals.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":"433"},"PeriodicalIF":12.3,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12723910/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145811860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-23DOI: 10.1186/s13059-025-03923-w
Seunghyuk Choi, Bing Zhang
Determining tumor-specificity of MHC-bound peptides is crucial for cancer immunotherapy development, yet current methods struggle with class II peptides and non-reference sequences. We introduce PepQueryMHC, an ultra-fast tool that integrates MHC-bound peptide sequences with translated RNA-seq reads for efficient tumor antigen prioritization. We demonstrate its versatility in prioritizing class I and II tumor antigens, mapping the cellular origins of presented peptides, and resolving uncertainties surrounding the prevalence of proteasome-spliced peptides.
{"title":"PepQueryMHC: rapid and comprehensive tumor antigen prioritization from immunopeptidomics data.","authors":"Seunghyuk Choi, Bing Zhang","doi":"10.1186/s13059-025-03923-w","DOIUrl":"10.1186/s13059-025-03923-w","url":null,"abstract":"<p><p>Determining tumor-specificity of MHC-bound peptides is crucial for cancer immunotherapy development, yet current methods struggle with class II peptides and non-reference sequences. We introduce PepQueryMHC, an ultra-fast tool that integrates MHC-bound peptide sequences with translated RNA-seq reads for efficient tumor antigen prioritization. We demonstrate its versatility in prioritizing class I and II tumor antigens, mapping the cellular origins of presented peptides, and resolving uncertainties surrounding the prevalence of proteasome-spliced peptides.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"26 1","pages":"434"},"PeriodicalIF":12.3,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12723928/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145821775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}