首页 > 最新文献

Genome Biology最新文献

英文 中文
Dynamic and cell-type specific transcriptional reprogramming underlies the floral transition in the maize shoot apical meristem. 动态和细胞类型特异性转录重编程是玉米茎尖分生组织成花转变的基础。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-18 DOI: 10.1186/s13059-026-04033-x
Liang Dong, Yonghao Sun, Lu Kang, Zichao Li, Yameng Liang, Huangjun Sheng, Feng Tian, David Jackson, Fang Yang

Background: The floral transition in maize represents a pivotal developmental switch that determines flowering time, environmental adaptation, and yield-related traits. However, the molecular mechanisms governing shoot apical meristem reprogramming and cell identity changes during this process remain poorly understood.

Results: By integrating time-course bulk RNA-seq, single-cell transcriptomics, chromatin accessibility, and transcription factor binding profiles, we construct a spatiotemporal molecular framework of the maize shoot apical meristem floral transition. Our analyses reveal global transcriptional reprogramming accompanied by pronounced cell type-specific regulation dynamics. At a global level, our transcriptional-level inference suggests that pathways associated with chromatin remodeling, environmental response, and reproductive development are sequentially activated. We further identify a ZmMADS69-ZmRap2.7-ZMM4 regulatory module that fine-tunes the floral transition within the shoot apical meristem. At single-cell resolution, we find that the floral transition is not driven by a uniform transcriptional switch, but instead emerges from the coordinated action of spatially distinct shoot apical meristem domains. Through differential expression, trajectory, and co-expression module analyses, we further identify previously unrecognized roles for the inflorescence regulators UNBRANCHED2 and UNBRANCHED3 in promoting the floral transition, suggesting that they coordinate floral induction with subsequent inflorescence development.

Conclusions: Our study establishes a comprehensive spatiotemporal regulatory framework for the maize floral transition, providing mechanistic insights into shoot apical meristem reprogramming and offering a foundation for identifying new regulators to improve maize adaptation and yield.

背景:玉米的花过渡是一个关键的发育开关,它决定了开花时间、环境适应性和产量相关性状。然而,在此过程中调控茎尖分生组织重编程和细胞身份改变的分子机制尚不清楚。结果:通过整合大体积rna序列、单细胞转录组学、染色质可及性和转录因子结合谱,我们构建了玉米茎尖分生组织成花转变的时空分子框架。我们的分析揭示了全球转录重编程伴随着明显的细胞类型特异性调控动力学。在全球水平上,我们的转录水平推断表明,与染色质重塑、环境反应和生殖发育相关的途径依次被激活。我们进一步鉴定了一个ZmMADS69-ZmRap2.7-ZMM4调控模块,该模块在茎尖分生组织中微调花的转变。在单细胞分辨率下,我们发现花的转变不是由一个统一的转录开关驱动的,而是由空间上不同的茎尖分生组织域的协调作用产生的。通过差异表达、轨迹分析和共表达模块分析,我们进一步发现了之前未被认识到的花序调控因子UNBRANCHED2和UNBRANCHED3在促进成花转变中的作用,表明它们协调了成花诱导和随后的花序发育。结论:本研究建立了一个完整的玉米花过渡时空调控框架,为研究茎尖分生组织重编程提供了机制,并为发现新的调控因子提高玉米的适应性和产量提供了基础。
{"title":"Dynamic and cell-type specific transcriptional reprogramming underlies the floral transition in the maize shoot apical meristem.","authors":"Liang Dong, Yonghao Sun, Lu Kang, Zichao Li, Yameng Liang, Huangjun Sheng, Feng Tian, David Jackson, Fang Yang","doi":"10.1186/s13059-026-04033-x","DOIUrl":"https://doi.org/10.1186/s13059-026-04033-x","url":null,"abstract":"<p><strong>Background: </strong>The floral transition in maize represents a pivotal developmental switch that determines flowering time, environmental adaptation, and yield-related traits. However, the molecular mechanisms governing shoot apical meristem reprogramming and cell identity changes during this process remain poorly understood.</p><p><strong>Results: </strong>By integrating time-course bulk RNA-seq, single-cell transcriptomics, chromatin accessibility, and transcription factor binding profiles, we construct a spatiotemporal molecular framework of the maize shoot apical meristem floral transition. Our analyses reveal global transcriptional reprogramming accompanied by pronounced cell type-specific regulation dynamics. At a global level, our transcriptional-level inference suggests that pathways associated with chromatin remodeling, environmental response, and reproductive development are sequentially activated. We further identify a ZmMADS69-ZmRap2.7-ZMM4 regulatory module that fine-tunes the floral transition within the shoot apical meristem. At single-cell resolution, we find that the floral transition is not driven by a uniform transcriptional switch, but instead emerges from the coordinated action of spatially distinct shoot apical meristem domains. Through differential expression, trajectory, and co-expression module analyses, we further identify previously unrecognized roles for the inflorescence regulators UNBRANCHED2 and UNBRANCHED3 in promoting the floral transition, suggesting that they coordinate floral induction with subsequent inflorescence development.</p><p><strong>Conclusions: </strong>Our study establishes a comprehensive spatiotemporal regulatory framework for the maize floral transition, providing mechanistic insights into shoot apical meristem reprogramming and offering a foundation for identifying new regulators to improve maize adaptation and yield.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147482102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IFDlong: a model-based isoform and fusion detector for accurate annotation and quantification of long-read RNA-seq data. IFDlong:基于模型的异构体和融合检测器,用于准确注释和定量长读RNA-seq数据。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-18 DOI: 10.1186/s13059-026-04023-z
Wenjia Wang, Jia-Jun Liu, Yuzhen Li, Sungjin Ko, Ning Feng, Manling Zhang, Qingqi Lin, Mengying Xia, Yan P Yu, Jian-Hua Luo, Pedro L Baldoni, George C Tseng, Silvia Liu

Long-read transcriptome sequencing (long-RNA-seq) revolutionizes transcriptome research by enabling full-length transcript analysis for comprehensive exploration of isoform diversity. We developed IFDlong, a probabilistic framework and software suite for detecting isoform and fusion transcripts from bulk or single-cell long-RNA-seq data. IFDlong annotates each long read, identifies novel isoforms, quantifies expression via an expectation-maximization algorithm, and profiles fusion transcripts. In large-scale simulation and real data analyses, IFDlong outperforms existing tools and demonstrated high accuracy and robustness across multiple in-house and public datasets, including healthy tissues, cell lines, and different diseases.

长读转录组测序(long-RNA-seq)通过实现全长转录组分析来全面探索同种异构体多样性,从而彻底改变了转录组研究。我们开发了IFDlong,这是一个概率框架和软件套件,用于从大量或单细胞长rna -seq数据中检测异构体和融合转录物。IFDlong注释每个长读,识别新的异构体,通过期望最大化算法量化表达,并描述融合转录本。在大规模模拟和真实数据分析中,IFDlong优于现有工具,并在多个内部和公共数据集(包括健康组织、细胞系和不同疾病)中表现出高精度和鲁棒性。
{"title":"IFDlong: a model-based isoform and fusion detector for accurate annotation and quantification of long-read RNA-seq data.","authors":"Wenjia Wang, Jia-Jun Liu, Yuzhen Li, Sungjin Ko, Ning Feng, Manling Zhang, Qingqi Lin, Mengying Xia, Yan P Yu, Jian-Hua Luo, Pedro L Baldoni, George C Tseng, Silvia Liu","doi":"10.1186/s13059-026-04023-z","DOIUrl":"10.1186/s13059-026-04023-z","url":null,"abstract":"<p><p>Long-read transcriptome sequencing (long-RNA-seq) revolutionizes transcriptome research by enabling full-length transcript analysis for comprehensive exploration of isoform diversity. We developed IFDlong, a probabilistic framework and software suite for detecting isoform and fusion transcripts from bulk or single-cell long-RNA-seq data. IFDlong annotates each long read, identifies novel isoforms, quantifies expression via an expectation-maximization algorithm, and profiles fusion transcripts. In large-scale simulation and real data analyses, IFDlong outperforms existing tools and demonstrated high accuracy and robustness across multiple in-house and public datasets, including healthy tissues, cell lines, and different diseases.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147482097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic evaluation of single-cell multimodal data integration enhances cell type resolution and discovery of clinically relevant states in complex tissues. 单细胞多模态数据整合的系统评估提高了复杂组织中细胞类型的分辨率和临床相关状态的发现。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-13 DOI: 10.1186/s13059-026-04002-4
Mario Acera-Mateos, Xian Adiconis, Jessica-Kanglin Li, Domenica Marchese, Ginevra Caratù, Chung-Chau Hon, Prabha Tiwari, Miki Kojima, Beate Vieth, Michael A Murphy, Sean K Simmons, Thomas Lefevre, Irene Claes, Christopher L O'Connor, Rajasree Menon, Edgar A Otto, Yoshinari Ando, Katy Vandereyken, Matthias Kretzler, Markus Bitzer, Ernest Fraenkel, Thierry Voet, Wolfgang Enard, Piero Carninci, Holger Heyn, Joshua Z Levin, Elisabetta Mereu

Background: The integration of multimodal single-cell data enables comprehensive organ reference atlases, yet its impact remains largely unexplored, particularly in complex tissues. Using the kidney as an emblematic example of a complex organ, we perform a systematic evaluation of multimodal single-cell integration strategies, with heart tissue used for additional methodological validation.

Results: We generate a benchmarking dataset for the renal cortex by integrating 3' and 5' scRNA-seq with joint snRNA-seq and snATAC-seq, profiling 119,744 high-quality nuclei/cells from 19 donors. To align cell identities and enable consistent comparisons, we develop the interpretable machine learning tool scOMM (single-cell Omics Multimodal Mapping) and systematically assess integration strategies. "Horizontal" integration of scRNA and snRNA-seq improves cell-type identification, while "vertical" integration of snRNA-seq and snATAC-seq has an additive effect, enhancing resolution in homogeneous populations and difficult-to-identify states. Global integration is especially effective in identifying adaptive states and rare cell types, including WFDC2-expressing Thick Ascending Limb and Norn cells, previously undetected in kidney atlases.

Conclusions: Our work establishes a robust framework for multimodal reference atlas generation, advancing single-cell analysis and extending its applicability to diverse tissues.

背景:多模态单细胞数据的整合可以实现全面的器官参考图谱,但其影响在很大程度上仍未被探索,特别是在复杂组织中。使用肾脏作为复杂器官的典型例子,我们对多模式单细胞整合策略进行了系统评估,并使用心脏组织进行了额外的方法验证。结果:通过整合3‘和5’ snRNA-seq以及联合snRNA-seq和snATAC-seq,我们生成了肾皮质的基准数据集,分析了来自19个供者的119,744个高质量细胞核/细胞。为了对齐细胞身份并实现一致的比较,我们开发了可解释的机器学习工具scOMM(单细胞组学多模态映射)并系统地评估整合策略。scRNA和snRNA-seq的“水平”整合提高了细胞类型的识别,而snRNA-seq和snATAC-seq的“垂直”整合具有叠加效应,提高了同质群体和难以识别状态的分辨率。全球整合在识别适应状态和罕见细胞类型方面尤其有效,包括表达wfdc2的厚升肢细胞和Norn细胞,这些细胞以前在肾脏图谱中未被检测到。结论:我们的工作为多模态参考图谱的生成建立了一个强大的框架,推进了单细胞分析并扩展了其对不同组织的适用性。
{"title":"Systematic evaluation of single-cell multimodal data integration enhances cell type resolution and discovery of clinically relevant states in complex tissues.","authors":"Mario Acera-Mateos, Xian Adiconis, Jessica-Kanglin Li, Domenica Marchese, Ginevra Caratù, Chung-Chau Hon, Prabha Tiwari, Miki Kojima, Beate Vieth, Michael A Murphy, Sean K Simmons, Thomas Lefevre, Irene Claes, Christopher L O'Connor, Rajasree Menon, Edgar A Otto, Yoshinari Ando, Katy Vandereyken, Matthias Kretzler, Markus Bitzer, Ernest Fraenkel, Thierry Voet, Wolfgang Enard, Piero Carninci, Holger Heyn, Joshua Z Levin, Elisabetta Mereu","doi":"10.1186/s13059-026-04002-4","DOIUrl":"10.1186/s13059-026-04002-4","url":null,"abstract":"<p><strong>Background: </strong>The integration of multimodal single-cell data enables comprehensive organ reference atlases, yet its impact remains largely unexplored, particularly in complex tissues. Using the kidney as an emblematic example of a complex organ, we perform a systematic evaluation of multimodal single-cell integration strategies, with heart tissue used for additional methodological validation.</p><p><strong>Results: </strong>We generate a benchmarking dataset for the renal cortex by integrating 3' and 5' scRNA-seq with joint snRNA-seq and snATAC-seq, profiling 119,744 high-quality nuclei/cells from 19 donors. To align cell identities and enable consistent comparisons, we develop the interpretable machine learning tool scOMM (single-cell Omics Multimodal Mapping) and systematically assess integration strategies. \"Horizontal\" integration of scRNA and snRNA-seq improves cell-type identification, while \"vertical\" integration of snRNA-seq and snATAC-seq has an additive effect, enhancing resolution in homogeneous populations and difficult-to-identify states. Global integration is especially effective in identifying adaptive states and rare cell types, including WFDC2-expressing Thick Ascending Limb and Norn cells, previously undetected in kidney atlases.</p><p><strong>Conclusions: </strong>Our work establishes a robust framework for multimodal reference atlas generation, advancing single-cell analysis and extending its applicability to diverse tissues.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"27 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12983708/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147444747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recurrent sex chromosome turnover mediated by distinct ARR17 and PISTILLATA duplications in willows. 柳树中不同的ARR17和PISTILLATA重复介导的反复性染色体更替。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-12 DOI: 10.1186/s13059-026-04026-w
Yuàn Wang, Zhi-Qing Xue, Ren-Gang Zhang, Zhi-Ying Zhu, Elvira Hörandl, Xiao-Ru Wang, Yan-Fei Mao, Deborah Charlesworth, Li He

Background: Sex chromosome turnovers evolve via translocation or duplication of established sex-determining genes, or their replacement by newly evolved ones. Few cases of replacements by new factors have been documented in dioecious plants, but are suspected in Salix, in which both XY and ZW systems occur, with sex-linked regions (SLRs) of different species on various chromosomes. The male-determining genes in XY species' SLRs are partial duplicates of autosomal ARR17-like genes and regulate the expression of downstream genes involved in stamen development by producing small RNAs that suppress the expression of intact copies.

Results: Here we describe phased chromosomal assemblies of three Salix species with a ZW system derived from an XY system, including four lineages of the Salix polyclona complex (six assemblies in total). Their SLRs are within the same repeat-rich pericentromeric region of chromosome 15 as in the willows with XY system. Although these Z- and W- SLRs carry intact and/or partial ARR17 duplicates, few RNA products are detectable in our sampled tissues. However, the W-SLRs include partial duplicates of PISTILLATA (PI), a stamen development gene. These are arranged in inverted repeats and express small interfering RNAs targeting the autosomal intact Salix PI gene, suggesting that they reduce its expression, and therefore act as maleness-suppressing factors.

Conclusions: The turnover events involving intact ARR17 and partial PI duplications in the 15ZW clade I species involve pericentromeric regions that recombine rarely, making changes possible in these Salix species that would be unlikely in other genome regions.

背景:性染色体翻转是通过已建立的性别决定基因的易位或重复,或它们被新进化的基因取代而进化的。在雌雄异株植物中,被新因子替代的案例很少,但在柳树中被怀疑,XY和ZW系统都发生,不同物种在不同染色体上有性别连锁区(SLRs)。XY物种slr中的雄性决定基因是常染色体arr17样基因的部分副本,并通过产生抑制完整副本表达的小rna来调节参与雄蕊发育的下游基因的表达。结果:在这里,我们描述了三个柳属物种的分阶段染色体组合,其中ZW系统衍生自XY系统,包括四个谱系的柳多克隆复合体(共6个组合)。它们的单反区位于与XY系统柳树相同的15号染色体富含重复序列的中心周围区域。尽管这些Z-和W-单反携带完整和/或部分的ARR17重复序列,但在我们的样本组织中检测到的RNA产物很少。然而,w - slr包含雄蕊发育基因pisttillata (PI)的部分重复。这些基因以反向重复排列,并表达针对常染色体完整的Salix PI基因的小干扰rna,这表明它们降低了该基因的表达,因此起到了抑制雄性的作用。结论:在15ZW进化支I种中涉及完整的ARR17和部分PI重复的周转事件涉及很少重组的中心点周围区域,这使得这些柳树物种可能发生在其他基因组区域中不可能发生的变化。
{"title":"Recurrent sex chromosome turnover mediated by distinct ARR17 and PISTILLATA duplications in willows.","authors":"Yuàn Wang, Zhi-Qing Xue, Ren-Gang Zhang, Zhi-Ying Zhu, Elvira Hörandl, Xiao-Ru Wang, Yan-Fei Mao, Deborah Charlesworth, Li He","doi":"10.1186/s13059-026-04026-w","DOIUrl":"https://doi.org/10.1186/s13059-026-04026-w","url":null,"abstract":"<p><strong>Background: </strong>Sex chromosome turnovers evolve via translocation or duplication of established sex-determining genes, or their replacement by newly evolved ones. Few cases of replacements by new factors have been documented in dioecious plants, but are suspected in Salix, in which both XY and ZW systems occur, with sex-linked regions (SLRs) of different species on various chromosomes. The male-determining genes in XY species' SLRs are partial duplicates of autosomal ARR17-like genes and regulate the expression of downstream genes involved in stamen development by producing small RNAs that suppress the expression of intact copies.</p><p><strong>Results: </strong>Here we describe phased chromosomal assemblies of three Salix species with a ZW system derived from an XY system, including four lineages of the Salix polyclona complex (six assemblies in total). Their SLRs are within the same repeat-rich pericentromeric region of chromosome 15 as in the willows with XY system. Although these Z- and W- SLRs carry intact and/or partial ARR17 duplicates, few RNA products are detectable in our sampled tissues. However, the W-SLRs include partial duplicates of PISTILLATA (PI), a stamen development gene. These are arranged in inverted repeats and express small interfering RNAs targeting the autosomal intact Salix PI gene, suggesting that they reduce its expression, and therefore act as maleness-suppressing factors.</p><p><strong>Conclusions: </strong>The turnover events involving intact ARR17 and partial PI duplications in the 15ZW clade I species involve pericentromeric regions that recombine rarely, making changes possible in these Salix species that would be unlikely in other genome regions.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147445669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The genomic history of Streptococcus mutans from the Mesolithic until modern times. 从中石器时代到现代的变形链球菌的基因组历史。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-12 DOI: 10.1186/s13059-026-04018-w
Vincent F D F Thygesen, Motahare Feizabadi Farahani, Sofie Holtsmark Nielsen, Florentin Constancias, Michael Givskov, Jacqueline Abranches, Gabriele Scorrano, Marie Louise S Jørkov, Ghader Ebrahimi, Julio C Bendezu-Sarmiento, Fabrice Demeter, Kristian Kristiansen, Daniel Belstrøm, Martin Sikora

Background: Streptococcus mutans is a member of the human oral microbiota and is considered one of the most important cariogenic organisms. Previous studies have suggested an expansion of S. mutans populations about 10,000 years ago with the onset of agriculture, yet direct molecular evidence of its presence from ancient DNA remains sparse.

Results: Here, we present population genomic analyses of 25 ancient S. mutans genomes (average read depth 0.1X - 387X) recovered from archaeological remains across Eurasia spanning ~ 8,000 years of human evolution. Recombination-corrected phylogenomic analyses using Gubbins show a star-like phylogeny indicative of an early radiation, with the ancient genomes falling within the genomic diversity of modern isolates but restricted to one of the major clades of the phylogeny (D). Analyses of genes encoding present day virulence factors reveals that the presence of the mutanobactin operon involved in oxygen tolerance is restricted to specific subclades (A & B) and absent among the ancient samples. Using the MEGAHIT assembler followed by binning of contigs with CONCOCT, we recover metagenome-assembled genomes (MAG) of 7 high-coverage ancient S. mutans strains, including a 7,500-year-old sample from an early European Neolithic farmer. Pangenome analysis with modern isolates using the anvi'o's suite revealed the presence of specific functional genes in the ancient isolates, which were lost through time.

Conclusions: Our study demonstrates that Streptococcus mutans DNA is well preserved in tooth samples from archaeological remains and show that it formed part of the human oral microbiota already before the onset of agriculture, consistent with a radiation and population expansion well before 8,000 years ago.

背景:变形链球菌是人类口腔微生物群的一员,被认为是最重要的致龋微生物之一。先前的研究表明,大约在1万年前,随着农业的兴起,变形链球菌的种群数量出现了扩张,但从古代DNA中获得的直接分子证据仍然很少。结果:在这里,我们对25个古代变形链球菌基因组(平均读取深度0.1X - 387X)进行了种群基因组分析,这些基因组来自欧亚大陆的考古遗址,跨越了约8000年的人类进化。利用Gubbins进行的重组校正系统基因组分析显示了一个星状系统发育,表明早期辐射,古代基因组属于现代分离株的基因组多样性,但仅限于系统发育的一个主要分支(D)。对编码当今毒力因子的基因的分析表明,参与氧耐受性的突变杆菌操纵子的存在仅限于特定亚支系(A和B),而在古代样本中不存在。利用MEGAHIT组装器和CONCOCT组合,我们恢复了7个高覆盖率的古代变形链球菌菌株的宏基因组组装基因组(MAG),其中包括来自欧洲新石器时代早期农民的7500年前的样本。使用anvi'o套件对现代分离株进行的泛基因组分析显示,随着时间的推移,古代分离株中存在特定的功能基因。结论:我们的研究表明,变形链球菌的DNA在考古遗迹的牙齿样本中保存得很好,并表明它在农业开始之前就已经形成了人类口腔微生物群的一部分,这与8000年前的辐射和人口扩张相一致。
{"title":"The genomic history of Streptococcus mutans from the Mesolithic until modern times.","authors":"Vincent F D F Thygesen, Motahare Feizabadi Farahani, Sofie Holtsmark Nielsen, Florentin Constancias, Michael Givskov, Jacqueline Abranches, Gabriele Scorrano, Marie Louise S Jørkov, Ghader Ebrahimi, Julio C Bendezu-Sarmiento, Fabrice Demeter, Kristian Kristiansen, Daniel Belstrøm, Martin Sikora","doi":"10.1186/s13059-026-04018-w","DOIUrl":"https://doi.org/10.1186/s13059-026-04018-w","url":null,"abstract":"<p><strong>Background: </strong>Streptococcus mutans is a member of the human oral microbiota and is considered one of the most important cariogenic organisms. Previous studies have suggested an expansion of S. mutans populations about 10,000 years ago with the onset of agriculture, yet direct molecular evidence of its presence from ancient DNA remains sparse.</p><p><strong>Results: </strong>Here, we present population genomic analyses of 25 ancient S. mutans genomes (average read depth 0.1X - 387X) recovered from archaeological remains across Eurasia spanning ~ 8,000 years of human evolution. Recombination-corrected phylogenomic analyses using Gubbins show a star-like phylogeny indicative of an early radiation, with the ancient genomes falling within the genomic diversity of modern isolates but restricted to one of the major clades of the phylogeny (D). Analyses of genes encoding present day virulence factors reveals that the presence of the mutanobactin operon involved in oxygen tolerance is restricted to specific subclades (A & B) and absent among the ancient samples. Using the MEGAHIT assembler followed by binning of contigs with CONCOCT, we recover metagenome-assembled genomes (MAG) of 7 high-coverage ancient S. mutans strains, including a 7,500-year-old sample from an early European Neolithic farmer. Pangenome analysis with modern isolates using the anvi'o's suite revealed the presence of specific functional genes in the ancient isolates, which were lost through time.</p><p><strong>Conclusions: </strong>Our study demonstrates that Streptococcus mutans DNA is well preserved in tooth samples from archaeological remains and show that it formed part of the human oral microbiota already before the onset of agriculture, consistent with a radiation and population expansion well before 8,000 years ago.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147444188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Blood proteomics: insights from public data. 血液蛋白质组学:来自公共数据的见解。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-12 DOI: 10.1186/s13059-026-04027-9
Asier Larrea-Sebal, Chengxin Dai, Alejandro J Brenes, Kathrin Korff, Benjamin A Neely, Philipp E Geyer, Laura F Dagley, Richard D Unwin, Alexandra Naba, Michael J MacCoss, Tiannan Guo, Eric W Deutsch, Cesar Martin, Jochen M Schwenk, Yasset Perez-Riverol

The circulating blood proteome comprises soluble and cellular components that reflect physiological and pathological states across tissues. Advances in mass spectrometry and affinity-based proteomics have improved sensitivity and throughput, enabling the generation of public blood proteomics resources. However, comprehensive assessments of these datasets remain limited. This work reviews the cellular and molecular complexity of publicly available blood proteomics data, recent methodological developments, and the complementarity of diverse data sources across the abundance range, while outlining remaining challenges for translating blood proteomics into personalized medicine.

循环血液蛋白质组包括反映跨组织生理和病理状态的可溶性和细胞组分。质谱和基于亲和力的蛋白质组学的进步提高了灵敏度和通量,使公共血液蛋白质组学资源的产生成为可能。然而,对这些数据集的全面评估仍然有限。这项工作回顾了可公开获得的血液蛋白质组学数据的细胞和分子复杂性,最近的方法发展,以及丰富范围内不同数据源的互补性,同时概述了将血液蛋白质组学转化为个性化医疗的剩余挑战。
{"title":"Blood proteomics: insights from public data.","authors":"Asier Larrea-Sebal, Chengxin Dai, Alejandro J Brenes, Kathrin Korff, Benjamin A Neely, Philipp E Geyer, Laura F Dagley, Richard D Unwin, Alexandra Naba, Michael J MacCoss, Tiannan Guo, Eric W Deutsch, Cesar Martin, Jochen M Schwenk, Yasset Perez-Riverol","doi":"10.1186/s13059-026-04027-9","DOIUrl":"10.1186/s13059-026-04027-9","url":null,"abstract":"<p><p>The circulating blood proteome comprises soluble and cellular components that reflect physiological and pathological states across tissues. Advances in mass spectrometry and affinity-based proteomics have improved sensitivity and throughput, enabling the generation of public blood proteomics resources. However, comprehensive assessments of these datasets remain limited. This work reviews the cellular and molecular complexity of publicly available blood proteomics data, recent methodological developments, and the complementarity of diverse data sources across the abundance range, while outlining remaining challenges for translating blood proteomics into personalized medicine.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"27 1","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12980870/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147444238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmarking Hi-C scaffolders using reference genomes and de novo assemblies. 利用参考基因组和从头组装对Hi-C支架进行基准测试。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-10 DOI: 10.1186/s13059-026-03978-3
Aakash Sur, William Stafford Noble, Peter J Myler

Background: Studying a new species using high-throughput sequencing requires a high-quality reference genome. However, assembling chromosome length sequences remains challenging. Recent advances in chromatin conformation capture (Hi-C) have provided a new approach to scaffolding genome assemblies, and the last ten years have seen a proliferation of such methods. However, to our knowledge no comprehensive benchmarking of Hi-C scaffolders has been conducted to date.

Results: Through a literature review we identify the most popular Hi-C scaffolders - Lachesis, HiRise, 3d-dna, SALSA, and AllHiC. We test their ability to scaffold four well studied genomes - S. cerevisiae, L. tarentolae, A. thaliana, and H. sapiens. Scaffolders are tasked with both scaffolding fragmented versions of the reference genome as well as de novo assemblies derived from long read datasets. We find that all scaffolders can exceed 80% accuracy under ideal circumstances but that their performance quickly deteriorates under more challenging conditions. Surprisingly, many scaffolders also show poor performance on the best assemblies, where contigs are near chromosome length. Overall, we find that HiRise and Lachesis offer the best performance on average across all conditions.

Conclusions: We compare the performance of five Hi-C scaffolders using multiple reference species under both ideal and real-life conditions, thereby illuminating their strengths and weaknesses.

背景:利用高通量测序研究新物种需要高质量的参考基因组。然而,组装染色体长度序列仍然具有挑战性。染色质构象捕获(Hi-C)的最新进展为脚手架基因组组装提供了一种新的方法,并且在过去的十年中已经看到了这种方法的扩散。然而,据我们所知,到目前为止还没有对Hi-C支架进行全面的基准测试。结果:通过文献回顾,我们确定了最流行的Hi-C支架- Lachesis, HiRise, 3d-dna, SALSA和allic。我们测试了它们支撑四种已被充分研究过的基因组的能力——酿酒葡萄球菌、猪链菌、拟南芥和智人。脚手架的任务是脚手架片段版本的参考基因组,以及从长读数据集衍生的从头组装。我们发现,在理想情况下,所有支架的精度都可以超过80%,但在更具有挑战性的条件下,它们的性能会迅速恶化。令人惊讶的是,许多支架在最佳组装上也表现不佳,其中contigs接近染色体长度。总的来说,我们发现HiRise和Lachesis在所有条件下的平均表现都是最好的。结论:我们比较了5种不同参考物种的Hi-C支架在理想和现实条件下的性能,从而阐明了它们的优缺点。
{"title":"Benchmarking Hi-C scaffolders using reference genomes and de novo assemblies.","authors":"Aakash Sur, William Stafford Noble, Peter J Myler","doi":"10.1186/s13059-026-03978-3","DOIUrl":"10.1186/s13059-026-03978-3","url":null,"abstract":"<p><strong>Background: </strong>Studying a new species using high-throughput sequencing requires a high-quality reference genome. However, assembling chromosome length sequences remains challenging. Recent advances in chromatin conformation capture (Hi-C) have provided a new approach to scaffolding genome assemblies, and the last ten years have seen a proliferation of such methods. However, to our knowledge no comprehensive benchmarking of Hi-C scaffolders has been conducted to date.</p><p><strong>Results: </strong>Through a literature review we identify the most popular Hi-C scaffolders - Lachesis, HiRise, 3d-dna, SALSA, and AllHiC. We test their ability to scaffold four well studied genomes - S. cerevisiae, L. tarentolae, A. thaliana, and H. sapiens. Scaffolders are tasked with both scaffolding fragmented versions of the reference genome as well as de novo assemblies derived from long read datasets. We find that all scaffolders can exceed 80% accuracy under ideal circumstances but that their performance quickly deteriorates under more challenging conditions. Surprisingly, many scaffolders also show poor performance on the best assemblies, where contigs are near chromosome length. Overall, we find that HiRise and Lachesis offer the best performance on average across all conditions.</p><p><strong>Conclusions: </strong>We compare the performance of five Hi-C scaffolders using multiple reference species under both ideal and real-life conditions, thereby illuminating their strengths and weaknesses.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147391591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tribolium madens satellitome reveals a network of highly abundant satellite DNAs in megabase-sized regions hallmarked by macro-dyad symmetries. Tribolium madens卫星组揭示了在以宏观双对称为特征的兆酶大小区域中高度丰富的卫星dna网络。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-07 DOI: 10.1186/s13059-026-04022-0
Damira Veseljak, Evelin Despot-Slade, Marin Volarić, Lucija Horvat, Nevenka Meštrović, Brankica Mravinac

Background: Tandemly repeated satellite DNAs (satDNAs) are among the most copious sequences of eukaryotic genomes. They often reside in centromeric regions, but their diversity among different organisms obscures the properties that centromere-competent satDNAs should possess.

Results: Here, we explore the satellitome of the satDNA-rich flour beetle Tribolium madens. By combining short-read Illumina and long-read PacBio HiFi sequencing, we identify 124 satDNAs comprising 41.4% of the genome. We find that 38% of the genome sequence originates from a ~ 110 bp element that gives rise to two distinct satDNAs, the major and minor satellites, which occupy multi-megabase regions likely encompassing (peri)centromeres of all chromosomes. Fine-scale analysis of long-range organization reveals that intermingled arrays of the major and minor satDNAs are arranged in macro-dyad symmetries with the potential to form hairpin or cruciform structures spanning tens of kilobases. The inversion sites within macro-dyad symmetries and the transition zones between the major and minor satDNA arrays are highly conserved, indicating structural significance. The organization of the Tribolium madens putative (peri)centromeric satDNAs is comparable to that of the closely related Tribolium freemani and Tribolium castaneum, whose completely different dominant satDNAs also incline toward macro-dyad symmetries.

Conclusions: We propose that satDNA-related macro-dyad symmetries may affect the organization of (peri)centromeric chromatin, potentially also influencing centromere specification. The analogous pattern in congeners suggests that such symmetries are an intrinsic feature of Tribolium (peri)centromeric regions, implying that repeat organization and potential non-canonical DNA structures could be functionally more significant than the primary sequence of satDNA repeats.

串联重复卫星dna (satdna)是真核生物基因组中最丰富的序列之一。它们通常位于着丝粒区域,但它们在不同生物体中的多样性模糊了着丝粒能力的satdna应该具有的特性。结果:本研究探索了富含satdna的面粉甲虫Tribolium madens的卫星组。通过结合短读Illumina和长读PacBio HiFi测序,我们鉴定出124个satdna,占基因组的41.4%。我们发现38%的基因组序列起源于一个~ 110 bp的元件,该元件产生两个不同的卫星dna,主要和次要卫星,它们占据可能包含所有染色体的着丝粒的多碱基区域。远程组织的精细分析表明,主要和次要satdna的混合阵列以宏观双对称排列,有可能形成跨越数十个碱基的发夹或十字形结构。大双体对称内的反转位点和主、副卫星dna阵列间的过渡区高度保守,具有重要的结构意义。Tribolium madens推定的(近)着丝性卫星dna的组织与密切相关的Tribolium freemani和Tribolium castaneum相当,它们完全不同的优势卫星dna也倾向于宏观双体对称。结论:我们认为与satdna相关的宏观双体对称性可能影响着丝粒染色质的组织,也可能影响着丝粒的规格。同系物中的类似模式表明,这种对称性是Tribolium (perii)着丝点区域的内在特征,这意味着重复序列组织和潜在的非规范DNA结构在功能上可能比satDNA重复序列的初级序列更重要。
{"title":"Tribolium madens satellitome reveals a network of highly abundant satellite DNAs in megabase-sized regions hallmarked by macro-dyad symmetries.","authors":"Damira Veseljak, Evelin Despot-Slade, Marin Volarić, Lucija Horvat, Nevenka Meštrović, Brankica Mravinac","doi":"10.1186/s13059-026-04022-0","DOIUrl":"https://doi.org/10.1186/s13059-026-04022-0","url":null,"abstract":"<p><strong>Background: </strong>Tandemly repeated satellite DNAs (satDNAs) are among the most copious sequences of eukaryotic genomes. They often reside in centromeric regions, but their diversity among different organisms obscures the properties that centromere-competent satDNAs should possess.</p><p><strong>Results: </strong>Here, we explore the satellitome of the satDNA-rich flour beetle Tribolium madens. By combining short-read Illumina and long-read PacBio HiFi sequencing, we identify 124 satDNAs comprising 41.4% of the genome. We find that 38% of the genome sequence originates from a ~ 110 bp element that gives rise to two distinct satDNAs, the major and minor satellites, which occupy multi-megabase regions likely encompassing (peri)centromeres of all chromosomes. Fine-scale analysis of long-range organization reveals that intermingled arrays of the major and minor satDNAs are arranged in macro-dyad symmetries with the potential to form hairpin or cruciform structures spanning tens of kilobases. The inversion sites within macro-dyad symmetries and the transition zones between the major and minor satDNA arrays are highly conserved, indicating structural significance. The organization of the Tribolium madens putative (peri)centromeric satDNAs is comparable to that of the closely related Tribolium freemani and Tribolium castaneum, whose completely different dominant satDNAs also incline toward macro-dyad symmetries.</p><p><strong>Conclusions: </strong>We propose that satDNA-related macro-dyad symmetries may affect the organization of (peri)centromeric chromatin, potentially also influencing centromere specification. The analogous pattern in congeners suggests that such symmetries are an intrinsic feature of Tribolium (peri)centromeric regions, implying that repeat organization and potential non-canonical DNA structures could be functionally more significant than the primary sequence of satDNA repeats.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147373292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Augmented CENH3 loading is accompanied by transcriptional and epigenetic reprogramming at rice centromeres during meiosis. 在减数分裂期间,水稻着丝粒上的CENH3负荷增加伴随着转录和表观遗传重编程。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-03-05 DOI: 10.1186/s13059-026-04020-2
Lei Cao, Yangzi Zhao, Hanli You, Huixin Liu, Bingxin Wang, Yonghang Run, Han Yang, Minsi Wen, Jiawei Chen, Pengtao Zhao, Yi Shen, Yafei Li, Wenli Zhang, Zhukuan Cheng

Background: Centromere identity in eukaryotes is defined epigenetically by CENH3 (CENPA), a specialized histone H3 variant essential for kinetochore establishment and faithful chromosome segregation. However, the regulatory mechanisms governing CENH3 loading during meiosis and how they differ from mitotic patterns remain insufficiently elucidated.

Results: Here we characterize the dynamics of CENH3 deposition across meiosis and compare them with mitotic loading in rice. Quantitative fluorescence imaging reveals a pronounced increase of CENH3 signal during meiotic prophase I, coinciding with increased accumulation of multiple kinetochore components. Super-resolution stimulated emission depletion microscopy further confirms a distinct peak of CENH3 loading at zygotene. Through low-input ChIP-seq integrated with multi-omics profiling of purified meiocytes, we find that this meiosis-specific enrichment reflects both expanded and intensified CENH3 deposition, predominantly at Ty3-Gypsy retrotransposons. This remodeling is accompanied by reduced transcription of mRNAs and small RNAs, along with a reduction in CHG methylation and H3K9me2 heterochromatin marks.

Conclusion: Our findings uncover a previously unrecognized, meiotic-specific pattern of CENH3 loading in rice and highlight a coordinated regulatory network linking centromeric chromatin reorganization, transcriptional repression, and epigenetic modification during early meiotic progression.

真核生物的着丝粒身份是由CENH3 (CENPA)表观遗传决定的,CENH3是一种特殊的组蛋白H3变体,对着丝粒的建立和忠实的染色体分离至关重要。然而,在减数分裂过程中控制CENH3负载的调节机制以及它们与有丝分裂模式的区别仍然没有得到充分阐明。结果:本文描述了减数分裂过程中CENH3沉积的动态,并将其与水稻有丝分裂负荷进行了比较。定量荧光成像显示,CENH3信号在减数分裂前期I明显增加,与多个着丝点组分的积累增加相一致。超分辨率受激发射耗尽显微镜进一步证实了在zygotene处明显的CENH3负载峰。通过低输入ChIP-seq结合纯化减数细胞的多组学分析,我们发现这种减数分裂特异性富集反映了扩大和增强的CENH3沉积,主要在Ty3-Gypsy反转录转座子上。这种重塑伴随着mrna和小rna转录的减少,以及CHG甲基化和H3K9me2异染色质标记的减少。结论:我们的研究结果揭示了一种以前未被认识到的、水稻减数分裂特异性的CENH3装载模式,并强调了在减数分裂早期进程中,一个连接着着丝粒染色质重组、转录抑制和表观遗传修饰的协调调节网络。
{"title":"Augmented CENH3 loading is accompanied by transcriptional and epigenetic reprogramming at rice centromeres during meiosis.","authors":"Lei Cao, Yangzi Zhao, Hanli You, Huixin Liu, Bingxin Wang, Yonghang Run, Han Yang, Minsi Wen, Jiawei Chen, Pengtao Zhao, Yi Shen, Yafei Li, Wenli Zhang, Zhukuan Cheng","doi":"10.1186/s13059-026-04020-2","DOIUrl":"https://doi.org/10.1186/s13059-026-04020-2","url":null,"abstract":"<p><strong>Background: </strong>Centromere identity in eukaryotes is defined epigenetically by CENH3 (CENPA), a specialized histone H3 variant essential for kinetochore establishment and faithful chromosome segregation. However, the regulatory mechanisms governing CENH3 loading during meiosis and how they differ from mitotic patterns remain insufficiently elucidated.</p><p><strong>Results: </strong>Here we characterize the dynamics of CENH3 deposition across meiosis and compare them with mitotic loading in rice. Quantitative fluorescence imaging reveals a pronounced increase of CENH3 signal during meiotic prophase I, coinciding with increased accumulation of multiple kinetochore components. Super-resolution stimulated emission depletion microscopy further confirms a distinct peak of CENH3 loading at zygotene. Through low-input ChIP-seq integrated with multi-omics profiling of purified meiocytes, we find that this meiosis-specific enrichment reflects both expanded and intensified CENH3 deposition, predominantly at Ty3-Gypsy retrotransposons. This remodeling is accompanied by reduced transcription of mRNAs and small RNAs, along with a reduction in CHG methylation and H3K9me2 heterochromatin marks.</p><p><strong>Conclusion: </strong>Our findings uncover a previously unrecognized, meiotic-specific pattern of CENH3 loading in rice and highlight a coordinated regulatory network linking centromeric chromatin reorganization, transcriptional repression, and epigenetic modification during early meiotic progression.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147367039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-context seeds enable fast and high-accuracy read mapping. 多上下文种子实现快速和高精度的读取映射。
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2026-02-28 DOI: 10.1186/s13059-026-04017-x
Ivan Tolstoganov, Marcel Martin, Nicolas Buchin, Kristoffer Sahlin

A key step in sequence similarity search is to identify shared seeds between a query and a reference sequence. A well-known tradeoff is that longer seeds offer fast searches but reduce sensitivity in variable regions. We introduce multi-context seeds (MCS), which allow the storage of seeds with different lengths in the same index structure, thus retaining the advantages of both short and long seeds. We demonstrate the applicability of MCS by implementing them in strobealign. Strobealign with MCS substantially improves accuracy compared to the previous version with little cost in runtime and no memory overhead.

序列相似性搜索的关键步骤是识别查询序列和参考序列之间的共享种子。一个众所周知的权衡是,较长的种子提供了快速搜索,但降低了在可变区域的灵敏度。我们引入了多上下文种子(multi-context seed, MCS),允许在同一索引结构中存储不同长度的种子,从而保留了短种子和长种子的优点。我们通过在频闪校准中实现MCS来证明其适用性。与以前的版本相比,与MCS相比,Strobealign的准确性大大提高,运行时成本很小,没有内存开销。
{"title":"Multi-context seeds enable fast and high-accuracy read mapping.","authors":"Ivan Tolstoganov, Marcel Martin, Nicolas Buchin, Kristoffer Sahlin","doi":"10.1186/s13059-026-04017-x","DOIUrl":"https://doi.org/10.1186/s13059-026-04017-x","url":null,"abstract":"<p><p>A key step in sequence similarity search is to identify shared seeds between a query and a reference sequence. A well-known tradeoff is that longer seeds offer fast searches but reduce sensitivity in variable regions. We introduce multi-context seeds (MCS), which allow the storage of seeds with different lengths in the same index structure, thus retaining the advantages of both short and long seeds. We demonstrate the applicability of MCS by implementing them in strobealign. Strobealign with MCS substantially improves accuracy compared to the previous version with little cost in runtime and no memory overhead.</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":" ","pages":""},"PeriodicalIF":12.3,"publicationDate":"2026-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147322367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genome Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1