Genome research最新文献_第6页

Statistically rigorous and computationally efficient chromatin stripe detection with Quagga 统计严谨和计算高效的染色质条纹检测与斑驴

IF 7 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-12 DOI: 10.1101/gr.280132.124

Fan Feng, Sean Moran, Anders Hansen, Xiaotian Zhang, Jie Liu

Chromatin stripes are architectural chromatin features where a singular loop anchor interacts with a contiguous region of DNA so, at the bulk sequencing level, it appears as a long stripe on chromatin contact matrices. Stripes are thought to play an important role in gene regulation and have been implicated in regulating a cell's lineage determination. Therefore, integrated analysis of stripes with genomic and epigenomic features at a genome-wide scale shows vast potential in understanding the cooperation between regulatory elements in 3D nucleome. To this end, we present Quagga, a computational tool for detection and statistical verification of genomic architectural stripes from Hi-C or Micro-C chromatin contact maps, which relies on robust image processing techniques and rigorous statistical tests for enrichment. Quagga outperforms other stripe detection methods in accuracy and is highly versatile, working with Hi-C, Micro-C, and other chromatin conformation capture data. By reporting on all tools' performance in classifying CTCF-cohesin anchored stripes, enhancer-promoter interacting stripes, and indeterminate stripes, we also demonstrate a thorough, integrated analysis to determine the output stripes' quality. Our work provides a flexible and convenient tool to help scientists explore the relationships between chromatin architectural stripes and important biological questions.

染色质条纹是染色质结构特征，其中单个环锚与DNA的连续区域相互作用，因此，在批量测序水平上，它在染色质接触矩阵上表现为长条纹。条纹被认为在基因调控中起着重要的作用，并与调节细胞的谱系决定有关。因此，在全基因组范围内对具有基因组和表观基因组特征的条纹进行综合分析，对于理解三维核组中调控元件之间的合作具有巨大的潜力。为此，我们提出了Quagga，这是一种用于检测和统计验证来自Hi-C或Micro-C染色质接触图谱的基因组结构条纹的计算工具，它依赖于强大的图像处理技术和严格的统计测试来富集。Quagga在准确性上优于其他条纹检测方法，并且具有高度通用性，可与Hi-C， Micro-C和其他染色质构象捕获数据一起工作。通过报告所有工具在分类ctcf -黏结剂锚定条纹、增强剂-促进剂相互作用条纹和不确定条纹方面的性能，我们还展示了一个全面的综合分析，以确定输出条纹的质量。我们的工作提供了一个灵活和方便的工具，帮助科学家探索染色质结构条纹和重要生物学问题之间的关系。

{"title":"Statistically rigorous and computationally efficient chromatin stripe detection with Quagga","authors":"Fan Feng, Sean Moran, Anders Hansen, Xiaotian Zhang, Jie Liu","doi":"10.1101/gr.280132.124","DOIUrl":"https://doi.org/10.1101/gr.280132.124","url":null,"abstract":"Chromatin stripes are architectural chromatin features where a singular loop anchor interacts with a contiguous region of DNA so, at the bulk sequencing level, it appears as a long stripe on chromatin contact matrices. Stripes are thought to play an important role in gene regulation and have been implicated in regulating a cell's lineage determination. Therefore, integrated analysis of stripes with genomic and epigenomic features at a genome-wide scale shows vast potential in understanding the cooperation between regulatory elements in 3D nucleome. To this end, we present Quagga, a computational tool for detection and statistical verification of genomic architectural stripes from Hi-C or Micro-C chromatin contact maps, which relies on robust image processing techniques and rigorous statistical tests for enrichment. Quagga outperforms other stripe detection methods in accuracy and is highly versatile, working with Hi-C, Micro-C, and other chromatin conformation capture data. By reporting on all tools' performance in classifying CTCF-cohesin anchored stripes, enhancer-promoter interacting stripes, and indeterminate stripes, we also demonstrate a thorough, integrated analysis to determine the output stripes' quality. Our work provides a flexible and convenient tool to help scientists explore the relationships between chromatin architectural stripes and important biological questions.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"19 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145492622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Partitioned Multi-MUM finding for scalable pangenomics with MumemtoM 用MumemtoM寻找可扩展泛基因组学的分区多mum

IF 7 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-07 DOI: 10.1101/gr.280940.125

Vikram S Shivakumar, Ben Langmead

Pangenome collections are growing to hundreds of high-quality genomes. This necessitates scalable methods for constructing pangenome alignments that can incorporate newly-sequenced assemblies. We previously developed Mumemto, which computes maximal unique matches (multi-MUMs) across pangenomes using compressed indexing. In this work, we introduce MumemtoM (Mumemto Merge), comprising two new partitioning and merging strategies. Both strategies enable highly parallel, memory efficient, and updateable computation of multi-MUMs. One of the strategies, called string-based merging, is also capable of conducting the merges in a way that follows the shape of a phylogenetic tree, naturally yielding the multi-MUM for the tree's internal nodes as well as the root. With these strategies, Mumemto now scales to 474 human haplotypes, the only multi-MUM method able to do so. It also introduces a time-memory tradeoff that allows Mumemto to be tailored to more scenarios, including in resource-limited settings.

泛基因组收集正在增长到数百个高质量基因组。这需要可扩展的方法来构建泛基因组比对，可以纳入新测序的组装。我们之前开发了Mumemto，它使用压缩索引计算跨泛基因组的最大唯一匹配（多mum）。在这项工作中，我们引入了MumemtoM (Mumemto Merge)，它包括两种新的分区和合并策略。这两种策略都可以实现多mum的高度并行、内存高效和可更新的计算。其中一种策略，称为基于字符串的合并，也能够以遵循系统发育树形状的方式进行合并，自然地为树的内部节点和根产生多mum。通过这些策略，Mumemto现在扩展到474个人类单倍型，这是唯一能够做到这一点的多mum方法。它还引入了时间-内存权衡，允许Mumemto针对更多场景进行定制，包括在资源有限的设置中。

引用次数: 0

Epigenomics of embryogenesis in turbot 大菱鲆胚胎发生的表观基因组学

IF 7 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-04 DOI: 10.1101/gr.280355.124

Oscar Aramburu, Belen G Pardo, Ada Jimenez-Gonzalez, Andrés Blanco-Hortas, Daniel Macqueen, Carmen Bouza, Paulino Martinez

Embryogenesis is the foundational step of ontogeny, where a complex organism emerges from a single totipotent cell. This process is orchestrated by changes in transcriptional regulation, influenced by chromatin accessibility and epigenetic modifications, enabling transcription factor accessibility. Epigenomic regulation of embryogenesis has been studied in model fish, but little attention has been paid to farmed fish - where relevant traits to aquaculture rely on early developmental processes. This study reports a regulatory atlas of turbot (Scophthalmus maximus) embryogenesis. 14,560 active genes were identified in the embryonic transcriptome with > 90% showing differential expression across consecutive stages. By integrating multi-histone ChIP-seq with ATAC-seq, we built a genome-wide chromatin state model, defining promoter and enhancer activity across stages. Diverse transcription factor binding motifs were detected within regulatory elements showing differential accessibility at distinct developmental stages. Strong shifts in chromatin accessibility across stages, notably during the transition from shield to early segmentation, suggest profound chromatin reorganization underpinning somitogenesis and early organogenesis. Regardless, most changes in chromatin accessibility did not affect promoters of differentially expressed genes, suggesting that their accessibility precedes gene transcription changes. Comparative analyses with zebrafish revealed a global transcriptomic correlation of single-copy orthologs at matched stages. While conserved expression dynamics were revealed for many orthologous Hox genes, notable cross-species differences were identified from pre-ZGA leading up to hatching. This multi-omics investigation provides a novel atlas of noncoding regulatory elements controlling turbot development, with key applications for flatfish biology and sustainable aquaculture.

胚胎发生是个体发生的基础步骤，一个复杂的有机体从一个单一的全能细胞出现。这一过程是由转录调控的变化精心安排的，受染色质可及性和表观遗传修饰的影响，使转录因子可及性成为可能。胚胎发生的表观基因组调控已经在模式鱼中进行了研究，但很少关注养殖鱼，因为养殖的相关性状依赖于早期发育过程。本研究报道了大菱鲆（Scophthalmus maximus）胚胎发生的调控图谱。在胚胎转录组中鉴定出14,560个活性基因，其中>； 90%在连续的阶段中表现出差异表达。通过整合多组蛋白ChIP-seq和ATAC-seq，我们建立了全基因组染色质状态模型，定义了不同阶段的启动子和增强子活性。不同的转录因子结合基序在调控元件中被检测到，在不同的发育阶段表现出不同的可及性。染色质可及性在各个阶段的强烈变化，特别是在从屏蔽到早期分节的过渡期间，表明染色质的深刻重组是体细胞发生和早期器官发生的基础。无论如何，大多数染色质可接近性的变化不会影响差异表达基因的启动子，这表明它们的可接近性先于基因转录变化。与斑马鱼的比较分析揭示了匹配阶段单拷贝同源物的全球转录组学相关性。虽然许多同源Hox基因的保守表达动态被揭示，但从zga前到孵化，发现了显着的跨物种差异。这项多组学研究提供了控制大比目鱼发育的非编码调控元件的新图谱，对比目鱼生物学和可持续水产养殖具有重要应用价值。

{"title":"Epigenomics of embryogenesis in turbot","authors":"Oscar Aramburu, Belen G Pardo, Ada Jimenez-Gonzalez, Andrés Blanco-Hortas, Daniel Macqueen, Carmen Bouza, Paulino Martinez","doi":"10.1101/gr.280355.124","DOIUrl":"https://doi.org/10.1101/gr.280355.124","url":null,"abstract":"Embryogenesis is the foundational step of ontogeny, where a complex organism emerges from a single totipotent cell. This process is orchestrated by changes in transcriptional regulation, influenced by chromatin accessibility and epigenetic modifications, enabling transcription factor accessibility. Epigenomic regulation of embryogenesis has been studied in model fish, but little attention has been paid to farmed fish - where relevant traits to aquaculture rely on early developmental processes. This study reports a regulatory atlas of turbot (Scophthalmus maximus) embryogenesis. 14,560 active genes were identified in the embryonic transcriptome with > 90% showing differential expression across consecutive stages. By integrating multi-histone ChIP-seq with ATAC-seq, we built a genome-wide chromatin state model, defining promoter and enhancer activity across stages. Diverse transcription factor binding motifs were detected within regulatory elements showing differential accessibility at distinct developmental stages. Strong shifts in chromatin accessibility across stages, notably during the transition from shield to early segmentation, suggest profound chromatin reorganization underpinning somitogenesis and early organogenesis. Regardless, most changes in chromatin accessibility did not affect promoters of differentially expressed genes, suggesting that their accessibility precedes gene transcription changes. Comparative analyses with zebrafish revealed a global transcriptomic correlation of single-copy orthologs at matched stages. While conserved expression dynamics were revealed for many orthologous Hox genes, notable cross-species differences were identified from pre-ZGA leading up to hatching. This multi-omics investigation provides a novel atlas of noncoding regulatory elements controlling turbot development, with key applications for flatfish biology and sustainable aquaculture.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"38 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145441309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dynamic metabolic and molecular changes during seasonal shrinking in Sorex araneus 季节萎缩过程中的动态代谢和分子变化

IF 7 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-04 DOI: 10.1101/gr.280639.125

William R. Thomas, Cecilia Baldoni, Yuanyuan Zeng, David Carlson, Julie Holm-Jacobsen, Marion Muturi, Dominik von Elverfeldt, Tue B. Bennike, Dina Dechmann, John Nieland, Angelique Corthals, Liliana M Davalos

To meet the challenge of wintering in place, many high-latitude small mammals reduce energy demands through hibernation. In contrast, short-lived Eurasian common shrews, Sorex araneus, remain active and shrink, including energy-intensive organs in winter, regrowing in spring in an evolved strategy called Dehnel's phenomenon. How this size change is linked to metabolic and regulatory changes to sustain their high metabolism is unknown. We analyzed metabolic, proteomic, and gene expression profiles spanning the entirety of Dehnel's seasonal cycle in wild shrews. We show regulatory changes to oxidative phosphorylation and increased fatty acid metabolism during autumn-to-winter shrinkage, as previously found in hibernating species. But in shrews we also found upregulated winter expression of genes involved in gluconeogenesis: the biosynthesis of glucose from noncarbohydrate substrates. Coexpression models revealed changes in size and metabolic gene expression interconnect via FOXO signaling, whose overexpression reduces size and extends lifespan in many model organisms. We propose that while shifts in gluconeogenesis meet the challenge posed by high metabolic rate and active winter lifestyle, FOXO signaling is central to Dehnel's phenomenon, with spring downregulation limiting lifespan in these shrews.

为了应对在原地过冬的挑战，许多高纬度小型哺乳动物通过冬眠来减少能量需求。相比之下，寿命较短的欧亚普通鼩鼱（Sorex araneus）保持活跃，并在冬季收缩，包括能量密集的器官，在春天重新生长，这是一种被称为德内尔现象的进化策略。这种大小变化如何与维持高代谢的代谢和调节变化联系在一起尚不清楚。我们分析了野生鼩鼱在整个Dehnel季节周期中的代谢、蛋白质组学和基因表达谱。我们发现，在秋冬收缩期间，氧化磷酸化的调节变化和脂肪酸代谢的增加，正如之前在冬眠物种中发现的那样。但在鼩鼱中，我们还发现冬季与糖异生有关的基因表达上调：糖异生是指从非碳水化合物底物中生物合成葡萄糖。共表达模型揭示了大小和代谢基因表达的变化通过FOXO信号相互联系，其过表达在许多模式生物中减小了大小并延长了寿命。我们认为，虽然糖异生的变化应对了高代谢率和活跃的冬季生活方式带来的挑战，但FOXO信号是Dehnel现象的核心，春季下调限制了这些鼩鼠的寿命。

{"title":"Dynamic metabolic and molecular changes during seasonal shrinking in Sorex araneus","authors":"William R. Thomas, Cecilia Baldoni, Yuanyuan Zeng, David Carlson, Julie Holm-Jacobsen, Marion Muturi, Dominik von Elverfeldt, Tue B. Bennike, Dina Dechmann, John Nieland, Angelique Corthals, Liliana M Davalos","doi":"10.1101/gr.280639.125","DOIUrl":"https://doi.org/10.1101/gr.280639.125","url":null,"abstract":"To meet the challenge of wintering in place, many high-latitude small mammals reduce energy demands through hibernation. In contrast, short-lived Eurasian common shrews, Sorex araneus, remain active and shrink, including energy-intensive organs in winter, regrowing in spring in an evolved strategy called Dehnel's phenomenon. How this size change is linked to metabolic and regulatory changes to sustain their high metabolism is unknown. We analyzed metabolic, proteomic, and gene expression profiles spanning the entirety of Dehnel's seasonal cycle in wild shrews. We show regulatory changes to oxidative phosphorylation and increased fatty acid metabolism during autumn-to-winter shrinkage, as previously found in hibernating species. But in shrews we also found upregulated winter expression of genes involved in gluconeogenesis: the biosynthesis of glucose from noncarbohydrate substrates. Coexpression models revealed changes in size and metabolic gene expression interconnect via FOXO signaling, whose overexpression reduces size and extends lifespan in many model organisms. We propose that while shifts in gluconeogenesis meet the challenge posed by high metabolic rate and active winter lifestyle, FOXO signaling is central to Dehnel's phenomenon, with spring downregulation limiting lifespan in these shrews.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":"21 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145441275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Iterative improvement of deep learning models using synthetic regulatory genomics. 基于合成调控基因组学的深度学习模型迭代改进。

IF 5.5 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-03 DOI: 10.1101/gr.280540.125

André M Ribeiro-Dos-Santos, Matthew T Maurano

Deep learning models can accurately reconstruct genome-wide epigenetic tracks from the reference genome sequence alone. But it is unclear what predictive power they have on sequence diverging from the reference, such as disease- and trait-associated variants or engineered sequences. Recent work has applied synthetic regulatory genomics to characterized dozens of deletions, inversions, and rearrangements of DNase I hypersensitive sites (DHSs). Here, we use the state-of-the-art model Enformer to predict DNA accessibility and RNA transcription across these engineered sequences when delivered at their endogenous loci. At a high level, we observe a good correlation between accessibility predicted by Enformer and experimental data. But model performance is best for sequences that more resembled the reference, such as single deletions or combinations of multiple DHSs. Predictive power is poorer for rearrangements affecting DHS order or orientation. We use these data to fine-tune Enformer, yielding significant reduction in prediction error. We show that this fine-tuning retains strong predictive performance for other tracks. Our results show that current deep learning models perform poorly when presented with novel sequences diverging in certain critical features from their training set. Thus, an iterative approach incorporating profiling of synthetic constructs can improve model generalizability and ultimately enable functional classification of regulatory variants identified by population studies.

深度学习模型可以从参考基因组序列中准确地重建全基因组的表观遗传轨迹。但目前尚不清楚它们对偏离参考序列的预测能力有多大，比如与疾病和性状相关的变异或工程序列。最近的工作已经应用合成调控基因组学来表征dna酶I超敏感位点（DHSs）的缺失、倒置和重排。在这里，我们使用最先进的Enformer模型来预测这些工程序列在其内源性位点传递时的DNA可及性和RNA转录。在较高的水平上，Enformer预测的可达性与实验数据具有较好的相关性。但是，模型性能最好的是与参考序列更相似的序列，例如单个缺失或多个dhs的组合。对于影响国土安全部秩序或方向的重新安排，预测能力较差。我们使用这些数据对Enformer进行微调，显著降低了预测误差。我们表明，这种微调对其他轨道保持了强大的预测性能。我们的研究结果表明，当前的深度学习模型在面对与训练集在某些关键特征上偏离的新序列时表现不佳。因此，结合合成结构分析的迭代方法可以提高模型的通用性，并最终实现种群研究确定的调节变异的功能分类。

{"title":"Iterative improvement of deep learning models using synthetic regulatory genomics.","authors":"André M Ribeiro-Dos-Santos, Matthew T Maurano","doi":"10.1101/gr.280540.125","DOIUrl":"10.1101/gr.280540.125","url":null,"abstract":"Deep learning models can accurately reconstruct genome-wide epigenetic tracks from the reference genome sequence alone. But it is unclear what predictive power they have on sequence diverging from the reference, such as disease- and trait-associated variants or engineered sequences. Recent work has applied synthetic regulatory genomics to characterized dozens of deletions, inversions, and rearrangements of DNase I hypersensitive sites (DHSs). Here, we use the state-of-the-art model Enformer to predict DNA accessibility and RNA transcription across these engineered sequences when delivered at their endogenous loci. At a high level, we observe a good correlation between accessibility predicted by Enformer and experimental data. But model performance is best for sequences that more resembled the reference, such as single deletions or combinations of multiple DHSs. Predictive power is poorer for rearrangements affecting DHS order or orientation. We use these data to fine-tune Enformer, yielding significant reduction in prediction error. We show that this fine-tuning retains strong predictive performance for other tracks. Our results show that current deep learning models perform poorly when presented with novel sequences diverging in certain critical features from their training set. Thus, an iterative approach incorporating profiling of synthetic constructs can improve model generalizability and ultimately enable functional classification of regulatory variants identified by population studies.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":" ","pages":"2539-2549"},"PeriodicalIF":5.5,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581950/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145344931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Kinetic measurement of gene-specific RNA polymerase II transcription elongation rates. 基因特异性RNA聚合酶II转录延伸率的动力学测量。

IF 5.5 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-03 DOI: 10.1101/gr.280852.125

Haiyue Liu, Lea H Gregersen

Transcription is regulated at multiple levels, including initiation, elongation, and termination. Whereas much research has focused on the initiation of transcription, regulation of elongation plays an important role not only in transcription dynamics but also in cotranscriptional RNA processing and genome stability. Despite advances in high-throughput approaches for global quantification of RNA polymerase II (RNAPII) speed, RNAPII elongation rate studies have been limited to a relatively small number of long genes or to velocity estimates inferred indirectly from RNAPII occupancy data. Here, we present DRB/TT_chem-seq2, a modified version of the DRB/TT_chem-seq method, to directly measure gene-specific elongation rates of more than 3000 genes. By combining short time point sampling after synchronized RNAPII release into the gene body and a new computational framework to track the distance traveled by RNAPII, we greatly increase the number of genes for which it is possible to obtain elongation rates. Our direct RNAPII elongation rate quantification reveals that elongation rates vary not only among genes but also within genes. Additionally, we describe how specific histone modifications and elongation factor occupancy correlate with subclasses of genes based on their elongation rates. Together, we present a robust and powerful method for RNAPII transcription elongation rate measurement.

转录在多个水平上受到调控，包括起始、延伸和终止。尽管许多研究都集中在转录的起始，但延伸调控不仅在转录动力学中起着重要作用，而且在共转录RNA加工和基因组稳定性中也起着重要作用。尽管全球定量RNA聚合酶II （RNAPII）速度的高通量方法取得了进展，但RNAPII延伸率的研究仅限于相对较少的长基因或从RNAPII占用数据间接推断的速度估计。在这里，我们提出了DRB/TTchem-seq2，这是DRB/TTchem-seq方法的改进版本，可以直接测量3000多个基因的基因特异性延伸率。通过结合同步RNAPII释放到基因体后的短时间点采样和跟踪RNAPII传播距离的新计算框架，我们大大增加了可能获得延伸率的基因数量。我们的直接RNAPII延伸率量化表明，延伸率不仅在基因之间而且在基因内变化。此外，我们描述了特定组蛋白修饰和延伸因子占用如何与基于其延伸率的基因亚类相关。我们共同提出了一种强大的RNAPII转录延伸率测量方法。

{"title":"Kinetic measurement of gene-specific RNA polymerase II transcription elongation rates.","authors":"Haiyue Liu, Lea H Gregersen","doi":"10.1101/gr.280852.125","DOIUrl":"10.1101/gr.280852.125","url":null,"abstract":"Transcription is regulated at multiple levels, including initiation, elongation, and termination. Whereas much research has focused on the initiation of transcription, regulation of elongation plays an important role not only in transcription dynamics but also in cotranscriptional RNA processing and genome stability. Despite advances in high-throughput approaches for global quantification of RNA polymerase II (RNAPII) speed, RNAPII elongation rate studies have been limited to a relatively small number of long genes or to velocity estimates inferred indirectly from RNAPII occupancy data. Here, we present DRB/TTchem-seq2, a modified version of the DRB/TTchem-seq method, to directly measure gene-specific elongation rates of more than 3000 genes. By combining short time point sampling after synchronized RNAPII release into the gene body and a new computational framework to track the distance traveled by RNAPII, we greatly increase the number of genes for which it is possible to obtain elongation rates. Our direct RNAPII elongation rate quantification reveals that elongation rates vary not only among genes but also within genes. Additionally, we describe how specific histone modifications and elongation factor occupancy correlate with subclasses of genes based on their elongation rates. Together, we present a robust and powerful method for RNAPII transcription elongation rate measurement.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":" ","pages":"2550-2562"},"PeriodicalIF":5.5,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581951/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145344916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Polyomino reconstructs spatial transcriptomic profiles with single-cell resolution via a region-allocation method. Polyomino通过区域分配方法重建单细胞分辨率的空间转录组谱。

IF 5.5 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-03 DOI: 10.1101/gr.280532.125

Quanyou Cai, Lihui Lin, Xin Liu, Jiekai Chen

Integration of single-cell and spatial transcriptomes represents a fundamental strategy to enhance spatial data quality. However, existing methods for mapping single-cell data to spatial coordinates struggle with large-scale data sets comprising millions of cells. Here, we introduce Polyomino, an intelligent region-allocation method inspired by the region-of-interest (ROI) concept from image processing. By using gradient descent, Polyomino allocates cells to structured spatial regions that match the most significant biological information, optimizing the integration of data and improving speed and accuracy. Polyomino excels in integrating data even in the presence of various sequencing artifacts, such as cell segmentation errors and imbalanced cell-type representations. Polyomino outperforms state-of-the-art methods by 10 to 1000 times in speed, and it is the only approach capable of integrating data sets containing millions of cells in a single run. As a result, Polyomino uncovers originally hidden gene expression patterns in brain sections and offers new insights into organogenesis and tumor microenvironments, all with exceptional efficiency and accuracy.

单细胞和空间转录组的整合是提高空间数据质量的基本策略。然而，现有的将单细胞数据映射到空间坐标的方法很难处理包含数百万细胞的大规模数据集。本文介绍了一种基于图像处理中感兴趣区域（ROI）概念的智能区域分配方法Polyomino。通过使用梯度下降，Polyomino将细胞分配到与最重要的生物信息匹配的结构化空间区域，优化数据集成，提高速度和准确性。Polyomino在整合数据方面表现出色，即使存在各种测序伪影，如细胞分割错误和不平衡的细胞类型表示。Polyomino的速度比目前最先进的方法快10到1000倍，它是唯一能够在一次运行中集成包含数百万个单元格的数据集的方法。因此，Polyomino揭示了最初隐藏在脑切片中的基因表达模式，并为器官发生和肿瘤微环境提供了新的见解，所有这些都具有卓越的效率和准确性。

{"title":"Polyomino reconstructs spatial transcriptomic profiles with single-cell resolution via a region-allocation method.","authors":"Quanyou Cai, Lihui Lin, Xin Liu, Jiekai Chen","doi":"10.1101/gr.280532.125","DOIUrl":"10.1101/gr.280532.125","url":null,"abstract":"Integration of single-cell and spatial transcriptomes represents a fundamental strategy to enhance spatial data quality. However, existing methods for mapping single-cell data to spatial coordinates struggle with large-scale data sets comprising millions of cells. Here, we introduce Polyomino, an intelligent region-allocation method inspired by the region-of-interest (ROI) concept from image processing. By using gradient descent, Polyomino allocates cells to structured spatial regions that match the most significant biological information, optimizing the integration of data and improving speed and accuracy. Polyomino excels in integrating data even in the presence of various sequencing artifacts, such as cell segmentation errors and imbalanced cell-type representations. Polyomino outperforms state-of-the-art methods by 10 to 1000 times in speed, and it is the only approach capable of integrating data sets containing millions of cells in a single run. As a result, Polyomino uncovers originally hidden gene expression patterns in brain sections and offers new insights into organogenesis and tumor microenvironments, all with exceptional efficiency and accuracy.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":" ","pages":"2513-2526"},"PeriodicalIF":5.5,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581947/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145344944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A gene regulatory element modulates myosin expression and controls cardiomyocyte response to stress. 一种基因调控元件调节肌球蛋白的表达并控制心肌细胞对应激的反应。

IF 5.5 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-03 DOI: 10.1101/gr.280825.125

Taylor Anglen, Irene M Kaplow, Baekgyu Choi, Enya Dewars, Robin M Perelli, Kevin T Hagy, Duc Tran, Megan E Ramaker, Svati H Shah, Inkyung Jung, Andrew P Landstrom, Ravi Karra, Yarui Diao, Charles A Gersbach

A hallmark of heart disease is gene dysregulation and reactivation of fetal gene programs. Reactivation of these fetal programs has compensatory effects during heart failure, depending on the type and stage of the underlying cardiomyopathy. Thousands of putative cardiac gene regulatory elements have been identified that may control these programs, but their functions are largely unknown. Here, we profile genome-wide changes to gene expression and chromatin structure in cardiomyocytes derived from human pluripotent stem cells. We identify and characterize a gene regulatory element essential for regulating MYH6 expression, which encodes human fetal myosin. Using chromatin conformation assays in combination with epigenome editing, we find that gene regulation is mediated by a direct interaction between MYH6 and the enhancer. We also find that enhancer activation alters cardiomyocyte response to the hypertrophy-inducing peptide endothelin-1. Enhancer activation prevents polyploidization as well as changes in calcium dynamics and metabolism following stress with endothelin-1. Collectively, these results identify regulatory mechanisms of cardiac gene programs that modulate cardiomyocyte maturation, affect cellular stress response, and could serve as potential therapeutic targets.

心脏病的一个标志是基因失调和胎儿基因程序的重新激活。这些胎儿程序的重新激活在心力衰竭期间具有代偿作用，这取决于潜在心肌病的类型和阶段。已经确定了数千种可能控制这些程序的假定的心脏基因调控元件，但它们的功能在很大程度上是未知的。在这里，我们分析了人类多能干细胞衍生的心肌细胞基因表达和染色质结构的全基因组变化。我们鉴定并鉴定了一个基因调控元件，该元件对调节MYH6的表达至关重要，MYH6编码人胎儿肌球蛋白。利用染色质构象分析结合表观基因组编辑，我们发现基因调控是由MYH6和增强子之间的直接相互作用介导的。我们还发现增强子的激活改变了心肌细胞对诱导肥大的肽内皮素-1的反应。增强子激活可防止多倍体以及内皮素-1应激后钙动力学和代谢的变化。总的来说，这些结果确定了调节心肌细胞成熟、影响细胞应激反应的心脏基因程序的调节机制，并可能作为潜在的治疗靶点。

{"title":"A gene regulatory element modulates myosin expression and controls cardiomyocyte response to stress.","authors":"Taylor Anglen, Irene M Kaplow, Baekgyu Choi, Enya Dewars, Robin M Perelli, Kevin T Hagy, Duc Tran, Megan E Ramaker, Svati H Shah, Inkyung Jung, Andrew P Landstrom, Ravi Karra, Yarui Diao, Charles A Gersbach","doi":"10.1101/gr.280825.125","DOIUrl":"10.1101/gr.280825.125","url":null,"abstract":"A hallmark of heart disease is gene dysregulation and reactivation of fetal gene programs. Reactivation of these fetal programs has compensatory effects during heart failure, depending on the type and stage of the underlying cardiomyopathy. Thousands of putative cardiac gene regulatory elements have been identified that may control these programs, but their functions are largely unknown. Here, we profile genome-wide changes to gene expression and chromatin structure in cardiomyocytes derived from human pluripotent stem cells. We identify and characterize a gene regulatory element essential for regulating MYH6 expression, which encodes human fetal myosin. Using chromatin conformation assays in combination with epigenome editing, we find that gene regulation is mediated by a direct interaction between MYH6 and the enhancer. We also find that enhancer activation alters cardiomyocyte response to the hypertrophy-inducing peptide endothelin-1. Enhancer activation prevents polyploidization as well as changes in calcium dynamics and metabolism following stress with endothelin-1. Collectively, these results identify regulatory mechanisms of cardiac gene programs that modulate cardiomyocyte maturation, affect cellular stress response, and could serve as potential therapeutic targets.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":" ","pages":"2418-2432"},"PeriodicalIF":5.5,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581865/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145344943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PoreMeth2 for decoding the evolution of methylome alterations with nanopore sequencing. 通过纳米孔测序解码甲基组改变的进化。

IF 5.5 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-03 DOI: 10.1101/gr.280259.124

Gianluca Mattei, Marta Baragli, Barbara Gega, Alessandra Mingrino, Martina Chieca, Tommaso Ducci, Gianmaria Frigè, Luca Mazzarella, Romina D'Aurizio, Francesco De Logu, Romina Nassini, Pier Giuseppe Pelicci, Alberto Magi

In epigenetic analysis, the identification of differentially methylated regions (DMRs) typically involves the detection of consecutive CpGs groups that show significant changes in their average methylation levels. However, the methylation state of a genomic region can also be characterized by a mixture of patterns (epialleles) with variable frequencies, and the relative proportions of such patterns can provide insights into its mechanisms of formation. Traditional methods based on bisulfite conversion and high-throughput sequencing, such as Illumina, owing to the read size (150 bp) allow epiallele frequency analysis only in high CpG density regions, limiting differential methylation studies to just 50% of the human methylome. Nanopore sequencing, with its long reads, enables the analysis of epiallele frequency across both high and low CpG density regions. Here, we introduce a novel computational approach, PoreMeth2, an R library that integrates epiallelic diversity and methylation frequency changes from nanopore data to identify DMRs, providing insights into their possible mechanisms of formation, and annotate them to genic and regulatory elements. We apply PoreMeth2 to cancer and glial cell data sets, providing evidence of its advance over other state-of-the-art methods and demonstrating its ability to distinguish epigenomic alterations with a strong impact on gene expression from those with weaker effects on transcriptional activity.

在表观遗传分析中，差异甲基化区（DMRs）的鉴定通常涉及检测连续的CpGs组，这些CpGs组在平均甲基化水平上表现出显著的变化。然而，基因组区域的甲基化状态也可以通过不同频率的模式（外显子）的混合来表征，并且这些模式的相对比例可以提供对其形成机制的见解。基于亚硫酸盐转化和高通量测序的传统方法，如Illumina，由于读取大小（150 bp），只能在高CpG密度区域进行外等位基因频率分析，将差异甲基化研究限制在人类甲基组的50%。纳米孔测序具有较长的读数，可以分析高和低CpG密度区域的外等位基因频率。在这里，我们引入了一种新的计算方法，PoreMeth2，这是一个R文库，它集成了纳米孔数据的外显子多样性和甲基化频率变化，以识别DMRs，提供对其可能形成机制的见解，并将其注释为基因和调控元件。我们将PoreMeth2应用于癌症和神经胶质细胞数据集，提供了其优于其他最先进方法的证据，并证明了其区分对基因表达有强烈影响的表观基因组改变与对转录活性影响较弱的表观基因组改变的能力。

{"title":"PoreMeth2 for decoding the evolution of methylome alterations with nanopore sequencing.","authors":"Gianluca Mattei, Marta Baragli, Barbara Gega, Alessandra Mingrino, Martina Chieca, Tommaso Ducci, Gianmaria Frigè, Luca Mazzarella, Romina D'Aurizio, Francesco De Logu, Romina Nassini, Pier Giuseppe Pelicci, Alberto Magi","doi":"10.1101/gr.280259.124","DOIUrl":"10.1101/gr.280259.124","url":null,"abstract":"In epigenetic analysis, the identification of differentially methylated regions (DMRs) typically involves the detection of consecutive CpGs groups that show significant changes in their average methylation levels. However, the methylation state of a genomic region can also be characterized by a mixture of patterns (epialleles) with variable frequencies, and the relative proportions of such patterns can provide insights into its mechanisms of formation. Traditional methods based on bisulfite conversion and high-throughput sequencing, such as Illumina, owing to the read size (150 bp) allow epiallele frequency analysis only in high CpG density regions, limiting differential methylation studies to just 50% of the human methylome. Nanopore sequencing, with its long reads, enables the analysis of epiallele frequency across both high and low CpG density regions. Here, we introduce a novel computational approach, PoreMeth2, an R library that integrates epiallelic diversity and methylation frequency changes from nanopore data to identify DMRs, providing insights into their possible mechanisms of formation, and annotate them to genic and regulatory elements. We apply PoreMeth2 to cancer and glial cell data sets, providing evidence of its advance over other state-of-the-art methods and demonstrating its ability to distinguish epigenomic alterations with a strong impact on gene expression from those with weaker effects on transcriptional activity.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":" ","pages":"2501-2512"},"PeriodicalIF":5.5,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581943/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145336859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Real-time analysis and visualization of nanopore metagenomic samples with MARTi. 利用MARTi对纳米孔宏基因组样品进行实时分析和可视化。

IF 5.5 2区生物学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY

Genome research

Pub Date : 2025-11-03 DOI: 10.1101/gr.280550.125

Ned Peel, Samuel Martin, Darren Heavens, Douglas W Yu, Matthew D Clark, Richard M Leggett

The emergence of nanopore sequencing technology has the potential to transform metagenomics by offering low-cost, portable, and long-read sequencing capabilities. Furthermore, these platforms enable real-time data generation, which could significantly reduce the time from sample collection to result, a crucial factor for point-of-care diagnostics and biosurveillance. However, the full potential of real-time metagenomics remains largely unfulfilled due to a lack of accessible, open-source bioinformatic tools. We present Metagenomic Analysis in Real-Time (MARTi), an innovative open-source software designed for the real-time analysis, visualization, and exploration of metagenomic data. MARTi supports various classification methods, including BLAST, Centrifuge, and Kraken2, letting users customize parameters and utilize their own databases for taxonomic classification and antimicrobial resistance analysis. With a user-friendly, browser-based graphical interface, MARTi provides dynamic, real-time updates on community composition and AMR gene identification. MARTi's architecture and operational flexibility make it suitable for diverse research applications, ranging from in-field analysis to large-scale metagenomic studies. Using both simulated and real-world data, we demonstrate MARTi's performance in read classification, taxon detection, and relative abundance estimation. By bridging the gap between sequencing and actionable insights, MARTi marks a significant advance in the accessibility and functionality of real-time metagenomic analysis.

纳米孔测序技术的出现有可能通过提供低成本、便携和长读取的测序能力来改变宏基因组学。此外，这些平台能够实时生成数据，这可以显着减少从样本收集到结果的时间，这是即时诊断和生物监测的关键因素。然而，由于缺乏可获得的、开源的生物信息学工具，实时宏基因组学的全部潜力在很大程度上仍未实现。我们介绍了实时宏基因组分析（MARTi），这是一个创新的开源软件，旨在实时分析、可视化和探索宏基因组数据。MARTi支持多种分类方法，包括BLAST、离心机和Kraken2，允许用户自定义参数并利用自己的数据库进行分类分类和抗菌素耐药性分析。通过一个用户友好的、基于浏览器的图形界面，MARTi提供了动态的、实时的群落组成和AMR基因鉴定的更新。MARTi的架构和操作灵活性使其适用于从现场分析到大规模宏基因组研究的各种研究应用。通过模拟和真实数据，我们展示了MARTi在读取分类、分类群检测和相对丰度估计方面的性能。通过弥合测序和可操作见解之间的差距，MARTi标志着实时宏基因组分析的可及性和功能的重大进步。

{"title":"Real-time analysis and visualization of nanopore metagenomic samples with MARTi.","authors":"Ned Peel, Samuel Martin, Darren Heavens, Douglas W Yu, Matthew D Clark, Richard M Leggett","doi":"10.1101/gr.280550.125","DOIUrl":"10.1101/gr.280550.125","url":null,"abstract":"The emergence of nanopore sequencing technology has the potential to transform metagenomics by offering low-cost, portable, and long-read sequencing capabilities. Furthermore, these platforms enable real-time data generation, which could significantly reduce the time from sample collection to result, a crucial factor for point-of-care diagnostics and biosurveillance. However, the full potential of real-time metagenomics remains largely unfulfilled due to a lack of accessible, open-source bioinformatic tools. We present Metagenomic Analysis in Real-Time (MARTi), an innovative open-source software designed for the real-time analysis, visualization, and exploration of metagenomic data. MARTi supports various classification methods, including BLAST, Centrifuge, and Kraken2, letting users customize parameters and utilize their own databases for taxonomic classification and antimicrobial resistance analysis. With a user-friendly, browser-based graphical interface, MARTi provides dynamic, real-time updates on community composition and AMR gene identification. MARTi's architecture and operational flexibility make it suitable for diverse research applications, ranging from in-field analysis to large-scale metagenomic studies. Using both simulated and real-world data, we demonstrate MARTi's performance in read classification, taxon detection, and relative abundance estimation. By bridging the gap between sequencing and actionable insights, MARTi marks a significant advance in the accessibility and functionality of real-time metagenomic analysis.","PeriodicalId":12678,"journal":{"name":"Genome research","volume":" ","pages":"2488-2500"},"PeriodicalIF":5.5,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12581910/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145377083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0