首页 > 最新文献

Cell genomics最新文献

英文 中文
ChromBERT: A foundation model for learning interpretable representations for context-specific transcriptional regulatory networks. 一个学习情境特异性转录调控网络的可解释表征的基础模型。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-26 DOI: 10.1016/j.xgen.2025.101130
Zhaowei Yu, Dongxu Yang, Qianqian Chen, Yuxuan Zhang, Zhanhao Li, Yucheng Wang, Chenfei Wang, Yong Zhang

Gene expression is shaped by transcriptional regulatory networks (TRNs), where transcription regulators interact within regulatory elements in a context-specific manner. Deciphering context-specific TRNs has long been constrained by the severe sparsity of cell-type-specific chromatin immunoprecipitation sequencing (ChIP-seq) profiles. Here, we present ChromBERT, a foundation model pre-trained on large-scale human ChIP-seq datasets covering ∼1,000 transcription regulators. ChromBERT learns the genome-wide syntax of regulatory cooperation and generates interpretable TRN representations. After prompt-enhanced fine-tuning, it outperforms existing methods for imputing unseen cistromes. Moreover, lightweight fine-tuning on cell-type-specific downstream tasks adapts the TRN representations to capture regulatory effects and dynamics within any given cellular context. The resulting context-specific representations can then be interpreted to infer regulatory roles of transcription regulators underlying these cell-type-specific regulatory outcomes without requiring additional ChIP-seq experiments. By overcoming the limitations of sparse transcription regulator data, ChromBERT significantly enhances our ability to model and interpret transcriptional regulation across a wide range of biological contexts.

基因表达是由转录调控网络(trn)塑造的,其中转录调控因子以特定环境的方式在调控元件中相互作用。长期以来,细胞类型特异性染色质免疫沉淀测序(ChIP-seq)谱的严重稀缺性限制了对上下文特异性trn的破译。在这里,我们提出了ChromBERT,这是一个在覆盖约1,000个转录调控因子的大规模人类ChIP-seq数据集上预先训练的基础模型。ChromBERT学习调控合作的全基因组语法,并生成可解释的TRN表示。经过即时增强的微调,它优于现有的计算未见云的方法。此外,对细胞类型特异性下游任务的轻量级微调使TRN表示适应于捕捉任何给定细胞环境中的调节效应和动态。由此产生的上下文特异性表征可以被解释为推断转录调节剂在这些细胞类型特异性调节结果基础上的调节作用,而不需要额外的ChIP-seq实验。通过克服稀疏转录调控数据的局限性,ChromBERT显著增强了我们在广泛的生物学背景下建模和解释转录调控的能力。
{"title":"ChromBERT: A foundation model for learning interpretable representations for context-specific transcriptional regulatory networks.","authors":"Zhaowei Yu, Dongxu Yang, Qianqian Chen, Yuxuan Zhang, Zhanhao Li, Yucheng Wang, Chenfei Wang, Yong Zhang","doi":"10.1016/j.xgen.2025.101130","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101130","url":null,"abstract":"<p><p>Gene expression is shaped by transcriptional regulatory networks (TRNs), where transcription regulators interact within regulatory elements in a context-specific manner. Deciphering context-specific TRNs has long been constrained by the severe sparsity of cell-type-specific chromatin immunoprecipitation sequencing (ChIP-seq) profiles. Here, we present ChromBERT, a foundation model pre-trained on large-scale human ChIP-seq datasets covering ∼1,000 transcription regulators. ChromBERT learns the genome-wide syntax of regulatory cooperation and generates interpretable TRN representations. After prompt-enhanced fine-tuning, it outperforms existing methods for imputing unseen cistromes. Moreover, lightweight fine-tuning on cell-type-specific downstream tasks adapts the TRN representations to capture regulatory effects and dynamics within any given cellular context. The resulting context-specific representations can then be interpreted to infer regulatory roles of transcription regulators underlying these cell-type-specific regulatory outcomes without requiring additional ChIP-seq experiments. By overcoming the limitations of sparse transcription regulator data, ChromBERT significantly enhances our ability to model and interpret transcriptional regulation across a wide range of biological contexts.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101130"},"PeriodicalIF":11.1,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146069142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping disease loci to biological processes via joint pleiotropic and epigenomic partitioning. 通过联合多效性和表观基因组分配绘制疾病位点的生物学过程。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-26 DOI: 10.1016/j.xgen.2025.101138
Gaspard Kerner, Nolan Kamitaki, Benjamin Strober, Alkes L Price

Genome-wide association studies have identified thousands of disease-associated loci, yet their biological interpretation remains limited. We propose joint pleiotropic and epigenomic partitioning (J-PEP), a clustering framework that integrates pleiotropic SNP effects on auxiliary traits and tissue-specific epigenomic data to partition disease-associated loci into biologically distinct clusters. We introduce a metric-pleiotropic and epigenomic prediction accuracy (PEPA)-that evaluates how well the clusters predict SNP-to-trait and SNP-to-tissue associations in off-chromosome data. Analyzing summary statistics for 165 diseases/traits (average N = 290,000), J-PEP attained 16%-30% higher PEPA than pleiotropic or epigenomic partitioning approaches, with larger improvements for well-powered traits, consistent with simulations; these gains arise from J-PEP's tendency to upweight signals present in both auxiliary trait and tissue data, emphasizing shared components. Notably, integrating single-cell chromatin accessibility data refined bulk-based clusters, enhancing cell-type resolution and specificity. For type 2 diabetes, hypertension, and other diseases/traits, J-PEP clusters recapitulated known pathways while revealing underexplored biological processes.

全基因组关联研究已经确定了数千个与疾病相关的基因座,但它们的生物学解释仍然有限。我们提出联合多效性和表观基因组划分(J-PEP),这是一个整合多效性SNP对辅助性状的影响和组织特异性表观基因组数据的聚类框架,将疾病相关位点划分为生物学上不同的聚类。我们引入了一个度量-多效性和表观基因组预测精度(PEPA)-评估如何很好地预测snp -性状和snp -组织外染色体数据的关联。通过对165种疾病/性状(平均N = 290,000)的汇总统计分析,J-PEP方法的PEPA比多效性或表观基因组分配方法高16%-30%,对强效性状的改善更大,与模拟结果一致;这些增益来自于J-PEP倾向于提高辅助性状和组织数据中存在的信号的权重,强调共享成分。值得注意的是,整合单细胞染色质可及性数据改进了基于体积的簇,提高了细胞类型的分辨率和特异性。对于2型糖尿病、高血压和其他疾病/特征,J-PEP集群概括了已知的途径,同时揭示了未被探索的生物过程。
{"title":"Mapping disease loci to biological processes via joint pleiotropic and epigenomic partitioning.","authors":"Gaspard Kerner, Nolan Kamitaki, Benjamin Strober, Alkes L Price","doi":"10.1016/j.xgen.2025.101138","DOIUrl":"10.1016/j.xgen.2025.101138","url":null,"abstract":"<p><p>Genome-wide association studies have identified thousands of disease-associated loci, yet their biological interpretation remains limited. We propose joint pleiotropic and epigenomic partitioning (J-PEP), a clustering framework that integrates pleiotropic SNP effects on auxiliary traits and tissue-specific epigenomic data to partition disease-associated loci into biologically distinct clusters. We introduce a metric-pleiotropic and epigenomic prediction accuracy (PEPA)-that evaluates how well the clusters predict SNP-to-trait and SNP-to-tissue associations in off-chromosome data. Analyzing summary statistics for 165 diseases/traits (average N = 290,000), J-PEP attained 16%-30% higher PEPA than pleiotropic or epigenomic partitioning approaches, with larger improvements for well-powered traits, consistent with simulations; these gains arise from J-PEP's tendency to upweight signals present in both auxiliary trait and tissue data, emphasizing shared components. Notably, integrating single-cell chromatin accessibility data refined bulk-based clusters, enhancing cell-type resolution and specificity. For type 2 diabetes, hypertension, and other diseases/traits, J-PEP clusters recapitulated known pathways while revealing underexplored biological processes.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101138"},"PeriodicalIF":11.1,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146069170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hist2Cell: Deciphering fine-grained cellular architectures from histology images. Hist2Cell:从组织学图像中破译细粒度的细胞结构。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-26 DOI: 10.1016/j.xgen.2025.101137
Weiqin Zhao, Zhuo Liang, Xianjie Huang, Yuanhua Huang, Lequan Yu

Histology images offer a cost-effective approach to predicting cellular phenotypes using spatial transcriptomics. However, existing methods struggle with individual gene expression accuracy and lack the capability to predict fine-grained transcriptional cell types. We present Hist2Cell, a vision graph-transformer framework to accurately resolve fine-grained cell types directly from histology images. Trained on human lung and breast cancer datasets, Hist2Cell predicts cell-type abundance with high accuracy (Pearson correlation over 0.80) and captures cellular colocalization. Moreover, it generalizes to large-scale The Cancer Genome Atlas (TCGA) cohorts without re-training, facilitating survival prediction by revealing distinct tissue microenvironments and cell type-patient mortality relationships. Thus, Hist2Cell enables cost-efficient analysis for large-scale spatial biology studies and precise cancer prognosis.

组织学图像提供了一个成本效益的方法来预测细胞表型使用空间转录组学。然而,现有的方法与个体基因表达的准确性相斗争,并且缺乏预测细粒度转录细胞类型的能力。我们提出了Hist2Cell,这是一个视觉图形转换框架,可以直接从组织学图像中准确地解析细粒度细胞类型。在人类肺癌和乳腺癌数据集上训练,Hist2Cell以高精度预测细胞类型丰度(Pearson相关性超过0.80)并捕获细胞共定位。此外,它可以推广到大规模的癌症基因组图谱(TCGA)队列,无需重新训练,通过揭示不同的组织微环境和细胞类型与患者死亡率的关系,促进生存预测。因此,Hist2Cell使大规模空间生物学研究和精确癌症预后的成本效益分析成为可能。
{"title":"Hist2Cell: Deciphering fine-grained cellular architectures from histology images.","authors":"Weiqin Zhao, Zhuo Liang, Xianjie Huang, Yuanhua Huang, Lequan Yu","doi":"10.1016/j.xgen.2025.101137","DOIUrl":"10.1016/j.xgen.2025.101137","url":null,"abstract":"<p><p>Histology images offer a cost-effective approach to predicting cellular phenotypes using spatial transcriptomics. However, existing methods struggle with individual gene expression accuracy and lack the capability to predict fine-grained transcriptional cell types. We present Hist2Cell, a vision graph-transformer framework to accurately resolve fine-grained cell types directly from histology images. Trained on human lung and breast cancer datasets, Hist2Cell predicts cell-type abundance with high accuracy (Pearson correlation over 0.80) and captures cellular colocalization. Moreover, it generalizes to large-scale The Cancer Genome Atlas (TCGA) cohorts without re-training, facilitating survival prediction by revealing distinct tissue microenvironments and cell type-patient mortality relationships. Thus, Hist2Cell enables cost-efficient analysis for large-scale spatial biology studies and precise cancer prognosis.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101137"},"PeriodicalIF":11.1,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146069192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial transcriptomics reveals altered communities and drivers of aberrant epithelia and pro-fibrotic fibroblasts in interstitial lung diseases. 空间转录组学揭示了间质性肺疾病中异常上皮细胞和前纤维化成纤维细胞群落的改变和驱动因素。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-22 DOI: 10.1016/j.xgen.2025.101066
Alok Jaiswal, Tristan Kooistra, Vladislav Pokatayev, Hélder N Bastos, Rita F Santos, Tresa R Sarraf, Åsa Segerstolpe, Crystal Lin, Liat Amir-Zilberstein, Shaina Twardus, Kevin Shannon, Shane P Murphy, Rachel Knipe, Ingo K Ganzleben, Katharine E Black, Toni M Delorey, Daniel B Graham, Yin P Hung, Lida P Hariri, Jacques Deguine, Agostinho Carvalho, Benjamin D Medoff, Ramnik J Xavier

Interstitial lung diseases (ILD) are characterized by fibrotic scarring of the lung parenchyma with remarkably unfavorable prognosis. Using single-nucleus RNA sequencing and spatial transcriptomics, we generated a comprehensive cellular network of the distal lung and its alterations in fibrosis. Integration with histopathology revealed that the transformation of normal parenchyma into fibrotic tissue is accompanied by ectopic bronchiolization and decellularization. Areas of active fibrosis were characterized by co-localization of pro-fibrotic CTHRC1-hi fibroblasts and aberrant transitional epithelial cells. We modeled this maladaptive differentiation of alveolar epithelial cells using organoids, demonstrating that all three pro-inflammatory ligands present in this pathogenic niche, TGF-β, IL-1β, and TNF-α, are jointly required for their induction. Additionally, we identified a requirement for the transcription factor NFATC4 during myofibroblast differentiation driven by soluble factors or mechanosensing. Collectively, this work identifies essential molecular drivers of the cellular interactions underlying lung fibrosis.

间质性肺病(ILD)以肺实质纤维化瘢痕为特征,预后不良。使用单核RNA测序和空间转录组学,我们生成了远端肺及其纤维化变化的全面细胞网络。结合组织病理学发现正常实质向纤维化组织的转变伴随着异位细支气管细支气管化和脱细胞。活跃纤维化区域的特征是促纤维化CTHRC1-hi成纤维细胞和异常移行上皮细胞的共定位。我们使用类器官模拟了肺泡上皮细胞的这种不适应分化,证明了在这种致病性生态位中存在的所有三种促炎配体TGF-β、IL-1β和TNF-α都是诱导其分化所必需的。此外,我们确定了在可溶性因子或机械感应驱动的肌成纤维细胞分化过程中对转录因子NFATC4的需求。总的来说,这项工作确定了肺纤维化细胞相互作用的基本分子驱动因素。
{"title":"Spatial transcriptomics reveals altered communities and drivers of aberrant epithelia and pro-fibrotic fibroblasts in interstitial lung diseases.","authors":"Alok Jaiswal, Tristan Kooistra, Vladislav Pokatayev, Hélder N Bastos, Rita F Santos, Tresa R Sarraf, Åsa Segerstolpe, Crystal Lin, Liat Amir-Zilberstein, Shaina Twardus, Kevin Shannon, Shane P Murphy, Rachel Knipe, Ingo K Ganzleben, Katharine E Black, Toni M Delorey, Daniel B Graham, Yin P Hung, Lida P Hariri, Jacques Deguine, Agostinho Carvalho, Benjamin D Medoff, Ramnik J Xavier","doi":"10.1016/j.xgen.2025.101066","DOIUrl":"10.1016/j.xgen.2025.101066","url":null,"abstract":"<p><p>Interstitial lung diseases (ILD) are characterized by fibrotic scarring of the lung parenchyma with remarkably unfavorable prognosis. Using single-nucleus RNA sequencing and spatial transcriptomics, we generated a comprehensive cellular network of the distal lung and its alterations in fibrosis. Integration with histopathology revealed that the transformation of normal parenchyma into fibrotic tissue is accompanied by ectopic bronchiolization and decellularization. Areas of active fibrosis were characterized by co-localization of pro-fibrotic CTHRC1-hi fibroblasts and aberrant transitional epithelial cells. We modeled this maladaptive differentiation of alveolar epithelial cells using organoids, demonstrating that all three pro-inflammatory ligands present in this pathogenic niche, TGF-β, IL-1β, and TNF-α, are jointly required for their induction. Additionally, we identified a requirement for the transcription factor NFATC4 during myofibroblast differentiation driven by soluble factors or mechanosensing. Collectively, this work identifies essential molecular drivers of the cellular interactions underlying lung fibrosis.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101066"},"PeriodicalIF":11.1,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146042223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Microbial single-cell omics in situ. 原位微生物单细胞组学。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-22 DOI: 10.1016/j.xgen.2025.101128
Xihong Lan, Qiaoxing Liang, Jinhua He, Jiayi Wu, Xiaoying Zhang, Fei Li, Lili Li, Guoping Zhao, Ruidong Guo, Huijue Jia

Metagenomics has enabled the understanding of the microbial composition and functional potential in various environments. Using laser-induced forward transfer (LIFT) technology, we report high-quality microbial single-cell genomes or transcriptomes in complex samples such as mouse gut, human saliva, and tumor sections. Bacterial cells in close proximity to each other or to host cells could be directly analyzed using this single-cell approach. Bacterial cells in mice or human samples could be fluorescently labeled for single-cell visualization before collection. The high-quality single-cell transcriptome results allow us to delineate cell-fate commitment in Bacillus sporulation and preliminarily characterize gene expression from Bacteroides in a colorectal cancer sample. The method is scalable and precise and empowers insights about microbial populations and single-cell interactions with the host.

宏基因组学使人们能够了解微生物的组成和在各种环境中的功能潜力。利用激光诱导前转移(LIFT)技术,我们报道了复杂样品(如小鼠肠道、人类唾液和肿瘤切片)中高质量的微生物单细胞基因组或转录组。细菌细胞彼此接近或与宿主细胞接近,可以使用这种单细胞方法直接分析。在收集之前,可以对小鼠或人样品中的细菌细胞进行荧光标记,以便单细胞可视化。高质量的单细胞转录组结果使我们能够描绘芽孢杆菌孢子的细胞命运承诺,并初步表征结直肠癌样本中拟杆菌的基因表达。该方法是可扩展的和精确的,并授权洞察微生物种群和单细胞与宿主的相互作用。
{"title":"Microbial single-cell omics in situ.","authors":"Xihong Lan, Qiaoxing Liang, Jinhua He, Jiayi Wu, Xiaoying Zhang, Fei Li, Lili Li, Guoping Zhao, Ruidong Guo, Huijue Jia","doi":"10.1016/j.xgen.2025.101128","DOIUrl":"10.1016/j.xgen.2025.101128","url":null,"abstract":"<p><p>Metagenomics has enabled the understanding of the microbial composition and functional potential in various environments. Using laser-induced forward transfer (LIFT) technology, we report high-quality microbial single-cell genomes or transcriptomes in complex samples such as mouse gut, human saliva, and tumor sections. Bacterial cells in close proximity to each other or to host cells could be directly analyzed using this single-cell approach. Bacterial cells in mice or human samples could be fluorescently labeled for single-cell visualization before collection. The high-quality single-cell transcriptome results allow us to delineate cell-fate commitment in Bacillus sporulation and preliminarily characterize gene expression from Bacteroides in a colorectal cancer sample. The method is scalable and precise and empowers insights about microbial populations and single-cell interactions with the host.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101128"},"PeriodicalIF":11.1,"publicationDate":"2026-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146042260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variant-resolved prediction of context-specific isoform variation with a graph-based attention model. 基于图的注意模型的情境特定异构体变异的变异分辨预测。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-16 DOI: 10.1016/j.xgen.2025.101126
Aviya Litman, Zhicheng Pan, Ksenia Sokolova, Joyce Fang, Tess Marvin, Natalie Sauerwald, Christopher Y Park, Chandra L Theesfeld, Olga G Troyanskaya

In eukaryotes, most genes produce multiple transcript isoforms that diversify the transcriptome and proteome, serving as a key mechanism of functional regulation. Genetic variation can disrupt the RNA processing signals that shape isoform structure and abundance, yet modeling these effects at full-length isoform resolution remains challenging due to the complexity of transcript regulation. Here, we introduce Otari, an attention-based graph neural network framework trained on the human genomic sequence and long-read transcriptomes across 30 tissue types and brain regions. Otari predicts tissue-specific differential isoform abundance by integrating sequence-derived epigenetic and post-transcriptional signals, enabling isoform-resolved variant effect interpretation. Applied to large-scale variant datasets, including an autism cohort, Otari uncovers patterns of isoform dysregulation undetectable at the gene level, such as variant-driven perturbations in isoform abundance and microexon usage implicated in autism pathophysiology. We provide Otari as a resource for powering isoform-level analyses across tissues at scale.

在真核生物中,大多数基因产生多种转录异构体,使转录组和蛋白质组多样化,这是功能调控的关键机制。遗传变异可以破坏影响异构体结构和丰度的RNA加工信号,但由于转录调控的复杂性,在全长异构体分辨率上建模这些影响仍然具有挑战性。在这里,我们介绍了Otari,一个基于注意力的图神经网络框架,它训练了人类基因组序列和30种组织类型和大脑区域的长读转录组。Otari通过整合序列衍生的表观遗传和转录后信号来预测组织特异性差异异构体丰度,从而实现异构体解决的变异效应解释。应用于大规模变异数据集,包括自闭症队列,Otari揭示了在基因水平上无法检测到的异构体失调模式,例如变异驱动的异构体丰度扰动和与自闭症病理生理相关的微外显子使用。我们提供Otari作为一种资源,为跨组织的异构体水平分析提供动力。
{"title":"Variant-resolved prediction of context-specific isoform variation with a graph-based attention model.","authors":"Aviya Litman, Zhicheng Pan, Ksenia Sokolova, Joyce Fang, Tess Marvin, Natalie Sauerwald, Christopher Y Park, Chandra L Theesfeld, Olga G Troyanskaya","doi":"10.1016/j.xgen.2025.101126","DOIUrl":"10.1016/j.xgen.2025.101126","url":null,"abstract":"<p><p>In eukaryotes, most genes produce multiple transcript isoforms that diversify the transcriptome and proteome, serving as a key mechanism of functional regulation. Genetic variation can disrupt the RNA processing signals that shape isoform structure and abundance, yet modeling these effects at full-length isoform resolution remains challenging due to the complexity of transcript regulation. Here, we introduce Otari, an attention-based graph neural network framework trained on the human genomic sequence and long-read transcriptomes across 30 tissue types and brain regions. Otari predicts tissue-specific differential isoform abundance by integrating sequence-derived epigenetic and post-transcriptional signals, enabling isoform-resolved variant effect interpretation. Applied to large-scale variant datasets, including an autism cohort, Otari uncovers patterns of isoform dysregulation undetectable at the gene level, such as variant-driven perturbations in isoform abundance and microexon usage implicated in autism pathophysiology. We provide Otari as a resource for powering isoform-level analyses across tissues at scale.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101126"},"PeriodicalIF":11.1,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145994566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncovering diversity in the immunoglobulin heavy chain locus. 揭示免疫球蛋白重链位点的多样性。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-14 DOI: 10.1016/j.xgen.2025.101134
Jill A Hollenbach

The immunoglobulin heavy chain constant (IGHC) locus houses genetic determinates of antibody function and specificity. In this issue of Cell Genomics, Jana et al. use long-read sequencing to characterize extensive inter-individual diversity in the IGHC region across ancestrally diverse populations, highlighting potential functional consequences.

免疫球蛋白重链常数(IGHC)位点包含抗体功能和特异性的遗传决定因素。在这一期的《细胞基因组学》中,Jana等人使用长读测序来表征不同祖先人群中IGHC区域广泛的个体间多样性,强调了潜在的功能后果。
{"title":"Uncovering diversity in the immunoglobulin heavy chain locus.","authors":"Jill A Hollenbach","doi":"10.1016/j.xgen.2025.101134","DOIUrl":"https://doi.org/10.1016/j.xgen.2025.101134","url":null,"abstract":"<p><p>The immunoglobulin heavy chain constant (IGHC) locus houses genetic determinates of antibody function and specificity. In this issue of Cell Genomics, Jana et al. use long-read sequencing to characterize extensive inter-individual diversity in the IGHC region across ancestrally diverse populations, highlighting potential functional consequences.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"6 1","pages":"101134"},"PeriodicalIF":11.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145992024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Co-mapping clonal and transcriptional heterogeneity in somatic evolution via GoT-Multi. 通过GoT-Multi研究体细胞进化的克隆和转录异质性。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-14 Epub Date: 2025-10-10 DOI: 10.1016/j.xgen.2025.101036
Minwoo Pak, Mirca S Saurty-Seerunghen, Kellie Wise, Tsega-Ab Abera, Chhiring Lama, Neelang Parghi, Ted Kang, Xiaotian Sun, Qi Gao, Liming Bao, Mikhail Roshal, John N Allan, Richard R Furman, Luciano G Martelotto, Anna S Nam

Somatic evolution leads to clonal heterogeneity, which fuels cancer progression and therapy resistance. To decipher the consequences of clonal heterogeneity, we require a method that deconvolutes complex clonal architectures and their downstream transcriptional states. We developed Genotyping of Transcriptomes for multiple targets and sample types (GoT-Multi), a high-throughput, formalin-fixed paraffin-embedded (FFPE) tissue-compatible single-cell multi-omics for co-detection of multiple somatic genotypes and whole transcriptomes. We developed an ensemble-based machine learning pipeline to optimize genotyping. We applied GoT-Multi to frozen or FFPE samples of Richter transformation, a progression of chronic lymphocytic leukemia to therapy-resistant large B cell lymphoma. GoT-Multi detected heterogeneous cancer cell states with genotypic data of 27 mutations, enabling clonal architecture reconstruction linked with their transcriptional programs. Distinct subclonal genotypes, including therapy-resistant mutations, converged on an inflammatory state. Other subclones displayed enhanced proliferation and/or MYC program. Thus, GoT-Multi revealed that distinct genotypic identities may converge on similar transcriptional states to mediate therapy resistance.

体细胞进化导致克隆异质性,这加剧了癌症的进展和治疗耐药性。为了破译克隆异质性的后果,我们需要一种方法来解卷积复杂的克隆结构及其下游转录状态。我们开发了针对多种靶点和样品类型的转录组基因分型(GoT-Multi),这是一种高通量、福尔马林固定石蜡包埋(FFPE)组织兼容的单细胞多组学,用于共同检测多种体细胞基因型和整个转录组。我们开发了一个基于集成的机器学习管道来优化基因分型。我们将GoT-Multi应用于Richter转化的冷冻或FFPE样本,Richter转化是慢性淋巴细胞白血病向治疗抵抗性大B细胞淋巴瘤的进展。GoT-Multi利用27个突变的基因型数据检测异质癌细胞状态,实现了与转录程序相关的克隆结构重建。不同的亚克隆基因型,包括治疗耐药突变,聚集在炎症状态。其他亚克隆表现出增强的增殖和/或MYC程序。因此,GoT-Multi揭示了不同的基因型身份可能会聚在相似的转录状态上,从而介导治疗耐药性。
{"title":"Co-mapping clonal and transcriptional heterogeneity in somatic evolution via GoT-Multi.","authors":"Minwoo Pak, Mirca S Saurty-Seerunghen, Kellie Wise, Tsega-Ab Abera, Chhiring Lama, Neelang Parghi, Ted Kang, Xiaotian Sun, Qi Gao, Liming Bao, Mikhail Roshal, John N Allan, Richard R Furman, Luciano G Martelotto, Anna S Nam","doi":"10.1016/j.xgen.2025.101036","DOIUrl":"10.1016/j.xgen.2025.101036","url":null,"abstract":"<p><p>Somatic evolution leads to clonal heterogeneity, which fuels cancer progression and therapy resistance. To decipher the consequences of clonal heterogeneity, we require a method that deconvolutes complex clonal architectures and their downstream transcriptional states. We developed Genotyping of Transcriptomes for multiple targets and sample types (GoT-Multi), a high-throughput, formalin-fixed paraffin-embedded (FFPE) tissue-compatible single-cell multi-omics for co-detection of multiple somatic genotypes and whole transcriptomes. We developed an ensemble-based machine learning pipeline to optimize genotyping. We applied GoT-Multi to frozen or FFPE samples of Richter transformation, a progression of chronic lymphocytic leukemia to therapy-resistant large B cell lymphoma. GoT-Multi detected heterogeneous cancer cell states with genotypic data of 27 mutations, enabling clonal architecture reconstruction linked with their transcriptional programs. Distinct subclonal genotypes, including therapy-resistant mutations, converged on an inflammatory state. Other subclones displayed enhanced proliferation and/or MYC program. Thus, GoT-Multi revealed that distinct genotypic identities may converge on similar transcriptional states to mediate therapy resistance.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101036"},"PeriodicalIF":11.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145276872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incomplete lineage sorting of segmental duplications defines the human chromosome 2 fusion site early during African great ape speciation. 在非洲类人猿物种形成的早期,人类染色体2的融合位点是由不完全的片段重复谱系分类确定的。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-14 Epub Date: 2025-12-02 DOI: 10.1016/j.xgen.2025.101079
Zikun Yang, Lu Zhang, Xinrui Jiang, Xiangyu Yang, Kaiyue Ma, DongAhn Yoo, Yong Lu, Shilong Zhang, Jieyi Chen, Yanhong Nie, Xinyan Bian, Junmin Han, Lianting Fu, Juan Zhang, Mario Ventura, Guojie Zhang, Qiang Sun, Evan E Eichler, Yafei Mao

All great apes differ karyotypically from humans due to the fusion of chromosomes 2a and 2b, resulting in human chromosome 2. Here, we show that the fusion was associated with multiple pericentric inversions, segmental duplications (SDs), and the turnover of subterminal repetitive DNA. We characterized the fusion site at the single-base-pair resolution and identified three distinct SDs that originated more than 5 million years ago. These three distinct SDs were differentially distributed among African great apes as a result of incomplete lineage sorting (ILS) and lineage-specific duplication. One of these SDs shares homology to a hypomethylated SD spacer sequence present in the subterminal heterochromatin of Pan but is completely absent subtelomerically in both humans and orangutans. CRISPR-Cas9-mediated depletion of the fusion site in human neural progenitor cells alters the expression of genes, indicating a potential regulatory consequence to this human-specific karyotypic change. Overall, this study offers insights into how complex regions subject to ILS may contribute to speciation.

所有类人猿的核型都与人类不同,这是由于2a和2b染色体的融合,形成了人类的2号染色体。在这里,我们发现融合与多个中心倒位,片段复制(SDs)和亚末端重复DNA的周转有关。我们以单碱基对分辨率对融合位点进行了表征,并确定了三个不同的SDs,它们起源于500多万年前。这三种不同的SDs在非洲类人猿中存在差异,这是由于谱系分类不完全和谱系特异性重复造成的。其中一种SDs与Pan亚端异染色质中存在的低甲基化SD间隔序列具有同源性,但在人类和猩猩的亚端中完全不存在。crispr - cas9介导的人类神经祖细胞融合位点的缺失改变了基因的表达,表明这种人类特异性核型变化具有潜在的调节后果。总的来说,这项研究提供了关于受ILS影响的复杂区域如何有助于物种形成的见解。
{"title":"Incomplete lineage sorting of segmental duplications defines the human chromosome 2 fusion site early during African great ape speciation.","authors":"Zikun Yang, Lu Zhang, Xinrui Jiang, Xiangyu Yang, Kaiyue Ma, DongAhn Yoo, Yong Lu, Shilong Zhang, Jieyi Chen, Yanhong Nie, Xinyan Bian, Junmin Han, Lianting Fu, Juan Zhang, Mario Ventura, Guojie Zhang, Qiang Sun, Evan E Eichler, Yafei Mao","doi":"10.1016/j.xgen.2025.101079","DOIUrl":"10.1016/j.xgen.2025.101079","url":null,"abstract":"<p><p>All great apes differ karyotypically from humans due to the fusion of chromosomes 2a and 2b, resulting in human chromosome 2. Here, we show that the fusion was associated with multiple pericentric inversions, segmental duplications (SDs), and the turnover of subterminal repetitive DNA. We characterized the fusion site at the single-base-pair resolution and identified three distinct SDs that originated more than 5 million years ago. These three distinct SDs were differentially distributed among African great apes as a result of incomplete lineage sorting (ILS) and lineage-specific duplication. One of these SDs shares homology to a hypomethylated SD spacer sequence present in the subterminal heterochromatin of Pan but is completely absent subtelomerically in both humans and orangutans. CRISPR-Cas9-mediated depletion of the fusion site in human neural progenitor cells alters the expression of genes, indicating a potential regulatory consequence to this human-specific karyotypic change. Overall, this study offers insights into how complex regions subject to ILS may contribute to speciation.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101079"},"PeriodicalIF":11.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145672800","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Epigenome and interactome profiling uncovers principles of distal regulation in the barley genome. 表观基因组和相互作用组分析揭示了大麦基因组远端调控的原理。
IF 11.1 Q1 CELL BIOLOGY Pub Date : 2026-01-14 Epub Date: 2025-10-10 DOI: 10.1016/j.xgen.2025.101037
Pavla Navratilova, Simon Pavlu, Zihao Zhu, Zuzana Tulpova, Ondrej Kopecky, Petr Novak, Nils Stein, Hana Simkova

Regulation of transcription initiation is the ground level of modulating gene expression during plant development. This process relies on interactions between transcription factors and cis-regulatory elements (CREs), which become promising targets for crop bioengineering. To annotate CREs in the barley genome and understand mechanisms of distal regulation, we profiled several epigenetic features across three stages of barley embryo and leaves and performed HiChIP to identify activating and repressive genomic interactions. Using machine learning, we integrated the data into seven chromatin states, predicting ∼77,000 CRE candidates, collectively representing 1.43% of the barley genome. Identified genomic interactions, often spanning multiple genes, linked thousands of predicted CREs with their putative targets and revealed notably frequent promoter-promoter contacts. Using the LEA gene family as an example, we discuss possible roles of these interactions in transcription regulation. On the Vrn3 gene, we demonstrate the potential of our datasets to predict CREs for other developmental stages.

转录起始调控是调控植物发育过程中基因表达的基础。这一过程依赖于转录因子和顺式调控元件(cre)之间的相互作用,这是作物生物工程中很有前景的靶点。为了注释大麦基因组中的cre并了解远端调控机制,我们分析了大麦胚胎和叶片三个阶段的几个表观遗传特征,并使用HiChIP来识别激活和抑制基因组相互作用。利用机器学习,我们将数据整合到7种染色质状态中,预测了约77,000个CRE候选者,总共代表了大麦基因组的1.43%。已确定的基因组相互作用,通常跨越多个基因,将数千个预测的cre与其假定的靶标联系起来,并揭示了显著频繁的启动子-启动子接触。以LEA基因家族为例,我们讨论了这些相互作用在转录调控中的可能作用。在Vrn3基因上,我们证明了我们的数据集在预测其他发育阶段的cre方面的潜力。
{"title":"Epigenome and interactome profiling uncovers principles of distal regulation in the barley genome.","authors":"Pavla Navratilova, Simon Pavlu, Zihao Zhu, Zuzana Tulpova, Ondrej Kopecky, Petr Novak, Nils Stein, Hana Simkova","doi":"10.1016/j.xgen.2025.101037","DOIUrl":"10.1016/j.xgen.2025.101037","url":null,"abstract":"<p><p>Regulation of transcription initiation is the ground level of modulating gene expression during plant development. This process relies on interactions between transcription factors and cis-regulatory elements (CREs), which become promising targets for crop bioengineering. To annotate CREs in the barley genome and understand mechanisms of distal regulation, we profiled several epigenetic features across three stages of barley embryo and leaves and performed HiChIP to identify activating and repressive genomic interactions. Using machine learning, we integrated the data into seven chromatin states, predicting ∼77,000 CRE candidates, collectively representing 1.43% of the barley genome. Identified genomic interactions, often spanning multiple genes, linked thousands of predicted CREs with their putative targets and revealed notably frequent promoter-promoter contacts. Using the LEA gene family as an example, we discuss possible roles of these interactions in transcription regulation. On the Vrn3 gene, we demonstrate the potential of our datasets to predict CREs for other developmental stages.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"101037"},"PeriodicalIF":11.1,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145276854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cell genomics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1