首页 > 最新文献

Genome Biology最新文献

英文 中文
DNA methylation patterns of transcription factor binding regions characterize their functional and evolutionary contexts 转录因子结合区的 DNA 甲基化模式揭示其功能和进化背景
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2024-06-06 DOI: 10.1186/s13059-024-03218-6
Martina Rimoldi, Ning Wang, Jilin Zhang, Diego Villar, Duncan T. Odom, Jussi Taipale, Paul Flicek, Maša Roller
DNA methylation is an important epigenetic modification which has numerous roles in modulating genome function. Its levels are spatially correlated across the genome, typically high in repressed regions but low in transcription factor (TF) binding sites and active regulatory regions. However, the mechanisms establishing genome-wide and TF binding site methylation patterns are still unclear. Here we use a comparative approach to investigate the association of DNA methylation to TF binding evolution in mammals. Specifically, we experimentally profile DNA methylation and combine this with published occupancy profiles of five distinct TFs (CTCF, CEBPA, HNF4A, ONECUT1, FOXA1) in the liver of five mammalian species (human, macaque, mouse, rat, dog). TF binding sites are lowly methylated, but they often also have intermediate methylation levels. Furthermore, biding sites are influenced by the methylation status of CpGs in their wider binding regions even when CpGs are absent from the core binding motif. Employing a classification and clustering approach, we extract distinct and species-conserved patterns of DNA methylation levels at TF binding regions. CEBPA, HNF4A, ONECUT1, and FOXA1 share the same methylation patterns, while CTCF's differ. These patterns characterize alternative functions and chromatin landscapes of TF-bound regions. Leveraging our phylogenetic framework, we find DNA methylation gain upon evolutionary loss of TF occupancy, indicating coordinated evolution. Furthermore, each methylation pattern has its own evolutionary trajectory reflecting its genomic contexts. Our epigenomic analyses indicate a role for DNA methylation in TF binding changes across species including that specific DNA methylation profiles characterize TF binding and are associated with their regulatory activity, chromatin contexts, and evolutionary trajectories.
DNA 甲基化是一种重要的表观遗传修饰,在调节基因组功能方面具有多种作用。其水平在整个基因组中具有空间相关性,通常在受抑制区域较高,而在转录因子(TF)结合位点和活性调控区域较低。然而,建立全基因组和 TF 结合位点甲基化模式的机制仍不清楚。在这里,我们使用一种比较方法来研究哺乳动物 DNA 甲基化与 TF 结合进化的关联。具体来说,我们通过实验分析了 DNA 甲基化情况,并将其与已发表的五种不同 TF(CTCF、CEBPA、HNF4A、ONECUT1、FOXA1)在五种哺乳动物(人、猕猴、小鼠、大鼠、狗)肝脏中的占据情况相结合。TF 结合位点的甲基化水平较低,但也常常具有中等甲基化水平。此外,即使核心结合图案中没有 CpGs,结合位点也会受到其更宽结合区域中 CpGs 甲基化状态的影响。通过分类和聚类方法,我们提取了 TF 结合区 DNA 甲基化水平的不同物种保守模式。CEBPA、HNF4A、ONECUT1 和 FOXA1 具有相同的甲基化模式,而 CTCF 的甲基化模式则有所不同。这些模式表征了 TF 结合区域的不同功能和染色质景观。利用我们的系统发育框架,我们发现在进化过程中 TF 占位丧失后 DNA 甲基化增殖,这表明进化是协调进行的。此外,每种甲基化模式都有自己的进化轨迹,反映了其基因组背景。我们的表观基因组分析表明了 DNA 甲基化在跨物种 TF 结合变化中的作用,包括特定的 DNA 甲基化图谱描述了 TF 结合的特征,并与其调控活性、染色质环境和进化轨迹相关联。
{"title":"DNA methylation patterns of transcription factor binding regions characterize their functional and evolutionary contexts","authors":"Martina Rimoldi, Ning Wang, Jilin Zhang, Diego Villar, Duncan T. Odom, Jussi Taipale, Paul Flicek, Maša Roller","doi":"10.1186/s13059-024-03218-6","DOIUrl":"https://doi.org/10.1186/s13059-024-03218-6","url":null,"abstract":"DNA methylation is an important epigenetic modification which has numerous roles in modulating genome function. Its levels are spatially correlated across the genome, typically high in repressed regions but low in transcription factor (TF) binding sites and active regulatory regions. However, the mechanisms establishing genome-wide and TF binding site methylation patterns are still unclear. Here we use a comparative approach to investigate the association of DNA methylation to TF binding evolution in mammals. Specifically, we experimentally profile DNA methylation and combine this with published occupancy profiles of five distinct TFs (CTCF, CEBPA, HNF4A, ONECUT1, FOXA1) in the liver of five mammalian species (human, macaque, mouse, rat, dog). TF binding sites are lowly methylated, but they often also have intermediate methylation levels. Furthermore, biding sites are influenced by the methylation status of CpGs in their wider binding regions even when CpGs are absent from the core binding motif. Employing a classification and clustering approach, we extract distinct and species-conserved patterns of DNA methylation levels at TF binding regions. CEBPA, HNF4A, ONECUT1, and FOXA1 share the same methylation patterns, while CTCF's differ. These patterns characterize alternative functions and chromatin landscapes of TF-bound regions. Leveraging our phylogenetic framework, we find DNA methylation gain upon evolutionary loss of TF occupancy, indicating coordinated evolution. Furthermore, each methylation pattern has its own evolutionary trajectory reflecting its genomic contexts. Our epigenomic analyses indicate a role for DNA methylation in TF binding changes across species including that specific DNA methylation profiles characterize TF binding and are associated with their regulatory activity, chromatin contexts, and evolutionary trajectories.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141264912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TMO-Net: an explainable pretrained multi-omics model for multi-task learning in oncology TMO-Net:用于肿瘤学多任务学习的可解释预训练多组学模型
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2024-06-06 DOI: 10.1186/s13059-024-03293-9
Feng-ao Wang, Zhenfeng Zhuang, Feng Gao, Ruikun He, Shaoting Zhang, Liansheng Wang, Junwei Liu, Yixue Li
Cancer is a complex disease composing systemic alterations in multiple scales. In this study, we develop the Tumor Multi-Omics pre-trained Network (TMO-Net) that integrates multi-omics pan-cancer datasets for model pre-training, facilitating cross-omics interactions and enabling joint representation learning and incomplete omics inference. This model enhances multi-omics sample representation and empowers various downstream oncology tasks with incomplete multi-omics datasets. By employing interpretable learning, we characterize the contributions of distinct omics features to clinical outcomes. The TMO-Net model serves as a versatile framework for cross-modal multi-omics learning in oncology, paving the way for tumor omics-specific foundation models.
癌症是一种复杂的疾病,由多个尺度的系统性改变组成。在本研究中,我们开发了肿瘤多组学预训练网络(TMO-Net),该网络整合了多组学泛癌症数据集进行模型预训练,促进了跨组学交互,实现了联合表征学习和不完整组学推断。该模型增强了多组学样本的表征能力,并为不完整多组学数据集的各种下游肿瘤学任务提供了支持。通过采用可解释学习,我们确定了不同组学特征对临床结果的贡献。TMO-Net 模型是肿瘤学跨模态多组学学习的通用框架,为肿瘤组学特异性基础模型铺平了道路。
{"title":"TMO-Net: an explainable pretrained multi-omics model for multi-task learning in oncology","authors":"Feng-ao Wang, Zhenfeng Zhuang, Feng Gao, Ruikun He, Shaoting Zhang, Liansheng Wang, Junwei Liu, Yixue Li","doi":"10.1186/s13059-024-03293-9","DOIUrl":"https://doi.org/10.1186/s13059-024-03293-9","url":null,"abstract":"Cancer is a complex disease composing systemic alterations in multiple scales. In this study, we develop the Tumor Multi-Omics pre-trained Network (TMO-Net) that integrates multi-omics pan-cancer datasets for model pre-training, facilitating cross-omics interactions and enabling joint representation learning and incomplete omics inference. This model enhances multi-omics sample representation and empowers various downstream oncology tasks with incomplete multi-omics datasets. By employing interpretable learning, we characterize the contributions of distinct omics features to clinical outcomes. The TMO-Net model serves as a versatile framework for cross-modal multi-omics learning in oncology, paving the way for tumor omics-specific foundation models.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141264960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic evaluation with practical guidelines for single-cell and spatially resolved transcriptomics data simulation under multiple scenarios 系统评估单细胞和空间分辨转录组学数据在多种情况下的模拟,并提供实用指南
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2024-06-03 DOI: 10.1186/s13059-024-03290-y
Hongrui Duo, Yinghong Li, Yang Lan, Jingxin Tao, Qingxia Yang, Yingxue Xiao, Jing Sun, Lei Li, Xiner Nie, Xiaoxi Zhang, Guizhao Liang, Mingwei Liu, Youjin Hao, Bo Li
Single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) have led to groundbreaking advancements in life sciences. To develop bioinformatics tools for scRNA-seq and SRT data and perform unbiased benchmarks, data simulation has been widely adopted by providing explicit ground truth and generating customized datasets. However, the performance of simulation methods under multiple scenarios has not been comprehensively assessed, making it challenging to choose suitable methods without practical guidelines. We systematically evaluated 49 simulation methods developed for scRNA-seq and/or SRT data in terms of accuracy, functionality, scalability, and usability using 152 reference datasets derived from 24 platforms. SRTsim, scDesign3, ZINB-WaVE, and scDesign2 have the best accuracy performance across various platforms. Unexpectedly, some methods tailored to scRNA-seq data have potential compatibility for simulating SRT data. Lun, SPARSim, and scDesign3-tree outperform other methods under corresponding simulation scenarios. Phenopath, Lun, Simple, and MFA yield high scalability scores but they cannot generate realistic simulated data. Users should consider the trade-offs between method accuracy and scalability (or functionality) when making decisions. Additionally, execution errors are mainly caused by failed parameter estimations and appearance of missing or infinite values in calculations. We provide practical guidelines for method selection, a standard pipeline Simpipe ( https://github.com/duohongrui/simpipe ; https://doi.org/10.5281/zenodo.11178409 ), and an online tool Simsite ( https://www.ciblab.net/software/simshiny/ ) for data simulation. No method performs best on all criteria, thus a good-yet-not-the-best method is recommended if it solves problems effectively and reasonably. Our comprehensive work provides crucial insights for developers on modeling gene expression data and fosters the simulation process for users.
单细胞 RNA 测序(scRNA-seq)和空间解析转录组学(SRT)在生命科学领域取得了突破性进展。为了开发用于 scRNA-seq 和 SRT 数据的生物信息学工具并执行无偏基准,数据模拟已被广泛采用,提供明确的基本事实并生成定制数据集。然而,模拟方法在多种情况下的性能尚未得到全面评估,因此在没有实用指南的情况下选择合适的方法具有挑战性。我们利用来自 24 个平台的 152 个参考数据集,从准确性、功能性、可扩展性和可用性等方面系统地评估了针对 scRNA-seq 和/或 SRT 数据开发的 49 种模拟方法。在各种平台上,SRTsim、scDesign3、ZINB-WaVE 和 scDesign2 的准确性表现最好。出乎意料的是,一些针对 scRNA-seq 数据定制的方法具有模拟 SRT 数据的潜在兼容性。在相应的模拟场景下,Lun、SPARSim 和 scDesign3-tree 优于其他方法。Phenopath、Lun、Simple 和 MFA 的可扩展性得分较高,但它们无法生成真实的模拟数据。用户在做决定时应考虑方法的准确性和可扩展性(或功能性)之间的权衡。此外,执行错误主要是由参数估计失败和计算中出现缺失值或无限值造成的。我们提供了实用的方法选择指南、标准管道 Simpipe ( https://github.com/duohongrui/simpipe ; https://doi.org/10.5281/zenodo.11178409 ) 和用于数据模拟的在线工具 Simsite ( https://www.ciblab.net/software/simshiny/ )。没有哪种方法在所有标准上都能达到最佳效果,因此,如果某种方法能有效、合理地解决问题,那么我们就会推荐使用这种 "虽好但非最佳 "的方法。我们的综合工作为基因表达数据建模开发人员提供了重要的见解,并促进了用户的模拟过程。
{"title":"Systematic evaluation with practical guidelines for single-cell and spatially resolved transcriptomics data simulation under multiple scenarios","authors":"Hongrui Duo, Yinghong Li, Yang Lan, Jingxin Tao, Qingxia Yang, Yingxue Xiao, Jing Sun, Lei Li, Xiner Nie, Xiaoxi Zhang, Guizhao Liang, Mingwei Liu, Youjin Hao, Bo Li","doi":"10.1186/s13059-024-03290-y","DOIUrl":"https://doi.org/10.1186/s13059-024-03290-y","url":null,"abstract":"Single-cell RNA sequencing (scRNA-seq) and spatially resolved transcriptomics (SRT) have led to groundbreaking advancements in life sciences. To develop bioinformatics tools for scRNA-seq and SRT data and perform unbiased benchmarks, data simulation has been widely adopted by providing explicit ground truth and generating customized datasets. However, the performance of simulation methods under multiple scenarios has not been comprehensively assessed, making it challenging to choose suitable methods without practical guidelines. We systematically evaluated 49 simulation methods developed for scRNA-seq and/or SRT data in terms of accuracy, functionality, scalability, and usability using 152 reference datasets derived from 24 platforms. SRTsim, scDesign3, ZINB-WaVE, and scDesign2 have the best accuracy performance across various platforms. Unexpectedly, some methods tailored to scRNA-seq data have potential compatibility for simulating SRT data. Lun, SPARSim, and scDesign3-tree outperform other methods under corresponding simulation scenarios. Phenopath, Lun, Simple, and MFA yield high scalability scores but they cannot generate realistic simulated data. Users should consider the trade-offs between method accuracy and scalability (or functionality) when making decisions. Additionally, execution errors are mainly caused by failed parameter estimations and appearance of missing or infinite values in calculations. We provide practical guidelines for method selection, a standard pipeline Simpipe ( https://github.com/duohongrui/simpipe ; https://doi.org/10.5281/zenodo.11178409 ), and an online tool Simsite ( https://www.ciblab.net/software/simshiny/ ) for data simulation. No method performs best on all criteria, thus a good-yet-not-the-best method is recommended if it solves problems effectively and reasonably. Our comprehensive work provides crucial insights for developers on modeling gene expression data and fosters the simulation process for users.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141235891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting gene expression state and prioritizing putative enhancers using 5hmC signal. 利用 5hmC 信号预测基因表达状态并优先选择推定增强子
IF 10.1 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-06-03 DOI: 10.1186/s13059-024-03273-z
Edahi Gonzalez-Avalos, Atsushi Onodera, Daniela Samaniego-Castruita, Anjana Rao, Ferhat Ay

Background: Like its parent base 5-methylcytosine (5mC), 5-hydroxymethylcytosine (5hmC) is a direct epigenetic modification of cytosines in the context of CpG dinucleotides. 5hmC is the most abundant oxidized form of 5mC, generated through the action of TET dioxygenases at gene bodies of actively-transcribed genes and at active or lineage-specific enhancers. Although such enrichments are reported for 5hmC, to date, predictive models of gene expression state or putative regulatory regions for genes using 5hmC have not been developed.

Results: Here, by using only 5hmC enrichment in genic regions and their vicinity, we develop neural network models that predict gene expression state across 49 cell types. We show that our deep neural network models distinguish high vs low expression state utilizing only 5hmC levels and these predictive models generalize to unseen cell types. Further, in order to leverage 5hmC signal in distal enhancers for expression prediction, we employ an Activity-by-Contact model and also develop a graph convolutional neural network model with both utilizing Hi-C data and 5hmC enrichment to prioritize enhancer-promoter links. These approaches identify known and novel putative enhancers for key genes in multiple immune cell subsets.

Conclusions: Our work highlights the importance of 5hmC in gene regulation through proximal and distal mechanisms and provides a framework to link it to genome function. With the recent advances in 6-letter DNA sequencing by short and long-read techniques, profiling of 5mC and 5hmC may be done routinely in the near future, hence, providing a broad range of applications for the methods developed here.

背景:与母基 5-甲基胞嘧啶(5mC)一样,5-羟甲基胞嘧啶(5hmC)也是 CpG 二核苷酸中胞嘧啶的一种直接表观遗传修饰。5hmC 是 5mC 最丰富的氧化形式,通过 TET 二氧酶的作用在活跃转录基因的基因体和活跃的或特定谱系的增强子上生成。尽管有报告称 5hmC 具有这种富集作用,但迄今为止,尚未开发出使用 5hmC 预测基因表达状态或基因假定调控区域的模型:结果:在此,我们仅利用基因区域及其附近的 5hmC 富集,开发了神经网络模型,可预测 49 种细胞类型的基因表达状态。我们的研究表明,我们的深度神经网络模型仅利用 5hmC 水平就能区分高表达状态和低表达状态,而且这些预测模型还能推广到未见过的细胞类型。此外,为了利用远端增强子中的 5hmC 信号进行表达预测,我们采用了 "按接触活动"(Activity-by-Contact)模型,还开发了图卷积神经网络模型,利用 Hi-C 数据和 5hmC 富集来优先考虑增强子-启动子链接。这些方法为多个免疫细胞亚群中的关键基因确定了已知的和新的推定增强子:我们的工作强调了 5hmC 通过近端和远端机制在基因调控中的重要性,并提供了将其与基因组功能联系起来的框架。随着近来短读和长读技术在 6 字母 DNA 测序方面的进步,5mC 和 5hmC 的分析在不久的将来可能会成为常规方法,从而为本文开发的方法提供了广泛的应用前景。
{"title":"Predicting gene expression state and prioritizing putative enhancers using 5hmC signal.","authors":"Edahi Gonzalez-Avalos, Atsushi Onodera, Daniela Samaniego-Castruita, Anjana Rao, Ferhat Ay","doi":"10.1186/s13059-024-03273-z","DOIUrl":"10.1186/s13059-024-03273-z","url":null,"abstract":"<p><strong>Background: </strong>Like its parent base 5-methylcytosine (5mC), 5-hydroxymethylcytosine (5hmC) is a direct epigenetic modification of cytosines in the context of CpG dinucleotides. 5hmC is the most abundant oxidized form of 5mC, generated through the action of TET dioxygenases at gene bodies of actively-transcribed genes and at active or lineage-specific enhancers. Although such enrichments are reported for 5hmC, to date, predictive models of gene expression state or putative regulatory regions for genes using 5hmC have not been developed.</p><p><strong>Results: </strong>Here, by using only 5hmC enrichment in genic regions and their vicinity, we develop neural network models that predict gene expression state across 49 cell types. We show that our deep neural network models distinguish high vs low expression state utilizing only 5hmC levels and these predictive models generalize to unseen cell types. Further, in order to leverage 5hmC signal in distal enhancers for expression prediction, we employ an Activity-by-Contact model and also develop a graph convolutional neural network model with both utilizing Hi-C data and 5hmC enrichment to prioritize enhancer-promoter links. These approaches identify known and novel putative enhancers for key genes in multiple immune cell subsets.</p><p><strong>Conclusions: </strong>Our work highlights the importance of 5hmC in gene regulation through proximal and distal mechanisms and provides a framework to link it to genome function. With the recent advances in 6-letter DNA sequencing by short and long-read techniques, profiling of 5mC and 5hmC may be done routinely in the near future, hence, providing a broad range of applications for the methods developed here.</p>","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":10.1,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11145787/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141199651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sequential drug treatment targeting cell cycle and cell fate regulatory programs blocks non-genetic cancer evolution in acute lymphoblastic leukemia 针对细胞周期和细胞命运调控程序的序贯药物治疗可阻止急性淋巴细胞白血病的非遗传性癌症进化
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2024-05-31 DOI: 10.1186/s13059-024-03260-4
Alena Malyukova, Mari Lahnalampi, Ton Falqués-Costa, Petri Pölönen, Mikko Sipola, Juha Mehtonen, Susanna Teppo, Karen Akopyan, Johanna Viiliainen, Olli Lohi, Anna K. Hagström-Andersson, Merja Heinäniemi, Olle Sangfelt
Targeted therapies exploiting vulnerabilities of cancer cells hold promise for improving patient outcome and reducing side-effects of chemotherapy. However, efficacy of precision therapies is limited in part because of tumor cell heterogeneity. A better mechanistic understanding of how drug effect is linked to cancer cell state diversity is crucial for identifying effective combination therapies that can prevent disease recurrence. Here, we characterize the effect of G2/M checkpoint inhibition in acute lymphoblastic leukemia (ALL) and demonstrate that WEE1 targeted therapy impinges on cell fate decision regulatory circuits. We find the highest inhibition of recovery of proliferation in ALL cells with KMT2A-rearrangements. Single-cell RNA-seq and ATAC-seq of RS4;11 cells harboring KMT2A::AFF1, treated with the WEE1 inhibitor AZD1775, reveal diversification of cell states, with a fraction of cells exhibiting strong activation of p53-driven processes linked to apoptosis and senescence, and disruption of a core KMT2A-RUNX1-MYC regulatory network. In this cell state diversification induced by WEE1 inhibition, a subpopulation transitions to a drug tolerant cell state characterized by activation of transcription factors regulating pre-B cell fate, lipid metabolism, and pre-BCR signaling in a reversible manner. Sequential treatment with BCR-signaling inhibitors dasatinib, ibrutinib, or perturbing metabolism by fatostatin or AZD2014 effectively counteracts drug tolerance by inducing cell death and repressing stemness markers. Collectively, our findings provide new insights into the tight connectivity of gene regulatory programs associated with cell cycle and cell fate regulation, and a rationale for sequential administration of WEE1 inhibitors with low toxicity inhibitors of pre-BCR signaling or metabolism.
利用癌细胞弱点的靶向疗法有望改善患者的预后并减少化疗的副作用。然而,精准疗法的疗效有限,部分原因在于肿瘤细胞的异质性。更好地从机理上理解药物作用如何与癌细胞状态多样性相关联,对于确定能防止疾病复发的有效组合疗法至关重要。在这里,我们描述了急性淋巴细胞白血病(ALL)中 G2/M 检查点抑制的效果,并证明 WEE1 靶向疗法会影响细胞命运决定调控回路。我们发现KMT2A重排的ALL细胞恢复增殖的抑制率最高。用WEE1抑制剂AZD1775治疗携带KMT2A::AFF1的RS4;11细胞的单细胞RNA-seq和ATAC-seq显示了细胞状态的多样化,一部分细胞表现出与凋亡和衰老相关的p53驱动过程的强激活,以及核心KMT2A-RUNX1-MYC调控网络的破坏。在这种由 WEE1 抑制诱导的细胞状态多样化中,一个亚群过渡到耐药细胞状态,其特点是以可逆方式激活调节前 B 细胞命运、脂质代谢和前 BCR 信号转导的转录因子。用BCR信号抑制剂达沙替尼、伊布替尼,或用法托司汀或AZD2014扰乱新陈代谢,通过诱导细胞死亡和抑制干性标志物,有效抵消药物耐受性。总之,我们的研究结果为了解与细胞周期和细胞命运调控相关的基因调控程序的紧密联系提供了新的视角,也为WEE1抑制剂与低毒性前BCR信号转导抑制剂或新陈代谢抑制剂的序贯用药提供了理论依据。
{"title":"Sequential drug treatment targeting cell cycle and cell fate regulatory programs blocks non-genetic cancer evolution in acute lymphoblastic leukemia","authors":"Alena Malyukova, Mari Lahnalampi, Ton Falqués-Costa, Petri Pölönen, Mikko Sipola, Juha Mehtonen, Susanna Teppo, Karen Akopyan, Johanna Viiliainen, Olli Lohi, Anna K. Hagström-Andersson, Merja Heinäniemi, Olle Sangfelt","doi":"10.1186/s13059-024-03260-4","DOIUrl":"https://doi.org/10.1186/s13059-024-03260-4","url":null,"abstract":"Targeted therapies exploiting vulnerabilities of cancer cells hold promise for improving patient outcome and reducing side-effects of chemotherapy. However, efficacy of precision therapies is limited in part because of tumor cell heterogeneity. A better mechanistic understanding of how drug effect is linked to cancer cell state diversity is crucial for identifying effective combination therapies that can prevent disease recurrence. Here, we characterize the effect of G2/M checkpoint inhibition in acute lymphoblastic leukemia (ALL) and demonstrate that WEE1 targeted therapy impinges on cell fate decision regulatory circuits. We find the highest inhibition of recovery of proliferation in ALL cells with KMT2A-rearrangements. Single-cell RNA-seq and ATAC-seq of RS4;11 cells harboring KMT2A::AFF1, treated with the WEE1 inhibitor AZD1775, reveal diversification of cell states, with a fraction of cells exhibiting strong activation of p53-driven processes linked to apoptosis and senescence, and disruption of a core KMT2A-RUNX1-MYC regulatory network. In this cell state diversification induced by WEE1 inhibition, a subpopulation transitions to a drug tolerant cell state characterized by activation of transcription factors regulating pre-B cell fate, lipid metabolism, and pre-BCR signaling in a reversible manner. Sequential treatment with BCR-signaling inhibitors dasatinib, ibrutinib, or perturbing metabolism by fatostatin or AZD2014 effectively counteracts drug tolerance by inducing cell death and repressing stemness markers. Collectively, our findings provide new insights into the tight connectivity of gene regulatory programs associated with cell cycle and cell fate regulation, and a rationale for sequential administration of WEE1 inhibitors with low toxicity inhibitors of pre-BCR signaling or metabolism.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141182456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
X-chromosome inactivation in human iPSCs provides insight into X-regulated gene expression in autosomes 人类 iPSC 中 X 染色体的失活有助于深入了解常染色体中 X 调控基因的表达情况
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2024-05-31 DOI: 10.1186/s13059-024-03286-8
Hande Topa, Clara Benoit-Pilven, Taru Tukiainen, Olli Pietiläinen
Variation in X chromosome inactivation (XCI) in human-induced pluripotent stem cells (hiPSCs) can impact their ability to model biological sex biases. The gene-wise landscape of X chromosome gene dosage remains unresolved in female hiPSCs. To characterize patterns of de-repression and escape from inactivation, we performed a systematic survey of allele specific expression in 165 female hiPSC lines. XCI erosion is non-random and primarily affects genes that escape XCI in human tissues. Individual genes and cell lines vary in the frequency and degree of de-repression. Bi-allelic expression increases gradually after modest decrease of XIST in cultures, whose loss is commonly used to mark lines with eroded XCI. We identify three clusters of female lines at different stages of XCI. Increased XCI erosion amplifies female-biased expression at hypomethylated sites and regions normally occupied by repressive histone marks, lowering male-biased differences in the X chromosome. In autosomes, erosion modifies sex differences in a dose-dependent way. Male-biased genes are enriched for hypermethylated regions, and de-repression of XIST-bound autosomal genes in female lines attenuates normal male-biased gene expression in eroded lines. XCI erosion can compensate for a dominant loss of function effect in several disease genes. We present a comprehensive view of X chromosome gene dosage in hiPSCs and implicate a direct mechanism for XCI erosion in regulating autosomal gene expression in trans. The uncommon and variable reactivation of X chromosome genes in female hiPSCs can provide insight into X chromosome’s role in regulating gene expression and sex differences in humans.
人类诱导多能干细胞(hiPSCs)中X染色体失活(XCI)的变异会影响其模拟生物性别偏差的能力。在雌性 hiPSCs 中,X 染色体基因剂量的基因全貌仍悬而未决。为了描述去抑制和摆脱失活的模式,我们对 165 个雌性 hiPSC 株系中的等位基因特异性表达进行了系统调查。XCI 侵蚀是非随机的,主要影响人类组织中逃避 XCI 的基因。单个基因和细胞系的去抑制频率和程度各不相同。在培养物中 XIST 适度减少后,双等位基因表达逐渐增加,而 XIST 的丢失通常被用来标记 XCI 侵蚀的品系。我们发现了处于 XCI 不同阶段的三个雌性品系群。XCI 侵蚀的加剧放大了低甲基化位点和通常被抑制性组蛋白标记占据区域的雌性偏向表达,降低了 X 染色体中的雄性偏向差异。在常染色体中,侵蚀以剂量依赖的方式改变性别差异。具有雄性偏向的基因富集于高甲基化区域,雌性品系中 XIST 结合的常染色体基因的去抑制作用削弱了侵蚀品系中正常的雄性偏向基因表达。XCI 侵蚀可补偿几个疾病基因的显性功能缺失效应。我们对 hiPSCs 中的 X 染色体基因剂量进行了全面分析,并提出了 XCI 侵蚀在反式调节常染色体基因表达中的直接机制。X 染色体基因在雌性 hiPSCs 中不常见且可变的重新激活,这有助于深入了解 X 染色体在调节基因表达和人类性别差异中的作用。
{"title":"X-chromosome inactivation in human iPSCs provides insight into X-regulated gene expression in autosomes","authors":"Hande Topa, Clara Benoit-Pilven, Taru Tukiainen, Olli Pietiläinen","doi":"10.1186/s13059-024-03286-8","DOIUrl":"https://doi.org/10.1186/s13059-024-03286-8","url":null,"abstract":"Variation in X chromosome inactivation (XCI) in human-induced pluripotent stem cells (hiPSCs) can impact their ability to model biological sex biases. The gene-wise landscape of X chromosome gene dosage remains unresolved in female hiPSCs. To characterize patterns of de-repression and escape from inactivation, we performed a systematic survey of allele specific expression in 165 female hiPSC lines. XCI erosion is non-random and primarily affects genes that escape XCI in human tissues. Individual genes and cell lines vary in the frequency and degree of de-repression. Bi-allelic expression increases gradually after modest decrease of XIST in cultures, whose loss is commonly used to mark lines with eroded XCI. We identify three clusters of female lines at different stages of XCI. Increased XCI erosion amplifies female-biased expression at hypomethylated sites and regions normally occupied by repressive histone marks, lowering male-biased differences in the X chromosome. In autosomes, erosion modifies sex differences in a dose-dependent way. Male-biased genes are enriched for hypermethylated regions, and de-repression of XIST-bound autosomal genes in female lines attenuates normal male-biased gene expression in eroded lines. XCI erosion can compensate for a dominant loss of function effect in several disease genes. We present a comprehensive view of X chromosome gene dosage in hiPSCs and implicate a direct mechanism for XCI erosion in regulating autosomal gene expression in trans. The uncommon and variable reactivation of X chromosome genes in female hiPSCs can provide insight into X chromosome’s role in regulating gene expression and sex differences in humans.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141182438","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reproductive isolation arises during laboratory adaptation to a novel hot environment. 在实验室适应新的高温环境的过程中出现了生殖隔离。
IF 10.1 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-05-28 DOI: 10.1186/s13059-024-03285-9
Sheng-Kai Hsu, Wei-Yun Lai, Johannes Novak, Felix Lehner, Ana Marija Jakšić, Elisabetta Versace, Christian Schlötterer

Background: Reproductive isolation can result from adaptive processes (e.g., ecological speciation and mutation-order speciation) or stochastic processes such as "system drift" model. Ecological speciation predicts barriers to gene flow between populations from different environments, but not among replicate populations from the same environment. In contrast, reproductive isolation among populations independently adapted to the same/similar environment can arise from both mutation-order speciation or system drift.

Results: In experimentally evolved populations adapting to a hot environment for over 100 generations, we find evidence for pre- and postmating reproductive isolation. On one hand, an altered lipid metabolism and cuticular hydrocarbon composition pointed to possible premating barriers between the ancestral and replicate evolved populations. On the other hand, the pronounced gene expression differences in male reproductive genes may underlie the postmating isolation among replicate evolved populations adapting to the same environment with the same standing genetic variation.

Conclusion: Our study confirms that replicated evolution experiments provide valuable insights into the mechanisms of speciation. The rapid emergence of the premating reproductive isolation during temperature adaptation showcases incipient ecological speciation. The potential evidence of postmating reproductive isolation among replicates gave rise to two hypotheses: (1) mutation-order speciation through a common selection on early fecundity leading to an inherent inter-locus sexual conflict; (2) system drift with genetic drift along the neutral ridges.

背景:生殖隔离可由适应过程(如生态分化和变异阶梯分化)或随机过程(如 "系统漂移 "模型)引起。生态种群分化预示着来自不同环境的种群之间存在基因流动障碍,但来自同一环境的复制种群之间却不存在基因流动障碍。与此相反,独立适应相同/相似环境的种群之间的生殖隔离可能来自突变阶梯式的物种分化或系统漂移:结果:在适应炎热环境超过 100 代的实验进化种群中,我们发现了交配前和交配后生殖隔离的证据。一方面,脂质代谢和角质烃组成的改变表明,祖先种群和复制进化种群之间可能存在交配前障碍。另一方面,雄性生殖基因在基因表达上的明显差异,可能是适应相同环境、具有相同遗传变异的复制进化种群在交配后出现隔离的原因:我们的研究证实,复制进化实验为物种演化机制提供了宝贵的见解。在温度适应过程中迅速出现的交配前生殖隔离显示了生态物种分化的萌芽。复制样本间交配后生殖隔离的潜在证据提出了两个假设:(1)通过对早期繁殖力的共同选择导致病灶间固有的性冲突,从而产生突变阶梯式物种分化;(2)系统漂移与沿中性脊的遗传漂移。
{"title":"Reproductive isolation arises during laboratory adaptation to a novel hot environment.","authors":"Sheng-Kai Hsu, Wei-Yun Lai, Johannes Novak, Felix Lehner, Ana Marija Jakšić, Elisabetta Versace, Christian Schlötterer","doi":"10.1186/s13059-024-03285-9","DOIUrl":"10.1186/s13059-024-03285-9","url":null,"abstract":"<p><strong>Background: </strong>Reproductive isolation can result from adaptive processes (e.g., ecological speciation and mutation-order speciation) or stochastic processes such as \"system drift\" model. Ecological speciation predicts barriers to gene flow between populations from different environments, but not among replicate populations from the same environment. In contrast, reproductive isolation among populations independently adapted to the same/similar environment can arise from both mutation-order speciation or system drift.</p><p><strong>Results: </strong>In experimentally evolved populations adapting to a hot environment for over 100 generations, we find evidence for pre- and postmating reproductive isolation. On one hand, an altered lipid metabolism and cuticular hydrocarbon composition pointed to possible premating barriers between the ancestral and replicate evolved populations. On the other hand, the pronounced gene expression differences in male reproductive genes may underlie the postmating isolation among replicate evolved populations adapting to the same environment with the same standing genetic variation.</p><p><strong>Conclusion: </strong>Our study confirms that replicated evolution experiments provide valuable insights into the mechanisms of speciation. The rapid emergence of the premating reproductive isolation during temperature adaptation showcases incipient ecological speciation. The potential evidence of postmating reproductive isolation among replicates gave rise to two hypotheses: (1) mutation-order speciation through a common selection on early fecundity leading to an inherent inter-locus sexual conflict; (2) system drift with genetic drift along the neutral ridges.</p>","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":10.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11134630/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141161773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ePRINT: exonuclease assisted mapping of protein-RNA interactions. ePRINT:外切酶辅助绘制蛋白质-RNA相互作用图。
IF 10.1 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-05-28 DOI: 10.1186/s13059-024-03271-1
Sophie Hawkins, Alexandre Mondaini, Seema C Namboori, Grady G Nguyen, Gene W Yeo, Asif Javed, Akshay Bhinge

RNA-binding proteins (RBPs) regulate key aspects of RNA processing including alternative splicing, mRNA degradation and localization by physically binding RNA molecules. Current methods to map these interactions, such as CLIP, rely on purifying single proteins at a time. Our new method, ePRINT, maps RBP-RNA interaction networks on a global scale without purifying individual RBPs. ePRINT uses exoribonuclease XRN1 to precisely map the 5' end of the RBP binding site and uncovers direct and indirect targets of an RBP of interest. Importantly, ePRINT can also uncover RBPs that are differentially activated between cell fate transitions, including neural progenitor differentiation into neurons.

RNA 结合蛋白(RBPs)通过物理结合 RNA 分子来调节 RNA 处理的关键环节,包括替代剪接、mRNA 降解和定位。目前绘制这些相互作用图的方法(如 CLIP)依赖于一次纯化单个蛋白质。我们的新方法 ePRINT 无需纯化单个 RBP 即可绘制全球范围的 RBP-RNA 相互作用网络图。ePRINT 利用外核酸酶 XRN1 精确绘制 RBP 结合位点的 5' 端,并发现感兴趣的 RBP 的直接和间接靶标。重要的是,ePRINT 还能发现在细胞命运转换(包括神经祖细胞向神经元的分化)过程中被不同程度激活的 RBPs。
{"title":"ePRINT: exonuclease assisted mapping of protein-RNA interactions.","authors":"Sophie Hawkins, Alexandre Mondaini, Seema C Namboori, Grady G Nguyen, Gene W Yeo, Asif Javed, Akshay Bhinge","doi":"10.1186/s13059-024-03271-1","DOIUrl":"10.1186/s13059-024-03271-1","url":null,"abstract":"<p><p>RNA-binding proteins (RBPs) regulate key aspects of RNA processing including alternative splicing, mRNA degradation and localization by physically binding RNA molecules. Current methods to map these interactions, such as CLIP, rely on purifying single proteins at a time. Our new method, ePRINT, maps RBP-RNA interaction networks on a global scale without purifying individual RBPs. ePRINT uses exoribonuclease XRN1 to precisely map the 5' end of the RBP binding site and uncovers direct and indirect targets of an RBP of interest. Importantly, ePRINT can also uncover RBPs that are differentially activated between cell fate transitions, including neural progenitor differentiation into neurons.</p>","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":10.1,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11134894/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141161761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Current status of community resources and priorities for weed genomics research 杂草基因组学研究的社区资源现状和优先事项
IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Pub Date : 2024-05-27 DOI: 10.1186/s13059-024-03274-y
Jacob Montgomery, Sarah Morran, Dana R. MacGregor, J. Scott McElroy, Paul Neve, Célia Neto, Martin M. Vila-Aiub, Maria Victoria Sandoval, Analia I. Menéndez, Julia M. Kreiner, Longjiang Fan, Ana L. Caicedo, Peter J. Maughan, Bianca Assis Barbosa Martins, Jagoda Mika, Alberto Collavo, Aldo Merotto, Nithya K. Subramanian, Muthukumar V. Bagavathiannan, Luan Cutti, Md. Mazharul Islam, Bikram S. Gill, Robert Cicchillo, Roger Gast, Neeta Soni, Terry R. Wright, Gina Zastrow-Hayes, Gregory May, Jenna M. Malone, Deepmala Sehgal, Shiv Shankhar Kaundun, Richard P. Dale, Barend Juan Vorster, Bodo Peters, Jens Lerchl, Patrick J. Tranel, Roland Beffa, Alexandre Fournier-Level, Mithila Jugulam, Kevin Fengler, Victor Llaca, Eric L. Patterson, Todd A. Gaines
Weeds are attractive models for basic and applied research due to their impacts on agricultural systems and capacity to swiftly adapt in response to anthropogenic selection pressures. Currently, a lack of genomic information precludes research to elucidate the genetic basis of rapid adaptation for important traits like herbicide resistance and stress tolerance and the effect of evolutionary mechanisms on wild populations. The International Weed Genomics Consortium is a collaborative group of scientists focused on developing genomic resources to impact research into sustainable, effective weed control methods and to provide insights about stress tolerance and adaptation to assist crop breeding.
由于杂草对农业系统的影响以及在人类选择压力下的快速适应能力,它们是基础研究和应用研究的极具吸引力的模型。目前,由于缺乏基因组信息,无法开展研究来阐明快速适应除草剂抗性和抗逆性等重要性状的遗传基础,以及进化机制对野生种群的影响。国际杂草基因组学联合会是一个由科学家组成的合作团体,致力于开发基因组资源,以影响可持续、有效的杂草控制方法的研究,并提供有关胁迫耐受性和适应性的见解,以协助作物育种。
{"title":"Current status of community resources and priorities for weed genomics research","authors":"Jacob Montgomery, Sarah Morran, Dana R. MacGregor, J. Scott McElroy, Paul Neve, Célia Neto, Martin M. Vila-Aiub, Maria Victoria Sandoval, Analia I. Menéndez, Julia M. Kreiner, Longjiang Fan, Ana L. Caicedo, Peter J. Maughan, Bianca Assis Barbosa Martins, Jagoda Mika, Alberto Collavo, Aldo Merotto, Nithya K. Subramanian, Muthukumar V. Bagavathiannan, Luan Cutti, Md. Mazharul Islam, Bikram S. Gill, Robert Cicchillo, Roger Gast, Neeta Soni, Terry R. Wright, Gina Zastrow-Hayes, Gregory May, Jenna M. Malone, Deepmala Sehgal, Shiv Shankhar Kaundun, Richard P. Dale, Barend Juan Vorster, Bodo Peters, Jens Lerchl, Patrick J. Tranel, Roland Beffa, Alexandre Fournier-Level, Mithila Jugulam, Kevin Fengler, Victor Llaca, Eric L. Patterson, Todd A. Gaines","doi":"10.1186/s13059-024-03274-y","DOIUrl":"https://doi.org/10.1186/s13059-024-03274-y","url":null,"abstract":"Weeds are attractive models for basic and applied research due to their impacts on agricultural systems and capacity to swiftly adapt in response to anthropogenic selection pressures. Currently, a lack of genomic information precludes research to elucidate the genetic basis of rapid adaptation for important traits like herbicide resistance and stress tolerance and the effect of evolutionary mechanisms on wild populations. The International Weed Genomics Consortium is a collaborative group of scientists focused on developing genomic resources to impact research into sustainable, effective weed control methods and to provide insights about stress tolerance and adaptation to assist crop breeding.","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":12.3,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141156659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rosace: a robust deep mutational scanning analysis framework employing position and mean-variance shrinkage. Rosace:采用位置和均方差收缩的稳健深度突变扫描分析框架。
IF 10.1 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-05-24 DOI: 10.1186/s13059-024-03279-7
Jingyou Rao, Ruiqi Xin, Christian Macdonald, Matthew K Howard, Gabriella O Estevam, Sook Wah Yee, Mingsen Wang, James S Fraser, Willow Coyote-Maestas, Harold Pimentel

Deep mutational scanning (DMS) measures the effects of thousands of genetic variants in a protein simultaneously. The small sample size renders classical statistical methods ineffective. For example, p-values cannot be correctly calibrated when treating variants independently. We propose Rosace, a Bayesian framework for analyzing growth-based DMS data. Rosace leverages amino acid position information to increase power and control the false discovery rate by sharing information across parameters via shrinkage. We also developed Rosette for simulating the distributional properties of DMS. We show that Rosace is robust to the violation of model assumptions and is more powerful than existing tools.

深度突变扫描(DMS)可同时测量蛋白质中数千个遗传变异的影响。由于样本量小,传统的统计方法无法奏效。例如,在独立处理变异时,无法正确校准 p 值。我们提出了一种贝叶斯框架--Rosace,用于分析基于生长的 DMS 数据。Rosace 利用氨基酸位置信息,通过缩小参数来共享信息,从而提高功率并控制误发现率。我们还开发了 Rosette,用于模拟 DMS 的分布特性。我们的研究表明,Rosace 对模型假设的违反具有鲁棒性,而且比现有工具更强大。
{"title":"Rosace: a robust deep mutational scanning analysis framework employing position and mean-variance shrinkage.","authors":"Jingyou Rao, Ruiqi Xin, Christian Macdonald, Matthew K Howard, Gabriella O Estevam, Sook Wah Yee, Mingsen Wang, James S Fraser, Willow Coyote-Maestas, Harold Pimentel","doi":"10.1186/s13059-024-03279-7","DOIUrl":"10.1186/s13059-024-03279-7","url":null,"abstract":"<p><p>Deep mutational scanning (DMS) measures the effects of thousands of genetic variants in a protein simultaneously. The small sample size renders classical statistical methods ineffective. For example, p-values cannot be correctly calibrated when treating variants independently. We propose Rosace, a Bayesian framework for analyzing growth-based DMS data. Rosace leverages amino acid position information to increase power and control the false discovery rate by sharing information across parameters via shrinkage. We also developed Rosette for simulating the distributional properties of DMS. We show that Rosace is robust to the violation of model assumptions and is more powerful than existing tools.</p>","PeriodicalId":12611,"journal":{"name":"Genome Biology","volume":null,"pages":null},"PeriodicalIF":10.1,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11127319/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141093421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Genome Biology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1