Functional characteristics and computational model of abundant hyperactive loci in the human genome.

IF 6.4 1区 生物学 Q1 BIOLOGY eLife Pub Date : 2024-11-13 DOI:10.7554/eLife.95170
Sanjarbek Hudaiberdiev, Ivan Ovcharenko
{"title":"Functional characteristics and computational model of abundant hyperactive loci in the human genome.","authors":"Sanjarbek Hudaiberdiev, Ivan Ovcharenko","doi":"10.7554/eLife.95170","DOIUrl":null,"url":null,"abstract":"<p><p>Enhancers and promoters are classically considered to be bound by a small set of transcription factors (TFs) in a sequence-specific manner. This assumption has come under increasing skepticism as the datasets of ChIP-seq assays of TFs have expanded. In particular, high-occupancy target (HOT) loci attract hundreds of TFs with often no detectable correlation between ChIP-seq peaks and DNA-binding motif presence. Here, we used a set of 1003 TF ChIP-seq datasets (HepG2, K562, H1) to analyze the patterns of ChIP-seq peak co-occurrence in combination with functional genomics datasets. We identified 43,891 HOT loci forming at the promoter (53%) and enhancer (47%) regions. HOT promoters regulate housekeeping genes, whereas HOT enhancers are involved in tissue-specific process regulation. HOT loci form the foundation of human super-enhancers and evolve under strong negative selection, with some of these loci being located in ultraconserved regions. Sequence-based classification analysis of HOT loci suggested that their formation is driven by the sequence features, and the density of mapped ChIP-seq peaks across TF-bound loci correlates with sequence features and the expression level of flanking genes. Based on the affinities to bind to promoters and enhancers we detected five distinct clusters of TFs that form the core of the HOT loci. We report an abundance of HOT loci in the human genome and a commitment of 51% of all TF ChIP-seq binding events to HOT locus formation thus challenging the classical model of enhancer activity and propose a model of HOT locus formation based on the existence of large transcriptional condensates.</p>","PeriodicalId":11640,"journal":{"name":"eLife","volume":"13 ","pages":""},"PeriodicalIF":6.4000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560132/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"eLife","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7554/eLife.95170","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Enhancers and promoters are classically considered to be bound by a small set of transcription factors (TFs) in a sequence-specific manner. This assumption has come under increasing skepticism as the datasets of ChIP-seq assays of TFs have expanded. In particular, high-occupancy target (HOT) loci attract hundreds of TFs with often no detectable correlation between ChIP-seq peaks and DNA-binding motif presence. Here, we used a set of 1003 TF ChIP-seq datasets (HepG2, K562, H1) to analyze the patterns of ChIP-seq peak co-occurrence in combination with functional genomics datasets. We identified 43,891 HOT loci forming at the promoter (53%) and enhancer (47%) regions. HOT promoters regulate housekeeping genes, whereas HOT enhancers are involved in tissue-specific process regulation. HOT loci form the foundation of human super-enhancers and evolve under strong negative selection, with some of these loci being located in ultraconserved regions. Sequence-based classification analysis of HOT loci suggested that their formation is driven by the sequence features, and the density of mapped ChIP-seq peaks across TF-bound loci correlates with sequence features and the expression level of flanking genes. Based on the affinities to bind to promoters and enhancers we detected five distinct clusters of TFs that form the core of the HOT loci. We report an abundance of HOT loci in the human genome and a commitment of 51% of all TF ChIP-seq binding events to HOT locus formation thus challenging the classical model of enhancer activity and propose a model of HOT locus formation based on the existence of large transcriptional condensates.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人类基因组中大量超活性位点的功能特征和计算模型。
人们通常认为,增强子和启动子会以序列特异性的方式与一小部分转录因子(TFs)结合。随着转录因子 ChIP-seq 检测数据集的扩大,这一假设受到越来越多的质疑。特别是,高占位目标(HOT)位点吸引了数百个 TFs,而 ChIP-seq 峰值与 DNA 结合基团的存在之间往往没有可检测到的相关性。在这里,我们使用了一组 1003 个 TF ChIP-seq 数据集(HepG2、K562、H1),结合功能基因组学数据集分析了 ChIP-seq 峰的共现模式。我们在启动子(53%)和增强子(47%)区域发现了 43,891 个 HOT 位点。HOT 启动子调控看家基因,而 HOT 增强子则参与组织特异性过程调控。HOT 基因座构成了人类超级增强子的基础,并在强烈的负选择下进化,其中一些基因座位于超保守区。基于序列的HOT基因座分类分析表明,HOT基因座的形成是由序列特征驱动的,TF结合基因座的ChIP-seq峰密度与序列特征和侧翼基因的表达水平相关。根据与启动子和增强子结合的亲和力,我们检测到了五个不同的 TF 簇,它们构成了 HOT 基因座的核心。我们报告了人类基因组中大量的 HOT 基因座,51% 的 TF ChIP-seq 结合事件都与 HOT 基因座的形成有关,从而挑战了增强子活性的经典模型,并提出了一种基于大型转录凝聚体存在的 HOT 基因座形成模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
eLife
eLife BIOLOGY-
CiteScore
12.90
自引率
3.90%
发文量
3122
审稿时长
17 weeks
期刊介绍: eLife is a distinguished, not-for-profit, peer-reviewed open access scientific journal that specializes in the fields of biomedical and life sciences. eLife is known for its selective publication process, which includes a variety of article types such as: Research Articles: Detailed reports of original research findings. Short Reports: Concise presentations of significant findings that do not warrant a full-length research article. Tools and Resources: Descriptions of new tools, technologies, or resources that facilitate scientific research. Research Advances: Brief reports on significant scientific advancements that have immediate implications for the field. Scientific Correspondence: Short communications that comment on or provide additional information related to published articles. Review Articles: Comprehensive overviews of a specific topic or field within the life sciences.
期刊最新文献
NE contribution to rebooting unconsciousness caused by midazolam. Neurotrophic factor Neuritin modulates T cell electrical and metabolic state for the balance of tolerance and immunity. The gamma rhythm as a guardian of brain health. Zika virus remodels and hijacks IGF2BP2 ribonucleoprotein complex to promote viral replication organelle biogenesis. Extramacrochaetae regulates Notch signaling in the Drosophila eye through non-apoptotic caspase activity.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1