T3E: a tool for characterising the epigenetic profile of transposable elements using ChIP-seq data.

IF 4.7 2区 生物学 Q1 GENETICS & HEREDITY Mobile DNA Pub Date : 2022-11-30 DOI:10.1186/s13100-022-00285-z
Michelle Almeida da Paz, Leila Taher
{"title":"T3E: a tool for characterising the epigenetic profile of transposable elements using ChIP-seq data.","authors":"Michelle Almeida da Paz, Leila Taher","doi":"10.1186/s13100-022-00285-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Despite the advent of Chromatin Immunoprecipitation Sequencing (ChIP-seq) having revolutionised our understanding of the mammalian genome's regulatory landscape, many challenges remain. In particular, because of their repetitive nature, the sequencing reads derived from transposable elements (TEs) pose a real bioinformatics challenge, to the point that standard analysis pipelines typically ignore reads whose genomic origin cannot be unambiguously ascertained.</p><p><strong>Results: </strong>We show that discarding ambiguously mapping reads may lead to a systematic underestimation of the number of reads associated with young TE families/subfamilies. We also provide evidence suggesting that the strategy of randomly permuting the location of the read mappings (or the TEs) that is often used to compute the background for enrichment calculations at TE families/subfamilies can result in both false positive and negative enrichments. To address these problems, we present the Transposable Element Enrichment Estimator (T3E), a tool that makes use of ChIP-seq data to characterise the epigenetic profile of associated TE families/subfamilies. T3E weights the number of read mappings assigned to the individual TE copies of a family/subfamily by the overall number of genomic loci to which the corresponding reads map, and this is done at the single nucleotide level. In addition, T3E computes ChIP-seq enrichment relative to a background estimated based on the distribution of the read mappings in the input control DNA. We demonstrated the capabilities of T3E on 23 different ChIP-seq libraries. T3E identified enrichments that were consistent with previous studies. Furthermore, T3E detected context-specific enrichments that are likely to pinpoint unexplored TE families/subfamilies with individual TE copies that have been frequently exapted as cis-regulatory elements during the evolution of mammalian regulatory networks.</p><p><strong>Conclusions: </strong>T3E is a novel open-source computational tool (available for use at: https://github.com/michelleapaz/T3E ) that overcomes some of the pitfalls associated with the analysis of ChIP-seq data arising from the repetitive mammalian genome and provides a framework to shed light on the epigenetics of entire TE families/subfamilies.</p>","PeriodicalId":18854,"journal":{"name":"Mobile DNA","volume":null,"pages":null},"PeriodicalIF":4.7000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9710123/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mobile DNA","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13100-022-00285-z","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Despite the advent of Chromatin Immunoprecipitation Sequencing (ChIP-seq) having revolutionised our understanding of the mammalian genome's regulatory landscape, many challenges remain. In particular, because of their repetitive nature, the sequencing reads derived from transposable elements (TEs) pose a real bioinformatics challenge, to the point that standard analysis pipelines typically ignore reads whose genomic origin cannot be unambiguously ascertained.

Results: We show that discarding ambiguously mapping reads may lead to a systematic underestimation of the number of reads associated with young TE families/subfamilies. We also provide evidence suggesting that the strategy of randomly permuting the location of the read mappings (or the TEs) that is often used to compute the background for enrichment calculations at TE families/subfamilies can result in both false positive and negative enrichments. To address these problems, we present the Transposable Element Enrichment Estimator (T3E), a tool that makes use of ChIP-seq data to characterise the epigenetic profile of associated TE families/subfamilies. T3E weights the number of read mappings assigned to the individual TE copies of a family/subfamily by the overall number of genomic loci to which the corresponding reads map, and this is done at the single nucleotide level. In addition, T3E computes ChIP-seq enrichment relative to a background estimated based on the distribution of the read mappings in the input control DNA. We demonstrated the capabilities of T3E on 23 different ChIP-seq libraries. T3E identified enrichments that were consistent with previous studies. Furthermore, T3E detected context-specific enrichments that are likely to pinpoint unexplored TE families/subfamilies with individual TE copies that have been frequently exapted as cis-regulatory elements during the evolution of mammalian regulatory networks.

Conclusions: T3E is a novel open-source computational tool (available for use at: https://github.com/michelleapaz/T3E ) that overcomes some of the pitfalls associated with the analysis of ChIP-seq data arising from the repetitive mammalian genome and provides a framework to shed light on the epigenetics of entire TE families/subfamilies.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
T3E:利用 ChIP-seq 数据描述转座元件表观遗传特征的工具。
背景:尽管染色质免疫沉淀测序(ChIP-seq)的出现彻底改变了我们对哺乳动物基因组调控格局的认识,但仍然存在许多挑战。特别是来自转座元件(TEs)的测序读数,由于其重复性,给生物信息学带来了真正的挑战,以至于标准分析管道通常会忽略其基因组来源无法明确确定的读数:结果:我们发现,忽略不明确的映射读数可能会导致系统性低估与年轻 TE 族/亚族相关的读数数量。我们还提供证据表明,通常用于计算TE家族/亚家族富集计算背景的随机排列读数映射(或TE)位置的策略可能会导致假阳性和假阴性富集。为了解决这些问题,我们提出了可转座元件富集估算器(T3E),这是一种利用 ChIP-seq 数据描述相关 TE 家系/亚家族表观遗传特征的工具。T3E 通过相应读数映射到的基因组位点总数,对分配给一个族/亚族的单个 TE 拷贝的读数映射数进行加权,这是在单核苷酸水平上完成的。此外,T3E 还能根据输入对照 DNA 中读数映射的分布情况,计算相对于背景估计值的 ChIP-seq 富集度。我们在 23 个不同的 ChIP-seq 文库上演示了 T3E 的功能。T3E 发现的富集与之前的研究一致。此外,T3E还检测到了上下文特异性富集,这些富集很可能是为了确定在哺乳动物调控网络进化过程中经常作为顺式调控元件外显的具有单个TE拷贝的未探索TE家族/亚家族:T3E 是一种新颖的开源计算工具(可在 https://github.com/michelleapaz/T3E 上使用),它克服了与分析重复哺乳动物基因组 ChIP-seq 数据相关的一些缺陷,并为揭示整个 TE 家族/亚家族的表观遗传学提供了一个框架。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Mobile DNA
Mobile DNA GENETICS & HEREDITY-
CiteScore
8.20
自引率
6.10%
发文量
26
审稿时长
11 weeks
期刊介绍: Mobile DNA is an online, peer-reviewed, open access journal that publishes articles providing novel insights into DNA rearrangements in all organisms, ranging from transposition and other types of recombination mechanisms to patterns and processes of mobile element and host genome evolution. In addition, the journal will consider articles on the utility of mobile genetic elements in biotechnological methods and protocols.
期刊最新文献
International congress on transposable elements (ICTE 2024) in Saint Malo: breaking down transposon waves and their impact. Accelerating de novo SINE annotation in plant and animal genomes. Association of hyperactivated transposon expression with exacerbated immune activation in systemic lupus erythematosus. Widespread HCD-tRNA derived SINEs in bivalves rely on multiple LINE partners and accumulate in genic regions. Correction: Transposon-derived introns as an element shaping the structure of eukaryotic genomes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1