Positional motif analysis reveals the extent of specificity of protein-RNA interactions observed by CLIP.

IF 12.3 1区 生物学 Q1 Agricultural and Biological Sciences Genome Biology Pub Date : 2022-09-09 DOI:10.1186/s13059-022-02755-2
Klara Kuret, Aram Gustav Amalietti, D Marc Jones, Charlotte Capitanchik, Jernej Ule
{"title":"Positional motif analysis reveals the extent of specificity of protein-RNA interactions observed by CLIP.","authors":"Klara Kuret, Aram Gustav Amalietti, D Marc Jones, Charlotte Capitanchik, Jernej Ule","doi":"10.1186/s13059-022-02755-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Crosslinking and immunoprecipitation (CLIP) is a method used to identify in vivo RNA-protein binding sites on a transcriptome-wide scale. With the increasing amounts of available data for RNA-binding proteins (RBPs), it is important to understand to what degree the enriched motifs specify the RNA-binding profiles of RBPs in cells.</p><p><strong>Results: </strong>We develop positionally enriched k-mer analysis (PEKA), a computational tool for efficient analysis of enriched motifs from individual CLIP datasets, which minimizes the impact of technical and regional genomic biases by internal data normalization. We cross-validate PEKA with mCross and show that the use of input control for background correction is not required to yield high specificity of enriched motifs. We identify motif classes with common enrichment patterns across eCLIP datasets and across RNA regions, while also observing variations in the specificity and the extent of motif enrichment across eCLIP datasets, between variant CLIP protocols, and between CLIP and in vitro binding data. Thereby, we gain insights into the contributions of technical and regional genomic biases to the enriched motifs, and find how motif enrichment features relate to the domain composition and low-complexity regions of the studied proteins.</p><p><strong>Conclusions: </strong>Our study provides insights into the overall contributions of regional binding preferences, protein domains, and low-complexity regions to the specificity of protein-RNA interactions, and shows the value of cross-motif and cross-RBP comparison for data interpretation. Our results are presented for exploratory analysis via an online platform in an RBP-centric and motif-centric manner ( https://imaps.goodwright.com/apps/peka/ ).</p>","PeriodicalId":48922,"journal":{"name":"Genome Biology","volume":"23 1","pages":"191"},"PeriodicalIF":12.3000,"publicationDate":"2022-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9461102/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13059-022-02755-2","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Crosslinking and immunoprecipitation (CLIP) is a method used to identify in vivo RNA-protein binding sites on a transcriptome-wide scale. With the increasing amounts of available data for RNA-binding proteins (RBPs), it is important to understand to what degree the enriched motifs specify the RNA-binding profiles of RBPs in cells.

Results: We develop positionally enriched k-mer analysis (PEKA), a computational tool for efficient analysis of enriched motifs from individual CLIP datasets, which minimizes the impact of technical and regional genomic biases by internal data normalization. We cross-validate PEKA with mCross and show that the use of input control for background correction is not required to yield high specificity of enriched motifs. We identify motif classes with common enrichment patterns across eCLIP datasets and across RNA regions, while also observing variations in the specificity and the extent of motif enrichment across eCLIP datasets, between variant CLIP protocols, and between CLIP and in vitro binding data. Thereby, we gain insights into the contributions of technical and regional genomic biases to the enriched motifs, and find how motif enrichment features relate to the domain composition and low-complexity regions of the studied proteins.

Conclusions: Our study provides insights into the overall contributions of regional binding preferences, protein domains, and low-complexity regions to the specificity of protein-RNA interactions, and shows the value of cross-motif and cross-RBP comparison for data interpretation. Our results are presented for exploratory analysis via an online platform in an RBP-centric and motif-centric manner ( https://imaps.goodwright.com/apps/peka/ ).

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
位置主题分析揭示了 CLIP 观察到的蛋白质-RNA 相互作用的特异性程度。
背景:交联和免疫沉淀(CLIP)是一种用于在整个转录组范围内鉴定体内 RNA 蛋白结合位点的方法。随着 RNA 结合蛋白(RBPs)可用数据量的不断增加,了解富集基序在多大程度上确定了细胞中 RBPs 的 RNA 结合特征就显得尤为重要:我们开发了位置富集 k-mer 分析(PEKA),这是一种用于高效分析来自单个 CLIP 数据集的富集基元的计算工具,它通过内部数据归一化将技术和区域基因组偏差的影响降至最低。我们用 mCross 对 PEKA 进行了交叉验证,结果表明不需要使用输入控制进行背景校正就能获得高特异性的富集主题。我们在 eCLIP 数据集和不同 RNA 区域之间发现了具有共同富集模式的主题词类别,同时也观察到了不同 eCLIP 数据集、不同 CLIP 协议以及不同 CLIP 和体外结合数据之间在特异性和主题词富集程度上的差异。因此,我们深入了解了技术和区域基因组偏差对主题词富集的贡献,并发现了主题词富集特征与所研究蛋白质的结构域组成和低复杂性区域之间的关系:我们的研究深入揭示了区域结合偏好、蛋白质结构域和低复杂性区域对蛋白质-RNA相互作用特异性的总体贡献,并显示了跨主题词和跨RBP比较对数据解读的价值。我们的研究结果通过在线平台以 RBP 为中心和以图案为中心的方式进行了探索性分析 ( https://imaps.goodwright.com/apps/peka/ )。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genome Biology
Genome Biology BIOTECHNOLOGY & APPLIED MICROBIOLOGY-GENETICS & HEREDITY
CiteScore
25.50
自引率
3.30%
发文量
0
审稿时长
14 weeks
期刊介绍: Genome Biology is a leading research journal that focuses on the study of biology and biomedicine from a genomic and post-genomic standpoint. The journal consistently publishes outstanding research across various areas within these fields. With an impressive impact factor of 12.3 (2022), Genome Biology has earned its place as the 3rd highest-ranked research journal in the Genetics and Heredity category, according to Thomson Reuters. Additionally, it is ranked 2nd among research journals in the Biotechnology and Applied Microbiology category. It is important to note that Genome Biology is the top-ranking open access journal in this category. In summary, Genome Biology sets a high standard for scientific publications in the field, showcasing cutting-edge research and earning recognition among its peers.
期刊最新文献
Cohesin distribution alone predicts chromatin organization in yeast via conserved-current loop extrusion. DeepKINET: a deep generative model for estimating single-cell RNA splicing and degradation rates. Seqrutinator: scrutiny of large protein superfamily sequence datasets for the identification and elimination of non-functional homologues. Systemic interindividual DNA methylation variants in cattle share major hallmarks with those in humans. TaqTth-hpRNA: a novel compact RNA-targeting tool for specific silencing of pathogenic mRNA.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1