NMDtxDB:人类 NMD 目标转录本的数据驱动识别和注释。

IF 5 3区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY RNA Pub Date : 2024-09-16 DOI:10.1261/rna.080066.124
Thiago Britto-Borges, Niels H Gehring, Volker Boehm, Christoph Dieterich
{"title":"NMDtxDB:人类 NMD 目标转录本的数据驱动识别和注释。","authors":"Thiago Britto-Borges, Niels H Gehring, Volker Boehm, Christoph Dieterich","doi":"10.1261/rna.080066.124","DOIUrl":null,"url":null,"abstract":"<p><p>The nonsense-mediated RNA decay (NMD) pathway is a crucial mechanism of mRNA quality control. Current annotations of NMD substrate RNAs are rarely data-driven, but use generally established rules. We present a data set with four cell lines and combinations for <i>SMG5</i>, <i>SMG6</i>, and <i>SMG7</i> knockdowns or <i>SMG7</i> knockout. Based on this data set, we implemented a workflow that combines Nanopore and Illumina sequencing to assemble a transcriptome, which is enriched for NMD target transcripts. Moreover, we use coding sequence information (CDS) from Ensembl, Gencode consensus Ribo-seq ORFs, and OpenProt to enhance the CDS annotation of novel transcript isoforms. In summary, 302,889 transcripts were obtained from the transcriptome assembly process, out of which 24% are absent from Ensembl database annotations, 48,213 contain a premature stop codon, and 6433 are significantly upregulated in three or more comparisons of NMD active versus deficient cell lines. We present an in-depth view of these results through the NMDtxDB database, which is available at https://shiny.dieterichlab.org/app/NMDtxDB, and supports the study of NMD-sensitive transcripts. We open sourced our implementation of the respective web-application and analysis workflow at https://github.com/dieterich-lab/NMDtxDB and https://github.com/dieterich-lab/nmd-wf.</p>","PeriodicalId":21401,"journal":{"name":"RNA","volume":" ","pages":"1277-1291"},"PeriodicalIF":5.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11404449/pdf/","citationCount":"0","resultStr":"{\"title\":\"NMDtxDB: data-driven identification and annotation of human NMD target transcripts.\",\"authors\":\"Thiago Britto-Borges, Niels H Gehring, Volker Boehm, Christoph Dieterich\",\"doi\":\"10.1261/rna.080066.124\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The nonsense-mediated RNA decay (NMD) pathway is a crucial mechanism of mRNA quality control. Current annotations of NMD substrate RNAs are rarely data-driven, but use generally established rules. We present a data set with four cell lines and combinations for <i>SMG5</i>, <i>SMG6</i>, and <i>SMG7</i> knockdowns or <i>SMG7</i> knockout. Based on this data set, we implemented a workflow that combines Nanopore and Illumina sequencing to assemble a transcriptome, which is enriched for NMD target transcripts. Moreover, we use coding sequence information (CDS) from Ensembl, Gencode consensus Ribo-seq ORFs, and OpenProt to enhance the CDS annotation of novel transcript isoforms. In summary, 302,889 transcripts were obtained from the transcriptome assembly process, out of which 24% are absent from Ensembl database annotations, 48,213 contain a premature stop codon, and 6433 are significantly upregulated in three or more comparisons of NMD active versus deficient cell lines. We present an in-depth view of these results through the NMDtxDB database, which is available at https://shiny.dieterichlab.org/app/NMDtxDB, and supports the study of NMD-sensitive transcripts. We open sourced our implementation of the respective web-application and analysis workflow at https://github.com/dieterich-lab/NMDtxDB and https://github.com/dieterich-lab/nmd-wf.</p>\",\"PeriodicalId\":21401,\"journal\":{\"name\":\"RNA\",\"volume\":\" \",\"pages\":\"1277-1291\"},\"PeriodicalIF\":5.0000,\"publicationDate\":\"2024-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11404449/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"RNA\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1261/rna.080066.124\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"RNA","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1261/rna.080066.124","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

无义介导的 RNA 衰变(NMD)途径是 mRNA 质量控制的重要机制。目前对 NMD 底物 RNA 的注释很少由数据驱动,而是使用一般的既定规则。我们展示了一个包含 4 个细胞系和 SMG5、SMG6 和 SMG7 基因敲除或 SMG7 基因敲除组合的数据集。在此数据集的基础上,我们实施了一个结合 Nanopore 和 Illumina 测序的工作流程,以组装转录组,其中富含 NMD 目标转录本。此外,我们还利用来自 Ensembl、Gencode 共识 RiboSeq ORFs 和 OpenProt 的编码序列信息来加强对新型转录本异构体的 CDS 注释。总之,在转录组组装过程中获得了 302,889 个转录本,其中 24% 的转录本在 Ensembl 数据库注释中缺失,48,213 个转录本含有过早终止密码子,6,433 个转录本在 NMD 活性与缺陷细胞系的三次或三次以上比较中显著上调。我们通过 https://shiny.dieterichlab.org/app/NMDtxDB 上的 NMDtxDB 数据库对这些结果进行了深入分析,该数据库支持对 NMD 敏感转录本的研究。我们将各自的网络应用程序和分析工作流程的实施开源,网址是 https://github.com/dieterich-lab/NMDtxDB 和 https://github.com/dieterich-lab/nmd-wf。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
NMDtxDB: data-driven identification and annotation of human NMD target transcripts.

The nonsense-mediated RNA decay (NMD) pathway is a crucial mechanism of mRNA quality control. Current annotations of NMD substrate RNAs are rarely data-driven, but use generally established rules. We present a data set with four cell lines and combinations for SMG5, SMG6, and SMG7 knockdowns or SMG7 knockout. Based on this data set, we implemented a workflow that combines Nanopore and Illumina sequencing to assemble a transcriptome, which is enriched for NMD target transcripts. Moreover, we use coding sequence information (CDS) from Ensembl, Gencode consensus Ribo-seq ORFs, and OpenProt to enhance the CDS annotation of novel transcript isoforms. In summary, 302,889 transcripts were obtained from the transcriptome assembly process, out of which 24% are absent from Ensembl database annotations, 48,213 contain a premature stop codon, and 6433 are significantly upregulated in three or more comparisons of NMD active versus deficient cell lines. We present an in-depth view of these results through the NMDtxDB database, which is available at https://shiny.dieterichlab.org/app/NMDtxDB, and supports the study of NMD-sensitive transcripts. We open sourced our implementation of the respective web-application and analysis workflow at https://github.com/dieterich-lab/NMDtxDB and https://github.com/dieterich-lab/nmd-wf.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
RNA
RNA 生物-生化与分子生物学
CiteScore
8.30
自引率
2.20%
发文量
101
审稿时长
2.6 months
期刊介绍: RNA is a monthly journal which provides rapid publication of significant original research in all areas of RNA structure and function in eukaryotic, prokaryotic, and viral systems. It covers a broad range of subjects in RNA research, including: structural analysis by biochemical or biophysical means; mRNA structure, function and biogenesis; alternative processing: cis-acting elements and trans-acting factors; ribosome structure and function; translational control; RNA catalysis; tRNA structure, function, biogenesis and identity; RNA editing; rRNA structure, function and biogenesis; RNA transport and localization; regulatory RNAs; large and small RNP structure, function and biogenesis; viral RNA metabolism; RNA stability and turnover; in vitro evolution; and RNA chemistry.
期刊最新文献
Cancer-associated synonymous mutations reveal stress signal-dependent mRNA folding that selectively modulates protein function. Localized regulation of cell junction mRNAs is required for epithelial cell integrity. Uridine analogs prevent stress granule formation, not by blocking PKR recognition, but by inhibiting the synthesis of T7 RNA polymerase byproducts. Blocky proline/glutamine patterns in the SFPQ intrinsically disordered region dictate paraspeckle formation as a distinct membraneless organelle. Identification of 3D motifs in Rfam with JAR3D.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1