长线程 RNA 测序分析揭示了 CLN3 转录本的复杂性。

IF 2.1 4区 医学 Q3 GENETICS & HEREDITY BMC Medical Genomics Pub Date : 2024-10-04 DOI:10.1186/s12920-024-02017-z
Hao-Yu Zhang, Christopher Minnis, Emil Gustavsson, Mina Ryten, Sara E Mole
{"title":"长线程 RNA 测序分析揭示了 CLN3 转录本的复杂性。","authors":"Hao-Yu Zhang, Christopher Minnis, Emil Gustavsson, Mina Ryten, Sara E Mole","doi":"10.1186/s12920-024-02017-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Batten disease is a group of rare inherited neurodegenerative diseases. Juvenile CLN3 disease is the most prevalent type, and the most common pathogenic variant shared by most patients is the \"1-kb\" deletion which removes two internal coding exons (7 and 8) in CLN3. Previously, we identified two transcripts in patient fibroblasts homozygous for the 1-kb deletion: the 'major' and 'minor' transcripts. To understand the full variety of disease transcripts and their role in disease pathogenesis, it is necessary to first investigate CLN3 transcription in \"healthy\" samples without juvenile CLN3 disease.</p><p><strong>Methods: </strong>We leveraged PacBio long-read RNA sequencing datasets from ENCODE to investigate the full range of CLN3 transcripts across various tissues and cell types in human control samples. Then we sought to validate their existence using data from different sources.</p><p><strong>Results: </strong>We found that a readthrough gene affects the quantification and annotation of CLN3. After taking this into account, we detected over 100 novel CLN3 transcripts, with no dominantly expressed CLN3 transcript. The most abundant transcript has median usage of 42.9%. Surprisingly, the known disease-associated 'major' transcripts are detected. Together, they have median usage of 1.5% across 22 samples. Furthermore, we identified 48 CLN3 ORFs, of which 26 are novel. The predominant ORF that encodes the canonical CLN3 protein isoform has median usage of 66.7%, meaning around one-third of CLN3 transcripts encode protein isoforms with different stretches of amino acids. The same ORFs could be found with alternative UTRs. Moreover, we were able to validate the translational potential of certain transcripts using public mass spectrometry data.</p><p><strong>Conclusion: </strong>Overall, these findings provide valuable insights into the complexity of CLN3 transcription, highlighting the importance of studying both canonical and non-canonical CLN3 protein isoforms as well as the regulatory role of UTRs to fully comprehend the regulation and function(s) of CLN3. This knowledge is essential for investigating the impact of the 1-kb deletion and rare pathogenic variants on CLN3 transcription and disease pathogenesis.</p>","PeriodicalId":8915,"journal":{"name":"BMC Medical Genomics","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11451007/pdf/","citationCount":"0","resultStr":"{\"title\":\"CLN3 transcript complexity revealed by long-read RNA sequencing analysis.\",\"authors\":\"Hao-Yu Zhang, Christopher Minnis, Emil Gustavsson, Mina Ryten, Sara E Mole\",\"doi\":\"10.1186/s12920-024-02017-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Batten disease is a group of rare inherited neurodegenerative diseases. Juvenile CLN3 disease is the most prevalent type, and the most common pathogenic variant shared by most patients is the \\\"1-kb\\\" deletion which removes two internal coding exons (7 and 8) in CLN3. Previously, we identified two transcripts in patient fibroblasts homozygous for the 1-kb deletion: the 'major' and 'minor' transcripts. To understand the full variety of disease transcripts and their role in disease pathogenesis, it is necessary to first investigate CLN3 transcription in \\\"healthy\\\" samples without juvenile CLN3 disease.</p><p><strong>Methods: </strong>We leveraged PacBio long-read RNA sequencing datasets from ENCODE to investigate the full range of CLN3 transcripts across various tissues and cell types in human control samples. Then we sought to validate their existence using data from different sources.</p><p><strong>Results: </strong>We found that a readthrough gene affects the quantification and annotation of CLN3. After taking this into account, we detected over 100 novel CLN3 transcripts, with no dominantly expressed CLN3 transcript. The most abundant transcript has median usage of 42.9%. Surprisingly, the known disease-associated 'major' transcripts are detected. Together, they have median usage of 1.5% across 22 samples. Furthermore, we identified 48 CLN3 ORFs, of which 26 are novel. The predominant ORF that encodes the canonical CLN3 protein isoform has median usage of 66.7%, meaning around one-third of CLN3 transcripts encode protein isoforms with different stretches of amino acids. The same ORFs could be found with alternative UTRs. Moreover, we were able to validate the translational potential of certain transcripts using public mass spectrometry data.</p><p><strong>Conclusion: </strong>Overall, these findings provide valuable insights into the complexity of CLN3 transcription, highlighting the importance of studying both canonical and non-canonical CLN3 protein isoforms as well as the regulatory role of UTRs to fully comprehend the regulation and function(s) of CLN3. This knowledge is essential for investigating the impact of the 1-kb deletion and rare pathogenic variants on CLN3 transcription and disease pathogenesis.</p>\",\"PeriodicalId\":8915,\"journal\":{\"name\":\"BMC Medical Genomics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11451007/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Genomics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12920-024-02017-z\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Genomics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12920-024-02017-z","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

背景:巴顿氏病是一组罕见的遗传性神经退行性疾病。大多数患者最常见的致病变异是 "1-kb "缺失,它缺失了 CLN3 的两个内部编码外显子(7 和 8)。此前,我们在同源 1-kb 缺失的患者成纤维细胞中发现了两种转录本:"主要 "和 "次要 "转录本。为了了解疾病转录本的全部种类及其在疾病发病机制中的作用,有必要首先研究无幼年CLN3疾病的 "健康 "样本中的CLN3转录本:我们利用 ENCODE 的 PacBio 长读程 RNA 测序数据集研究了人类对照样本中不同组织和细胞类型中的各种 CLN3 转录本。然后,我们试图利用不同来源的数据验证它们的存在:结果:我们发现,通读基因会影响 CLN3 的定量和注释。考虑到这一点,我们检测到了 100 多个新的 CLN3 转录本,其中没有显性表达的 CLN3 转录本。最丰富的转录本的中位使用率为 42.9%。令人惊讶的是,我们检测到了已知的与疾病相关的 "主要 "转录本。在 22 个样本中,这些转录本的中位使用率为 1.5%。此外,我们还发现了 48 个 CLN3 ORF,其中 26 个是新的。编码典型CLN3蛋白同工型的主要ORF的中位使用率为66.7%,这意味着约三分之一的CLN3转录本编码具有不同氨基酸段的蛋白同工型。同样的 ORF 也可以通过替代的 UTR 发现。此外,我们还利用公开的质谱数据验证了某些转录本的翻译潜力:总之,这些发现为了解 CLN3 转录的复杂性提供了有价值的见解,突出了研究规范和非规范 CLN3 蛋白异构体以及 UTR 的调控作用对全面理解 CLN3 的调控和功能的重要性。这些知识对于研究 1-kb 缺失和罕见致病变体对 CLN3 转录和疾病发病机制的影响至关重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
CLN3 transcript complexity revealed by long-read RNA sequencing analysis.

Background: Batten disease is a group of rare inherited neurodegenerative diseases. Juvenile CLN3 disease is the most prevalent type, and the most common pathogenic variant shared by most patients is the "1-kb" deletion which removes two internal coding exons (7 and 8) in CLN3. Previously, we identified two transcripts in patient fibroblasts homozygous for the 1-kb deletion: the 'major' and 'minor' transcripts. To understand the full variety of disease transcripts and their role in disease pathogenesis, it is necessary to first investigate CLN3 transcription in "healthy" samples without juvenile CLN3 disease.

Methods: We leveraged PacBio long-read RNA sequencing datasets from ENCODE to investigate the full range of CLN3 transcripts across various tissues and cell types in human control samples. Then we sought to validate their existence using data from different sources.

Results: We found that a readthrough gene affects the quantification and annotation of CLN3. After taking this into account, we detected over 100 novel CLN3 transcripts, with no dominantly expressed CLN3 transcript. The most abundant transcript has median usage of 42.9%. Surprisingly, the known disease-associated 'major' transcripts are detected. Together, they have median usage of 1.5% across 22 samples. Furthermore, we identified 48 CLN3 ORFs, of which 26 are novel. The predominant ORF that encodes the canonical CLN3 protein isoform has median usage of 66.7%, meaning around one-third of CLN3 transcripts encode protein isoforms with different stretches of amino acids. The same ORFs could be found with alternative UTRs. Moreover, we were able to validate the translational potential of certain transcripts using public mass spectrometry data.

Conclusion: Overall, these findings provide valuable insights into the complexity of CLN3 transcription, highlighting the importance of studying both canonical and non-canonical CLN3 protein isoforms as well as the regulatory role of UTRs to fully comprehend the regulation and function(s) of CLN3. This knowledge is essential for investigating the impact of the 1-kb deletion and rare pathogenic variants on CLN3 transcription and disease pathogenesis.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Medical Genomics
BMC Medical Genomics 医学-遗传学
CiteScore
3.90
自引率
0.00%
发文量
243
审稿时长
3.5 months
期刊介绍: BMC Medical Genomics is an open access journal publishing original peer-reviewed research articles in all aspects of functional genomics, genome structure, genome-scale population genetics, epigenomics, proteomics, systems analysis, and pharmacogenomics in relation to human health and disease.
期刊最新文献
The signature of SARS-CoV-2-related genes predicts the immune therapeutic response and prognosis in breast cancer. Association between venous thromboembolism and atrial fibrillation: a Mendelian randomization study. New insight into the development of synpolydactyly caused by expansion of HOXD13 polyalanine based on weighted gene co-expression network analysis. RNA-seq validation of microRNA expression signatures for precision melanoma diagnosis and prognostic stratification. The Toll-like receptor 4 antagonist TAK-242 in combination with sodium hyaluronate alleviates postoperative abdominal adhesion in a mouse model.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1