Deep learning analyses of splicing variants identify the link of PCP4 with amyotrophic lateral sclerosis

IF 11.7 1区 医学 Q1 CLINICAL NEUROLOGY Brain Pub Date : 2025-01-24 DOI:10.1093/brain/awaf025
Xuelin Tang, Yan Chen, Yongfei Ren, Wanli Yang, Wendi Yu, Yu Zhou, Jingyan Guo, Jiali Hu, Xi Chen, Yuqi Gu, Chuyi Wang, Yi Dong, Hong Yang, Christine Sato, Ji He, Dongsheng Fan, Linya You, Lorne Zinman, Ekaterina Rogaeva, Yelin Chen, Ming Zhang
{"title":"Deep learning analyses of splicing variants identify the link of PCP4 with amyotrophic lateral sclerosis","authors":"Xuelin Tang, Yan Chen, Yongfei Ren, Wanli Yang, Wendi Yu, Yu Zhou, Jingyan Guo, Jiali Hu, Xi Chen, Yuqi Gu, Chuyi Wang, Yi Dong, Hong Yang, Christine Sato, Ji He, Dongsheng Fan, Linya You, Lorne Zinman, Ekaterina Rogaeva, Yelin Chen, Ming Zhang","doi":"10.1093/brain/awaf025","DOIUrl":null,"url":null,"abstract":"Amyotrophic lateral sclerosis (ALS) is a severe motor neuron disease, with most sporadic cases lacking clear genetic causes. Abnormal pre-mRNA splicing is a fundamental mechanism in neurodegenerative diseases. For example, TAR DNA-binding protein 43 (TDP-43) loss-of-function (LOF) causes widespread RNA mis-splicing events in ALS. Additionally, splicing mutations are major contributors to neurological disorders. However, the role of intronic variants driving RNA mis-splicing in ALS remains poorly understood. To address this, we developed Spliformer to predict RNA splicing. Spliformer is a transformer-based deep learning model trained and tested on splicing events from the GENCODE database, as well as RNA-seq data from blood and central nervous system tissues. We benchmarked Spliformer against SpliceAI and Pangolin using testing datasets and paired whole-genome sequencing (WGS) with RNA-seq data. We further developed the Spliformer-motif model to identify splicing regulatory motifs. We analyzed Clinvar dataset to identify the link of splicing variants with disease pathogenicity. Additionally, we analyzed WGS data of ALS patients and controls to identify common intronic splicing variants linked to ALS risk or disease phenotypes. We also profiled rare intronic splicing variants in ALS patients to identify known or novel ALS-associated genes. Minigene assays were employed to validate candidate splicing variants. Finally, we measured spine density in neurons with a specific gene knockdown or those expressing a TDP-43 disease-causing mutant. Spliformer accurately predicts the possibilities of a nucleotide within a pre-mRNA sequence being a splice donor, acceptor, or neither. Spliformer outperformed SpliceAI and Pangolin in both speed and accuracy in tested splicing events and/or paired WGS/RNA-seq data. Spliformer-motif successfully identified canonical and novel splicing regulatory motifs. In Clinvar dataset, splicing variants are highly related to disease pathogenicity. Genome-wide analyses of common intronic splicing variants nominated one variant linked to ALS progression. Deep learning analyses of WGS data from 1,370 ALS patients revealed rare splicing variants in reported ALS genes (such as PTPRN2 and CFAP410, validated through minigene assays and RNA-seq), and TDP-43 LOF related RNA mis-splicing genes (such as PTPRD). Further genetic analysis and minigene assays nominated PCP4 and TMEM63A as ALS-associated genes. Functional assays demonstrated that PCP4 is critical for maintaining spine density and can rescue spine loss in neurons expressing a disease-causing TDP-43 mutant. In summary, we developed Spliformer and Spliformer-motif that accurately predict and interpret pre-mRNA splicing. Our findings highlight an intronic genetic mechanism driving RNA mis-splicing in ALS and nominate PCP4 as an ALS-associated gene.","PeriodicalId":9063,"journal":{"name":"Brain","volume":"58 1","pages":""},"PeriodicalIF":11.7000,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brain","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/brain/awaf025","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Amyotrophic lateral sclerosis (ALS) is a severe motor neuron disease, with most sporadic cases lacking clear genetic causes. Abnormal pre-mRNA splicing is a fundamental mechanism in neurodegenerative diseases. For example, TAR DNA-binding protein 43 (TDP-43) loss-of-function (LOF) causes widespread RNA mis-splicing events in ALS. Additionally, splicing mutations are major contributors to neurological disorders. However, the role of intronic variants driving RNA mis-splicing in ALS remains poorly understood. To address this, we developed Spliformer to predict RNA splicing. Spliformer is a transformer-based deep learning model trained and tested on splicing events from the GENCODE database, as well as RNA-seq data from blood and central nervous system tissues. We benchmarked Spliformer against SpliceAI and Pangolin using testing datasets and paired whole-genome sequencing (WGS) with RNA-seq data. We further developed the Spliformer-motif model to identify splicing regulatory motifs. We analyzed Clinvar dataset to identify the link of splicing variants with disease pathogenicity. Additionally, we analyzed WGS data of ALS patients and controls to identify common intronic splicing variants linked to ALS risk or disease phenotypes. We also profiled rare intronic splicing variants in ALS patients to identify known or novel ALS-associated genes. Minigene assays were employed to validate candidate splicing variants. Finally, we measured spine density in neurons with a specific gene knockdown or those expressing a TDP-43 disease-causing mutant. Spliformer accurately predicts the possibilities of a nucleotide within a pre-mRNA sequence being a splice donor, acceptor, or neither. Spliformer outperformed SpliceAI and Pangolin in both speed and accuracy in tested splicing events and/or paired WGS/RNA-seq data. Spliformer-motif successfully identified canonical and novel splicing regulatory motifs. In Clinvar dataset, splicing variants are highly related to disease pathogenicity. Genome-wide analyses of common intronic splicing variants nominated one variant linked to ALS progression. Deep learning analyses of WGS data from 1,370 ALS patients revealed rare splicing variants in reported ALS genes (such as PTPRN2 and CFAP410, validated through minigene assays and RNA-seq), and TDP-43 LOF related RNA mis-splicing genes (such as PTPRD). Further genetic analysis and minigene assays nominated PCP4 and TMEM63A as ALS-associated genes. Functional assays demonstrated that PCP4 is critical for maintaining spine density and can rescue spine loss in neurons expressing a disease-causing TDP-43 mutant. In summary, we developed Spliformer and Spliformer-motif that accurately predict and interpret pre-mRNA splicing. Our findings highlight an intronic genetic mechanism driving RNA mis-splicing in ALS and nominate PCP4 as an ALS-associated gene.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
剪接变异的深度学习分析确定了PCP4与肌萎缩侧索硬化症的联系
肌萎缩性侧索硬化症(ALS)是一种严重的运动神经元疾病,大多数散发病例缺乏明确的遗传原因。异常前mrna剪接是神经退行性疾病的基本机制。例如,TAR dna结合蛋白43 (TDP-43)功能丧失(LOF)导致ALS中广泛的RNA错剪接事件。此外,剪接突变是神经系统疾病的主要原因。然而,在ALS中,内含子变异体驱动RNA错误剪接的作用仍然知之甚少。为了解决这个问题,我们开发了Spliformer来预测RNA剪接。Spliformer是一种基于转换器的深度学习模型,对来自GENCODE数据库的剪接事件以及来自血液和中枢神经系统组织的RNA-seq数据进行训练和测试。我们使用测试数据集和配对全基因组测序(WGS)与RNA-seq数据对Spliformer与SpliceAI和穿山甲进行基准比对。我们进一步开发了Spliformer-motif模型来识别剪接调控基序。我们分析了Clinvar数据集,以确定剪接变异与疾病致病性的联系。此外,我们分析了ALS患者和对照组的WGS数据,以确定与ALS风险或疾病表型相关的常见内含子剪接变异。我们还分析了ALS患者中罕见的内含子剪接变异,以鉴定已知或新的ALS相关基因。采用迷你基因检测来验证候选剪接变异体。最后,我们测量了特定基因敲低或表达TDP-43致病突变的神经元的脊柱密度。Spliformer准确地预测了前mrna序列内核苷酸是剪接供体、受体还是两者都不是的可能性。Spliformer在测试剪接事件和/或配对WGS/RNA-seq数据的速度和准确性方面都优于SpliceAI和穿山甲。Spliformer-motif成功鉴定出典型和新颖的剪接调控基序。在Clinvar数据集中,剪接变异与疾病致病性高度相关。对常见内含子剪接变异体的全基因组分析表明,其中一种变异体与ALS进展有关。对1370例ALS患者WGS数据的深度学习分析显示,报告的ALS基因(如PTPRN2和CFAP410,通过minigene分析和RNA-seq验证)和TDP-43 LOF相关RNA错剪接基因(如PTPRD)中存在罕见的剪接变异。进一步的遗传分析和基因分析表明PCP4和TMEM63A是als相关基因。功能分析表明,PCP4对于维持脊柱密度至关重要,并且可以在表达致病性TDP-43突变体的神经元中挽救脊柱损失。总之,我们开发了Spliformer和Spliformer-motif,可以准确预测和解释pre-mRNA剪接。我们的研究结果强调了在ALS中驱动RNA错误剪接的内含子遗传机制,并提名PCP4为ALS相关基因。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Brain
Brain 医学-临床神经学
CiteScore
20.30
自引率
4.10%
发文量
458
审稿时长
3-6 weeks
期刊介绍: Brain, a journal focused on clinical neurology and translational neuroscience, has been publishing landmark papers since 1878. The journal aims to expand its scope by including studies that shed light on disease mechanisms and conducting innovative clinical trials for brain disorders. With a wide range of topics covered, the Editorial Board represents the international readership and diverse coverage of the journal. Accepted articles are promptly posted online, typically within a few weeks of acceptance. As of 2022, Brain holds an impressive impact factor of 14.5, according to the Journal Citation Reports.
期刊最新文献
Forget processing speed: cognitive changes in multiple sclerosis call for new measurement tools. What comes next for global neurology? From 'senile dementia' to non-vascular Alzheimer's synucleinopathy: the evolution of dementia classification. Retraction of: 'Accumulation of oligomer-prone α-synuclein exacerbates synaptic and neuronal degeneration in vivo' and 'Correction to: Accumulation of oligomer-prone α-synuclein exacerbates synaptic and neuronal degeneration in vivo'. HML-2 env knockdown by AAV9-mediated miRNAs attenuates amyotrophic lateral sclerosis-like manifestations in mice.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1