A validated heart-specific model for splice-disrupting variants in childhood heart disease.

IF 10.4 1区 生物学 Q1 GENETICS & HEREDITY Genome Medicine Pub Date : 2024-10-15 DOI:10.1186/s13073-024-01383-8
Robert Lesurf, Jeroen Breckpot, Jade Bouwmeester, Nour Hanafi, Anjali Jain, Yijing Liang, Tanya Papaz, Jane Lougheed, Tapas Mondal, Mahmoud Alsalehi, Luis Altamirano-Diaz, Erwin Oechslin, Enrique Audain, Gregor Dombrowsky, Alex V Postma, Odilia I Woudstra, Berto J Bouma, Marc-Phillip Hitz, Connie R Bezzina, Gillian M Blue, David S Winlaw, Seema Mital
{"title":"A validated heart-specific model for splice-disrupting variants in childhood heart disease.","authors":"Robert Lesurf, Jeroen Breckpot, Jade Bouwmeester, Nour Hanafi, Anjali Jain, Yijing Liang, Tanya Papaz, Jane Lougheed, Tapas Mondal, Mahmoud Alsalehi, Luis Altamirano-Diaz, Erwin Oechslin, Enrique Audain, Gregor Dombrowsky, Alex V Postma, Odilia I Woudstra, Berto J Bouma, Marc-Phillip Hitz, Connie R Bezzina, Gillian M Blue, David S Winlaw, Seema Mital","doi":"10.1186/s13073-024-01383-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Congenital heart disease (CHD) is the most common congenital anomaly. Almost 90% of isolated cases have an unexplained genetic etiology after clinical testing. Non-canonical splice variants that disrupt mRNA splicing through the loss or creation of exon boundaries are not routinely captured and/or evaluated by standard clinical genetic tests. Recent computational algorithms such as SpliceAI have shown an ability to predict such variants, but are not specific to cardiac-expressed genes and transcriptional isoforms.</p><p><strong>Methods: </strong>We used genome sequencing (GS) (n = 1101 CHD probands) and myocardial RNA-Sequencing (RNA-Seq) (n = 154 CHD and n = 43 cardiomyopathy probands) to identify and validate splice disrupting variants, and to develop a heart-specific model for canonical and non-canonical splice variants that can be applied to patients with CHD and cardiomyopathy. Two thousand five hundred seventy GS samples from the Medical Genome Reference Bank were analyzed as healthy controls.</p><p><strong>Results: </strong>Of 8583 rare DNA splice-disrupting variants initially identified using SpliceAI, 100 were associated with altered splice junctions in the corresponding patient myocardium affecting 95 genes. Using strength of myocardial gene expression and genome-wide DNA variant features that were confirmed to affect splicing in myocardial RNA, we trained a machine learning model for predicting cardiac-specific splice-disrupting variants (AUC 0.86 on internal validation). In a validation set of 48 CHD probands, the cardiac-specific model outperformed a SpliceAI model alone (AUC 0.94 vs 0.67 respectively). Application of this model to an additional 947 CHD probands with only GS data identified 1% patients with canonical and 11% patients with non-canonical splice-disrupting variants in CHD genes. Forty-nine percent of predicted splice-disrupting variants were intronic and > 10 bp from existing splice junctions. The burden of high-confidence splice-disrupting variants in CHD genes was 1.28-fold higher in CHD cases compared with healthy controls.</p><p><strong>Conclusions: </strong>A new cardiac-specific in silico model was developed using complementary GS and RNA-Seq data that improved genetic yield by identifying a significant burden of non-canonical splice variants associated with CHD that would not be detectable through panel or exome sequencing.</p>","PeriodicalId":12645,"journal":{"name":"Genome Medicine","volume":"16 1","pages":"119"},"PeriodicalIF":10.4000,"publicationDate":"2024-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11476204/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genome Medicine","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13073-024-01383-8","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Congenital heart disease (CHD) is the most common congenital anomaly. Almost 90% of isolated cases have an unexplained genetic etiology after clinical testing. Non-canonical splice variants that disrupt mRNA splicing through the loss or creation of exon boundaries are not routinely captured and/or evaluated by standard clinical genetic tests. Recent computational algorithms such as SpliceAI have shown an ability to predict such variants, but are not specific to cardiac-expressed genes and transcriptional isoforms.

Methods: We used genome sequencing (GS) (n = 1101 CHD probands) and myocardial RNA-Sequencing (RNA-Seq) (n = 154 CHD and n = 43 cardiomyopathy probands) to identify and validate splice disrupting variants, and to develop a heart-specific model for canonical and non-canonical splice variants that can be applied to patients with CHD and cardiomyopathy. Two thousand five hundred seventy GS samples from the Medical Genome Reference Bank were analyzed as healthy controls.

Results: Of 8583 rare DNA splice-disrupting variants initially identified using SpliceAI, 100 were associated with altered splice junctions in the corresponding patient myocardium affecting 95 genes. Using strength of myocardial gene expression and genome-wide DNA variant features that were confirmed to affect splicing in myocardial RNA, we trained a machine learning model for predicting cardiac-specific splice-disrupting variants (AUC 0.86 on internal validation). In a validation set of 48 CHD probands, the cardiac-specific model outperformed a SpliceAI model alone (AUC 0.94 vs 0.67 respectively). Application of this model to an additional 947 CHD probands with only GS data identified 1% patients with canonical and 11% patients with non-canonical splice-disrupting variants in CHD genes. Forty-nine percent of predicted splice-disrupting variants were intronic and > 10 bp from existing splice junctions. The burden of high-confidence splice-disrupting variants in CHD genes was 1.28-fold higher in CHD cases compared with healthy controls.

Conclusions: A new cardiac-specific in silico model was developed using complementary GS and RNA-Seq data that improved genetic yield by identifying a significant burden of non-canonical splice variants associated with CHD that would not be detectable through panel or exome sequencing.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
儿童心脏病中剪接干扰变异的特异性心脏模型经过验证。
背景:先天性心脏病(CHD)是最常见的先天性畸形:先天性心脏病(CHD)是最常见的先天性畸形。近 90% 的孤立病例经临床检测后,遗传病因不明。非规范剪接变异通过丢失或创建外显子边界来破坏 mRNA 的剪接,但标准临床基因检测并未对其进行常规捕捉和/或评估。最近的计算算法(如 SpliceAI)已显示出预测此类变异的能力,但并不针对心脏表达基因和转录同工酶:我们使用基因组测序(GS)(n = 1101 个 CHD 病例)和心肌 RNA 序列测序(RNA-Seq)(n = 154 个 CHD 病例和 n = 43 个心肌病病例)来鉴定和验证剪接干扰变异,并开发出一个心脏特异性的规范和非规范剪接变异模型,该模型可应用于 CHD 和心肌病患者。作为健康对照,对医学基因组参考库中的 2,570 个 GS 样本进行了分析:结果:在使用 SpliceAI 初步鉴定出的 8583 个罕见 DNA 剪接干扰变异中,有 100 个与相应患者心肌中剪接接头的改变有关,影响 95 个基因。利用心肌基因表达强度和已证实会影响心肌 RNA 剪接的全基因组 DNA 变异特征,我们训练了一个机器学习模型来预测心脏特异性剪接干扰变异(内部验证的 AUC 为 0.86)。在由 48 名冠心病受试者组成的验证集中,心脏特异性模型的表现优于单独的 SpliceAI 模型(AUC 分别为 0.94 和 0.67)。将该模型应用于另外 947 个只有 GS 数据的冠心病受试者,发现 1%的患者存在冠心病基因中的典型剪接干扰变异,11% 的患者存在非典型剪接干扰变异。49%的预测剪接干扰变异为内含子变异,距离现有剪接接头> 10 bp。与健康对照组相比,冠心病病例中冠心病基因高置信度剪接干扰变异的负担要高出1.28倍:利用互补的GS和RNA-Seq数据开发了一种新的心脏特异性硅学模型,该模型通过识别与CHD相关的大量非经典剪接变异提高了遗传产量,而这些变异是无法通过基因组或外显子组测序检测到的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genome Medicine
Genome Medicine GENETICS & HEREDITY-
CiteScore
20.80
自引率
0.80%
发文量
128
审稿时长
6-12 weeks
期刊介绍: Genome Medicine is an open access journal that publishes outstanding research applying genetics, genomics, and multi-omics to understand, diagnose, and treat disease. Bridging basic science and clinical research, it covers areas such as cancer genomics, immuno-oncology, immunogenomics, infectious disease, microbiome, neurogenomics, systems medicine, clinical genomics, gene therapies, precision medicine, and clinical trials. The journal publishes original research, methods, software, and reviews to serve authors and promote broad interest and importance in the field.
期刊最新文献
HGVS Nomenclature 2024: improvements to community engagement, usability, and computability. Tracing carriage, acquisition, and transmission of ESBL-producing Escherichia coli over two years in a tertiary care hospital. SARS-CoV-2 introductions to the island of Ireland: a phylogenetic and geospatiotemporal study of infection dynamics. Epigenome-wide association studies identify novel DNA methylation sites associated with PTSD: a meta-analysis of 23 military and civilian cohorts. Patterns of genomic instability in > 2000 patients with ovarian cancer across six clinical trials evaluating olaparib.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1