Structure-based network analysis predicts pathogenic variants in human proteins associated with inherited retinal disease.

IF 4.7 2区 医学 Q1 GENETICS & HEREDITY NPJ Genomic Medicine Pub Date : 2024-05-27 DOI:10.1038/s41525-024-00416-w
Blake M Hauser, Yuyang Luo, Anusha Nathan, Ahmad Al-Moujahed, Demetrios G Vavvas, Jason Comander, Eric A Pierce, Emily M Place, Kinga M Bujakowska, Gaurav D Gaiha, Elizabeth J Rossin
{"title":"Structure-based network analysis predicts pathogenic variants in human proteins associated with inherited retinal disease.","authors":"Blake M Hauser, Yuyang Luo, Anusha Nathan, Ahmad Al-Moujahed, Demetrios G Vavvas, Jason Comander, Eric A Pierce, Emily M Place, Kinga M Bujakowska, Gaurav D Gaiha, Elizabeth J Rossin","doi":"10.1038/s41525-024-00416-w","DOIUrl":null,"url":null,"abstract":"<p><p>Advances in gene sequencing technologies have accelerated the identification of genetic variants, but better tools are needed to understand which are causal of disease. This would be particularly useful in fields where gene therapy is a potential therapeutic modality for a disease-causing variant such as inherited retinal disease (IRD). Here, we apply structure-based network analysis (SBNA), which has been successfully utilized to identify variant-constrained amino acid residues in viral proteins, to identify residues that may cause IRD if subject to missense mutation. SBNA is based entirely on structural first principles and is not fit to specific outcome data, which makes it distinct from other contemporary missense prediction tools. In 4 well-studied human disease-associated proteins (BRCA1, HRAS, PTEN, and ERK2) with high-quality structural data, we find that SBNA scores correlate strongly with deep mutagenesis data. When applied to 47 IRD genes with available high-quality crystal structure data, SBNA scores reliably identified disease-causing variants according to phenotype definitions from the ClinVar database. Finally, we applied this approach to 63 patients at Massachusetts Eye and Ear (MEE) with IRD but for whom no genetic cause had been identified. Untrained models built using SBNA scores and BLOSUM62 scores for IRD-associated genes successfully predicted the pathogenicity of novel variants (AUC = 0.851), allowing us to identify likely causative disease variants in 40 IRD patients. Model performance was further augmented by incorporating orthogonal data from EVE scores (AUC = 0.927), which are based on evolutionary multiple sequence alignments. In conclusion, SBNA can used to successfully identify variants as causal of disease in human proteins and may help predict variants causative of IRD in an unbiased fashion.</p>","PeriodicalId":19273,"journal":{"name":"NPJ Genomic Medicine","volume":"9 1","pages":"31"},"PeriodicalIF":4.7000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11130145/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Genomic Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41525-024-00416-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Advances in gene sequencing technologies have accelerated the identification of genetic variants, but better tools are needed to understand which are causal of disease. This would be particularly useful in fields where gene therapy is a potential therapeutic modality for a disease-causing variant such as inherited retinal disease (IRD). Here, we apply structure-based network analysis (SBNA), which has been successfully utilized to identify variant-constrained amino acid residues in viral proteins, to identify residues that may cause IRD if subject to missense mutation. SBNA is based entirely on structural first principles and is not fit to specific outcome data, which makes it distinct from other contemporary missense prediction tools. In 4 well-studied human disease-associated proteins (BRCA1, HRAS, PTEN, and ERK2) with high-quality structural data, we find that SBNA scores correlate strongly with deep mutagenesis data. When applied to 47 IRD genes with available high-quality crystal structure data, SBNA scores reliably identified disease-causing variants according to phenotype definitions from the ClinVar database. Finally, we applied this approach to 63 patients at Massachusetts Eye and Ear (MEE) with IRD but for whom no genetic cause had been identified. Untrained models built using SBNA scores and BLOSUM62 scores for IRD-associated genes successfully predicted the pathogenicity of novel variants (AUC = 0.851), allowing us to identify likely causative disease variants in 40 IRD patients. Model performance was further augmented by incorporating orthogonal data from EVE scores (AUC = 0.927), which are based on evolutionary multiple sequence alignments. In conclusion, SBNA can used to successfully identify variants as causal of disease in human proteins and may help predict variants causative of IRD in an unbiased fashion.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于结构的网络分析预测了与遗传性视网膜疾病相关的人类蛋白质中的致病变体。
基因测序技术的进步加快了基因变异的鉴定速度,但还需要更好的工具来了解哪些基因变异是致病因素。这在基因治疗是一种潜在的致病变异治疗方法的领域尤其有用,如遗传性视网膜病(IRD)。在这里,我们应用基于结构的网络分析(SBNA)来识别病毒蛋白质中受变异约束的氨基酸残基,该方法已成功用于识别发生错义突变时可能导致 IRD 的残基。SBNA 完全基于结构第一性原理,并不与特定结果数据相匹配,这使其有别于其他当代的错义预测工具。在具有高质量结构数据的 4 个研究充分的人类疾病相关蛋白(BRCA1、HRAS、PTEN 和 ERK2)中,我们发现 SBNA 分数与深度诱变数据密切相关。当应用于 47 个具有高质量晶体结构数据的 IRD 基因时,根据 ClinVar 数据库中的表型定义,SBNA 评分可靠地鉴定出了致病变体。最后,我们将这种方法应用于马萨诸塞眼耳科(MEE)的 63 名 IRD 患者,这些患者的遗传病因尚未确定。使用 IRD 相关基因的 SBNA 评分和 BLOSUM62 评分建立的未训练模型成功预测了新型变异体的致病性(AUC = 0.851),使我们能够确定 40 例 IRD 患者中可能的致病变异体。基于进化多序列比对的 EVE 评分(AUC = 0.927)纳入了正交数据,进一步提高了模型性能。总之,SBNA 可用于成功识别人类蛋白质中的致病变异,并有助于以无偏见的方式预测 IRD 的致病变异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
NPJ Genomic Medicine
NPJ Genomic Medicine Biochemistry, Genetics and Molecular Biology-Molecular Biology
CiteScore
9.40
自引率
1.90%
发文量
67
审稿时长
17 weeks
期刊介绍: npj Genomic Medicine is an international, peer-reviewed journal dedicated to publishing the most important scientific advances in all aspects of genomics and its application in the practice of medicine. The journal defines genomic medicine as "diagnosis, prognosis, prevention and/or treatment of disease and disorders of the mind and body, using approaches informed or enabled by knowledge of the genome and the molecules it encodes." Relevant and high-impact papers that encompass studies of individuals, families, or populations are considered for publication. An emphasis will include coupling detailed phenotype and genome sequencing information, both enabled by new technologies and informatics, to delineate the underlying aetiology of disease. Clinical recommendations and/or guidelines of how that data should be used in the clinical management of those patients in the study, and others, are also encouraged.
期刊最新文献
Implementing genomic medicine in clinical practice for adults with undiagnosed rare diseases. Germline sequence variation in cancer genes in Rwandan breast and prostate cancer cases. Common protein-altering variant in GFAP is associated with white matter lesions in the older Japanese population. Benchmarking nanopore sequencing and rapid genomics feasibility: validation at a quaternary hospital in New Zealand. Coding and non-coding variants in the ciliopathy gene CFAP410 cause early-onset non-syndromic retinal degeneration.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1