Bian Li, Jeffrey L Mendenhall, Brett M Kroncke, Keenan C Taylor, Hui Huang, Derek K Smith, Carlos G Vanoye, Jeffrey D Blume, Alfred L George, Charles R Sanders, Jens Meiler
{"title":"Predicting the Functional Impact of KCNQ1 Variants of Unknown Significance.","authors":"Bian Li, Jeffrey L Mendenhall, Brett M Kroncke, Keenan C Taylor, Hui Huang, Derek K Smith, Carlos G Vanoye, Jeffrey D Blume, Alfred L George, Charles R Sanders, Jens Meiler","doi":"10.1161/CIRCGENETICS.117.001754","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>An emerging standard-of-care for long-QT syndrome uses clinical genetic testing to identify genetic variants of the KCNQ1 potassium channel. However, interpreting results from genetic testing is confounded by the presence of variants of unknown significance for which there is inadequate evidence of pathogenicity.</p><p><strong>Methods and results: </strong>In this study, we curated from the literature a high-quality set of 107 functionally characterized KCNQ1 variants. Based on this data set, we completed a detailed quantitative analysis on the sequence conservation patterns of subdomains of KCNQ1 and the distribution of pathogenic variants therein. We found that conserved subdomains generally are critical for channel function and are enriched with dysfunctional variants. Using this experimentally validated data set, we trained a neural network, designated Q1VarPred, specifically for predicting the functional impact of KCNQ1 variants of unknown significance. The estimated predictive performance of Q1VarPred in terms of Matthew's correlation coefficient and area under the receiver operating characteristic curve were 0.581 and 0.884, respectively, superior to the performance of 8 previous methods tested in parallel. Q1VarPred is publicly available as a web server at http://meilerlab.org/q1varpred.</p><p><strong>Conclusions: </strong>Although a plethora of tools are available for making pathogenicity predictions over a genome-wide scale, previous tools fail to perform in a robust manner when applied to KCNQ1. The contrasting and favorable results for Q1VarPred suggest a promising approach, where a machine-learning algorithm is tailored to a specific protein target and trained with a functionally validated data set to calibrate informatics tools.</p>","PeriodicalId":10277,"journal":{"name":"Circulation: Cardiovascular Genetics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1161/CIRCGENETICS.117.001754","citationCount":"34","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Circulation: Cardiovascular Genetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1161/CIRCGENETICS.117.001754","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 34
Abstract
Background: An emerging standard-of-care for long-QT syndrome uses clinical genetic testing to identify genetic variants of the KCNQ1 potassium channel. However, interpreting results from genetic testing is confounded by the presence of variants of unknown significance for which there is inadequate evidence of pathogenicity.
Methods and results: In this study, we curated from the literature a high-quality set of 107 functionally characterized KCNQ1 variants. Based on this data set, we completed a detailed quantitative analysis on the sequence conservation patterns of subdomains of KCNQ1 and the distribution of pathogenic variants therein. We found that conserved subdomains generally are critical for channel function and are enriched with dysfunctional variants. Using this experimentally validated data set, we trained a neural network, designated Q1VarPred, specifically for predicting the functional impact of KCNQ1 variants of unknown significance. The estimated predictive performance of Q1VarPred in terms of Matthew's correlation coefficient and area under the receiver operating characteristic curve were 0.581 and 0.884, respectively, superior to the performance of 8 previous methods tested in parallel. Q1VarPred is publicly available as a web server at http://meilerlab.org/q1varpred.
Conclusions: Although a plethora of tools are available for making pathogenicity predictions over a genome-wide scale, previous tools fail to perform in a robust manner when applied to KCNQ1. The contrasting and favorable results for Q1VarPred suggest a promising approach, where a machine-learning algorithm is tailored to a specific protein target and trained with a functionally validated data set to calibrate informatics tools.
期刊介绍:
Circulation: Genomic and Precision Medicine considers all types of original research articles, including studies conducted in human subjects, laboratory animals, in vitro, and in silico. Articles may include investigations of: clinical genetics as applied to the diagnosis and management of monogenic or oligogenic cardiovascular disorders; the molecular basis of complex cardiovascular disorders, including genome-wide association studies, exome and genome sequencing-based association studies, coding variant association studies, genetic linkage studies, epigenomics, transcriptomics, proteomics, metabolomics, and metagenomics; integration of electronic health record data or patient-generated data with any of the aforementioned approaches, including phenome-wide association studies, or with environmental or lifestyle factors; pharmacogenomics; regulation of gene expression; gene therapy and therapeutic genomic editing; systems biology approaches to the diagnosis and management of cardiovascular disorders; novel methods to perform any of the aforementioned studies; and novel applications of precision medicine. Above all, we seek studies with relevance to human cardiovascular biology and disease.