Amy J Osborne, Agnieszka Bierzynska, Elizabeth Colby, Uwe Andag, Philip A Kalra, Olivier Radresa, Philipp Skroblin, Maarten W Taal, Gavin I Welsh, Moin A Saleem, Colin Campbell
{"title":"Multivariate canonical correlation analysis identifies additional genetic variants for chronic kidney disease.","authors":"Amy J Osborne, Agnieszka Bierzynska, Elizabeth Colby, Uwe Andag, Philip A Kalra, Olivier Radresa, Philipp Skroblin, Maarten W Taal, Gavin I Welsh, Moin A Saleem, Colin Campbell","doi":"10.1038/s41540-024-00350-8","DOIUrl":null,"url":null,"abstract":"<p><p>Chronic kidney diseases (CKD) have genetic associations with kidney function. Univariate genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with estimated glomerular filtration rate (eGFR) and blood urea nitrogen (BUN), two complementary kidney function markers. However, it is unknown whether additional SNPs for kidney function can be identified by multivariate statistical analysis. To address this, we applied canonical correlation analysis (CCA), a multivariate method, to two individual-level CKD genotype datasets, and metaCCA to two published GWAS summary statistics datasets. We identified SNPs previously associated with kidney function by published univariate GWASs with high replication rates, validating the metaCCA method. We then extended discovery and identified previously unreported lead SNPs for both kidney function markers, jointly. These showed expression quantitative trait loci (eQTL) colocalisation with genes having significant differential expression between CKD and healthy individuals. Several of these identified lead missense SNPs were predicted to have a functional impact, including in SLC14A2. We also identified previously unreported lead SNPs that showed significant correlation with both kidney function markers, jointly, in the European ancestry CKDGen, National Unified Renal Translational Research Enterprise (NURTuRE)-CKD and Salford Kidney Study (SKS) datasets. Of these, rs3094060 colocalised with FLOT1 gene expression and was significantly more common in CKD cases in both NURTURE-CKD and SKS, than in the general population. Overall, by using multivariate analysis by CCA, we identified additional SNPs and genes for both kidney function and CKD, that can be prioritised for further CKD analyses.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":null,"pages":null},"PeriodicalIF":3.5000,"publicationDate":"2024-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10924093/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Systems Biology and Applications","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41540-024-00350-8","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Chronic kidney diseases (CKD) have genetic associations with kidney function. Univariate genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with estimated glomerular filtration rate (eGFR) and blood urea nitrogen (BUN), two complementary kidney function markers. However, it is unknown whether additional SNPs for kidney function can be identified by multivariate statistical analysis. To address this, we applied canonical correlation analysis (CCA), a multivariate method, to two individual-level CKD genotype datasets, and metaCCA to two published GWAS summary statistics datasets. We identified SNPs previously associated with kidney function by published univariate GWASs with high replication rates, validating the metaCCA method. We then extended discovery and identified previously unreported lead SNPs for both kidney function markers, jointly. These showed expression quantitative trait loci (eQTL) colocalisation with genes having significant differential expression between CKD and healthy individuals. Several of these identified lead missense SNPs were predicted to have a functional impact, including in SLC14A2. We also identified previously unreported lead SNPs that showed significant correlation with both kidney function markers, jointly, in the European ancestry CKDGen, National Unified Renal Translational Research Enterprise (NURTuRE)-CKD and Salford Kidney Study (SKS) datasets. Of these, rs3094060 colocalised with FLOT1 gene expression and was significantly more common in CKD cases in both NURTURE-CKD and SKS, than in the general population. Overall, by using multivariate analysis by CCA, we identified additional SNPs and genes for both kidney function and CKD, that can be prioritised for further CKD analyses.
期刊介绍:
npj Systems Biology and Applications is an online Open Access journal dedicated to publishing the premier research that takes a systems-oriented approach. The journal aims to provide a forum for the presentation of articles that help define this nascent field, as well as those that apply the advances to wider fields. We encourage studies that integrate, or aid the integration of, data, analyses and insight from molecules to organisms and broader systems. Important areas of interest include not only fundamental biological systems and drug discovery, but also applications to health, medical practice and implementation, big data, biotechnology, food science, human behaviour, broader biological systems and industrial applications of systems biology.
We encourage all approaches, including network biology, application of control theory to biological systems, computational modelling and analysis, comprehensive and/or high-content measurements, theoretical, analytical and computational studies of system-level properties of biological systems and computational/software/data platforms enabling such studies.