Currently, the only effect size prior that is routinely implemented in a Bayesian fine-mapping multi-single-nucleotide polymorphism (SNP) analysis is the Gaussian prior. Here, we show how the Laplace prior can be deployed in Bayesian multi-SNP fine mapping studies. We compare the ranking performance of the posterior inclusion probability (PIP) using a Laplace prior with the ranking performance of the corresponding Gaussian prior and FINEMAP. Our results indicate that, for the simulation scenarios we consider here, the Laplace prior can lead to higher PIPs than either the Gaussian prior or FINEMAP, particularly for moderately sized fine-mapping studies. The Laplace prior also appears to have better worst-case scenario properties. We reanalyse the iCOGS case–control data from the CASP8 region on Chromosome 2. Even though this study has a total sample size of nearly 90,000 individuals, there are still some differences in the top few ranked SNPs if the Laplace prior is used rather than the Gaussian prior. R code to implement the Laplace (and Gaussian) prior is available at https://github.com/Kevin-walters/lapmapr.
{"title":"Bayesian multivariant fine mapping using the Laplace prior","authors":"Kevin Walters, Hannuun Yaacob","doi":"10.1002/gepi.22517","DOIUrl":"10.1002/gepi.22517","url":null,"abstract":"<p>Currently, the only effect size prior that is routinely implemented in a Bayesian fine-mapping multi-single-nucleotide polymorphism (SNP) analysis is the Gaussian prior. Here, we show how the Laplace prior can be deployed in Bayesian multi-SNP fine mapping studies. We compare the ranking performance of the posterior inclusion probability (PIP) using a Laplace prior with the ranking performance of the corresponding Gaussian prior and FINEMAP. Our results indicate that, for the simulation scenarios we consider here, the Laplace prior can lead to higher PIPs than either the Gaussian prior or FINEMAP, particularly for moderately sized fine-mapping studies. The Laplace prior also appears to have better worst-case scenario properties. We reanalyse the iCOGS case–control data from the CASP8 region on Chromosome 2. Even though this study has a total sample size of nearly 90,000 individuals, there are still some differences in the top few ranked SNPs if the Laplace prior is used rather than the Gaussian prior. R code to implement the Laplace (and Gaussian) prior is available at https://github.com/Kevin-walters/lapmapr.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 3","pages":"249-260"},"PeriodicalIF":2.1,"publicationDate":"2023-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22517","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9120129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Genetic variants in drug targets can be used to predict the long-term, on-target effect of drugs. Here, we extend this principle to assess how sex and body mass index may modify the effect of genetically predicted lower CETP levels on biomarkers and cardiovascular outcomes. We found sex and body mass index (BMI) to be modifiers of the association between genetically predicted lower CETP and lipid biomarkers in UK Biobank participants. Female sex and lower BMI were associated with higher high-density lipoprotein cholesterol and lower low-density lipoprotein cholesterol for the same genetically predicted reduction in CETP concentration. We found that sex also modulated the effect of genetically lower CETP on cholesterol efflux capacity in samples from the Montreal Heart Institute Biobank. However, these modifying effects did not extend to sex differences in cardiovascular outcomes in our data. Our results provide insight into the clinical effects of CETP inhibitors in the presence of effect modification based on genetic data. The approach can support precision medicine applications and help assess the external validity of clinical trials.
{"title":"Study of effect modifiers of genetically predicted CETP reduction","authors":"Marc-André Legault, Amina Barhdadi, Isabel Gamache, Audrey Lemaçon, Louis-Philippe Lemieux Perreault, Jean-Christophe Grenier, Marie-Pierre Sylvestre, Julie G. Hussin, David Rhainds, Jean-Claude Tardif, Marie-Pierre Dubé","doi":"10.1002/gepi.22514","DOIUrl":"10.1002/gepi.22514","url":null,"abstract":"<p>Genetic variants in drug targets can be used to predict the long-term, on-target effect of drugs. Here, we extend this principle to assess how sex and body mass index may modify the effect of genetically predicted lower CETP levels on biomarkers and cardiovascular outcomes. We found sex and body mass index (BMI) to be modifiers of the association between genetically predicted lower CETP and lipid biomarkers in UK Biobank participants. Female sex and lower BMI were associated with higher high-density lipoprotein cholesterol and lower low-density lipoprotein cholesterol for the same genetically predicted reduction in CETP concentration. We found that sex also modulated the effect of genetically lower CETP on cholesterol efflux capacity in samples from the Montreal Heart Institute Biobank. However, these modifying effects did not extend to sex differences in cardiovascular outcomes in our data. Our results provide insight into the clinical effects of CETP inhibitors in the presence of effect modification based on genetic data. The approach can support precision medicine applications and help assess the external validity of clinical trials.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 2","pages":"198-212"},"PeriodicalIF":2.1,"publicationDate":"2023-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22514","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9406442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In genome-wide association studies (GWAS) for thousands of phenotypes in biobanks, most binary phenotypes have substantially fewer cases than controls. Many widely used approaches for joint analysis of multiple phenotypes produce inflated type I error rates for such extremely unbalanced case-control phenotypes. In this research, we develop a method to jointly analyze multiple unbalanced case-control phenotypes to circumvent this issue. We first group multiple phenotypes into different clusters based on a hierarchical clustering method, then we merge phenotypes in each cluster into a single phenotype. In each cluster, we use the saddlepoint approximation to estimate the p value of an association test between the merged phenotype and a single nucleotide polymorphism (SNP) which eliminates the issue of inflated type I error rate of the test for extremely unbalanced case-control phenotypes. Finally, we use the Cauchy combination method to obtain an integrated p value for all clusters to test the association between multiple phenotypes and a SNP. We use extensive simulation studies to evaluate the performance of the proposed approach. The results show that the proposed approach can control type I error rate very well and is more powerful than other available methods. We also apply the proposed approach to phenotypes in category IX (diseases of the circulatory system) in the UK Biobank. We find that the proposed approach can identify more significant SNPs than the other viable methods we compared with.
{"title":"Joint analysis of multiple phenotypes for extremely unbalanced case-control association studies","authors":"Hongjing Xie, Xuewei Cao, Shuanglin Zhang, Qiuying Sha","doi":"10.1002/gepi.22513","DOIUrl":"10.1002/gepi.22513","url":null,"abstract":"<p>In genome-wide association studies (GWAS) for thousands of phenotypes in biobanks, most binary phenotypes have substantially fewer cases than controls. Many widely used approaches for joint analysis of multiple phenotypes produce inflated type I error rates for such extremely unbalanced case-control phenotypes. In this research, we develop a method to jointly analyze multiple unbalanced case-control phenotypes to circumvent this issue. We first group multiple phenotypes into different clusters based on a hierarchical clustering method, then we merge phenotypes in each cluster into a single phenotype. In each cluster, we use the saddlepoint approximation to estimate the <i>p</i> value of an association test between the merged phenotype and a single nucleotide polymorphism (SNP) which eliminates the issue of inflated type I error rate of the test for extremely unbalanced case-control phenotypes. Finally, we use the Cauchy combination method to obtain an integrated <i>p</i> value for all clusters to test the association between multiple phenotypes and a SNP. We use extensive simulation studies to evaluate the performance of the proposed approach. The results show that the proposed approach can control type I error rate very well and is more powerful than other available methods. We also apply the proposed approach to phenotypes in category IX (diseases of the circulatory system) in the UK Biobank. We find that the proposed approach can identify more significant SNPs than the other viable methods we compared with.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 2","pages":"185-197"},"PeriodicalIF":2.1,"publicationDate":"2023-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9906667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Analysis of host genetic components provides insights into the susceptibility and response to viral infection such as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which causes coronavirus disease 2019 (COVID-19). To reveal genetic determinants of susceptibility to COVID-19 related mortality, we train a deep learning model to identify groups of genetic variants and their interactions that contribute to the COVID-19 related mortality risk using the UK Biobank data (28,097 affected cases and 1656 deaths). We refer to such groups of variants as super variants. We identify 15 super variants with various levels of significance as susceptibility loci for COVID-19 mortality. Specifically, we identify a super variant (odds ratio [OR] = 1.594, p = 5.47 × 10−9) on Chromosome 7 that consists of the minor allele of rs76398985, rs6943608, rs2052130, 7:150989011_CT_C, rs118033050, and rs12540488. We also discover a super variant (OR = 1.353, p = 2.87 × 10−8) on Chromosome 5 that contains rs12517344, rs72733036, rs190052994, rs34723029, rs72734818, 5:9305797_GTA_G, and rs180899355.
对宿主遗传成分的分析可以深入了解对病毒感染的易感性和反应,例如导致2019冠状病毒病(COVID-19)的严重急性呼吸综合征冠状病毒2 (SARS-CoV-2)。为了揭示COVID-19相关死亡率易感性的遗传决定因素,我们训练了一个深度学习模型,利用英国生物银行(UK Biobank)的数据(28,097例受影响病例和1656例死亡)来识别导致COVID-19相关死亡率风险的遗传变异组及其相互作用。我们把这样的变体组称为超级变体。我们确定了15个具有不同程度显著性的超级变异,作为COVID-19死亡率的易感性位点。具体来说,我们在7号染色体上发现了一个超级变异(优势比[OR] = 1.594, p = 5.47 × 10−9),由rs76398985、rs6943608、rs2052130、7:150989011_CT_C、rs118033050和rs12540488等小等位基因组成。我们还在5号染色体上发现了一个超级变异(OR = 1.353, p = 2.87 × 10−8),包含rs12517344、rs72733036、rs190052994、rs34723029、rs72734818、5:9305797_GTA_G和rs180899355。
{"title":"Deep learning identified genetic variants for COVID-19-related mortality among 28,097 affected cases in UK Biobank","authors":"Zihuan Liu, Wei Dai, Shiying Wang, Yisha Yao, Heping Zhang","doi":"10.1002/gepi.22515","DOIUrl":"10.1002/gepi.22515","url":null,"abstract":"<p>Analysis of host genetic components provides insights into the susceptibility and response to viral infection such as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which causes coronavirus disease 2019 (COVID-19). To reveal genetic determinants of susceptibility to COVID-19 related mortality, we train a deep learning model to identify groups of genetic variants and their interactions that contribute to the COVID-19 related mortality risk using the UK Biobank data (28,097 affected cases and 1656 deaths). We refer to such groups of variants as super variants. We identify 15 super variants with various levels of significance as susceptibility loci for COVID-19 mortality. Specifically, we identify a super variant (odds ratio [OR] = 1.594, <i>p</i> = 5.47 × 10<sup>−9</sup>) on Chromosome 7 that consists of the minor allele of rs76398985, rs6943608, rs2052130, 7:150989011_CT_C, rs118033050, and rs12540488. We also discover a super variant (OR = 1.353, <i>p</i> = 2.87 × 10<sup>−8</sup>) on Chromosome 5 that contains rs12517344, rs72733036, rs190052994, rs34723029, rs72734818, 5:9305797_GTA_G, and rs180899355.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"47 3","pages":"215-230"},"PeriodicalIF":2.1,"publicationDate":"2023-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22515","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9184940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}