{"title":"一种基于主成分的自适应测试,用于使用GWAS汇总数据检测多种表型关联。","authors":"Qianran Wei, Lili Chen, Yajing Zhou, Huiyi Wang","doi":"10.1007/s10709-023-00179-9","DOIUrl":null,"url":null,"abstract":"<p><p>Extensive evidence from genome-wide association studies (GWAS) has shown that jointly analyzing multiple phenotypes can improve the power of the association test compared to the traditional single variant versus single trait approach. Here we propose an adaptive test based on principal components (ATPC) that is powerful and efficient for discovering the association between a single variant and multiple traits. Our method only needs GWAS summary statistics that are often available. We first estimate the trait correlation matrix by LD score regression. Then, based on the correlation matrix, we construct a series of test statistics that contain different numbers of principal components. The ultimate test statistic combines the P values of these principal component-based statistics by using the aggregated Cauchy association test. The analytical P-value of the test statistic can be computed quickly without the permutation process, which is the notable feature of our proposed method. The extensive simulation studies demonstrate that ATPC can control the type I error rates and have powerful and robust performance compared to several existing tests in a wide range of simulation settings. The analysis of the lipids GWAS summary data from the Global Lipids Genetics Consortium shows that ATPC identifies 230 new SNPs that are missed by the original single trait association analysis. By searching the GWAS Catalog, some SNPs and mapped genes identified by ATPC are reported to be associated with lipid traits. Through further analysis for GWAS results, we also find some Gene Ontology terms and biological pathways related to lipids.</p>","PeriodicalId":55121,"journal":{"name":"Genetica","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An adaptive test based on principal components for detecting multiple phenotype associations using GWAS summary data.\",\"authors\":\"Qianran Wei, Lili Chen, Yajing Zhou, Huiyi Wang\",\"doi\":\"10.1007/s10709-023-00179-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Extensive evidence from genome-wide association studies (GWAS) has shown that jointly analyzing multiple phenotypes can improve the power of the association test compared to the traditional single variant versus single trait approach. Here we propose an adaptive test based on principal components (ATPC) that is powerful and efficient for discovering the association between a single variant and multiple traits. Our method only needs GWAS summary statistics that are often available. We first estimate the trait correlation matrix by LD score regression. Then, based on the correlation matrix, we construct a series of test statistics that contain different numbers of principal components. The ultimate test statistic combines the P values of these principal component-based statistics by using the aggregated Cauchy association test. The analytical P-value of the test statistic can be computed quickly without the permutation process, which is the notable feature of our proposed method. The extensive simulation studies demonstrate that ATPC can control the type I error rates and have powerful and robust performance compared to several existing tests in a wide range of simulation settings. The analysis of the lipids GWAS summary data from the Global Lipids Genetics Consortium shows that ATPC identifies 230 new SNPs that are missed by the original single trait association analysis. By searching the GWAS Catalog, some SNPs and mapped genes identified by ATPC are reported to be associated with lipid traits. Through further analysis for GWAS results, we also find some Gene Ontology terms and biological pathways related to lipids.</p>\",\"PeriodicalId\":55121,\"journal\":{\"name\":\"Genetica\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2023-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genetica\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1007/s10709-023-00179-9\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetica","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s10709-023-00179-9","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
An adaptive test based on principal components for detecting multiple phenotype associations using GWAS summary data.
Extensive evidence from genome-wide association studies (GWAS) has shown that jointly analyzing multiple phenotypes can improve the power of the association test compared to the traditional single variant versus single trait approach. Here we propose an adaptive test based on principal components (ATPC) that is powerful and efficient for discovering the association between a single variant and multiple traits. Our method only needs GWAS summary statistics that are often available. We first estimate the trait correlation matrix by LD score regression. Then, based on the correlation matrix, we construct a series of test statistics that contain different numbers of principal components. The ultimate test statistic combines the P values of these principal component-based statistics by using the aggregated Cauchy association test. The analytical P-value of the test statistic can be computed quickly without the permutation process, which is the notable feature of our proposed method. The extensive simulation studies demonstrate that ATPC can control the type I error rates and have powerful and robust performance compared to several existing tests in a wide range of simulation settings. The analysis of the lipids GWAS summary data from the Global Lipids Genetics Consortium shows that ATPC identifies 230 new SNPs that are missed by the original single trait association analysis. By searching the GWAS Catalog, some SNPs and mapped genes identified by ATPC are reported to be associated with lipid traits. Through further analysis for GWAS results, we also find some Gene Ontology terms and biological pathways related to lipids.
期刊介绍:
Genetica publishes papers dealing with genetics, genomics, and evolution. Our journal covers novel advances in the fields of genomics, conservation genetics, genotype-phenotype interactions, evo-devo, population and quantitative genetics, and biodiversity. Genetica publishes original research articles addressing novel conceptual, experimental, and theoretical issues in these areas, whatever the taxon considered. Biomedical papers and papers on breeding animal and plant genetics are not within the scope of Genetica, unless framed in an evolutionary context. Recent advances in genetics, genomics and evolution are also published in thematic issues and synthesis papers published by experts in the field.