An adaptive test based on principal components for detecting multiple phenotype associations using GWAS summary data.

IF 1.3 4区 生物学 Q4 GENETICS & HEREDITY Genetica Pub Date : 2023-04-01 DOI:10.1007/s10709-023-00179-9
Qianran Wei, Lili Chen, Yajing Zhou, Huiyi Wang
{"title":"An adaptive test based on principal components for detecting multiple phenotype associations using GWAS summary data.","authors":"Qianran Wei,&nbsp;Lili Chen,&nbsp;Yajing Zhou,&nbsp;Huiyi Wang","doi":"10.1007/s10709-023-00179-9","DOIUrl":null,"url":null,"abstract":"<p><p>Extensive evidence from genome-wide association studies (GWAS) has shown that jointly analyzing multiple phenotypes can improve the power of the association test compared to the traditional single variant versus single trait approach. Here we propose an adaptive test based on principal components (ATPC) that is powerful and efficient for discovering the association between a single variant and multiple traits. Our method only needs GWAS summary statistics that are often available. We first estimate the trait correlation matrix by LD score regression. Then, based on the correlation matrix, we construct a series of test statistics that contain different numbers of principal components. The ultimate test statistic combines the P values of these principal component-based statistics by using the aggregated Cauchy association test. The analytical P-value of the test statistic can be computed quickly without the permutation process, which is the notable feature of our proposed method. The extensive simulation studies demonstrate that ATPC can control the type I error rates and have powerful and robust performance compared to several existing tests in a wide range of simulation settings. The analysis of the lipids GWAS summary data from the Global Lipids Genetics Consortium shows that ATPC identifies 230 new SNPs that are missed by the original single trait association analysis. By searching the GWAS Catalog, some SNPs and mapped genes identified by ATPC are reported to be associated with lipid traits. Through further analysis for GWAS results, we also find some Gene Ontology terms and biological pathways related to lipids.</p>","PeriodicalId":55121,"journal":{"name":"Genetica","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2023-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetica","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s10709-023-00179-9","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Extensive evidence from genome-wide association studies (GWAS) has shown that jointly analyzing multiple phenotypes can improve the power of the association test compared to the traditional single variant versus single trait approach. Here we propose an adaptive test based on principal components (ATPC) that is powerful and efficient for discovering the association between a single variant and multiple traits. Our method only needs GWAS summary statistics that are often available. We first estimate the trait correlation matrix by LD score regression. Then, based on the correlation matrix, we construct a series of test statistics that contain different numbers of principal components. The ultimate test statistic combines the P values of these principal component-based statistics by using the aggregated Cauchy association test. The analytical P-value of the test statistic can be computed quickly without the permutation process, which is the notable feature of our proposed method. The extensive simulation studies demonstrate that ATPC can control the type I error rates and have powerful and robust performance compared to several existing tests in a wide range of simulation settings. The analysis of the lipids GWAS summary data from the Global Lipids Genetics Consortium shows that ATPC identifies 230 new SNPs that are missed by the original single trait association analysis. By searching the GWAS Catalog, some SNPs and mapped genes identified by ATPC are reported to be associated with lipid traits. Through further analysis for GWAS results, we also find some Gene Ontology terms and biological pathways related to lipids.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种基于主成分的自适应测试,用于使用GWAS汇总数据检测多种表型关联。
来自全基因组关联研究(GWAS)的大量证据表明,与传统的单变异或单性状方法相比,联合分析多种表型可以提高关联测试的有效性。在这里,我们提出了一种基于主成分(ATPC)的自适应测试,它对于发现单个变异和多个性状之间的关联是强大而有效的。我们的方法只需要经常可用的GWAS汇总统计数据。我们首先用LD分数回归估计性状相关矩阵。然后,在相关矩阵的基础上,构造一系列包含不同主成分个数的检验统计量。最终检验统计量通过使用聚合柯西关联检验将这些基于主成分的统计量的P值组合在一起。检验统计量的分析p值可以快速计算,而不需要置换过程,这是我们提出的方法的显著特点。大量的仿真研究表明,与现有的几种测试相比,ATPC可以控制I型错误率,并且在广泛的仿真设置中具有强大的鲁棒性。对来自全球脂质遗传联盟的脂质GWAS汇总数据的分析表明,ATPC鉴定出230个新的snp,这些snp是原始单性状关联分析所遗漏的。通过检索GWAS目录,一些由ATPC鉴定的snp和定位基因被报道与脂质性状相关。通过对GWAS结果的进一步分析,我们还发现了一些与脂质相关的基因本体术语和生物学途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genetica
Genetica 生物-遗传学
CiteScore
2.70
自引率
0.00%
发文量
32
审稿时长
>12 weeks
期刊介绍: Genetica publishes papers dealing with genetics, genomics, and evolution. Our journal covers novel advances in the fields of genomics, conservation genetics, genotype-phenotype interactions, evo-devo, population and quantitative genetics, and biodiversity. Genetica publishes original research articles addressing novel conceptual, experimental, and theoretical issues in these areas, whatever the taxon considered. Biomedical papers and papers on breeding animal and plant genetics are not within the scope of Genetica, unless framed in an evolutionary context. Recent advances in genetics, genomics and evolution are also published in thematic issues and synthesis papers published by experts in the field.
期刊最新文献
Genome-wide identification and data mining reveals major-latex protein (MLP) from the PR-10 protein family played defense-related roles against phytopathogenic challenges in cassava (Manihot esculenta Crantz). Comparative genomic analysis reveals expansion of the DnaJ gene family in Lagerstroemia indica and its members response to salt stress. Identification and expression analysis of XIP gene family members in rice. Genome-wide identification and expression analysis of the universal stress protein (USP) gene family in Arabidopsis thaliana, Zea mays, and Oryza sativa. A multi-tissue de novo transcriptome assembly and relative gene expression of the vulnerable freshwater salmonid Thymallus ligericus.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1