GWASBrewer:模拟真实 GWAS 摘要统计的 R 软件包

IF 1.7 4区 医学 Q3 GENETICS & HEREDITY Genetic Epidemiology Pub Date : 2024-10-06 DOI:10.1002/gepi.22594
Jean Morrison
{"title":"GWASBrewer:模拟真实 GWAS 摘要统计的 R 软件包","authors":"Jean Morrison","doi":"10.1002/gepi.22594","DOIUrl":null,"url":null,"abstract":"<p>Many statistical genetics analysis methods make use of GWAS summary statistics. Best statistical practice requires evaluating these methods in realistic simulation experiments. However, simulating summary statistics by first simulating individual genotype and phenotype data is extremely computationally demanding. This high cost may force researchers to conduct overly simplistic simulations that fail to accurately measure method performance. Alternatively, summary statistics can be simulated directly from their theoretical distribution. Although this is a common need among statistical genetics researchers, no software packages exist for comprehensive GWAS summary statistic simulation. We present <span>GWASBrewer</span>, an open source R package for direct simulation of GWAS summary statistics. We show that statistics simulated by \n<span>GWASBrewer</span> have the same distribution as statistics generated from individual level data, and can be produced at a fraction of the computational expense. Additionally, \n<span>GWASBrewer</span> can simulate standard error estimates, something that is typically not done when sampling summary statistics directly. \n<span>GWASBrewer</span> is highly flexible, allowing the user to simulate data for multiple traits connected by causal effects and with complex distributions of effect sizes. We demonstrate example uses of \n<span>GWASBrewer</span> for evaluating Mendelian randomization, polygenic risk score, and heritability estimation methods.</p>","PeriodicalId":12710,"journal":{"name":"Genetic Epidemiology","volume":"49 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22594","citationCount":"0","resultStr":"{\"title\":\"GWASBrewer: An R Package for Simulating Realistic GWAS Summary Statistics\",\"authors\":\"Jean Morrison\",\"doi\":\"10.1002/gepi.22594\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Many statistical genetics analysis methods make use of GWAS summary statistics. Best statistical practice requires evaluating these methods in realistic simulation experiments. However, simulating summary statistics by first simulating individual genotype and phenotype data is extremely computationally demanding. This high cost may force researchers to conduct overly simplistic simulations that fail to accurately measure method performance. Alternatively, summary statistics can be simulated directly from their theoretical distribution. Although this is a common need among statistical genetics researchers, no software packages exist for comprehensive GWAS summary statistic simulation. We present <span>GWASBrewer</span>, an open source R package for direct simulation of GWAS summary statistics. We show that statistics simulated by \\n<span>GWASBrewer</span> have the same distribution as statistics generated from individual level data, and can be produced at a fraction of the computational expense. Additionally, \\n<span>GWASBrewer</span> can simulate standard error estimates, something that is typically not done when sampling summary statistics directly. \\n<span>GWASBrewer</span> is highly flexible, allowing the user to simulate data for multiple traits connected by causal effects and with complex distributions of effect sizes. We demonstrate example uses of \\n<span>GWASBrewer</span> for evaluating Mendelian randomization, polygenic risk score, and heritability estimation methods.</p>\",\"PeriodicalId\":12710,\"journal\":{\"name\":\"Genetic Epidemiology\",\"volume\":\"49 1\",\"pages\":\"\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2024-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/gepi.22594\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genetic Epidemiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22594\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetic Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/gepi.22594","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

许多统计遗传学分析方法都使用了 GWAS 摘要统计。最佳统计实践要求在实际模拟实验中评估这些方法。然而,通过首先模拟单个基因型和表型数据来模拟汇总统计量对计算要求极高。这种高成本可能会迫使研究人员进行过于简单的模拟,从而无法准确衡量方法的性能。另一种方法是直接从理论分布模拟汇总统计量。虽然这是统计遗传学研究人员的共同需求,但目前还没有软件包可用于全面的 GWAS 概要统计模拟。我们介绍了 GWASBrewer,这是一个直接模拟 GWAS 概要统计量的开源 R 软件包。我们的研究表明,GWASBrewer 模拟的统计量与从个体水平数据生成的统计量具有相同的分布,而且只需花费很少的计算费用即可生成。此外,GWASBrewer 还能模拟标准误差估计值,而这在直接对汇总统计数据进行采样时通常是做不到的。GWASBrewer 非常灵活,允许用户模拟由因果效应连接的多个性状的数据,以及效应大小的复杂分布。我们将举例说明 GWASBrewer 在评估孟德尔随机化、多基因风险评分和遗传率估计方法方面的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
GWASBrewer: An R Package for Simulating Realistic GWAS Summary Statistics

Many statistical genetics analysis methods make use of GWAS summary statistics. Best statistical practice requires evaluating these methods in realistic simulation experiments. However, simulating summary statistics by first simulating individual genotype and phenotype data is extremely computationally demanding. This high cost may force researchers to conduct overly simplistic simulations that fail to accurately measure method performance. Alternatively, summary statistics can be simulated directly from their theoretical distribution. Although this is a common need among statistical genetics researchers, no software packages exist for comprehensive GWAS summary statistic simulation. We present GWASBrewer, an open source R package for direct simulation of GWAS summary statistics. We show that statistics simulated by GWASBrewer have the same distribution as statistics generated from individual level data, and can be produced at a fraction of the computational expense. Additionally, GWASBrewer can simulate standard error estimates, something that is typically not done when sampling summary statistics directly. GWASBrewer is highly flexible, allowing the user to simulate data for multiple traits connected by causal effects and with complex distributions of effect sizes. We demonstrate example uses of GWASBrewer for evaluating Mendelian randomization, polygenic risk score, and heritability estimation methods.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Genetic Epidemiology
Genetic Epidemiology 医学-公共卫生、环境卫生与职业卫生
CiteScore
4.40
自引率
9.50%
发文量
49
审稿时长
6-12 weeks
期刊介绍: Genetic Epidemiology is a peer-reviewed journal for discussion of research on the genetic causes of the distribution of human traits in families and populations. Emphasis is placed on the relative contribution of genetic and environmental factors to human disease as revealed by genetic, epidemiological, and biologic investigations. Genetic Epidemiology primarily publishes papers in statistical genetics, a research field that is primarily concerned with development of statistical, bioinformatical, and computational models for analyzing genetic data. Incorporation of underlying biology and population genetics into conceptual models is favored. The Journal seeks original articles comprising either applied research or innovative statistical, mathematical, computational, or genomic methodologies that advance studies in genetic epidemiology. Other types of reports are encouraged, such as letters to the editor, topic reviews, and perspectives from other fields of research that will likely enrich the field of genetic epidemiology.
期刊最新文献
Genetic Associations of Persistent Opioid Use After Surgery Point to OPRM1 but Not Other Opioid-Related Loci as the Main Driver of Opioid Use Disorder. Bayesian Effect Size Ranking to Prioritise Genetic Risk Variants in Common Diseases for Follow-Up Studies. Using Family History Data to Improve the Power of Association Studies: Application to Cancer in UK Biobank. Issue Information Issue Information
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1