基于汇总数据的多变量孟德尔随机化的两阶段线性混合模型(TS-LMM)

Ming Ding, Fei Zou
{"title":"基于汇总数据的多变量孟德尔随机化的两阶段线性混合模型(TS-LMM)","authors":"Ming Ding, Fei Zou","doi":"10.1101/2023.04.25.23289099","DOIUrl":null,"url":null,"abstract":"<p><p>Summary-data-based multivariable Mendelian randomization (MVMR) methods, such as MVMR-Egger, MVMR-IVW, MVMR median-based, and MVMR-PRESSO, assess the causal effects of multiple risk factors on disease. However, accounting for variances in summary statistics related to risk factors remains a challenge. We propose a linear mixed model with measurement error correction (LMM-MEC) that accounts for the variance of summary statistics for both disease outcomes and risk factors. In step I, a linear mixed model is applied to account for the variance in disease summary statistics. Specifically, if heterogeneity is present in disease summary statistics, we treat it as a random effect and adopt an iteratively re-weighted least squares algorithm to estimate causal effects. In step II, we treat the variance in the summary statistics of risk factors as multiple measurement errors and apply a regression calibration method for simultaneous multiple measurement error correction. In a simulation study, when using independent genetic variants as instrumental variables (IV), our method showed comparable performance to existing MVMR methods under conditions of no pleiotropy or balanced pleiotropy with the outcome, and it exhibited higher coverage rates and power under directional pleiotropy. Similar findings were observed when using genetic variants with low to moderate linkage disequilibrium (LD) (0 < <i>ρ</i> <sup>2</sup> ≤ 0.3) as IVs, although coverage rates reduced for all methods compared to using independent genetic variants as IVs. In the application study, we examined causal associations between correlated cholesterol biomarkers and longevity. By including 739 genetic variants selected based on P values <5×10 <sup>-5</sup> from GWAS and allowing for low LD ( <i>ρ</i> <sup>2</sup> ≤ 0.1), our method identified that large LDL-c were causally associated with lower likelihood of achieving longevity.</p>","PeriodicalId":18659,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168515/pdf/","citationCount":"0","resultStr":"{\"title\":\"A Linear Mixed Model with Measurement Error Correction (LMM-MEC): A Method for Summary-data-based Multivariable Mendelian Randomization.\",\"authors\":\"Ming Ding, Fei Zou\",\"doi\":\"10.1101/2023.04.25.23289099\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Summary-data-based multivariable Mendelian randomization (MVMR) methods, such as MVMR-Egger, MVMR-IVW, MVMR median-based, and MVMR-PRESSO, assess the causal effects of multiple risk factors on disease. However, accounting for variances in summary statistics related to risk factors remains a challenge. We propose a linear mixed model with measurement error correction (LMM-MEC) that accounts for the variance of summary statistics for both disease outcomes and risk factors. In step I, a linear mixed model is applied to account for the variance in disease summary statistics. Specifically, if heterogeneity is present in disease summary statistics, we treat it as a random effect and adopt an iteratively re-weighted least squares algorithm to estimate causal effects. In step II, we treat the variance in the summary statistics of risk factors as multiple measurement errors and apply a regression calibration method for simultaneous multiple measurement error correction. In a simulation study, when using independent genetic variants as instrumental variables (IV), our method showed comparable performance to existing MVMR methods under conditions of no pleiotropy or balanced pleiotropy with the outcome, and it exhibited higher coverage rates and power under directional pleiotropy. Similar findings were observed when using genetic variants with low to moderate linkage disequilibrium (LD) (0 < <i>ρ</i> <sup>2</sup> ≤ 0.3) as IVs, although coverage rates reduced for all methods compared to using independent genetic variants as IVs. In the application study, we examined causal associations between correlated cholesterol biomarkers and longevity. By including 739 genetic variants selected based on P values <5×10 <sup>-5</sup> from GWAS and allowing for low LD ( <i>ρ</i> <sup>2</sup> ≤ 0.1), our method identified that large LDL-c were causally associated with lower likelihood of achieving longevity.</p>\",\"PeriodicalId\":18659,\"journal\":{\"name\":\"medRxiv : the preprint server for health sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10168515/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv : the preprint server for health sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2023.04.25.23289099\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2023.04.25.23289099","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

多变量孟德尔随机化(MVMR)方法提供了一种应用全基因组汇总统计来评估多种危险因素对疾病结果的同时因果效应的策略。单变量MR方法假设没有水平多效性(遗传变异只与一个风险因素相关),而MVMR方法允许遗传变异与多个风险因素相关,并通过将风险因素作为多个变量的汇总统计数据纳入回归模型来建模多效性。在这里,我们提出了一个两阶段线性混合模型(TS-LMM)的MVMR,它不仅考虑了结果,而且考虑了所有风险因素的汇总统计方差。在第一阶段,我们应用线性混合模型将疾病汇总统计中的方差视为固定/随机效应,同时考虑由于连锁不平衡(LD)导致的遗传变异之间的协方差。特别地,我们使用迭代重加权最小二乘算法来获得随机效应的估计。在第二阶段,我们采用测量误差校正方法,考虑了遗传变异之间的LD和危险因素汇总统计之间的相关性,同时考虑了多个危险因素汇总统计的方差。我们在模拟研究中将MVMR方法与其他方法进行了比较。当大多数工具变量(IVs)都很强时,我们的模型产生了最高的真实因果关联覆盖率,最高的检测显著因果关联的能力,以及在遗传变异(弱LD [γ 2≤0.1],中等LD [0.1< γ 2≤0.5])之间的风险因素汇总统计与LD之间的相关性(弱、强)变化的一系列场景中识别零因果效应的最低假阳性率。当减少强IVs的比例时,我们的模型显示出与MVMR-Egger和MVMR-IVW相当的性能。在风险因素之间存在相关性的情况下,我们的模型的更准确推断支持潜在的广泛应用于通常多维且相关的组学数据,如在寿命决定因素的应用中所示,我们的方法从10个脂蛋白胆固醇测量的面板中指定了一个特定的显著脂蛋白亚部分作为因果关系。我们的模型对相关结构的稳健性表明,在实践中,我们可以在iv的选择中允许适度的LD,从而有可能以更有效的方式利用全基因组汇总数据。我们的模型是在R中的'TS_LMM'宏中实现的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Linear Mixed Model with Measurement Error Correction (LMM-MEC): A Method for Summary-data-based Multivariable Mendelian Randomization.

Summary-data-based multivariable Mendelian randomization (MVMR) methods, such as MVMR-Egger, MVMR-IVW, MVMR median-based, and MVMR-PRESSO, assess the causal effects of multiple risk factors on disease. However, accounting for variances in summary statistics related to risk factors remains a challenge. We propose a linear mixed model with measurement error correction (LMM-MEC) that accounts for the variance of summary statistics for both disease outcomes and risk factors. In step I, a linear mixed model is applied to account for the variance in disease summary statistics. Specifically, if heterogeneity is present in disease summary statistics, we treat it as a random effect and adopt an iteratively re-weighted least squares algorithm to estimate causal effects. In step II, we treat the variance in the summary statistics of risk factors as multiple measurement errors and apply a regression calibration method for simultaneous multiple measurement error correction. In a simulation study, when using independent genetic variants as instrumental variables (IV), our method showed comparable performance to existing MVMR methods under conditions of no pleiotropy or balanced pleiotropy with the outcome, and it exhibited higher coverage rates and power under directional pleiotropy. Similar findings were observed when using genetic variants with low to moderate linkage disequilibrium (LD) (0 < ρ 2 ≤ 0.3) as IVs, although coverage rates reduced for all methods compared to using independent genetic variants as IVs. In the application study, we examined causal associations between correlated cholesterol biomarkers and longevity. By including 739 genetic variants selected based on P values <5×10 -5 from GWAS and allowing for low LD ( ρ 2 ≤ 0.1), our method identified that large LDL-c were causally associated with lower likelihood of achieving longevity.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Online Database of Clinical Algorithms with Race and Ethnicity. A Linear Mixed Model with Measurement Error Correction (LMM-MEC): A Method for Summary-data-based Multivariable Mendelian Randomization. After the Infection: A Survey of Pathogens and Non-communicable Human Disease. The Extra-Islet Pancreas Supports Autoimmunity in Human Type 1 Diabetes. Keyphrase Identification Using Minimal Labeled Data with Hierarchical Contexts and Transfer Learning.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1