A plasma proteomic signature for atherosclerotic cardiovascular disease risk prediction in the UK Biobank cohort

Trisha P. Gupte, Zahra Azizi, Pik Fang Kho, Jiayan Zhou, Ming-Li Chen, Daniel J. Panyard, Rodrigo Guarischi-Sousa, Austin T. Hilliard, Disha Sharma, Kathleen Watson, Fahim Abbasi, Shoa L. Clarke, Themistocles L. Assimes
{"title":"A plasma proteomic signature for atherosclerotic cardiovascular disease risk prediction in the UK Biobank cohort","authors":"Trisha P. Gupte, Zahra Azizi, Pik Fang Kho, Jiayan Zhou, Ming-Li Chen, Daniel J. Panyard, Rodrigo Guarischi-Sousa, Austin T. Hilliard, Disha Sharma, Kathleen Watson, Fahim Abbasi, Shoa L. Clarke, Themistocles L. Assimes","doi":"10.1101/2024.09.13.24313652","DOIUrl":null,"url":null,"abstract":"Background: While risk stratification for atherosclerotic cardiovascular disease (ASCVD) is essential for primary prevention, current clinical risk algorithms demonstrate variability and leave room for further improvement. The plasma proteome holds promise as a future diagnostic and prognostic tool that can accurately reflect complex human traits and disease processes. We assessed the ability of plasma proteins to predict ASCVD. Method: Clinical, genetic, and high-throughput plasma proteomic data were analyzed for association with ASCVD in a cohort of 41,650 UK Biobank participants. Selected features for analysis included clinical variables such as a UK-based cardiovascular clinical risk score (QRISK3) and lipid levels, 36 polygenic risk scores (PRSs), and Olink protein expression data of 2,920 proteins. We used least absolute shrinkage and selection operator (LASSO) regression to select features and compared area under the curve (AUC) statistics between data types. Randomized LASSO regression with a stability selection algorithm identified a smaller set of more robustly associated proteins. The benefit of plasma proteins over standard clinical variables, the QRISK3 score, and PRSs was evaluated through the derivation of Δ AUC values. We also assessed the incremental gain in model performance using proteomic datasets with varying numbers of proteins. To identify potential causal proteins for ASCVD, we conducted a two-sample Mendelian randomization (MR) analysis. Result: The mean age of our cohort was 54.3 years, 53.3% were female, and 9.9% developed incident ASCVD over a median follow-up of 6.9 years. A protein-only LASSO model selected 294 proteins and returned an AUC of 0.723 (95% CI 0.708-0.737). A clinical variable and PRS-only LASSO model selected 4 clinical variables and 20 PRSs and achieved an AUC of 0.726 (95% CI 0.712-0.741). The addition of the full proteomic dataset to clinical variables and PRSs resulted in a Δ AUC of 0.010 (95% CI 0.003-0.018). Fifteen proteins selected by a stability selection algorithm offered improvement in ASCVD prediction over the QRISK3 risk score [Δ AUC: 0.013 (95% CI 0.005-0.021)]. Filtered and clustered versions of the full proteomic dataset (consisting of 600-1,500 proteins) performed comparably to the full dataset for ASCVD prediction. Using MR, we identified 12 proteins as potentially causal for ASCVD. Conclusion: A plasma proteomic signature performs well for incident ASCVD prediction but only modestly improves prediction over clinical and genetic factors. Further studies are warranted to better elucidate the clinical utility of this signature in predicting the risk of ASCVD over the standard practice of using the QRISK3 score.","PeriodicalId":501375,"journal":{"name":"medRxiv - Genetic and Genomic Medicine","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Genetic and Genomic Medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.13.24313652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: While risk stratification for atherosclerotic cardiovascular disease (ASCVD) is essential for primary prevention, current clinical risk algorithms demonstrate variability and leave room for further improvement. The plasma proteome holds promise as a future diagnostic and prognostic tool that can accurately reflect complex human traits and disease processes. We assessed the ability of plasma proteins to predict ASCVD. Method: Clinical, genetic, and high-throughput plasma proteomic data were analyzed for association with ASCVD in a cohort of 41,650 UK Biobank participants. Selected features for analysis included clinical variables such as a UK-based cardiovascular clinical risk score (QRISK3) and lipid levels, 36 polygenic risk scores (PRSs), and Olink protein expression data of 2,920 proteins. We used least absolute shrinkage and selection operator (LASSO) regression to select features and compared area under the curve (AUC) statistics between data types. Randomized LASSO regression with a stability selection algorithm identified a smaller set of more robustly associated proteins. The benefit of plasma proteins over standard clinical variables, the QRISK3 score, and PRSs was evaluated through the derivation of Δ AUC values. We also assessed the incremental gain in model performance using proteomic datasets with varying numbers of proteins. To identify potential causal proteins for ASCVD, we conducted a two-sample Mendelian randomization (MR) analysis. Result: The mean age of our cohort was 54.3 years, 53.3% were female, and 9.9% developed incident ASCVD over a median follow-up of 6.9 years. A protein-only LASSO model selected 294 proteins and returned an AUC of 0.723 (95% CI 0.708-0.737). A clinical variable and PRS-only LASSO model selected 4 clinical variables and 20 PRSs and achieved an AUC of 0.726 (95% CI 0.712-0.741). The addition of the full proteomic dataset to clinical variables and PRSs resulted in a Δ AUC of 0.010 (95% CI 0.003-0.018). Fifteen proteins selected by a stability selection algorithm offered improvement in ASCVD prediction over the QRISK3 risk score [Δ AUC: 0.013 (95% CI 0.005-0.021)]. Filtered and clustered versions of the full proteomic dataset (consisting of 600-1,500 proteins) performed comparably to the full dataset for ASCVD prediction. Using MR, we identified 12 proteins as potentially causal for ASCVD. Conclusion: A plasma proteomic signature performs well for incident ASCVD prediction but only modestly improves prediction over clinical and genetic factors. Further studies are warranted to better elucidate the clinical utility of this signature in predicting the risk of ASCVD over the standard practice of using the QRISK3 score.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
英国生物库队列中用于预测动脉粥样硬化性心血管疾病风险的血浆蛋白质组特征
背景:动脉粥样硬化性心血管疾病(ASCVD)的风险分层对一级预防至关重要,但目前的临床风险算法存在变异,有待进一步改进。血浆蛋白质组有望成为未来的诊断和预后工具,准确反映复杂的人体特征和疾病过程。我们评估了血浆蛋白质预测 ASCVD 的能力。方法:我们分析了英国生物库 41,650 名参与者的临床、遗传和高通量血浆蛋白质组数据与 ASCVD 的关联。所选的分析特征包括临床变量(如基于英国的心血管临床风险评分(QRISK3)和血脂水平)、36个多基因风险评分(PRS)以及2,920个蛋白的Olink蛋白表达数据。我们使用最小绝对收缩和选择算子(LASSO)回归来选择特征,并比较了不同数据类型的曲线下面积(AUC)统计量。采用稳定性选择算法的随机 LASSO 回归确定了一组较小的关联性更强的蛋白质。通过得出 Δ AUC 值,评估了血浆蛋白相对于标准临床变量、QRISK3 评分和 PRS 的优势。我们还利用蛋白质数量不同的蛋白质组数据集评估了模型性能的增益。为了确定潜在的 ASCVD 病因蛋白,我们进行了双样本孟德尔随机化 (MR) 分析。分析结果我们队列中的平均年龄为 54.3 岁,53.3% 为女性,9.9% 在中位 6.9 年的随访期间发生了 ASCVD。纯蛋白质 LASSO 模型选择了 294 种蛋白质,其 AUC 为 0.723(95% CI 0.708-0.737)。纯临床变量和 PRS LASSO 模型选择了 4 个临床变量和 20 个 PRS,得出的 AUC 为 0.726(95% CI 0.712-0.741)。将完整的蛋白质组数据集加入临床变量和 PRS 后,Δ AUC 为 0.010(95% CI 0.003-0.018)。与 QRISK3 风险评分相比,通过稳定性选择算法选出的 15 个蛋白质提高了 ASCVD 预测能力[Δ AUC:0.013 (95% CI 0.005-0.021)]。完整蛋白质组数据集(由 600-1,500 个蛋白质组成)的过滤和聚类版本在 ASCVD 预测方面的表现与完整数据集相当。利用MR,我们确定了12种蛋白质可能与ASCVD有因果关系。结论:血浆蛋白质组特征对急性心血管疾病的预测效果很好,但与临床和遗传因素相比,只能适度提高预测效果。为了更好地阐明该特征在预测 ASCVD 风险方面的临床实用性,而不是使用 QRISK3 评分的标准做法,有必要开展进一步的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Identifying individuals at risk for surgical supravalvar aortic stenosis by polygenic risk score with graded phenotyping Exome wide association study for blood lipids in 1,158,017 individuals from diverse populations Allelic effects on KLHL17 expression likely mediated by JunB/D underlie a PDAC GWAS signal at chr1p36.33 Genetic associations between SGLT2 inhibition, DPP4 inhibition or GLP1R agonism and prostate cancer risk: a two-sample Mendelian randomisation study A Genome-wide Association Study Identifies Novel Genetic Variants Associated with Knee Pain in the UK Biobank (N = 441,757)
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1