开发用于疾病预测的网络引导转录组风险评分

IF 0.7 4区 数学 Q3 STATISTICS & PROBABILITY Stat Pub Date : 2024-01-16 DOI:10.1002/sta4.648
Xuan Cao, Liangliang Zhang, Kyoungjae Lee
{"title":"开发用于疾病预测的网络引导转录组风险评分","authors":"Xuan Cao, Liangliang Zhang, Kyoungjae Lee","doi":"10.1002/sta4.648","DOIUrl":null,"url":null,"abstract":"Omics data, routinely collected in various clinical settings, are of a complex and network-structured nature. Recent progress in RNA sequencing (RNA-seq) allows us to explore whole-genome gene expression profiles and to develop predictive model for disease risk. In this study, we propose a novel Bayesian approach to construct RNA-seq-based risk score leveraging gene expression network for disease risk prediction. Specifically, we consider a hierarchical model with spike and slab priors over regression coefficients as well as entries in the inverse covariance matrix for covariates to simultaneously perform variable selection and network estimation in high-dimensional logistic regression. Through theoretical investigation and simulation studies, our method is shown to both enjoy desirable consistency properties and achieve superior empirical performance compared with other state-of-the-art methods. We analyse RNA-seq gene expression data from 441 asthmatic and 254 non-asthmatic samples to form a weighted network-guided risk score and benchmark the proposed method against existing approaches for asthma risk stratification.","PeriodicalId":56159,"journal":{"name":"Stat","volume":"37 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development of network-guided transcriptomic risk score for disease prediction\",\"authors\":\"Xuan Cao, Liangliang Zhang, Kyoungjae Lee\",\"doi\":\"10.1002/sta4.648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Omics data, routinely collected in various clinical settings, are of a complex and network-structured nature. Recent progress in RNA sequencing (RNA-seq) allows us to explore whole-genome gene expression profiles and to develop predictive model for disease risk. In this study, we propose a novel Bayesian approach to construct RNA-seq-based risk score leveraging gene expression network for disease risk prediction. Specifically, we consider a hierarchical model with spike and slab priors over regression coefficients as well as entries in the inverse covariance matrix for covariates to simultaneously perform variable selection and network estimation in high-dimensional logistic regression. Through theoretical investigation and simulation studies, our method is shown to both enjoy desirable consistency properties and achieve superior empirical performance compared with other state-of-the-art methods. We analyse RNA-seq gene expression data from 441 asthmatic and 254 non-asthmatic samples to form a weighted network-guided risk score and benchmark the proposed method against existing approaches for asthma risk stratification.\",\"PeriodicalId\":56159,\"journal\":{\"name\":\"Stat\",\"volume\":\"37 1\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2024-01-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Stat\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1002/sta4.648\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Stat","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1002/sta4.648","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

摘要

在各种临床环境中常规收集的 Omics 数据具有复杂的网络结构性质。最近在 RNA 测序(RNA-seq)方面取得的进展使我们能够探索全基因组基因表达谱,并建立疾病风险预测模型。在本研究中,我们提出了一种新颖的贝叶斯方法,利用基因表达网络构建基于 RNA-seq 的风险评分,用于疾病风险预测。具体来说,我们考虑了一个分层模型,该模型对回归系数以及协变量的逆协方差矩阵中的条目具有尖峰和板块前验,可在高维逻辑回归中同时执行变量选择和网络估计。通过理论研究和模拟研究,我们的方法不仅具有理想的一致性,而且与其他最先进的方法相比具有更优越的经验性能。我们分析了 441 个哮喘样本和 254 个非哮喘样本的 RNA-seq 基因表达数据,形成了加权网络指导风险评分,并将所提出的方法与现有的哮喘风险分层方法进行了比较。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Development of network-guided transcriptomic risk score for disease prediction
Omics data, routinely collected in various clinical settings, are of a complex and network-structured nature. Recent progress in RNA sequencing (RNA-seq) allows us to explore whole-genome gene expression profiles and to develop predictive model for disease risk. In this study, we propose a novel Bayesian approach to construct RNA-seq-based risk score leveraging gene expression network for disease risk prediction. Specifically, we consider a hierarchical model with spike and slab priors over regression coefficients as well as entries in the inverse covariance matrix for covariates to simultaneously perform variable selection and network estimation in high-dimensional logistic regression. Through theoretical investigation and simulation studies, our method is shown to both enjoy desirable consistency properties and achieve superior empirical performance compared with other state-of-the-art methods. We analyse RNA-seq gene expression data from 441 asthmatic and 254 non-asthmatic samples to form a weighted network-guided risk score and benchmark the proposed method against existing approaches for asthma risk stratification.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Stat
Stat Decision Sciences-Statistics, Probability and Uncertainty
CiteScore
1.10
自引率
0.00%
发文量
85
期刊介绍: Stat is an innovative electronic journal for the rapid publication of novel and topical research results, publishing compact articles of the highest quality in all areas of statistical endeavour. Its purpose is to provide a means of rapid sharing of important new theoretical, methodological and applied research. Stat is a joint venture between the International Statistical Institute and Wiley-Blackwell. Stat is characterised by: • Speed - a high-quality review process that aims to reach a decision within 20 days of submission. • Concision - a maximum article length of 10 pages of text, not including references. • Supporting materials - inclusion of electronic supporting materials including graphs, video, software, data and images. • Scope - addresses all areas of statistics and interdisciplinary areas. Stat is a scientific journal for the international community of statisticians and researchers and practitioners in allied quantitative disciplines.
期刊最新文献
Communication‐Efficient Distributed Estimation of Causal Effects With High‐Dimensional Data A Joint Temporal Model for Hospitalizations and ICU Admissions Due to COVID‐19 in Quebec Bitcoin Price Prediction Using Deep Bayesian LSTM With Uncertainty Quantification: A Monte Carlo Dropout–Based Approach Exact interval estimation for three parameters subject to false positive misclassification Novel Closed‐Form Point Estimators for a Weighted Exponential Family Derived From Likelihood Equations
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1