Duy Nguyen, Ca Hoang, Phat K. Huynh, Tien Truong, Dang Nguyen, Abhay Sharma, Trung Q. Le
{"title":"Multi-level Phenotypic Models of Cardiovascular Disease and Obstructive Sleep Apnea Comorbidities: A Longitudinal Wisconsin Sleep Cohort Study","authors":"Duy Nguyen, Ca Hoang, Phat K. Huynh, Tien Truong, Dang Nguyen, Abhay Sharma, Trung Q. Le","doi":"arxiv-2406.18602","DOIUrl":null,"url":null,"abstract":"Cardiovascular diseases (CVDs) are notably prevalent among patients with\nobstructive sleep apnea (OSA), posing unique challenges in predicting CVD\nprogression due to the intricate interactions of comorbidities. Traditional\nmodels typically lack the necessary dynamic and longitudinal scope to\naccurately forecast CVD trajectories in OSA patients. This study introduces a\nnovel multi-level phenotypic model to analyze the progression and interplay of\nthese conditions over time, utilizing data from the Wisconsin Sleep Cohort,\nwhich includes 1,123 participants followed for decades. Our methodology\ncomprises three advanced steps: (1) Conducting feature importance analysis\nthrough tree-based models to underscore critical predictive variables like\ntotal cholesterol, low-density lipoprotein (LDL), and diabetes. (2) Developing\na logistic mixed-effects model (LGMM) to track longitudinal transitions and\npinpoint significant factors, which displayed a diagnostic accuracy of 0.9556.\n(3) Implementing t-distributed Stochastic Neighbor Embedding (t-SNE) alongside\nGaussian Mixture Models (GMM) to segment patient data into distinct phenotypic\nclusters that reflect varied risk profiles and disease progression pathways.\nThis phenotypic clustering revealed two main groups, with one showing a\nmarkedly increased risk of major adverse cardiovascular events (MACEs),\nunderscored by the significant predictive role of nocturnal hypoxia and\nsympathetic nervous system activity from sleep data. Analysis of transitions\nand trajectories with t-SNE and GMM highlighted different progression rates\nwithin the cohort, with one cluster progressing more slowly towards severe CVD\nstates than the other. This study offers a comprehensive understanding of the\ndynamic relationship between CVD and OSA, providing valuable tools for\npredicting disease onset and tailoring treatment approaches.","PeriodicalId":501215,"journal":{"name":"arXiv - STAT - Computation","volume":"47 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.18602","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Cardiovascular diseases (CVDs) are notably prevalent among patients with
obstructive sleep apnea (OSA), posing unique challenges in predicting CVD
progression due to the intricate interactions of comorbidities. Traditional
models typically lack the necessary dynamic and longitudinal scope to
accurately forecast CVD trajectories in OSA patients. This study introduces a
novel multi-level phenotypic model to analyze the progression and interplay of
these conditions over time, utilizing data from the Wisconsin Sleep Cohort,
which includes 1,123 participants followed for decades. Our methodology
comprises three advanced steps: (1) Conducting feature importance analysis
through tree-based models to underscore critical predictive variables like
total cholesterol, low-density lipoprotein (LDL), and diabetes. (2) Developing
a logistic mixed-effects model (LGMM) to track longitudinal transitions and
pinpoint significant factors, which displayed a diagnostic accuracy of 0.9556.
(3) Implementing t-distributed Stochastic Neighbor Embedding (t-SNE) alongside
Gaussian Mixture Models (GMM) to segment patient data into distinct phenotypic
clusters that reflect varied risk profiles and disease progression pathways.
This phenotypic clustering revealed two main groups, with one showing a
markedly increased risk of major adverse cardiovascular events (MACEs),
underscored by the significant predictive role of nocturnal hypoxia and
sympathetic nervous system activity from sleep data. Analysis of transitions
and trajectories with t-SNE and GMM highlighted different progression rates
within the cohort, with one cluster progressing more slowly towards severe CVD
states than the other. This study offers a comprehensive understanding of the
dynamic relationship between CVD and OSA, providing valuable tools for
predicting disease onset and tailoring treatment approaches.
心血管疾病(CVDs)在患有结构性睡眠呼吸暂停(OSA)的患者中非常普遍,由于合并症之间错综复杂的相互作用,给预测心血管疾病的进展带来了独特的挑战。传统模型通常缺乏必要的动态和纵向范围,无法准确预测 OSA 患者的心血管疾病发展轨迹。本研究利用威斯康星睡眠队列(Wisconsin Sleep Cohort)的数据,引入了一种新的多层次表型模型来分析这些疾病随时间的发展和相互作用。我们的方法包括三个先进步骤:(1)通过树状模型进行特征重要性分析,以强调关键的预测变量,如总胆固醇、低密度脂蛋白(LDL)和糖尿病。(2)开发逻辑混合效应模型(LGMM)来追踪纵向转变并指出重要因素,诊断准确率为 0.9556。3)实施 t 分布随机邻域嵌入(t-SNE)和高斯混合模型(GMM),将患者数据分割成不同的表型聚类,以反映不同的风险特征和疾病进展途径。这种表型聚类揭示了两个主要群体,其中一个群体发生主要不良心血管事件(MACE)的风险明显增加,而睡眠数据中的夜间缺氧和交感神经系统活动的重要预测作用则凸显了这一点。利用 t-SNE 和 GMM 对过渡和轨迹进行的分析突显了队列中不同的进展速度,其中一个群组比另一个群组在严重心血管疾病状态的进展速度更慢。这项研究全面揭示了心血管疾病与 OSA 之间的动态关系,为预测疾病的发生和定制治疗方法提供了宝贵的工具。