Machine learning analysis of emerging risk factors for early-onset hypertension in the Tlalpan 2020 cohort.

IF 2.8 3区 医学 Q2 CARDIAC & CARDIOVASCULAR SYSTEMS Frontiers in Cardiovascular Medicine Pub Date : 2025-01-17 eCollection Date: 2024-01-01 DOI:10.3389/fcvm.2024.1434418
Mireya Martínez-García, Guadalupe O Gutiérrez-Esparza, Manlio F Márquez, Luis M Amezcua-Guerra, Enrique Hernández-Lemus
{"title":"Machine learning analysis of emerging risk factors for early-onset hypertension in the Tlalpan 2020 cohort.","authors":"Mireya Martínez-García, Guadalupe O Gutiérrez-Esparza, Manlio F Márquez, Luis M Amezcua-Guerra, Enrique Hernández-Lemus","doi":"10.3389/fcvm.2024.1434418","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Hypertension is a significant public health concern. Several relevant risk factors have been identified. However, since it is a complex condition with broad variability and strong dependence on environmental and lifestyle factors, current risk factors only account for a fraction of the observed prevalence. This study aims to investigate the emerging early-onset hypertension risk factors using a data-driven approach by implementing machine learning models within a well-established cohort in Mexico City, comprising initially 2,500 healthy adults aged 18 to 50 years.</p><p><strong>Methods: </strong>Hypertensive individuals were newly diagnosed during 6,000 person-years, and normotensive individuals were those who, during the same time, remained without exceeding 140 mm Hg in systolic blood pressure and/or diastolic blood pressure of 90 mm Hg. Data on sociodemographic, lifestyle, anthropometric, clinical, and biochemical variables were collected through standardized questionnaires as well as clinical and laboratory assessments. Extreme Gradient Boosting (XGBoost), Logistic Regression (LG) and Support Vector Machines (SVM) were employed to evaluate the relationship between these factors and hypertension risk.</p><p><strong>Results: </strong>The Random Forest (RF) Importance Percent was calculated to assess the structural relevance of each variable in the model, while Shapley Additive Explanations (SHAP) analysis quantified both the average impact and direction of each feature on individual predictions. Additionally, odds ratios were calculated to express the size and direction of influence for each variable, and a sex-stratified analysis was conducted to identify any gender-specific risk factors.</p><p><strong>Discussion: </strong>This nested study provides evidence that sleep disorders, a sedentary lifestyle, consumption of high-fat foods, and energy drinks are potentially modifiable risk factors for hypertension in a Mexico City cohort of young and relatively healthy adults. These findings underscore the importance of addressing these factors in hypertension prevention and management strategies.</p>","PeriodicalId":12414,"journal":{"name":"Frontiers in Cardiovascular Medicine","volume":"11 ","pages":"1434418"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11782138/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cardiovascular Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fcvm.2024.1434418","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Hypertension is a significant public health concern. Several relevant risk factors have been identified. However, since it is a complex condition with broad variability and strong dependence on environmental and lifestyle factors, current risk factors only account for a fraction of the observed prevalence. This study aims to investigate the emerging early-onset hypertension risk factors using a data-driven approach by implementing machine learning models within a well-established cohort in Mexico City, comprising initially 2,500 healthy adults aged 18 to 50 years.

Methods: Hypertensive individuals were newly diagnosed during 6,000 person-years, and normotensive individuals were those who, during the same time, remained without exceeding 140 mm Hg in systolic blood pressure and/or diastolic blood pressure of 90 mm Hg. Data on sociodemographic, lifestyle, anthropometric, clinical, and biochemical variables were collected through standardized questionnaires as well as clinical and laboratory assessments. Extreme Gradient Boosting (XGBoost), Logistic Regression (LG) and Support Vector Machines (SVM) were employed to evaluate the relationship between these factors and hypertension risk.

Results: The Random Forest (RF) Importance Percent was calculated to assess the structural relevance of each variable in the model, while Shapley Additive Explanations (SHAP) analysis quantified both the average impact and direction of each feature on individual predictions. Additionally, odds ratios were calculated to express the size and direction of influence for each variable, and a sex-stratified analysis was conducted to identify any gender-specific risk factors.

Discussion: This nested study provides evidence that sleep disorders, a sedentary lifestyle, consumption of high-fat foods, and energy drinks are potentially modifiable risk factors for hypertension in a Mexico City cohort of young and relatively healthy adults. These findings underscore the importance of addressing these factors in hypertension prevention and management strategies.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Tlalpan 2020队列中出现的早发性高血压危险因素的机器学习分析。
高血压是一个重要的公共卫生问题。已经确定了几个相关的风险因素。然而,由于它是一种复杂的疾病,具有广泛的变异性和对环境和生活方式因素的强烈依赖,目前的危险因素只占观察到的患病率的一小部分。本研究旨在利用数据驱动的方法,通过在墨西哥城一个成熟的队列中实施机器学习模型,调查新出现的早发性高血压风险因素,该队列最初包括2500名年龄在18至50岁之间的健康成年人。方法:新诊断的高血压患者为6000人年,正常血压患者为同期收缩压不超过140 mm Hg和/或舒张压不超过90 mm Hg的患者。通过标准化问卷调查以及临床和实验室评估收集社会人口学、生活方式、人体测量学、临床和生化变量的数据。采用极端梯度增强(XGBoost)、Logistic回归(LG)和支持向量机(SVM)评估这些因素与高血压风险的关系。结果:计算随机森林(RF)重要性百分比来评估模型中每个变量的结构相关性,而Shapley加性解释(SHAP)分析量化了每个特征对个体预测的平均影响和方向。此外,计算优势比以表示每个变量的影响大小和方向,并进行性别分层分析以确定任何性别特定的风险因素。讨论:这项嵌套式研究提供了证据,表明睡眠障碍、久坐不动的生活方式、高脂肪食物和能量饮料的摄入是墨西哥城一组相对健康的年轻成年人高血压的潜在可改变的危险因素。这些发现强调了在高血压预防和管理策略中解决这些因素的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Frontiers in Cardiovascular Medicine
Frontiers in Cardiovascular Medicine Medicine-Cardiology and Cardiovascular Medicine
CiteScore
3.80
自引率
11.10%
发文量
3529
审稿时长
14 weeks
期刊介绍: Frontiers? Which frontiers? Where exactly are the frontiers of cardiovascular medicine? And who should be defining these frontiers? At Frontiers in Cardiovascular Medicine we believe it is worth being curious to foresee and explore beyond the current frontiers. In other words, we would like, through the articles published by our community journal Frontiers in Cardiovascular Medicine, to anticipate the future of cardiovascular medicine, and thus better prevent cardiovascular disorders and improve therapeutic options and outcomes of our patients.
期刊最新文献
A systematic review and meta-analysis of the efficacy of addition and reduction of Shixiao powder in the treatment of coronary heart disease. Association between circulating levels of miR-29 and postoperative neurological complications in acute type A aortic dissection patients. Development and validation of a nomogram for predicting hospitalization-associated disability in older patients with acute heart failure. Usefulness of albumin-to-D-dimer ratio in predicting the long-term outcome after hospital discharge in patients with ST-elevation myocardial infarction: a retrospective cohort study. Establishment and validation of an interpretable machine learning-based predictive model for risk of post-PCI in-hospital heart failure in AIHD patients.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1