Mireya Martínez-García, Guadalupe O Gutiérrez-Esparza, Manlio F Márquez, Luis M Amezcua-Guerra, Enrique Hernández-Lemus
{"title":"Machine learning analysis of emerging risk factors for early-onset hypertension in the Tlalpan 2020 cohort.","authors":"Mireya Martínez-García, Guadalupe O Gutiérrez-Esparza, Manlio F Márquez, Luis M Amezcua-Guerra, Enrique Hernández-Lemus","doi":"10.3389/fcvm.2024.1434418","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Hypertension is a significant public health concern. Several relevant risk factors have been identified. However, since it is a complex condition with broad variability and strong dependence on environmental and lifestyle factors, current risk factors only account for a fraction of the observed prevalence. This study aims to investigate the emerging early-onset hypertension risk factors using a data-driven approach by implementing machine learning models within a well-established cohort in Mexico City, comprising initially 2,500 healthy adults aged 18 to 50 years.</p><p><strong>Methods: </strong>Hypertensive individuals were newly diagnosed during 6,000 person-years, and normotensive individuals were those who, during the same time, remained without exceeding 140 mm Hg in systolic blood pressure and/or diastolic blood pressure of 90 mm Hg. Data on sociodemographic, lifestyle, anthropometric, clinical, and biochemical variables were collected through standardized questionnaires as well as clinical and laboratory assessments. Extreme Gradient Boosting (XGBoost), Logistic Regression (LG) and Support Vector Machines (SVM) were employed to evaluate the relationship between these factors and hypertension risk.</p><p><strong>Results: </strong>The Random Forest (RF) Importance Percent was calculated to assess the structural relevance of each variable in the model, while Shapley Additive Explanations (SHAP) analysis quantified both the average impact and direction of each feature on individual predictions. Additionally, odds ratios were calculated to express the size and direction of influence for each variable, and a sex-stratified analysis was conducted to identify any gender-specific risk factors.</p><p><strong>Discussion: </strong>This nested study provides evidence that sleep disorders, a sedentary lifestyle, consumption of high-fat foods, and energy drinks are potentially modifiable risk factors for hypertension in a Mexico City cohort of young and relatively healthy adults. These findings underscore the importance of addressing these factors in hypertension prevention and management strategies.</p>","PeriodicalId":12414,"journal":{"name":"Frontiers in Cardiovascular Medicine","volume":"11 ","pages":"1434418"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11782138/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cardiovascular Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fcvm.2024.1434418","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Hypertension is a significant public health concern. Several relevant risk factors have been identified. However, since it is a complex condition with broad variability and strong dependence on environmental and lifestyle factors, current risk factors only account for a fraction of the observed prevalence. This study aims to investigate the emerging early-onset hypertension risk factors using a data-driven approach by implementing machine learning models within a well-established cohort in Mexico City, comprising initially 2,500 healthy adults aged 18 to 50 years.
Methods: Hypertensive individuals were newly diagnosed during 6,000 person-years, and normotensive individuals were those who, during the same time, remained without exceeding 140 mm Hg in systolic blood pressure and/or diastolic blood pressure of 90 mm Hg. Data on sociodemographic, lifestyle, anthropometric, clinical, and biochemical variables were collected through standardized questionnaires as well as clinical and laboratory assessments. Extreme Gradient Boosting (XGBoost), Logistic Regression (LG) and Support Vector Machines (SVM) were employed to evaluate the relationship between these factors and hypertension risk.
Results: The Random Forest (RF) Importance Percent was calculated to assess the structural relevance of each variable in the model, while Shapley Additive Explanations (SHAP) analysis quantified both the average impact and direction of each feature on individual predictions. Additionally, odds ratios were calculated to express the size and direction of influence for each variable, and a sex-stratified analysis was conducted to identify any gender-specific risk factors.
Discussion: This nested study provides evidence that sleep disorders, a sedentary lifestyle, consumption of high-fat foods, and energy drinks are potentially modifiable risk factors for hypertension in a Mexico City cohort of young and relatively healthy adults. These findings underscore the importance of addressing these factors in hypertension prevention and management strategies.
高血压是一个重要的公共卫生问题。已经确定了几个相关的风险因素。然而,由于它是一种复杂的疾病,具有广泛的变异性和对环境和生活方式因素的强烈依赖,目前的危险因素只占观察到的患病率的一小部分。本研究旨在利用数据驱动的方法,通过在墨西哥城一个成熟的队列中实施机器学习模型,调查新出现的早发性高血压风险因素,该队列最初包括2500名年龄在18至50岁之间的健康成年人。方法:新诊断的高血压患者为6000人年,正常血压患者为同期收缩压不超过140 mm Hg和/或舒张压不超过90 mm Hg的患者。通过标准化问卷调查以及临床和实验室评估收集社会人口学、生活方式、人体测量学、临床和生化变量的数据。采用极端梯度增强(XGBoost)、Logistic回归(LG)和支持向量机(SVM)评估这些因素与高血压风险的关系。结果:计算随机森林(RF)重要性百分比来评估模型中每个变量的结构相关性,而Shapley加性解释(SHAP)分析量化了每个特征对个体预测的平均影响和方向。此外,计算优势比以表示每个变量的影响大小和方向,并进行性别分层分析以确定任何性别特定的风险因素。讨论:这项嵌套式研究提供了证据,表明睡眠障碍、久坐不动的生活方式、高脂肪食物和能量饮料的摄入是墨西哥城一组相对健康的年轻成年人高血压的潜在可改变的危险因素。这些发现强调了在高血压预防和管理策略中解决这些因素的重要性。
期刊介绍:
Frontiers? Which frontiers? Where exactly are the frontiers of cardiovascular medicine? And who should be defining these frontiers?
At Frontiers in Cardiovascular Medicine we believe it is worth being curious to foresee and explore beyond the current frontiers. In other words, we would like, through the articles published by our community journal Frontiers in Cardiovascular Medicine, to anticipate the future of cardiovascular medicine, and thus better prevent cardiovascular disorders and improve therapeutic options and outcomes of our patients.