Mireya Martínez-García, Guadalupe O Gutiérrez-Esparza, Manlio F Márquez, Luis M Amezcua-Guerra, Enrique Hernández-Lemus
{"title":"Machine learning analysis of emerging risk factors for early-onset hypertension in the Tlalpan 2020 cohort.","authors":"Mireya Martínez-García, Guadalupe O Gutiérrez-Esparza, Manlio F Márquez, Luis M Amezcua-Guerra, Enrique Hernández-Lemus","doi":"10.3389/fcvm.2024.1434418","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Hypertension is a significant public health concern. Several relevant risk factors have been identified. However, since it is a complex condition with broad variability and strong dependence on environmental and lifestyle factors, current risk factors only account for a fraction of the observed prevalence. This study aims to investigate the emerging early-onset hypertension risk factors using a data-driven approach by implementing machine learning models within a well-established cohort in Mexico City, comprising initially 2,500 healthy adults aged 18 to 50 years.</p><p><strong>Methods: </strong>Hypertensive individuals were newly diagnosed during 6,000 person-years, and normotensive individuals were those who, during the same time, remained without exceeding 140 mm Hg in systolic blood pressure and/or diastolic blood pressure of 90 mm Hg. Data on sociodemographic, lifestyle, anthropometric, clinical, and biochemical variables were collected through standardized questionnaires as well as clinical and laboratory assessments. Extreme Gradient Boosting (XGBoost), Logistic Regression (LG) and Support Vector Machines (SVM) were employed to evaluate the relationship between these factors and hypertension risk.</p><p><strong>Results: </strong>The Random Forest (RF) Importance Percent was calculated to assess the structural relevance of each variable in the model, while Shapley Additive Explanations (SHAP) analysis quantified both the average impact and direction of each feature on individual predictions. Additionally, odds ratios were calculated to express the size and direction of influence for each variable, and a sex-stratified analysis was conducted to identify any gender-specific risk factors.</p><p><strong>Discussion: </strong>This nested study provides evidence that sleep disorders, a sedentary lifestyle, consumption of high-fat foods, and energy drinks are potentially modifiable risk factors for hypertension in a Mexico City cohort of young and relatively healthy adults. These findings underscore the importance of addressing these factors in hypertension prevention and management strategies.</p>","PeriodicalId":12414,"journal":{"name":"Frontiers in Cardiovascular Medicine","volume":"11 ","pages":"1434418"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11782138/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cardiovascular Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fcvm.2024.1434418","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Hypertension is a significant public health concern. Several relevant risk factors have been identified. However, since it is a complex condition with broad variability and strong dependence on environmental and lifestyle factors, current risk factors only account for a fraction of the observed prevalence. This study aims to investigate the emerging early-onset hypertension risk factors using a data-driven approach by implementing machine learning models within a well-established cohort in Mexico City, comprising initially 2,500 healthy adults aged 18 to 50 years.
Methods: Hypertensive individuals were newly diagnosed during 6,000 person-years, and normotensive individuals were those who, during the same time, remained without exceeding 140 mm Hg in systolic blood pressure and/or diastolic blood pressure of 90 mm Hg. Data on sociodemographic, lifestyle, anthropometric, clinical, and biochemical variables were collected through standardized questionnaires as well as clinical and laboratory assessments. Extreme Gradient Boosting (XGBoost), Logistic Regression (LG) and Support Vector Machines (SVM) were employed to evaluate the relationship between these factors and hypertension risk.
Results: The Random Forest (RF) Importance Percent was calculated to assess the structural relevance of each variable in the model, while Shapley Additive Explanations (SHAP) analysis quantified both the average impact and direction of each feature on individual predictions. Additionally, odds ratios were calculated to express the size and direction of influence for each variable, and a sex-stratified analysis was conducted to identify any gender-specific risk factors.
Discussion: This nested study provides evidence that sleep disorders, a sedentary lifestyle, consumption of high-fat foods, and energy drinks are potentially modifiable risk factors for hypertension in a Mexico City cohort of young and relatively healthy adults. These findings underscore the importance of addressing these factors in hypertension prevention and management strategies.
期刊介绍:
Frontiers? Which frontiers? Where exactly are the frontiers of cardiovascular medicine? And who should be defining these frontiers?
At Frontiers in Cardiovascular Medicine we believe it is worth being curious to foresee and explore beyond the current frontiers. In other words, we would like, through the articles published by our community journal Frontiers in Cardiovascular Medicine, to anticipate the future of cardiovascular medicine, and thus better prevent cardiovascular disorders and improve therapeutic options and outcomes of our patients.