Comparison of machine learning models for the prediction of hypertension in transgender patients undergoing gynecologic surgery

IF 5.4 Q1 MEDICINE, RESEARCH & EXPERIMENTAL Communications medicine Pub Date : 2024-09-30 DOI:10.1038/s43856-024-00603-x

Reetam Ganguli, Jordan Franklin, Xiaotian Yu, Alice Lin, Aditi Vichare, Stephen Wagner

{"title":"Comparison of machine learning models for the prediction of hypertension in transgender patients undergoing gynecologic surgery","authors":"Reetam Ganguli, Jordan Franklin, Xiaotian Yu, Alice Lin, Aditi Vichare, Stephen Wagner","doi":"10.1038/s43856-024-00603-x","DOIUrl":null,"url":null,"abstract":"Transgender patients face a higher burden of cardiovascular morbidity due to structural and biological stressors, particularly in low-resource settings. No studies exist comparing machine learning model development strategies for this unique patient cohort and limited literature exists comparing data/outcomes between transgender and cisgender populations. We compare machine learning models trained solely on transgender patients against models developed on a size-matched and ratio-matched cohort of cisgender patients and a 300-fold larger, ratio-matched cohort of cisgender patients undergoing obstetric/gynecologic procedures in the National Surgical Quality Improvement Program from January 1, 2005 through December 31, 2019. All models were developed to predict the outcome of hypertension. Statistical significance between models was calculated using 5-by-2 fold cross validation hypothesis testing. Among 626,102 patients having an obstetric/gynecologic surgery, there are 1959 transgender patients of which 85,405 (13.7%) have hypertension requiring medication. Saliently, the logistic regression machine learning models trained selectively on the transgender cohort have an AUC of 0.865 (95% CI: 0.83–0.90), with an accuracy of 85% (95% CI: 0.80–0.87) compared to (p < 0.05) the logistic regression model trained on the 300-fold larger combined cohort which has an AUC of 0.861 (95% CI: 0.82–0.90), with an accuracy of 83% (95% CI: 0.80–0.87). Machine learning models can be trained on smaller, selectively transgender populations and may perform similarly or better to predict cardiovascular outcomes in transgender patients, than models developed on predominantly cisgender patients; this can be useful in lower-resource settings with smaller-volume transgender patients. Transgender patients face a higher burden of cardiovascular disease. Statistical models that predict cardiovascular disease-related outcomes, such as high blood pressure (hypertension), may be useful to clinicians to guide treatment, but existing models are mainly developed in cisgender populations. Here, we developed models to predict hypertension in patients undergoing surgery, and compared models developed using data from cisgender patients, transgender patients, or mixed populations to see if this affected how well these models could predict hypertension in the transgender population. We ultimately found that one of our models trained on a much smaller cohort of solely transgender patients outperformed the same model trained on a 300-times larger population of mixed cisgender and transgender patients. These findings might help to guide future efforts to develop statistical approaches to accurately predict health outcomes in transgender patients. Ganguli et al. compare the performance of machine learning models to predict hypertension in transgender patients undergoing gynecologic surgery. Logistic regression models trained on data from a cohort of transgender patients perform better than those trained on a predominantly cisgender cohort.","PeriodicalId":72646,"journal":{"name":"Communications medicine","volume":null,"pages":null},"PeriodicalIF":5.4000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11442826/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.nature.com/articles/s43856-024-00603-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Transgender patients face a higher burden of cardiovascular morbidity due to structural and biological stressors, particularly in low-resource settings. No studies exist comparing machine learning model development strategies for this unique patient cohort and limited literature exists comparing data/outcomes between transgender and cisgender populations. We compare machine learning models trained solely on transgender patients against models developed on a size-matched and ratio-matched cohort of cisgender patients and a 300-fold larger, ratio-matched cohort of cisgender patients undergoing obstetric/gynecologic procedures in the National Surgical Quality Improvement Program from January 1, 2005 through December 31, 2019. All models were developed to predict the outcome of hypertension. Statistical significance between models was calculated using 5-by-2 fold cross validation hypothesis testing. Among 626,102 patients having an obstetric/gynecologic surgery, there are 1959 transgender patients of which 85,405 (13.7%) have hypertension requiring medication. Saliently, the logistic regression machine learning models trained selectively on the transgender cohort have an AUC of 0.865 (95% CI: 0.83–0.90), with an accuracy of 85% (95% CI: 0.80–0.87) compared to (p < 0.05) the logistic regression model trained on the 300-fold larger combined cohort which has an AUC of 0.861 (95% CI: 0.82–0.90), with an accuracy of 83% (95% CI: 0.80–0.87). Machine learning models can be trained on smaller, selectively transgender populations and may perform similarly or better to predict cardiovascular outcomes in transgender patients, than models developed on predominantly cisgender patients; this can be useful in lower-resource settings with smaller-volume transgender patients. Transgender patients face a higher burden of cardiovascular disease. Statistical models that predict cardiovascular disease-related outcomes, such as high blood pressure (hypertension), may be useful to clinicians to guide treatment, but existing models are mainly developed in cisgender populations. Here, we developed models to predict hypertension in patients undergoing surgery, and compared models developed using data from cisgender patients, transgender patients, or mixed populations to see if this affected how well these models could predict hypertension in the transgender population. We ultimately found that one of our models trained on a much smaller cohort of solely transgender patients outperformed the same model trained on a 300-times larger population of mixed cisgender and transgender patients. These findings might help to guide future efforts to develop statistical approaches to accurately predict health outcomes in transgender patients. Ganguli et al. compare the performance of machine learning models to predict hypertension in transgender patients undergoing gynecologic surgery. Logistic regression models trained on data from a cohort of transgender patients perform better than those trained on a predominantly cisgender cohort.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

比较用于预测接受妇科手术的变性患者高血压的机器学习模型。

背景：变性患者因结构性和生物性压力因素而面临更高的心血管发病率负担，尤其是在资源匮乏的环境中。目前还没有针对这一特殊患者群体的机器学习模型开发策略进行比较的研究，而比较变性人群和顺性人群的数据/结果的文献也很有限：我们将仅针对跨性别患者训练的机器学习模型与针对大小匹配和比例匹配的顺性别患者队列以及从 2005 年 1 月 1 日到 2019 年 12 月 31 日期间在国家外科手术质量改进计划中接受妇产科手术的比顺性患者队列大 300 倍的比例匹配患者队列开发的模型进行了比较。所有模型均用于预测高血压的结果。采用5乘2倍交叉验证假设检验法计算模型之间的统计显著性：在 626 102 名接受妇产科手术的患者中，有 1959 名变性患者，其中 85 405 人（13.7%）患有需要药物治疗的高血压。值得注意的是，有选择性地在变性人队列中训练的逻辑回归机器学习模型的AUC为0.865（95% CI：0.83-0.90），准确率为85%（95% CI：0.80-0.87），与（p 结论：机器学习模型可以在较小的变性人队列中训练，但需要一定的时间：机器学习模型可以在较小的、有选择性的变性人群中进行训练，在预测变性患者心血管预后方面的表现可能类似于或优于主要针对顺性患者开发的模型；这在资源较少、变性患者数量较少的环境中非常有用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Communications medicine

自引率

0.00%

发文量