Godsway Edem Kpene, Sylvester Yao Lokpo, Sandra A Darfour-Oduro
{"title":"加纳某三级医院T2DM患者死亡率的预测模型和决定因素:机器学习技术的表现如何?","authors":"Godsway Edem Kpene, Sylvester Yao Lokpo, Sandra A Darfour-Oduro","doi":"10.1186/s12902-025-01831-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The increasing prevalence of type 2 diabetes mellitus (T2DM) in lower and middle - income countries call for preventive public health interventions. Studies from Africa including those from Ghana, consistently reveal high T2DM-related mortality rates. While previous research in the Ho municipality has primarily examined risk factors, comorbidity, and quality of life of T2DM patients, this study specifically investigated mortality predictors among these patients.</p><p><strong>Method: </strong>The study was retrospective involving medical records of T2DM patients. Data extracted included mortality outcome (dead or alive), sociodemographic characteristics (age, sex, marital status, educational level, occupation and location), family history of diseases (diabetes, cardiovascular disease (CVD), or asthma), lifestyle (smoking and alcohol intake), comorbidities (such as skin infections, sickle cell disease, urinary tract infections, and pneumonia) and complications of diabetes (CVD, nephropathy, neuropathy, foot ulcers, and diabetic ketoacidosis) were analyzed using Stata version 16.0 and Python 3.6.1 programming language. Both descriptive and inferential statistics were done to describe and build predictive models respectively. The performance of machine learning (ML) techniques such as support vector machine (SVM), decision tree, k nearest neighbor (kNN), eXtreme Gradient Boosting (XGBoost) and logistic regression were evaluated using the best-fitting predictive model for T2DM mortality.</p><p><strong>Results: </strong>Of the 328 participants, 183 (55.79%) were female, and the percentage of mortality was 11.28%. A 100% mortality was recorded among the T2DM patients with sepsis (p-value = 0.012). T2DM in-patients were 3.83 times as likely to die [AOR = 3.83; 95% CI: (1.53-9.61)] if they had nephropathy compared to T2DM in-patients without nephropathy (p-value = 0.004). The full model which included sociodemographic characteristics, family history, lifestyle variables and complications of T2DM had the best prediction of T2DM mortality outcome (ROC = 72.97%). The accuracy for (test and train datasets) were as follows: (90% and 90%), (100% and 100%), (90% and 90%), (90% and 88%) and (88% and 90%) respectively for the various ML classification techniques: logistic regression, Decision tree classifier, kNN classifier, SVM and XGBoost.</p><p><strong>Conclusion: </strong>This study found that all in-patients with sepsis died. Nephropathy was the identified significant predictor of T2DM mortality. Decision tree classifier provided the best classifying potential.</p>","PeriodicalId":9152,"journal":{"name":"BMC Endocrine Disorders","volume":"25 1","pages":"9"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11720850/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predictive models and determinants of mortality among T2DM patients in a tertiary hospital in Ghana, how do machine learning techniques perform?\",\"authors\":\"Godsway Edem Kpene, Sylvester Yao Lokpo, Sandra A Darfour-Oduro\",\"doi\":\"10.1186/s12902-025-01831-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The increasing prevalence of type 2 diabetes mellitus (T2DM) in lower and middle - income countries call for preventive public health interventions. Studies from Africa including those from Ghana, consistently reveal high T2DM-related mortality rates. While previous research in the Ho municipality has primarily examined risk factors, comorbidity, and quality of life of T2DM patients, this study specifically investigated mortality predictors among these patients.</p><p><strong>Method: </strong>The study was retrospective involving medical records of T2DM patients. Data extracted included mortality outcome (dead or alive), sociodemographic characteristics (age, sex, marital status, educational level, occupation and location), family history of diseases (diabetes, cardiovascular disease (CVD), or asthma), lifestyle (smoking and alcohol intake), comorbidities (such as skin infections, sickle cell disease, urinary tract infections, and pneumonia) and complications of diabetes (CVD, nephropathy, neuropathy, foot ulcers, and diabetic ketoacidosis) were analyzed using Stata version 16.0 and Python 3.6.1 programming language. Both descriptive and inferential statistics were done to describe and build predictive models respectively. The performance of machine learning (ML) techniques such as support vector machine (SVM), decision tree, k nearest neighbor (kNN), eXtreme Gradient Boosting (XGBoost) and logistic regression were evaluated using the best-fitting predictive model for T2DM mortality.</p><p><strong>Results: </strong>Of the 328 participants, 183 (55.79%) were female, and the percentage of mortality was 11.28%. A 100% mortality was recorded among the T2DM patients with sepsis (p-value = 0.012). T2DM in-patients were 3.83 times as likely to die [AOR = 3.83; 95% CI: (1.53-9.61)] if they had nephropathy compared to T2DM in-patients without nephropathy (p-value = 0.004). The full model which included sociodemographic characteristics, family history, lifestyle variables and complications of T2DM had the best prediction of T2DM mortality outcome (ROC = 72.97%). The accuracy for (test and train datasets) were as follows: (90% and 90%), (100% and 100%), (90% and 90%), (90% and 88%) and (88% and 90%) respectively for the various ML classification techniques: logistic regression, Decision tree classifier, kNN classifier, SVM and XGBoost.</p><p><strong>Conclusion: </strong>This study found that all in-patients with sepsis died. Nephropathy was the identified significant predictor of T2DM mortality. Decision tree classifier provided the best classifying potential.</p>\",\"PeriodicalId\":9152,\"journal\":{\"name\":\"BMC Endocrine Disorders\",\"volume\":\"25 1\",\"pages\":\"9\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-01-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11720850/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Endocrine Disorders\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12902-025-01831-5\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENDOCRINOLOGY & METABOLISM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Endocrine Disorders","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12902-025-01831-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
Predictive models and determinants of mortality among T2DM patients in a tertiary hospital in Ghana, how do machine learning techniques perform?
Background: The increasing prevalence of type 2 diabetes mellitus (T2DM) in lower and middle - income countries call for preventive public health interventions. Studies from Africa including those from Ghana, consistently reveal high T2DM-related mortality rates. While previous research in the Ho municipality has primarily examined risk factors, comorbidity, and quality of life of T2DM patients, this study specifically investigated mortality predictors among these patients.
Method: The study was retrospective involving medical records of T2DM patients. Data extracted included mortality outcome (dead or alive), sociodemographic characteristics (age, sex, marital status, educational level, occupation and location), family history of diseases (diabetes, cardiovascular disease (CVD), or asthma), lifestyle (smoking and alcohol intake), comorbidities (such as skin infections, sickle cell disease, urinary tract infections, and pneumonia) and complications of diabetes (CVD, nephropathy, neuropathy, foot ulcers, and diabetic ketoacidosis) were analyzed using Stata version 16.0 and Python 3.6.1 programming language. Both descriptive and inferential statistics were done to describe and build predictive models respectively. The performance of machine learning (ML) techniques such as support vector machine (SVM), decision tree, k nearest neighbor (kNN), eXtreme Gradient Boosting (XGBoost) and logistic regression were evaluated using the best-fitting predictive model for T2DM mortality.
Results: Of the 328 participants, 183 (55.79%) were female, and the percentage of mortality was 11.28%. A 100% mortality was recorded among the T2DM patients with sepsis (p-value = 0.012). T2DM in-patients were 3.83 times as likely to die [AOR = 3.83; 95% CI: (1.53-9.61)] if they had nephropathy compared to T2DM in-patients without nephropathy (p-value = 0.004). The full model which included sociodemographic characteristics, family history, lifestyle variables and complications of T2DM had the best prediction of T2DM mortality outcome (ROC = 72.97%). The accuracy for (test and train datasets) were as follows: (90% and 90%), (100% and 100%), (90% and 90%), (90% and 88%) and (88% and 90%) respectively for the various ML classification techniques: logistic regression, Decision tree classifier, kNN classifier, SVM and XGBoost.
Conclusion: This study found that all in-patients with sepsis died. Nephropathy was the identified significant predictor of T2DM mortality. Decision tree classifier provided the best classifying potential.
期刊介绍:
BMC Endocrine Disorders is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of endocrine disorders, as well as related molecular genetics, pathophysiology, and epidemiology.