Application of machine learning techniques for predicting child mortality and identifying associated risk factors

Elliot Mbunge, S. Fashoto, Benhildah Muchemwa, R. Millham, Garikayi B. Chemhaka, M. Sibiya, T. Dzinamarira, Jolly Buwerimwe
{"title":"Application of machine learning techniques for predicting child mortality and identifying associated risk factors","authors":"Elliot Mbunge, S. Fashoto, Benhildah Muchemwa, R. Millham, Garikayi B. Chemhaka, M. Sibiya, T. Dzinamarira, Jolly Buwerimwe","doi":"10.1109/ICTAS56421.2023.10082734","DOIUrl":null,"url":null,"abstract":"Despite continuous persistent efforts to enhance child health through, among other things, universal access to care, child mortality remains a significant public health concern on a global scale. Child mortality is attributed to several factors including birth asphyxia/trauma, demographic and socioeconomic factors, preterm birth and intrapartum-related complications, pneumonia, preventable and treatable diseases, congenital anomalies, poor access to quality healthcare, poor hygiene and nutrition, and sanitation among others. In many sub-Saharan African nations, including Zimbabwe, the use of machine learning techniques to predict child mortality is still in its infancy. Therefore, this study applied machine learning algorithms decision trees, random forest, logistic regression and XGBoost to develop child mortality predictive models that utilize nationally representative demographic and health survey data. The logistic regression classifier achieved an accuracy of 74%, random forest 72%, Decision tree 72%, and XGBoost a high accuracy of 81%. All under-five predictive models achieved a precision of 95 %. However, logistic regression achieved a recall of 76%, random forest 74%, Decision tree 74%, and XGBoost 84%. Logistic Regression achieved F1-score of 84%, random forest 83%, Decision tree 83% and 89% for XGBoost. The XGBoost outperformed other under-five predictive models. Integrating such models into health information systems can significantly assist policymakers and healthcare professionals to improve the health status of children, access to quality care and most importantly, improve preventive measures, immunization programmes, policies, and decision-making to improve child health. Understanding the risk factors can assist in designing intervention programmes aimed at improve child health while reducing child mortality.","PeriodicalId":158720,"journal":{"name":"2023 Conference on Information Communications Technology and Society (ICTAS)","volume":"2003 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Conference on Information Communications Technology and Society (ICTAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAS56421.2023.10082734","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Despite continuous persistent efforts to enhance child health through, among other things, universal access to care, child mortality remains a significant public health concern on a global scale. Child mortality is attributed to several factors including birth asphyxia/trauma, demographic and socioeconomic factors, preterm birth and intrapartum-related complications, pneumonia, preventable and treatable diseases, congenital anomalies, poor access to quality healthcare, poor hygiene and nutrition, and sanitation among others. In many sub-Saharan African nations, including Zimbabwe, the use of machine learning techniques to predict child mortality is still in its infancy. Therefore, this study applied machine learning algorithms decision trees, random forest, logistic regression and XGBoost to develop child mortality predictive models that utilize nationally representative demographic and health survey data. The logistic regression classifier achieved an accuracy of 74%, random forest 72%, Decision tree 72%, and XGBoost a high accuracy of 81%. All under-five predictive models achieved a precision of 95 %. However, logistic regression achieved a recall of 76%, random forest 74%, Decision tree 74%, and XGBoost 84%. Logistic Regression achieved F1-score of 84%, random forest 83%, Decision tree 83% and 89% for XGBoost. The XGBoost outperformed other under-five predictive models. Integrating such models into health information systems can significantly assist policymakers and healthcare professionals to improve the health status of children, access to quality care and most importantly, improve preventive measures, immunization programmes, policies, and decision-making to improve child health. Understanding the risk factors can assist in designing intervention programmes aimed at improve child health while reducing child mortality.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
机器学习技术在预测儿童死亡率和识别相关风险因素中的应用
尽管通过普及保健等方式不断作出不懈努力,增进儿童健康,但儿童死亡率仍然是全球范围内令人关切的重大公共卫生问题。儿童死亡率可归因于几个因素,包括出生窒息/创伤、人口和社会经济因素、早产和与分娩有关的并发症、肺炎、可预防和可治疗的疾病、先天性异常、难以获得优质保健、卫生和营养不良以及环境卫生等。在包括津巴布韦在内的许多撒哈拉以南非洲国家,使用机器学习技术预测儿童死亡率仍处于起步阶段。因此,本研究应用机器学习算法决策树、随机森林、逻辑回归和XGBoost,利用具有全国代表性的人口和健康调查数据建立儿童死亡率预测模型。逻辑回归分类器的准确率为74%,随机森林为72%,决策树为72%,XGBoost的准确率高达81%。所有5岁以下的预测模型都达到了95%的精度。然而,逻辑回归的召回率为76%,随机森林74%,决策树74%,XGBoost 84%。XGBoost Logistic回归的f1得分为84%,随机森林83%,决策树83%,89%。XGBoost的表现优于其他5岁以下的预测模型。将这些模型整合到卫生信息系统中可以极大地帮助决策者和卫生保健专业人员改善儿童的健康状况,获得高质量的保健,最重要的是,改善预防措施、免疫规划、政策和决策,以改善儿童健康。了解风险因素有助于设计旨在改善儿童健康同时降低儿童死亡率的干预方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Impact of anxiety on students' behavioural intention to use business simulation games Biometric Recognition of Infants Using Fingerprints: Can the infant fingerprint be used for secure authentication? A study on farmers' perceptions about the scope of the Kisan Suvidha App in improving agricultural sustainability Enhancing Traffic Simulations Analysis Efficacy using Multiperspective Heterogeneous Toolset Implementation of ensemble machine learning classifiers to predict diarrhoea with SMOTEENN, SMOTE, and SMOTETomek class imbalance approaches
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1