尼日利亚5岁以下儿童死亡率的趋势分析和决定因素:机器学习方法

Asian Journal of Probability and Statistics Pub Date : 2023-08-29 DOI:10.9734/ajpas/2023/v24i2520

Solomon Ntukidem, A. Chukwu, O. Oyamakin, C. James, Ignace Habimana-Kabano

{"title":"尼日利亚5岁以下儿童死亡率的趋势分析和决定因素:机器学习方法","authors":"Solomon Ntukidem, A. Chukwu, O. Oyamakin, C. James, Ignace Habimana-Kabano","doi":"10.9734/ajpas/2023/v24i2520","DOIUrl":null,"url":null,"abstract":"The study aimed to examine the trend of the under-five mortality rate in Nigeria from 2003 to 2018 and the determinants of under-five mortality using the Nigeria Demographic and Health Survey (NDHS) data. The data for the study was the Nigeria Demographic and Health Survey data conducted in 2003, 2008, 2013, and 2018. These four surveys were used to study under-five mortality trends within the study period, while machine learning was applied only to the 2018 dataset being the latest in Nigeria. The data were partitioned into training and testing sets. 30% of the dataset was randomly selected for testing, while 70% was used in training the model. Before applying logistic regression and neural networks, the essential under-five mortality variables were first selected using a random forest classifier. \nThe trend showed that the mortality rates were 200.72, 156.86, 128.05, and 132.02 in 2003, 2008, 2013, and 2018 respectively, per 1,000 live births. This result means that one in every five children died before their fifth birthday in 2003, one in six in 2008, one in eight in 2013, and one in seven in 2018. The forecast result indicated that the under-five mortality rate would likely be 102.17 in 2023. The variable importance result of the random forest showed that breastfeeding (when the child was put to the breast after birth) had the highest contribution to under-five mortality. The breakdown of breastfeeding from the logistic regression result showed that delaying the breastfeeding of a child to 6-23 hours in comparison with 0-5 hours after birth increases by 1.4 fold the likelihood of child death. The accuracy of logistic regression (LR) on the test set was 60%, and that of deep neural network (DNN) was 74%, recall (sensitivity) for LR was 63%, and DNN was 75%), Precision (LR=97%, DNN=95), F1 score (LR=76%, DNN=84%) and area under the curve (AUC) (LR=79%, DNN=77%). \nBoth logistic regression and deep neural network models performed very well in discriminative ability and accuracy. The deep neural network had a better performance than the logistic regression.","PeriodicalId":8532,"journal":{"name":"Asian Journal of Probability and Statistics","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Trend Analysis and Determinants of under-5 Mortality in Nigeria: A Machine Learning Approach\",\"authors\":\"Solomon Ntukidem, A. Chukwu, O. Oyamakin, C. James, Ignace Habimana-Kabano\",\"doi\":\"10.9734/ajpas/2023/v24i2520\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The study aimed to examine the trend of the under-five mortality rate in Nigeria from 2003 to 2018 and the determinants of under-five mortality using the Nigeria Demographic and Health Survey (NDHS) data. The data for the study was the Nigeria Demographic and Health Survey data conducted in 2003, 2008, 2013, and 2018. These four surveys were used to study under-five mortality trends within the study period, while machine learning was applied only to the 2018 dataset being the latest in Nigeria. The data were partitioned into training and testing sets. 30% of the dataset was randomly selected for testing, while 70% was used in training the model. Before applying logistic regression and neural networks, the essential under-five mortality variables were first selected using a random forest classifier. \\nThe trend showed that the mortality rates were 200.72, 156.86, 128.05, and 132.02 in 2003, 2008, 2013, and 2018 respectively, per 1,000 live births. This result means that one in every five children died before their fifth birthday in 2003, one in six in 2008, one in eight in 2013, and one in seven in 2018. The forecast result indicated that the under-five mortality rate would likely be 102.17 in 2023. The variable importance result of the random forest showed that breastfeeding (when the child was put to the breast after birth) had the highest contribution to under-five mortality. The breakdown of breastfeeding from the logistic regression result showed that delaying the breastfeeding of a child to 6-23 hours in comparison with 0-5 hours after birth increases by 1.4 fold the likelihood of child death. The accuracy of logistic regression (LR) on the test set was 60%, and that of deep neural network (DNN) was 74%, recall (sensitivity) for LR was 63%, and DNN was 75%), Precision (LR=97%, DNN=95), F1 score (LR=76%, DNN=84%) and area under the curve (AUC) (LR=79%, DNN=77%). \\nBoth logistic regression and deep neural network models performed very well in discriminative ability and accuracy. The deep neural network had a better performance than the logistic regression.\",\"PeriodicalId\":8532,\"journal\":{\"name\":\"Asian Journal of Probability and Statistics\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Asian Journal of Probability and Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.9734/ajpas/2023/v24i2520\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asian Journal of Probability and Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9734/ajpas/2023/v24i2520","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

该研究旨在利用尼日利亚人口与健康调查(NDHS)数据，研究2003年至2018年尼日利亚五岁以下儿童死亡率的趋势，以及五岁以下儿童死亡率的决定因素。该研究的数据是2003年、2008年、2013年和2018年进行的尼日利亚人口与健康调查数据。这四项调查用于研究研究期间五岁以下儿童死亡率的趋势，而机器学习仅应用于2018年的数据集，这是尼日利亚最新的数据集。数据被划分为训练集和测试集。随机选择30%的数据集进行测试，70%的数据集用于训练模型。在应用逻辑回归和神经网络之前，首先使用随机森林分类器选择重要的五岁以下死亡率变量。趋势显示，2003年、2008年、2013年和2018年，每千名活产婴儿的死亡率分别为200.72、156.86、128.05和132.02。这一结果意味着，2003年，五分之一的儿童在五岁生日前死亡，2008年为六分之一，2013年为八分之一，2018年为七分之一。预测结果表明，2023年5岁以下儿童死亡率可能为102.17。随机森林的可变重要性结果表明，母乳喂养(孩子出生后喂奶)对五岁以下儿童死亡率的贡献最大。逻辑回归结果对母乳喂养的细分显示，与出生后0-5小时相比，将儿童母乳喂养延迟至6-23小时，儿童死亡的可能性增加1.4倍。逻辑回归(LR)的准确率为60%，深度神经网络(DNN)的准确率为74%，LR的召回率(灵敏度)为63%，DNN为75%)，精度(LR=97%， DNN=95)， F1评分(LR=76%， DNN=84%)和曲线下面积(AUC) (LR=79%， DNN=77%)。逻辑回归模型和深度神经网络模型在判别能力和准确率上都有很好的表现。深度神经网络比逻辑回归具有更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Trend Analysis and Determinants of under-5 Mortality in Nigeria: A Machine Learning Approach

The study aimed to examine the trend of the under-five mortality rate in Nigeria from 2003 to 2018 and the determinants of under-five mortality using the Nigeria Demographic and Health Survey (NDHS) data. The data for the study was the Nigeria Demographic and Health Survey data conducted in 2003, 2008, 2013, and 2018. These four surveys were used to study under-five mortality trends within the study period, while machine learning was applied only to the 2018 dataset being the latest in Nigeria. The data were partitioned into training and testing sets. 30% of the dataset was randomly selected for testing, while 70% was used in training the model. Before applying logistic regression and neural networks, the essential under-five mortality variables were first selected using a random forest classifier. The trend showed that the mortality rates were 200.72, 156.86, 128.05, and 132.02 in 2003, 2008, 2013, and 2018 respectively, per 1,000 live births. This result means that one in every five children died before their fifth birthday in 2003, one in six in 2008, one in eight in 2013, and one in seven in 2018. The forecast result indicated that the under-five mortality rate would likely be 102.17 in 2023. The variable importance result of the random forest showed that breastfeeding (when the child was put to the breast after birth) had the highest contribution to under-five mortality. The breakdown of breastfeeding from the logistic regression result showed that delaying the breastfeeding of a child to 6-23 hours in comparison with 0-5 hours after birth increases by 1.4 fold the likelihood of child death. The accuracy of logistic regression (LR) on the test set was 60%, and that of deep neural network (DNN) was 74%, recall (sensitivity) for LR was 63%, and DNN was 75%), Precision (LR=97%, DNN=95), F1 score (LR=76%, DNN=84%) and area under the curve (AUC) (LR=79%, DNN=77%). Both logistic regression and deep neural network models performed very well in discriminative ability and accuracy. The deep neural network had a better performance than the logistic regression.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Asian Journal of Probability and Statistics

自引率

0.00%

发文量