使用随机森林预测低出生体重:与逻辑回归的比较

Parisa Ahmadi, H. Alavi-Majd, S. Khodakarim, Leili Tapak, N. Kariman, Payam Amini, Forough Pazhuheian
{"title":"使用随机森林预测低出生体重:与逻辑回归的比较","authors":"Parisa Ahmadi, H. Alavi-Majd, S. Khodakarim, Leili Tapak, N. Kariman, Payam Amini, Forough Pazhuheian","doi":"10.22037/JPS.V8I3.15412","DOIUrl":null,"url":null,"abstract":"Low birth weight (neonate weighing less than 2500 g) is associated with several maternal and fetal factors, all interrelated with each other [ 1 ]. This study is aimed to survey maternal risk factors associated with low birth weight neonates using data mining (Random Forest) to account for interactions between them. We also intended to compare Random Forest with traditional Logistic regression. The dataset used in the present study consisted of 600 volunteer pregnant women.  This cross-sectional study was carried out in Milad hospital, Tehran, during 2005-2009. Ten potential risk factors that are commonly associated with low birth weight were selected by using Random Forest technique. Several criteria such as the area under ROC curve were considered in comparing Random Forest with Logistic Regression.According to both criteria, four top rank variables identified by Random Forest were pregnancy age, body mass index during the third three months of pregnancy, mother’s age and body mass index during the first three months of pregnancy, respectively. In addition, in terms of different criteria the Random Forest technique outperformed the Logistic regression (area under ROC curve: 93% ; Total Accuracy:95% ; Kappa Coefficient: 66%).The results of the present study showed that using Random Forest improved the prediction of low birth weight compared with Logistic Regression. This is because of the fact that the former accounts for all interactions between covariates. Therefore, this approach is a promising classifier for predicting low birth weight .","PeriodicalId":16663,"journal":{"name":"Journal of paramedical sciences","volume":"73 1","pages":"36-43"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Prediction of low birth weight using Random Forest: A comparison with Logistic Regression\",\"authors\":\"Parisa Ahmadi, H. Alavi-Majd, S. Khodakarim, Leili Tapak, N. Kariman, Payam Amini, Forough Pazhuheian\",\"doi\":\"10.22037/JPS.V8I3.15412\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Low birth weight (neonate weighing less than 2500 g) is associated with several maternal and fetal factors, all interrelated with each other [ 1 ]. This study is aimed to survey maternal risk factors associated with low birth weight neonates using data mining (Random Forest) to account for interactions between them. We also intended to compare Random Forest with traditional Logistic regression. The dataset used in the present study consisted of 600 volunteer pregnant women.  This cross-sectional study was carried out in Milad hospital, Tehran, during 2005-2009. Ten potential risk factors that are commonly associated with low birth weight were selected by using Random Forest technique. Several criteria such as the area under ROC curve were considered in comparing Random Forest with Logistic Regression.According to both criteria, four top rank variables identified by Random Forest were pregnancy age, body mass index during the third three months of pregnancy, mother’s age and body mass index during the first three months of pregnancy, respectively. In addition, in terms of different criteria the Random Forest technique outperformed the Logistic regression (area under ROC curve: 93% ; Total Accuracy:95% ; Kappa Coefficient: 66%).The results of the present study showed that using Random Forest improved the prediction of low birth weight compared with Logistic Regression. This is because of the fact that the former accounts for all interactions between covariates. Therefore, this approach is a promising classifier for predicting low birth weight .\",\"PeriodicalId\":16663,\"journal\":{\"name\":\"Journal of paramedical sciences\",\"volume\":\"73 1\",\"pages\":\"36-43\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-07-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of paramedical sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.22037/JPS.V8I3.15412\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of paramedical sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22037/JPS.V8I3.15412","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

摘要

低出生体重(新生儿体重低于2500 g)与多种母胎因素有关,且相互关联[1]。本研究旨在利用数据挖掘(随机森林)来调查与低出生体重新生儿相关的孕产妇危险因素,以解释它们之间的相互作用。我们还打算将随机森林与传统逻辑回归进行比较。本研究使用的数据集包括600名自愿怀孕的妇女。这项横断面研究于2005-2009年在德黑兰Milad医院进行。采用随机森林技术选择10个与低出生体重相关的潜在危险因素。在比较随机森林和逻辑回归时,考虑了几个标准,如ROC曲线下面积。根据这两个标准,随机森林确定的四个最重要的变量分别是怀孕年龄、怀孕第三个月的体重指数、母亲年龄和怀孕前三个月的体重指数。此外,就不同的标准而言,随机森林技术优于Logistic回归(ROC曲线下面积:93%;总准确率:95%;Kappa系数:66%)。本研究结果表明,与Logistic回归相比,使用随机森林可以改善低出生体重的预测。这是因为前者解释了协变量之间的所有相互作用。因此,该方法是预测低出生体重的一种很有前途的分类器。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Prediction of low birth weight using Random Forest: A comparison with Logistic Regression
Low birth weight (neonate weighing less than 2500 g) is associated with several maternal and fetal factors, all interrelated with each other [ 1 ]. This study is aimed to survey maternal risk factors associated with low birth weight neonates using data mining (Random Forest) to account for interactions between them. We also intended to compare Random Forest with traditional Logistic regression. The dataset used in the present study consisted of 600 volunteer pregnant women.  This cross-sectional study was carried out in Milad hospital, Tehran, during 2005-2009. Ten potential risk factors that are commonly associated with low birth weight were selected by using Random Forest technique. Several criteria such as the area under ROC curve were considered in comparing Random Forest with Logistic Regression.According to both criteria, four top rank variables identified by Random Forest were pregnancy age, body mass index during the third three months of pregnancy, mother’s age and body mass index during the first three months of pregnancy, respectively. In addition, in terms of different criteria the Random Forest technique outperformed the Logistic regression (area under ROC curve: 93% ; Total Accuracy:95% ; Kappa Coefficient: 66%).The results of the present study showed that using Random Forest improved the prediction of low birth weight compared with Logistic Regression. This is because of the fact that the former accounts for all interactions between covariates. Therefore, this approach is a promising classifier for predicting low birth weight .
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Putative Role of Factor V Leiden and Prothrombin Mutations in Pregnancy Complications Information Governance Program: A Review of Applications in Healthcare Strain Selection and Statistical Optimization of Culture Conditions for 19F Polysaccharide Production from Pneumococcus Influence of Moderate and Severe Exercise on Memory Formation and Anxiety-like Behaviors in Male Wistar Rat Anxiety Effect: A Case of Text Modification and the Effect of High and Low Anxiety Levels on Medical Students’ Comprehension Performance
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1