Factors associated with underweight, overweight, and obesity in Chinese children aged 3-14 years using ensemble learning algorithms.

IF 4.5 3区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Journal of Global Health Pub Date : 2025-02-07 DOI:10.7189/jogh.15.04013
Kening Chen, Fangjieyi Zheng, Xiaoqian Zhang, Qiong Wang, Zhixin Zhang, Wenquan Niu
{"title":"Factors associated with underweight, overweight, and obesity in Chinese children aged 3-14 years using ensemble learning algorithms.","authors":"Kening Chen, Fangjieyi Zheng, Xiaoqian Zhang, Qiong Wang, Zhixin Zhang, Wenquan Niu","doi":"10.7189/jogh.15.04013","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Factors underlying the development of childhood underweight, overweight, and obesity are not fully understood. Traditional models have drawbacks in handling large-scale, high-dimensional, and nonlinear data. In this study, we aimed to identify factors responsible for underweight, overweight, and obesity using machine learning methods among Chinese children.</p><p><strong>Methods: </strong>Our study participants were children aged 3-14 from 30 kindergartens and 26 schools in Beijing and Tangshan. Weight status was defined per the World Health Organization criteria. We implemented three ensemble learning algorithms and compared their performance and ranked the contributing factors by importance and identified an optimal set. A user-friendly web application was developed to calculate the predicted probability of childhood underweight, overweight, and obesity.</p><p><strong>Results: </strong>We analysed data from 18 503 children aged 3-14, including 1798 underweight, 10 579 of normal weight, 3257 overweight, and 2869 with obesity. Of all algorithms, random forest performed the best, with the area under the receiver operating characteristic reaching 0.759 for underweight, 0.806 for overweight, and 0.849 for obesity, with other metrics also reinforcing this algorithm. Further cumulative analyses showed that, for underweight, the optimal set of six factors included maternal body mass index (BMI), age, paternal BMI, maternal reproductive age, paternal reproductive age, and birth weight. The optimal set for overweight comprised of five factors: age, fast food intake, maternal BMI, paternal BMI, and sedentary time. For obesity, the optimal set included six factors: age, fast food intake, maternal BMI, paternal BMI, sedentary time, and maternal reproductive age. Further logistic regression analyses confirmed the predictive capability of individual top factors.</p><p><strong>Conclusions: </strong>Our findings indicate that random forest is the best ensemble learning algorithm for predicting underweight, overweight, and obesity in children aged 3-14 years. We identified the optimal set of significant factors for each malnutrition status and incorporated them into a web application to support the application of this study's findings.</p>","PeriodicalId":48734,"journal":{"name":"Journal of Global Health","volume":"15 ","pages":"04013"},"PeriodicalIF":4.5000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11804908/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Global Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.7189/jogh.15.04013","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Factors underlying the development of childhood underweight, overweight, and obesity are not fully understood. Traditional models have drawbacks in handling large-scale, high-dimensional, and nonlinear data. In this study, we aimed to identify factors responsible for underweight, overweight, and obesity using machine learning methods among Chinese children.

Methods: Our study participants were children aged 3-14 from 30 kindergartens and 26 schools in Beijing and Tangshan. Weight status was defined per the World Health Organization criteria. We implemented three ensemble learning algorithms and compared their performance and ranked the contributing factors by importance and identified an optimal set. A user-friendly web application was developed to calculate the predicted probability of childhood underweight, overweight, and obesity.

Results: We analysed data from 18 503 children aged 3-14, including 1798 underweight, 10 579 of normal weight, 3257 overweight, and 2869 with obesity. Of all algorithms, random forest performed the best, with the area under the receiver operating characteristic reaching 0.759 for underweight, 0.806 for overweight, and 0.849 for obesity, with other metrics also reinforcing this algorithm. Further cumulative analyses showed that, for underweight, the optimal set of six factors included maternal body mass index (BMI), age, paternal BMI, maternal reproductive age, paternal reproductive age, and birth weight. The optimal set for overweight comprised of five factors: age, fast food intake, maternal BMI, paternal BMI, and sedentary time. For obesity, the optimal set included six factors: age, fast food intake, maternal BMI, paternal BMI, sedentary time, and maternal reproductive age. Further logistic regression analyses confirmed the predictive capability of individual top factors.

Conclusions: Our findings indicate that random forest is the best ensemble learning algorithm for predicting underweight, overweight, and obesity in children aged 3-14 years. We identified the optimal set of significant factors for each malnutrition status and incorporated them into a web application to support the application of this study's findings.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Global Health
Journal of Global Health PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH -
CiteScore
6.10
自引率
2.80%
发文量
240
审稿时长
6 weeks
期刊介绍: Journal of Global Health is a peer-reviewed journal published by the Edinburgh University Global Health Society, a not-for-profit organization registered in the UK. We publish editorials, news, viewpoints, original research and review articles in two issues per year.
期刊最新文献
Paediatric healthcare in Manhiça district through a gender lens: a retrospective analysis of 17 years of morbidity and demographic surveillance data. Spatial patterns and temporal trends in stillbirth, neonatal, and infant mortality: an exploration of country-level data from 2000 to 2021. The treatment responses among different inhalation therapies for GOLD group E patients with chronic obstructive pulmonary disease. The efficiency and productivity-changing trend of PHCIs since the 2009 health reform in China based on a three-stage DEA and Malmquist Productivity Index. Using calculations from the Lives Saved Tool in other global health modelling tools.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1