Factors associated with underweight, overweight, and obesity in Chinese children aged 3-14 years using ensemble learning algorithms.

IF 4.3 3区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Journal of Global Health Pub Date : 2025-02-07 DOI:10.7189/jogh.15.04013
Kening Chen, Fangjieyi Zheng, Xiaoqian Zhang, Qiong Wang, Zhixin Zhang, Wenquan Niu
{"title":"Factors associated with underweight, overweight, and obesity in Chinese children aged 3-14 years using ensemble learning algorithms.","authors":"Kening Chen, Fangjieyi Zheng, Xiaoqian Zhang, Qiong Wang, Zhixin Zhang, Wenquan Niu","doi":"10.7189/jogh.15.04013","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Factors underlying the development of childhood underweight, overweight, and obesity are not fully understood. Traditional models have drawbacks in handling large-scale, high-dimensional, and nonlinear data. In this study, we aimed to identify factors responsible for underweight, overweight, and obesity using machine learning methods among Chinese children.</p><p><strong>Methods: </strong>Our study participants were children aged 3-14 from 30 kindergartens and 26 schools in Beijing and Tangshan. Weight status was defined per the World Health Organization criteria. We implemented three ensemble learning algorithms and compared their performance and ranked the contributing factors by importance and identified an optimal set. A user-friendly web application was developed to calculate the predicted probability of childhood underweight, overweight, and obesity.</p><p><strong>Results: </strong>We analysed data from 18 503 children aged 3-14, including 1798 underweight, 10 579 of normal weight, 3257 overweight, and 2869 with obesity. Of all algorithms, random forest performed the best, with the area under the receiver operating characteristic reaching 0.759 for underweight, 0.806 for overweight, and 0.849 for obesity, with other metrics also reinforcing this algorithm. Further cumulative analyses showed that, for underweight, the optimal set of six factors included maternal body mass index (BMI), age, paternal BMI, maternal reproductive age, paternal reproductive age, and birth weight. The optimal set for overweight comprised of five factors: age, fast food intake, maternal BMI, paternal BMI, and sedentary time. For obesity, the optimal set included six factors: age, fast food intake, maternal BMI, paternal BMI, sedentary time, and maternal reproductive age. Further logistic regression analyses confirmed the predictive capability of individual top factors.</p><p><strong>Conclusions: </strong>Our findings indicate that random forest is the best ensemble learning algorithm for predicting underweight, overweight, and obesity in children aged 3-14 years. We identified the optimal set of significant factors for each malnutrition status and incorporated them into a web application to support the application of this study's findings.</p>","PeriodicalId":48734,"journal":{"name":"Journal of Global Health","volume":"15 ","pages":"04013"},"PeriodicalIF":4.3000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11804908/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Global Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.7189/jogh.15.04013","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Factors underlying the development of childhood underweight, overweight, and obesity are not fully understood. Traditional models have drawbacks in handling large-scale, high-dimensional, and nonlinear data. In this study, we aimed to identify factors responsible for underweight, overweight, and obesity using machine learning methods among Chinese children.

Methods: Our study participants were children aged 3-14 from 30 kindergartens and 26 schools in Beijing and Tangshan. Weight status was defined per the World Health Organization criteria. We implemented three ensemble learning algorithms and compared their performance and ranked the contributing factors by importance and identified an optimal set. A user-friendly web application was developed to calculate the predicted probability of childhood underweight, overweight, and obesity.

Results: We analysed data from 18 503 children aged 3-14, including 1798 underweight, 10 579 of normal weight, 3257 overweight, and 2869 with obesity. Of all algorithms, random forest performed the best, with the area under the receiver operating characteristic reaching 0.759 for underweight, 0.806 for overweight, and 0.849 for obesity, with other metrics also reinforcing this algorithm. Further cumulative analyses showed that, for underweight, the optimal set of six factors included maternal body mass index (BMI), age, paternal BMI, maternal reproductive age, paternal reproductive age, and birth weight. The optimal set for overweight comprised of five factors: age, fast food intake, maternal BMI, paternal BMI, and sedentary time. For obesity, the optimal set included six factors: age, fast food intake, maternal BMI, paternal BMI, sedentary time, and maternal reproductive age. Further logistic regression analyses confirmed the predictive capability of individual top factors.

Conclusions: Our findings indicate that random forest is the best ensemble learning algorithm for predicting underweight, overweight, and obesity in children aged 3-14 years. We identified the optimal set of significant factors for each malnutrition status and incorporated them into a web application to support the application of this study's findings.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用集成学习算法的中国3-14岁儿童体重不足、超重和肥胖相关因素
背景:儿童体重不足、超重和肥胖发展的潜在因素尚不完全清楚。传统模型在处理大规模、高维和非线性数据时存在缺陷。在这项研究中,我们旨在利用机器学习方法在中国儿童中确定导致体重不足、超重和肥胖的因素。方法:研究对象为北京市和唐山市30所幼儿园和26所学校的3 ~ 14岁儿童。体重状况是根据世界卫生组织的标准确定的。我们实现了三种集成学习算法,并比较了它们的性能,并根据重要性对影响因素进行了排名,并确定了一个最优集。开发了一个用户友好的web应用程序来计算儿童体重不足、超重和肥胖的预测概率。结果:我们分析了18 503名3-14岁儿童的数据,其中体重不足1798人,体重正常10 579人,超重3257人,肥胖2869人。在所有算法中,随机森林的表现最好,体重不足的接收者工作特征下的面积达到0.759,超重的接收者工作特征下的面积达到0.806,肥胖的接收者工作特征下的面积达到0.849,其他指标也强化了该算法。进一步的累积分析表明,对于体重不足,最优的六个因素包括母亲的身体质量指数(BMI)、年龄、父亲的身体质量指数、母亲的生育年龄、父亲的生育年龄和出生体重。超重的最佳组合包括五个因素:年龄、快餐摄入量、母亲的身体质量指数、父亲的身体质量指数和久坐时间。对于肥胖,最优的一组包括六个因素:年龄、快餐摄入量、母亲的身体质量指数、父亲的身体质量指数、久坐时间和母亲的生育年龄。进一步的logistic回归分析证实了个别top因子的预测能力。结论:我们的研究结果表明,随机森林是预测3-14岁儿童体重过轻、超重和肥胖的最佳集成学习算法。我们确定了每种营养不良状态的最佳重要因素集,并将其纳入web应用程序,以支持本研究结果的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Global Health
Journal of Global Health PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH -
CiteScore
6.10
自引率
2.80%
发文量
240
审稿时长
6 weeks
期刊介绍: Journal of Global Health is a peer-reviewed journal published by the Edinburgh University Global Health Society, a not-for-profit organization registered in the UK. We publish editorials, news, viewpoints, original research and review articles in two issues per year.
期刊最新文献
Prevalence of widowhood across states and union territories in India, 1993-2021: a repeated cross-sectional study. National, regional and provincial prevalence of carotid atherosclerosis and carotid plaque in Chinese adults: an updated systematic review and modelling analysis. The improvement effects and mechanisms of virtual reality training on depression: a systematic review from a neurotransmitter-plasticity dual-pathway perspective. Workplace violence and self-reported physical and mental health: a national cross-sectional study in Lebanon. Environmental determinants of cardiovascular-kidney-metabolic health: interactive roles of air pollution, heat waves, and green spaces.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1