Using classification and regression trees to model missingness in youth BMI, height and body mass data.

IF 2.2 4区 医学 Q2 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Health Promotion and Chronic Disease Prevention in Canada-Research Policy and Practice Pub Date : 2023-05-01 DOI:10.24095/hpcdp.43.5.03
Amanda Doggett, Ashok Chaurasia, Jean-Philippe Chaput, Scott T Leatherdale
{"title":"Using classification and regression trees to model missingness in youth BMI, height and body mass data.","authors":"Amanda Doggett,&nbsp;Ashok Chaurasia,&nbsp;Jean-Philippe Chaput,&nbsp;Scott T Leatherdale","doi":"10.24095/hpcdp.43.5.03","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Research suggests that there is often a high degree of missingness in youth body mass index (BMI) data derived from self-reported measures, which may have a large effect on research findings. The first step in handling missing data is to examine the levels and patterns of missingness. However, previous studies examining youth BMI missingness used logistic regression, which is limited in its ability to discern subgroups or identify a hierarchy of importance for variables, aspects that may go a long way in helping understand missing data patterns.</p><p><strong>Methods: </strong>This study used sex-stratified classification and regression tree (CART) models to examine missingness in height, body mass and BMI data among 74 501 youth participating in the 2018/19 COMPASS study (a prospective cohort study examining health behaviours among Canadian youth), where 31% of BMI data were missing. Diet, movement, academic, mental health and substance use variables were examined for associations with missingness in height, body mass and BMI.</p><p><strong>Results: </strong>CART models indicated that the combination of being younger, having a selfperception of being overweight, being less physically active and having poorer mental health yielded female and male subgroups highly likely to be missing BMI values. Survey respondents who did not perceive themselves as overweight and who were older were unlikely to be missing BMI values.</p><p><strong>Conclusion: </strong>The subgroups identified by the CART models indicate that a sample that deletes cases with missing BMI would be biased towards physically, emotionally and mentally healthier youth. Given the ability of CART models to identify these subgroups and a hierarchy of variable importance, they are an invaluable tool for examining missing data patterns and appropriate handling of missing data.</p>","PeriodicalId":51316,"journal":{"name":"Health Promotion and Chronic Disease Prevention in Canada-Research Policy and Practice","volume":"43 5","pages":"231-242"},"PeriodicalIF":2.2000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10237263/pdf/43_5_3.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Promotion and Chronic Disease Prevention in Canada-Research Policy and Practice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.24095/hpcdp.43.5.03","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 1

Abstract

Introduction: Research suggests that there is often a high degree of missingness in youth body mass index (BMI) data derived from self-reported measures, which may have a large effect on research findings. The first step in handling missing data is to examine the levels and patterns of missingness. However, previous studies examining youth BMI missingness used logistic regression, which is limited in its ability to discern subgroups or identify a hierarchy of importance for variables, aspects that may go a long way in helping understand missing data patterns.

Methods: This study used sex-stratified classification and regression tree (CART) models to examine missingness in height, body mass and BMI data among 74 501 youth participating in the 2018/19 COMPASS study (a prospective cohort study examining health behaviours among Canadian youth), where 31% of BMI data were missing. Diet, movement, academic, mental health and substance use variables were examined for associations with missingness in height, body mass and BMI.

Results: CART models indicated that the combination of being younger, having a selfperception of being overweight, being less physically active and having poorer mental health yielded female and male subgroups highly likely to be missing BMI values. Survey respondents who did not perceive themselves as overweight and who were older were unlikely to be missing BMI values.

Conclusion: The subgroups identified by the CART models indicate that a sample that deletes cases with missing BMI would be biased towards physically, emotionally and mentally healthier youth. Given the ability of CART models to identify these subgroups and a hierarchy of variable importance, they are an invaluable tool for examining missing data patterns and appropriate handling of missing data.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用分类和回归树对青少年BMI、身高和体重数据的缺失进行建模。
导论:研究表明,青少年自我报告的体重指数(BMI)数据往往存在高度缺失,这可能对研究结果产生很大影响。处理缺失数据的第一步是检查缺失的程度和模式。然而,之前研究青少年BMI缺失的研究使用的是逻辑回归,这在辨别亚组或确定变量的重要性层次的能力上是有限的,这些方面可能有助于理解缺失的数据模式。方法:本研究使用性别分层分类和回归树(CART)模型来检查参加2018/19 COMPASS研究(一项研究加拿大青年健康行为的前瞻性队列研究)的74 501名青年的身高、体重和BMI数据缺失,其中31%的BMI数据缺失。研究人员检查了饮食、运动、学业、心理健康和物质使用等变量与身高、体重和体重指数缺失的关系。结果:CART模型表明,年轻、自我感觉超重、身体活动较少和心理健康状况较差的组合,导致女性和男性亚组极有可能失去BMI值。不认为自己超重且年龄较大的受访者不太可能错过BMI值。结论:CART模型确定的亚组表明,删除BMI缺失病例的样本将偏向于身体、情感和心理更健康的年轻人。由于CART模型能够识别这些子组和不同重要性的层次结构,因此它们是检查缺失数据模式和适当处理缺失数据的宝贵工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.10
自引率
0.00%
发文量
65
审稿时长
40 weeks
期刊介绍: Health Promotion and Chronic Disease Prevention in Canada: Research, Policy and Practice (the HPCDP Journal) is the monthly, online scientific journal of the Health Promotion and Chronic Disease Prevention Branch of the Public Health Agency of Canada. The journal publishes articles on disease prevention, health promotion and health equity in the areas of chronic diseases, injuries and life course health. Content includes research from fields such as public/community health, epidemiology, biostatistics, the behavioural and social sciences, and health services or economics.
期刊最新文献
Characteristics of Canadians who use vaping products, by smoking status: findings from the Canadian Community Health Survey, 2020. Perceptions of overdose response hotlines and applications among rural and remote individuals who use drugs in Canada: a qualitative study. Temporal trends and characteristics of fall-related deaths, hospitalizations and emergency department visits among older adults in Canada. The distribution of hunger in Canadian youth. Assessing the impact of the COVID-19 pandemic on the mental health-related hospitalization rate of youth in Canada: an interrupted time series analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1