A demonstration of modeling count data with an application to physical activity.

Donald J Slymen, Guadalupe X Ayala, Elva M Arredondo, John P Elder
{"title":"A demonstration of modeling count data with an application to physical activity.","authors":"Donald J Slymen,&nbsp;Guadalupe X Ayala,&nbsp;Elva M Arredondo,&nbsp;John P Elder","doi":"10.1186/1742-5573-3-3","DOIUrl":null,"url":null,"abstract":"<p><p>Counting outcomes such as days of physical activity or servings of fruits and vegetables often have distributions that are highly skewed toward the right with a preponderance of zeros, posing analytical challenges. This paper demonstrates how such outcomes may be analyzed with several modifications to Poisson regression. Five regression models 1) Poisson, 2) overdispersed Poisson, 3) negative binomial, 4) zero-inflated Poisson (ZIP), and 5) zero-inflated negative binomial (ZINB) are fitted to data assessing predictors of vigorous physical activity (VPA) among Latina women. The models are described, and analytical and graphical approaches are discussed to aid in model selection. Poisson regression provided a poor fit where 82% of the subjects reported no days of VPA. The fit improved considerably with the negative binomial and ZIP models. There was little difference in fit between the ZIP and ZINB models. Overall, the ZIP model fit best. No days of VPA were associated with poorer self-reported health and less assimilation to Anglo culture, and marginally associated with increasing BMI. The intensity portion of the model suggested that increasing days of VPA were associated with more education, and marginally associated with increasing age. These underutilized models provide useful approaches for handling counting outcomes.</p>","PeriodicalId":87082,"journal":{"name":"Epidemiologic perspectives & innovations : EP+I","volume":"3 ","pages":"3"},"PeriodicalIF":0.0000,"publicationDate":"2006-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1742-5573-3-3","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epidemiologic perspectives & innovations : EP+I","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/1742-5573-3-3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Counting outcomes such as days of physical activity or servings of fruits and vegetables often have distributions that are highly skewed toward the right with a preponderance of zeros, posing analytical challenges. This paper demonstrates how such outcomes may be analyzed with several modifications to Poisson regression. Five regression models 1) Poisson, 2) overdispersed Poisson, 3) negative binomial, 4) zero-inflated Poisson (ZIP), and 5) zero-inflated negative binomial (ZINB) are fitted to data assessing predictors of vigorous physical activity (VPA) among Latina women. The models are described, and analytical and graphical approaches are discussed to aid in model selection. Poisson regression provided a poor fit where 82% of the subjects reported no days of VPA. The fit improved considerably with the negative binomial and ZIP models. There was little difference in fit between the ZIP and ZINB models. Overall, the ZIP model fit best. No days of VPA were associated with poorer self-reported health and less assimilation to Anglo culture, and marginally associated with increasing BMI. The intensity portion of the model suggested that increasing days of VPA were associated with more education, and marginally associated with increasing age. These underutilized models provide useful approaches for handling counting outcomes.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
演示如何用一个应用程序对计数数据进行建模。
计算结果,如体力活动天数或水果和蔬菜的份数,其分布往往高度向右倾斜,零占多数,这给分析带来了挑战。本文演示了如何用泊松回归的几种修正来分析这些结果。本文拟合了1)泊松、2)过分散泊松、3)负二项、4)零膨胀泊松(ZIP)和5)零膨胀负二项(ZINB) 5种回归模型来评估拉丁裔女性剧烈运动(VPA)的预测因子。对模型进行了描述,并讨论了有助于模型选择的分析方法和图解方法。泊松回归提供了一个很差的拟合,82%的受试者报告没有VPA的天数。负二项模型和ZIP模型的拟合得到了显著改善。ZIP和ZINB模型之间的拟合差异不大。总的来说,ZIP模型最适合。没有VPA的日子与自我报告的健康状况较差和对盎格鲁文化的同化程度较低有关,并与BMI增加轻微相关。模型的强度部分表明,VPA天数的增加与受教育程度的增加有关,与年龄的增加略有相关。这些未充分利用的模型为处理计数结果提供了有用的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Extending the sufficient component cause model to describe the Stable Unit Treatment Value Assumption (SUTVA). Use of the integrated health interview series: trends in medical provider utilization (1972-2008). Social network analysis and agent-based modeling in social epidemiology. The use of complete-case and multiple imputation-based analyses in molecular epidemiology studies that assess interaction effects. Attributing the burden of cancer at work: three areas of concern when examining the example of shift-work.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1