Modeling Zero-inflated Count Data Using Generalized Poisson and Ordinal Logistic Regression Models in Medical Research.

Q2 Medicine Oman Medical Journal Pub Date : 2024-01-31 eCollection Date: 2024-01-01 DOI:10.5001/omj.2024.41
Bijesh Yadav, Lakshmanan Jeyaseelan, Marimuthu Sappani, Thenmozhi Mani, Sebastian George, Shrikant I Bangdiwala
{"title":"Modeling Zero-inflated Count Data Using Generalized Poisson and Ordinal Logistic Regression Models in Medical Research.","authors":"Bijesh Yadav, Lakshmanan Jeyaseelan, Marimuthu Sappani, Thenmozhi Mani, Sebastian George, Shrikant I Bangdiwala","doi":"10.5001/omj.2024.41","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>In medical research, the study's design and statistical methods are pivotal, as they guide interpretation and conclusion. Selecting appropriate statistical models hinges on the distribution of the outcome measure. Count data, frequently used in medical research, often exhibit over-dispersion or zero inflation. Occasionally, count data are considered ordinal (with a maximum outcome value of 5), and this calls for the application of ordinal regression models. Various models exist for analyzing over-dispersed data such as negative binomial, generalized Poisson (GP), and ordinal regression model. This study aims to examine whether the GP model is a superior alternative to the ordinal logistic regression (OLR) model, specifically in the context of zero-inflated Poisson models using both simulated and real-time data.</p><p><strong>Methods: </strong>Simulated data were generated with varied estimates of regression coefficients, sample sizes, and various proportions of zeros. The GP and OLR models were compared using fit statistics. Additionally, comparisons were made using real-time datasets.</p><p><strong>Results: </strong>The simulated results consistently revealed lower bias and mean squared error values in the GP model compared to the OLR model. The same trend was observed in real-time datasets, with the GP model consistently demonstrating lower standard errors. Except when the sample size was 1000 and the proportions of zeros were 30% and 40%, the Bayesian information criterion consistently favored the GP model over the OLR model.</p><p><strong>Conclusions: </strong>This study establishes that the proposed GP model offers a more advantageous alternative to the OLR model. Moreover, the GP model facilitates easier modeling and interpretation when compared to the OLR model.</p>","PeriodicalId":19667,"journal":{"name":"Oman Medical Journal","volume":"1 1","pages":"e586"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11056604/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oman Medical Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5001/omj.2024.41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Objectives: In medical research, the study's design and statistical methods are pivotal, as they guide interpretation and conclusion. Selecting appropriate statistical models hinges on the distribution of the outcome measure. Count data, frequently used in medical research, often exhibit over-dispersion or zero inflation. Occasionally, count data are considered ordinal (with a maximum outcome value of 5), and this calls for the application of ordinal regression models. Various models exist for analyzing over-dispersed data such as negative binomial, generalized Poisson (GP), and ordinal regression model. This study aims to examine whether the GP model is a superior alternative to the ordinal logistic regression (OLR) model, specifically in the context of zero-inflated Poisson models using both simulated and real-time data.

Methods: Simulated data were generated with varied estimates of regression coefficients, sample sizes, and various proportions of zeros. The GP and OLR models were compared using fit statistics. Additionally, comparisons were made using real-time datasets.

Results: The simulated results consistently revealed lower bias and mean squared error values in the GP model compared to the OLR model. The same trend was observed in real-time datasets, with the GP model consistently demonstrating lower standard errors. Except when the sample size was 1000 and the proportions of zeros were 30% and 40%, the Bayesian information criterion consistently favored the GP model over the OLR model.

Conclusions: This study establishes that the proposed GP model offers a more advantageous alternative to the OLR model. Moreover, the GP model facilitates easier modeling and interpretation when compared to the OLR model.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在医学研究中使用广义泊松和序数逻辑回归模型建立零膨胀计数数据模型。
目的:在医学研究中,研究的设计和统计方法至关重要,因为它们指导着解释和结论。选择合适的统计模型取决于结果测量的分布情况。医学研究中经常使用的计数数据往往表现出过度分散或零膨胀。有时,计数数据被认为是序数数据(最大结果值为 5),这就需要应用序数回归模型。目前有多种用于分析过度分散数据的模型,如负二项、广义泊松(GP)和序数回归模型。本研究旨在利用模拟数据和实时数据,考察 GP 模型是否是一种优于序数逻辑回归(OLR)模型的替代模型,特别是在零膨胀泊松模型的情况下:方法:使用不同的回归系数估计值、样本大小和不同比例的零生成模拟数据。使用拟合统计量对 GP 模型和 OLR 模型进行比较。此外,还使用实时数据集进行了比较:模拟结果一致显示,GP 模型的偏差和均方误差值低于 OLR 模型。在实时数据集中也观察到了同样的趋势,GP 模式的标准误差一直较低。除了当样本量为 1000 个且零的比例为 30% 和 40% 时,贝叶斯信息标准一直倾向于 GP 模型而非 OLR 模型:本研究证实,拟议的 GP 模型比 OLR 模型更具优势。此外,与 OLR 模型相比,GP 模型更易于建模和解释。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Oman Medical Journal
Oman Medical Journal Medicine-Medicine (all)
CiteScore
3.10
自引率
0.00%
发文量
119
审稿时长
12 weeks
期刊最新文献
Central Diabetes Insipidus in Acute Myeloid Leukemia with Cytogenetic Abnormality of 9q34 Deletion. Bullous Pyoderma Gangrenosum Associated with Antiphospholipid Syndrome: A Case Report and Literature Review. Polyangiitis Overlap Syndrome: Polyarteritis Nodosa with Leukocytoclastic Vasculitis Associated with Left Ventricular Thrombus and Vocal Cord Palsy: A Case Report. Assessing Healthcare Providers' Preparedness for Managing COVID-19 Patients: A Mixed-methods Study. COVID-19 Pandemic and its Impact on Vaccine Hesitancy: A Review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1