Grouped Random Parameters Negative Binomial-Lindley for accounting unobserved heterogeneity in crash data with preponderant zero observations

IF 12.5 1区 工程技术 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Analytic Methods in Accident Research Pub Date : 2023-03-01 DOI:10.1016/j.amar.2022.100255
A.S.M. Mohaiminul Islam , Mohammadali Shirazi , Dominique Lord
{"title":"Grouped Random Parameters Negative Binomial-Lindley for accounting unobserved heterogeneity in crash data with preponderant zero observations","authors":"A.S.M. Mohaiminul Islam ,&nbsp;Mohammadali Shirazi ,&nbsp;Dominique Lord","doi":"10.1016/j.amar.2022.100255","DOIUrl":null,"url":null,"abstract":"<div><p>Developing robust and reliable statistical models to estimate, analyze, and understand crash data is a key element in various highway safety evaluation tasks. Crash data have characteristics not found in other data, including but not limited to the excess number of zero responses. The Negative Binomial-Lindley (NB-L) model has been proposed as a method to analyze data with many zero observations. In addition, the differences in various temporal and spatial factors result in variations of model coefficients among different groups of observations. A grouped random parameters model is a strategy to account for such unobserved heterogeneity. In this paper, we proposed the derivations and applications of the grouped random parameters negative binomial-Lindley model (G-RPNB-L) to account for the unobserved heterogeneity in crash data with many zero observations. We first illustrated our proposed model by designing a simulation study. The simulation study showed the ability of the proposed model to correctly estimate the coefficients. Then, we used an empirical dataset in Maine to show the application of the proposed model. We showed that the impact of weather variables denoting “Days with precipitation greater than 1.0 in.”, and “Days with temperature less than 32°F” varies across Maine counties. We also compared the proposed model with the NB, NB-L, and grouped random-parameters NB (G-RPNB) models using different goodness-of-fit metrics. The proposed G-RPNB-L model showed a superior fit compared to the other models.</p></div>","PeriodicalId":47520,"journal":{"name":"Analytic Methods in Accident Research","volume":"37 ","pages":"Article 100255"},"PeriodicalIF":12.5000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytic Methods in Accident Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2213665722000446","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 5

Abstract

Developing robust and reliable statistical models to estimate, analyze, and understand crash data is a key element in various highway safety evaluation tasks. Crash data have characteristics not found in other data, including but not limited to the excess number of zero responses. The Negative Binomial-Lindley (NB-L) model has been proposed as a method to analyze data with many zero observations. In addition, the differences in various temporal and spatial factors result in variations of model coefficients among different groups of observations. A grouped random parameters model is a strategy to account for such unobserved heterogeneity. In this paper, we proposed the derivations and applications of the grouped random parameters negative binomial-Lindley model (G-RPNB-L) to account for the unobserved heterogeneity in crash data with many zero observations. We first illustrated our proposed model by designing a simulation study. The simulation study showed the ability of the proposed model to correctly estimate the coefficients. Then, we used an empirical dataset in Maine to show the application of the proposed model. We showed that the impact of weather variables denoting “Days with precipitation greater than 1.0 in.”, and “Days with temperature less than 32°F” varies across Maine counties. We also compared the proposed model with the NB, NB-L, and grouped random-parameters NB (G-RPNB) models using different goodness-of-fit metrics. The proposed G-RPNB-L model showed a superior fit compared to the other models.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
分组随机参数负二项Lindley用于解释具有优势零观测的碰撞数据中未观测到的异质性
开发稳健可靠的统计模型来估计、分析和理解碰撞数据是各种公路安全评估任务的关键要素。崩溃数据具有其他数据中没有的特征,包括但不限于零响应的过量数量。负二项林德利(NB-L)模型是一种分析具有多个零观测值的数据的方法。此外,各种时空因子的差异导致不同观测组间模式系数的变化。分组随机参数模型是解释这种未观察到的异质性的一种策略。在本文中,我们提出了分组随机参数负二项林德利模型(G-RPNB-L)的推导和应用,以解释具有许多零观测值的碰撞数据中未观测到的异质性。我们首先通过设计一个模拟研究来说明我们提出的模型。仿真研究表明,所提出的模型能够正确估计系数。然后,我们使用缅因州的经验数据集来展示所提出模型的应用。我们表明,天气变量表示“降水大于1.0英寸的天数”的影响。和“气温低于32华氏度的日子”在缅因州的各个县有所不同。我们还使用不同的拟合优度指标将所提出的模型与NB、NB- l和分组随机参数NB (G-RPNB)模型进行了比较。与其他模型相比,所提出的G-RPNB-L模型具有更好的拟合效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
22.10
自引率
34.10%
发文量
35
审稿时长
24 days
期刊介绍: Analytic Methods in Accident Research is a journal that publishes articles related to the development and application of advanced statistical and econometric methods in studying vehicle crashes and other accidents. The journal aims to demonstrate how these innovative approaches can provide new insights into the factors influencing the occurrence and severity of accidents, thereby offering guidance for implementing appropriate preventive measures. While the journal primarily focuses on the analytic approach, it also accepts articles covering various aspects of transportation safety (such as road, pedestrian, air, rail, and water safety), construction safety, and other areas where human behavior, machine failures, or system failures lead to property damage or bodily harm.
期刊最新文献
Econometric approaches to examine the onset and duration of temporal variations in pedestrian and bicyclist injury severity analysis Determinants influencing alcohol-related two-vehicle crash severity: A multivariate Bayesian hierarchical random parameters correlated outcomes logit model Effects of sample size on pedestrian crash risk estimation from traffic conflicts using extreme value models Editorial Board A cross-comparison of different extreme value modeling techniques for traffic conflict-based crash risk estimation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1