{"title":"Effects of sample size on pedestrian crash risk estimation from traffic conflicts using extreme value models","authors":"Faizan Nazir , Yasir Ali , Md Mazharul Haque","doi":"10.1016/j.amar.2024.100353","DOIUrl":null,"url":null,"abstract":"<div><div>Sample size plays a critical role in an Extreme Value Theory (EVT) model for estimating crash risks from traffic conflicts. Many studies have raised concerns regarding sample size and its consequent negative impact on the performance of EVT models. However, the effects of sample size on EVT models are not well-known, requiring an extensive investigation and a deeper understanding of the effects of sample size on model performance. Motivated by this research gap, this study proposes a systematic approach to examine the effects of sample size on EVT models aimed at estimating pedestrian crash risks from traffic conflicts. Ten smaller and homogeneous samples of traffic conflicts are derived from a total of 144 h of video data collected from three signalised intersections in Brisbane, Australia, whereby vehicle–pedestrian conflicts are measured by post encroachment time. To ensure that each subset contains equal data from three intersections, samples are formed using a uniform distribution, and their effects are tested using non-stationary Block Maxima and Peak Over Threshold models estimated in the Bayesian framework. Results show that the sample size influences the prediction of mean crash frequencies, confidence intervals, and relative errors. Although the effect of sample size is non-uniform, the model performance appears to improve with the increase in sample size, whereby the block maxima models show higher sensitivity towards sample size variation, and the peak over threshold models reveal relatively stable and better performance. Moreover, a comparison of sample size thresholds indicates that the peak over threshold approach is more cost-efficient than its counterpart. Overall, the findings of this study demonstrate that improper sample size can lead to poor predictability, low reliability, and large uncertainties.</div></div>","PeriodicalId":47520,"journal":{"name":"Analytic Methods in Accident Research","volume":"44 ","pages":"Article 100353"},"PeriodicalIF":12.5000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytic Methods in Accident Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S221366572400037X","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Sample size plays a critical role in an Extreme Value Theory (EVT) model for estimating crash risks from traffic conflicts. Many studies have raised concerns regarding sample size and its consequent negative impact on the performance of EVT models. However, the effects of sample size on EVT models are not well-known, requiring an extensive investigation and a deeper understanding of the effects of sample size on model performance. Motivated by this research gap, this study proposes a systematic approach to examine the effects of sample size on EVT models aimed at estimating pedestrian crash risks from traffic conflicts. Ten smaller and homogeneous samples of traffic conflicts are derived from a total of 144 h of video data collected from three signalised intersections in Brisbane, Australia, whereby vehicle–pedestrian conflicts are measured by post encroachment time. To ensure that each subset contains equal data from three intersections, samples are formed using a uniform distribution, and their effects are tested using non-stationary Block Maxima and Peak Over Threshold models estimated in the Bayesian framework. Results show that the sample size influences the prediction of mean crash frequencies, confidence intervals, and relative errors. Although the effect of sample size is non-uniform, the model performance appears to improve with the increase in sample size, whereby the block maxima models show higher sensitivity towards sample size variation, and the peak over threshold models reveal relatively stable and better performance. Moreover, a comparison of sample size thresholds indicates that the peak over threshold approach is more cost-efficient than its counterpart. Overall, the findings of this study demonstrate that improper sample size can lead to poor predictability, low reliability, and large uncertainties.
期刊介绍:
Analytic Methods in Accident Research is a journal that publishes articles related to the development and application of advanced statistical and econometric methods in studying vehicle crashes and other accidents. The journal aims to demonstrate how these innovative approaches can provide new insights into the factors influencing the occurrence and severity of accidents, thereby offering guidance for implementing appropriate preventive measures. While the journal primarily focuses on the analytic approach, it also accepts articles covering various aspects of transportation safety (such as road, pedestrian, air, rail, and water safety), construction safety, and other areas where human behavior, machine failures, or system failures lead to property damage or bodily harm.