Effect of within-sample choice distribution and sample size on the estimation accuracy of logit model

Minhui Zeng, M. Zhong, J. Hunt
{"title":"Effect of within-sample choice distribution and sample size on the estimation accuracy of logit model","authors":"Minhui Zeng, M. Zhong, J. Hunt","doi":"10.1109/ICTIS.2015.7232160","DOIUrl":null,"url":null,"abstract":"Within-Sample Choice Distribution and Sample size are important considerations in the estimation of logit model, but their effects on the estimation accuracy have not been systematically studied. Therefore, the objective of this paper is to provide an empirical examination to the above issues through a set of simulated choice datasets. In this paper, the utility function coefficients and alternative specific constants (ASCs) are specified as a prior. Then, assuming alternative attributes and error components follow a normal distribution, both revealed preference (RP) and stated preference (SP) synthetic choice datasets are simulated. Based on these simulated datasets, the utility coefficients and ASCs are re-estimated and compared with the original values specified as the prior. It is found that the utility coefficients can be re-estimated with reasonable accuracy, but the estimates of the ASCs are confronted with much larger errors. The Sum of Square Errors (SSEs) between the “original” and the estimated utility coefficients and ASCs using RP and SP datasets of varying sample size are calculated, plotted and the corresponding diminishing marginal return points are identified. Regarding within-sample choice distribution, study results show that, as the within-sample choice distribution becomes more balanced, the hit-ratio decreases. It appears that, when alternatives are chosen with similar frequency, choosing one alternative vs. another does not make much difference in terms of utility perceived by each decision-maker. Therefore, it is suggested that a population with varying socioeconomic characteristics be created and used in future studies.","PeriodicalId":389628,"journal":{"name":"2015 International Conference on Transportation Information and Safety (ICTIS)","volume":"12 18","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Transportation Information and Safety (ICTIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTIS.2015.7232160","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Within-Sample Choice Distribution and Sample size are important considerations in the estimation of logit model, but their effects on the estimation accuracy have not been systematically studied. Therefore, the objective of this paper is to provide an empirical examination to the above issues through a set of simulated choice datasets. In this paper, the utility function coefficients and alternative specific constants (ASCs) are specified as a prior. Then, assuming alternative attributes and error components follow a normal distribution, both revealed preference (RP) and stated preference (SP) synthetic choice datasets are simulated. Based on these simulated datasets, the utility coefficients and ASCs are re-estimated and compared with the original values specified as the prior. It is found that the utility coefficients can be re-estimated with reasonable accuracy, but the estimates of the ASCs are confronted with much larger errors. The Sum of Square Errors (SSEs) between the “original” and the estimated utility coefficients and ASCs using RP and SP datasets of varying sample size are calculated, plotted and the corresponding diminishing marginal return points are identified. Regarding within-sample choice distribution, study results show that, as the within-sample choice distribution becomes more balanced, the hit-ratio decreases. It appears that, when alternatives are chosen with similar frequency, choosing one alternative vs. another does not make much difference in terms of utility perceived by each decision-maker. Therefore, it is suggested that a population with varying socioeconomic characteristics be created and used in future studies.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
样本内选择分布和样本量对logit模型估计精度的影响
样本内选择分布和样本容量是logit模型估计中的重要考虑因素,但它们对估计精度的影响尚未得到系统的研究。因此,本文的目的是通过一组模拟选择数据集对上述问题进行实证检验。在本文中,效用函数系数和可选比常数(ASCs)被指定为先验。然后,假设可选属性和误差分量服从正态分布,分别模拟了显示偏好(RP)和陈述偏好(SP)合成选择数据集。基于这些模拟数据集,重新估计了效用系数和ASCs,并与先验指定的原始值进行了比较。研究发现,效用系数可以在合理的精度下进行重新估计,但其估计值存在较大的误差。使用不同样本量的RP和SP数据集,计算和绘制了“原始”和估计效用系数以及ASCs之间的平方和误差(sse),并确定了相应的边际收益递减点。对于样本内选择分布,研究结果表明,随着样本内选择分布变得更加平衡,命中率降低。似乎,当选择的频率相似时,选择一种替代方案与另一种替代方案在每个决策者感知的效用方面没有太大区别。因此,建议创建具有不同社会经济特征的人口,并在未来的研究中使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An inexact bus departure frequency model for traffic pollution control A sequential barrier-based model to evaluate human reliability in maritime accident process Risk evaluation index system of navigation environment of Qiongzhou Strait based on FAHP Effect of hull deformation on inherent characteristics of propulsion shafting vibration Inland hazardous cargo ship safety evaluation offsetting factors analysis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1