Efficiency of propensity score adjustment and calibration on the estimation from non-probabilistic online surveys

IF 0.7 4区 数学 Q4 OPERATIONS RESEARCH & MANAGEMENT SCIENCE Sort-Statistics and Operations Research Transactions Pub Date : 2018-12-21 DOI:10.2436/20.8080.02.73
R. Ferri-García, M. Rueda
{"title":"Efficiency of propensity score adjustment and calibration on the estimation from non-probabilistic online surveys","authors":"R. Ferri-García, M. Rueda","doi":"10.2436/20.8080.02.73","DOIUrl":null,"url":null,"abstract":"One of the main sources of inaccuracy in modern survey techniques, such as online and smartphone surveys, is the absence of an adequate sampling frame that could provide a probabilistic sampling. This kind of data collection leads to the presence of high amounts of bias in final estimates of the survey, specially if the estimated variables (also known as target variables) have some influence on the decision of the respondent to participate in the survey. Various correction techniques, such as calibration and propensity score adjustment or PSA, can be applied to remove the bias. This study attempts to analyse the efficiency of correction techniques in multiple situations, applying a combination of propensity score adjustment and calibration on both types of variables (correlated and not correlated with the missing data mechanism) and testing the use of a reference survey to get the population totals for calibration variables. The study was performed using a simulation of a fictitious population of potential voters and a real volunteer survey aimed to a population for which a complete census was available. Results showed that PSA combined with calibration results in a bias removal considerably larger when compared with calibration with no prior adjustment. Results also showed that using population totals from the estimates of a reference survey instead of the available population data does not make a difference in estimates accuracy, although it can contribute to slightly increment the variance of the estimator.","PeriodicalId":49497,"journal":{"name":"Sort-Statistics and Operations Research Transactions","volume":"EM-29 1","pages":"0159-182"},"PeriodicalIF":0.7000,"publicationDate":"2018-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sort-Statistics and Operations Research Transactions","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.2436/20.8080.02.73","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
引用次数: 13

Abstract

One of the main sources of inaccuracy in modern survey techniques, such as online and smartphone surveys, is the absence of an adequate sampling frame that could provide a probabilistic sampling. This kind of data collection leads to the presence of high amounts of bias in final estimates of the survey, specially if the estimated variables (also known as target variables) have some influence on the decision of the respondent to participate in the survey. Various correction techniques, such as calibration and propensity score adjustment or PSA, can be applied to remove the bias. This study attempts to analyse the efficiency of correction techniques in multiple situations, applying a combination of propensity score adjustment and calibration on both types of variables (correlated and not correlated with the missing data mechanism) and testing the use of a reference survey to get the population totals for calibration variables. The study was performed using a simulation of a fictitious population of potential voters and a real volunteer survey aimed to a population for which a complete census was available. Results showed that PSA combined with calibration results in a bias removal considerably larger when compared with calibration with no prior adjustment. Results also showed that using population totals from the estimates of a reference survey instead of the available population data does not make a difference in estimates accuracy, although it can contribute to slightly increment the variance of the estimator.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
倾向分数调整和校正对非概率在线调查估计的效率
现代调查技术(如在线和智能手机调查)不准确的主要来源之一是缺乏可以提供概率抽样的适当抽样框架。这种数据收集导致在调查的最终估计中存在大量偏差,特别是如果估计的变量(也称为目标变量)对受访者参与调查的决定有一定影响。各种校正技术,如校准和倾向评分调整或PSA,可以应用于消除偏差。本研究试图在多种情况下分析校正技术的效率,对两种类型的变量(与缺失数据机制相关和不相关)应用倾向得分调整和校准的组合,并测试使用参考调查来获得校准变量的总体总数。这项研究是通过模拟一个虚构的潜在选民群体和一个真实的志愿者调查来进行的,该调查针对的是一个完整的人口普查。结果表明,与没有事先调整的校准相比,PSA结合校准的偏差去除效果要大得多。结果还表明,使用参考调查估计的人口总数而不是现有的人口数据并不会对估计的准确性产生影响,尽管它可以略微增加估计器的方差。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Sort-Statistics and Operations Research Transactions
Sort-Statistics and Operations Research Transactions 管理科学-统计学与概率论
CiteScore
3.10
自引率
0.00%
发文量
0
审稿时长
>12 weeks
期刊介绍: SORT (Statistics and Operations Research Transactions) —formerly Qüestiió— is an international journal launched in 2003. It is published twice-yearly, in English, by the Statistical Institute of Catalonia (Idescat). The journal is co-edited by the Universitat Politècnica de Catalunya, Universitat de Barcelona, Universitat Autonòma de Barcelona, Universitat de Girona, Universitat Pompeu Fabra i Universitat de Lleida, with the co-operation of the Spanish Section of the International Biometric Society and the Catalan Statistical Society. SORT promotes the publication of original articles of a methodological or applied nature or motivated by an applied problem in statistics, operations research, official statistics or biometrics as well as book reviews. We encourage authors to include an example of a real data set in their manuscripts.
期刊最新文献
Green hybrid fleets using electric vehicles: solving the heterogeneous vehicle routing problem with multiple driving ranges and loading capacities Integer constraints for enhancing interpretability in linear regression On interpretations of tests and effect sizes in regression models with a compositional predictor Modelling count data using the logratio-normal-multinomial distribution Bayesian structured antedependence model proposals for longitudinal data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1