Journal of Survey Statistics and Methodology最新文献

Real World Data Versus Probability Surveys for Estimating Health Conditions at the State Level. 真实世界数据与估计州一级健康状况的概率调查。

IF 1.6 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2024-11-01 DOI: 10.1093/jssam/smae036

David A Marker, Charity Hilton, Jacob Zelko, Jon Duke, Deborah Rolka, Rachel Kaufmann, Richard Boyd

Government statistical offices worldwide are under pressure to produce statistics rapidly and for more detailed geographies, to compete with unofficial estimates available from web-based big data sources or from private companies. Commonly suggested sources of improved health information are electronic health records (EHRs) and medical claims data. These data sources are collectively known as real world data (RWD) because they are generated from routine health care processes, and they are available for millions of patients. It is clear that RWD can provide estimates that are more timely and less expensive to produce- but a key question is whether or not they are very accurate. To test this, we took advantage of a unique health data source that includes a full range of sociodemographic variables and compare estimates using all of those potential weighting variables, versus estimates derived when only age and sex are available for weighting (as is common with most RWD sources). We show that not accounting for other variables can produce misleading, and quite inaccurate, health estimates.

世界各地的政府统计部门都面临着压力，需要迅速编制更详细的地区统计数据，以与基于网络的大数据源或私营公司提供的非官方估计数据竞争。通常建议改进健康信息的来源是电子健康记录（EHRs）和医疗索赔数据。这些数据源统称为真实世界数据（RWD），因为它们是由常规医疗保健流程生成的，可供数百万患者使用。很明显，RWD可以提供更及时、成本更低的估算——但一个关键问题是它们是否非常准确。为了验证这一点，我们利用了一个独特的健康数据源，其中包括各种社会人口变量，并将使用所有这些潜在加权变量的估计值与仅使用年龄和性别进行加权的估计值进行比较（大多数RWD来源都是如此）。我们表明，不考虑其他变量可能会产生误导性的、相当不准确的健康估计。

{"title":"Real World Data Versus Probability Surveys for Estimating Health Conditions at the State Level.","authors":"David A Marker, Charity Hilton, Jacob Zelko, Jon Duke, Deborah Rolka, Rachel Kaufmann, Richard Boyd","doi":"10.1093/jssam/smae036","DOIUrl":"https://doi.org/10.1093/jssam/smae036","url":null,"abstract":"Government statistical offices worldwide are under pressure to produce statistics rapidly and for more detailed geographies, to compete with unofficial estimates available from web-based big data sources or from private companies. Commonly suggested sources of improved health information are electronic health records (EHRs) and medical claims data. These data sources are collectively known as real world data (RWD) because they are generated from routine health care processes, and they are available for millions of patients. It is clear that RWD can provide estimates that are more timely and less expensive to produce- but a key question is whether or not they are very accurate. To test this, we took advantage of a unique health data source that includes a full range of sociodemographic variables and compare estimates using all of those potential weighting variables, versus estimates derived when only age and sex are available for weighting (as is common with most RWD sources). We show that not accounting for other variables can produce misleading, and quite inaccurate, health estimates.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":"12 5","pages":"1515-1530"},"PeriodicalIF":1.6,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708384/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142950570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Analyzing Potential Non-Ignorable Selection Bias in an Off-Wave Mail Survey Implemented in a Long-Standing Panel Study.

IF 1.6 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2024-10-23 eCollection Date: 2025-02-01 DOI: 10.1093/jssam/smae039

Heather M Schroeder, Brady T West

Typical design-based methods for weighting probability samples rely on several assumptions, including the random selection of sampled units according to known probabilities of selection and ignorable unit nonresponse. If any of these assumptions are not met, weighting methods that account for the probabilities of selection, nonresponse, and calibration may not fully account for the potential selection bias in a given sample, which could produce misleading population estimates. This analysis investigates possible selection bias in the 2019 Health Survey Mailer (HSM), a sub-study of the longitudinal Health and Retirement Study (HRS). The primary HRS data collection has occurred in "even" years since 1992, but additional survey data collections take place in the "off-wave" odd years via mailed invitations sent to selected participants. While the HSM achieved a high response rate (83 percent), the assumption of ignorable probability-based selection of HRS panel members may not hold due to the eligibility criteria that were imposed. To investigate this possible non-ignorable selection bias, our analysis utilizes a novel analysis method for estimating measures of unadjusted bias for proportions (MUBP), introduced by Andridge et al. in 2019. This method incorporates aggregate information from the larger HRS target population, including means, variances, and covariances for key covariates related to the HSM variables, to inform estimates of proportions. We explore potential non-ignorable selection bias by comparing proportions calculated from the HSM under three conditions: ignoring HRS weights, weighting based on the usual design-based approach for HRS "off-wave" mail surveys, and using the MUBP adjustment. We find examples of differences between the weighted and MUBP-adjusted estimates in four out of ten outcomes we analyzed. However, these differences are modest, and while this result gives some evidence of non-ignorable selection bias, typical design-based weighting methods are sufficient for correcting for it and their use is appropriate in this case.

{"title":"Analyzing Potential Non-Ignorable Selection Bias in an Off-Wave Mail Survey Implemented in a Long-Standing Panel Study.","authors":"Heather M Schroeder, Brady T West","doi":"10.1093/jssam/smae039","DOIUrl":"10.1093/jssam/smae039","url":null,"abstract":"Typical design-based methods for weighting probability samples rely on several assumptions, including the random selection of sampled units according to known probabilities of selection and ignorable unit nonresponse. If any of these assumptions are not met, weighting methods that account for the probabilities of selection, nonresponse, and calibration may not fully account for the potential selection bias in a given sample, which could produce misleading population estimates. This analysis investigates possible selection bias in the 2019 Health Survey Mailer (HSM), a sub-study of the longitudinal Health and Retirement Study (HRS). The primary HRS data collection has occurred in \"even\" years since 1992, but additional survey data collections take place in the \"off-wave\" odd years via mailed invitations sent to selected participants. While the HSM achieved a high response rate (83 percent), the assumption of ignorable probability-based selection of HRS panel members may not hold due to the eligibility criteria that were imposed. To investigate this possible non-ignorable selection bias, our analysis utilizes a novel analysis method for estimating measures of unadjusted bias for proportions (MUBP), introduced by Andridge et al. in 2019. This method incorporates aggregate information from the larger HRS target population, including means, variances, and covariances for key covariates related to the HSM variables, to inform estimates of proportions. We explore potential non-ignorable selection bias by comparing proportions calculated from the HSM under three conditions: ignoring HRS weights, weighting based on the usual design-based approach for HRS \"off-wave\" mail surveys, and using the MUBP adjustment. We find examples of differences between the weighted and MUBP-adjusted estimates in four out of ten outcomes we analyzed. However, these differences are modest, and while this result gives some evidence of non-ignorable selection bias, typical design-based weighting methods are sufficient for correcting for it and their use is appropriate in this case.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":"13 1","pages":"100-127"},"PeriodicalIF":1.6,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11770253/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143059427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Small Area Poverty Estimation under Heteroskedasticity 异方差下的小地区贫困估计

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2024-01-10 DOI: 10.1093/jssam/smad045

Sumonkanti Das Ray Chambers

Multilevel models with nested errors are widely used in poverty estimation. An important application in this context is estimating the distribution of poverty as defined by the distribution of income within a set of domains that cover the population of interest. Since unit-level values of income are usually heteroskedastic, the standard homoskedasticity assumptions implicit in popular multilevel models may not be appropriate and can lead to bias, particularly when used to estimate domain-specific income distributions. This article addresses this problem when the income values in the population of interest can be characterized by a two-level mixed linear model with independent and identically distributed domain effects and with independent but not identically distributed individual effects. Estimation of poverty indicators that are functionals of domain-level income distributions is also addressed, and a nonparametric bootstrap procedure is used to estimate mean squared errors and confidence intervals. The proposed methodology is compared with the well-known World Bank poverty mapping methodology for this situation, using model-based simulation experiments as well as an empirical study based on Bangladesh poverty data.

具有嵌套误差的多层次模型被广泛应用于贫困估算。在这种情况下，一个重要的应用就是估算贫困的分布情况，而贫困的分布情况是由覆盖相关人口的一系列领域内的收入分布情况来定义的。由于单位水平的收入值通常是异方差的，因此流行的多层次模型中隐含的标准同方差假设可能并不合适，并可能导致偏差，尤其是在用于估计特定领域的收入分布时。当相关人群的收入值可以用具有独立且同分布的领域效应和具有独立但非同分布的个体效应的两级混合线性模型来描述时，本文就可以解决这个问题。文章还讨论了作为领域级收入分布函数的贫困指标的估算问题，并使用了非参数自举程序来估算均方误差和置信区间。利用基于模型的模拟实验以及基于孟加拉国贫困数据的实证研究，将所提出的方法与著名的世界银行贫困绘图方法进行了比较。

{"title":"Small Area Poverty Estimation under Heteroskedasticity","authors":"Sumonkanti Das, Ray Chambers","doi":"10.1093/jssam/smad045","DOIUrl":"https://doi.org/10.1093/jssam/smad045","url":null,"abstract":"\u0000 Multilevel models with nested errors are widely used in poverty estimation. An important application in this context is estimating the distribution of poverty as defined by the distribution of income within a set of domains that cover the population of interest. Since unit-level values of income are usually heteroskedastic, the standard homoskedasticity assumptions implicit in popular multilevel models may not be appropriate and can lead to bias, particularly when used to estimate domain-specific income distributions. This article addresses this problem when the income values in the population of interest can be characterized by a two-level mixed linear model with independent and identically distributed domain effects and with independent but not identically distributed individual effects. Estimation of poverty indicators that are functionals of domain-level income distributions is also addressed, and a nonparametric bootstrap procedure is used to estimate mean squared errors and confidence intervals. The proposed methodology is compared with the well-known World Bank poverty mapping methodology for this situation, using model-based simulation experiments as well as an empirical study based on Bangladesh poverty data.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":"50 9","pages":""},"PeriodicalIF":2.1,"publicationDate":"2024-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139441260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Investigating Respondent Attention to Experimental Text Lengths 调查受访者对实验文本长度的关注度

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2024-01-04 DOI: 10.1093/jssam/smad044

Tobias Rettig, A. Blom

Whether respondents pay adequate attention to a questionnaire has long been of concern to survey researchers. In this study, we measure respondents’ attention with an instruction manipulation check. We investigate which respondents read question texts of experimentally varied lengths and which become inattentive in a probability-based online panel of the German population. We find that respondent attention is closely linked to text length. Individual response speed is strongly correlated with respondent attention, but a fixed cutoff time is unsuitable as a standalone attention indicator. Differing levels of attention are also associated with respondents’ age, gender, education, panel experience, and the device used to complete the survey. Removal of inattentive respondents is thus likely to result in a biased remaining sample. Instead, questions should be curtailed to encourage respondents of different backgrounds and abilities to read them attentively and provide optimized answers.

长期以来，调查研究人员一直关注受访者是否对问卷给予了足够的关注。在本研究中，我们通过指令操作检查来测量受访者的注意力。我们调查了在一个基于概率的德国人口在线小组中，哪些受访者会阅读实验性不同长度的问题文本，哪些受访者会变得注意力不集中。我们发现，受访者的注意力与文本长度密切相关。个人的反应速度与受访者的注意力密切相关，但固定的截止时间并不适合作为独立的注意力指标。不同的注意力水平还与受访者的年龄、性别、教育程度、小组经验以及完成调查所使用的设备有关。因此，剔除注意力不集中的受访者可能会导致剩余样本出现偏差。因此，应减少问题的数量，以鼓励不同背景和能力的受访者专心阅读问题并提供最佳答案。

引用次数: 0

A Catch-22—the Test–Retest Method of Reliability Estimation 自相矛盾--可靠性估计的测试-重测法

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2023-12-20 DOI: 10.1093/jssam/smad043

Paula A. Tufiș D. Alwin Daniel N Ramírez

This article addresses the problems with the traditional reinterview approach to estimating the reliability of survey measures. Using data from three reinterview (or panel) studies conducted by the General Social Survey, we investigate the differences between the two-wave correlational approach embodied by the traditional reinterview strategy, compared to estimates of reliability that take the stability of traits into account based on a three-wave model. Our results indicate that the problems identified with the two-wave correlational approach reflect a kind of “Catch-22” in the sense that the only solution to the problem is denied by the approach itself. Specifically, we show that the correctly specified two-wave model, which includes the potential for true change in the latent variable, is underidentified, and thus, unless one is willing to make some potentially risky assumptions, reliability parameters are not estimable. This article compares the two-wave correlational approach to an alternative model for estimating reliability, Heise’s estimates based on the three-wave simplex model. Using three waves of data from the GSS panels, which were separated by 2-year intervals between waves, this article examines the conditions under which the wave-1, wave-2 correlations which do not take stability into account approximate the reliability estimate obtained from three-wave simplex models that do take stability into account. The results lead to the conclusion that the differences between estimates depend on the stability and/or fixed nature of the underlying processes involved. Few if any differences are identified when traits are fixed or highly stable, but for traits involving changes in the underlying traits the differences can be quite large, and thus, we argue for the superiority of reinterview designs that involve more than 2 waves in the estimation of reliability parameters.

本文探讨了传统的重新访谈法在估算调查措施可靠性方面存在的问题。我们利用 "综合社会调查 "进行的三项再访谈（或小组）研究的数据，研究了传统再访谈策略所体现的两波相关法与基于三波模型考虑特质稳定性的可靠性估计法之间的差异。我们的研究结果表明，两波相关法发现的问题反映了一种 "Catch-22"，即解决问题的唯一方法被该方法本身所否定。具体来说，我们表明，正确指定的两波模型（包括潜在变量真实变化的可能性）识别不足，因此，除非人们愿意做出一些潜在的风险假设，否则可靠性参数是无法估计的。本文将两波相关法与另一种可靠性估计模型--海斯基于三波简单模型的估计法--进行了比较。本文使用了来自全球抽样调查面板的三波数据（波与波之间的间隔为两年），研究了在什么条件下，不考虑稳定性的第一波、第二波相关性与考虑稳定性的三波单纯模型所得到的可靠性估计值相近。结果得出的结论是，估计值之间的差异取决于所涉及的基本过程的稳定性和/或固定性。在特征固定或高度稳定的情况下，即使有差异也很小，但对于涉及基础特征变化的特征，差异可能相当大。

{"title":"A Catch-22—the Test–Retest Method of Reliability Estimation","authors":"Paula A. Tufiș, D. Alwin, Daniel N Ramírez","doi":"10.1093/jssam/smad043","DOIUrl":"https://doi.org/10.1093/jssam/smad043","url":null,"abstract":"\u0000 This article addresses the problems with the traditional reinterview approach to estimating the reliability of survey measures. Using data from three reinterview (or panel) studies conducted by the General Social Survey, we investigate the differences between the two-wave correlational approach embodied by the traditional reinterview strategy, compared to estimates of reliability that take the stability of traits into account based on a three-wave model. Our results indicate that the problems identified with the two-wave correlational approach reflect a kind of “Catch-22” in the sense that the only solution to the problem is denied by the approach itself. Specifically, we show that the correctly specified two-wave model, which includes the potential for true change in the latent variable, is underidentified, and thus, unless one is willing to make some potentially risky assumptions, reliability parameters are not estimable. This article compares the two-wave correlational approach to an alternative model for estimating reliability, Heise’s estimates based on the three-wave simplex model. Using three waves of data from the GSS panels, which were separated by 2-year intervals between waves, this article examines the conditions under which the wave-1, wave-2 correlations which do not take stability into account approximate the reliability estimate obtained from three-wave simplex models that do take stability into account. The results lead to the conclusion that the differences between estimates depend on the stability and/or fixed nature of the underlying processes involved. Few if any differences are identified when traits are fixed or highly stable, but for traits involving changes in the underlying traits the differences can be quite large, and thus, we argue for the superiority of reinterview designs that involve more than 2 waves in the estimation of reliability parameters.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":"36 20","pages":""},"PeriodicalIF":2.1,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138955719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Poverty Mapping Under Area-Level Random Regression Coefficient Poisson Models 地区级随机回归系数泊松模型下的贫困绘图

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2023-11-29 DOI: 10.1093/jssam/smad036

Naomi Diz-Rosales M. Lombardía Domingo Morales

Under an area-level random regression coefficient Poisson model, this article derives small area predictors of counts and proportions and introduces bootstrap estimators of the mean squared errors (MSEs). The maximum likelihood estimators of the model parameters and the mode predictors of the random effects are calculated by a Laplace approximation algorithm. Simulation experiments are implemented to investigate the behavior of the fitting algorithm, the predictors, and the MSE estimators with and without bias correction. The new statistical methodology is applied to data from the Spanish Living Conditions Survey. The target is to estimate the proportions of women and men under the poverty line by province.

在区域级随机回归系数泊松模型下，本文推导了计数和比例的小区域预测因子，并引入了均方误差（MSE）的自引导估计值。模型参数的最大似然估计值和随机效应的模式预测值是通过拉普拉斯近似算法计算得出的。通过模拟实验，研究了拟合算法、预测器和有无偏差校正的 MSE 估计器的行为。新的统计方法适用于西班牙生活条件调查的数据。目标是估算各省处于贫困线以下的男女比例。

引用次数: 0

Peekaboo! The Effect of Different Visible Cash Display and Amount Options During Mail Contact When Recruiting to a Probability-Based Panel 躲躲猫!招聘到一个基于概率的小组时，不同的可见现金显示和金额选项在邮件联系期间的影响

4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2023-11-09 DOI: 10.1093/jssam/smad039

Ipek Bilgen, David Dutwin, Roopam Singh, Erlina Hendarwan

Abstract Recent studies consistently showed that making cash visible with a windowed envelope during mail contact increases response rates in surveys. The visible cash aims to pique interest and encourage sampled households to open the envelope. This article extends prior research by examining the effect of additional interventions implemented during mail recruitment to a survey panel on recruitment rates and costs. Specifically, we implemented randomized experiments to examine size (small, large) and location (none, front, back) of the window displaying cash, combined with what part of the cash is shown through the window envelope (numeric amount, face/image), and various prepaid incentive amounts (two $1, one $2, one $5). We used the recruitment effort for NORC’s AmeriSpeak Panel as the data source for this study. The probability-based AmeriSpeak Panel uses an address-based sample and multiple modes of respondent contact, including mail, phone, and in-person outreach during recruitment. Our results were consistent with prior research and showed significant improvement in recruitment rates when cash was displayed through a window during mail contact. We also found that placing the window on the front of the envelope, showing $5 through the envelope compared to $2 and $1, and showing the tender amount compared to the image on the cash through the window were more likely to improve the recruitment rates. Our cost analyses illustrated that the cost difference in printing window versus no window envelope is small. There is no difference in printing cost between front window and back window as they both require custom manufacturing. There is also no cost difference in printing envelopes with small windows versus large windows. Lastly, we found no evidence of mail theft based on our review of the United States Postal Service’s “track and trace” reports, seed mailings sent to staff, and undeliverable mailing rates.

最近的研究一致表明，在邮件联系过程中，用带窗口的信封显示现金可以增加调查的回复率。这些看得见的现金旨在激起人们的兴趣，并鼓励被抽样的家庭打开信封。本文通过检查邮件招聘期间实施的额外干预措施对招聘率和成本的影响，扩展了先前的研究。具体来说，我们实施了随机实验来检查展示现金的窗口的大小(小、大)和位置(没有、前面、后面)，并结合窗口信封显示的现金部分(数字金额、人脸/图像)和各种预付奖励金额(两个1美元、一个2美元、一个5美元)。我们使用NORC的AmeriSpeak Panel的招聘工作作为本研究的数据源。基于概率的AmeriSpeak Panel使用基于地址的样本和多种受访者联系模式，包括邮件，电话和招聘期间的亲自外展。我们的结果与之前的研究一致，表明在邮件联系时通过窗口显示现金，招聘率有了显著提高。我们还发现，在信封正面放置窗口，通过信封显示5美元，而不是2美元和1美元，通过窗口显示招标金额，而不是现金上的图像，更有可能提高招聘率。我们的成本分析表明，印刷窗口与无窗口信封的成本差异很小。前窗和后窗的打印成本没有区别，因为它们都需要定制制造。打印小窗口和大窗口的信封也没有成本差异。最后，根据我们对美国邮政服务的“跟踪和追踪”报告、发送给员工的种子邮件和无法投递的邮件率的审查，我们没有发现邮件被盗的证据。

{"title":"Peekaboo! The Effect of Different Visible Cash Display and Amount Options During Mail Contact When Recruiting to a Probability-Based Panel","authors":"Ipek Bilgen, David Dutwin, Roopam Singh, Erlina Hendarwan","doi":"10.1093/jssam/smad039","DOIUrl":"https://doi.org/10.1093/jssam/smad039","url":null,"abstract":"Abstract Recent studies consistently showed that making cash visible with a windowed envelope during mail contact increases response rates in surveys. The visible cash aims to pique interest and encourage sampled households to open the envelope. This article extends prior research by examining the effect of additional interventions implemented during mail recruitment to a survey panel on recruitment rates and costs. Specifically, we implemented randomized experiments to examine size (small, large) and location (none, front, back) of the window displaying cash, combined with what part of the cash is shown through the window envelope (numeric amount, face/image), and various prepaid incentive amounts (two $1, one $2, one $5). We used the recruitment effort for NORC’s AmeriSpeak Panel as the data source for this study. The probability-based AmeriSpeak Panel uses an address-based sample and multiple modes of respondent contact, including mail, phone, and in-person outreach during recruitment. Our results were consistent with prior research and showed significant improvement in recruitment rates when cash was displayed through a window during mail contact. We also found that placing the window on the front of the envelope, showing $5 through the envelope compared to $2 and $1, and showing the tender amount compared to the image on the cash through the window were more likely to improve the recruitment rates. Our cost analyses illustrated that the cost difference in printing window versus no window envelope is small. There is no difference in printing cost between front window and back window as they both require custom manufacturing. There is also no cost difference in printing envelopes with small windows versus large windows. Lastly, we found no evidence of mail theft based on our review of the United States Postal Service’s “track and trace” reports, seed mailings sent to staff, and undeliverable mailing rates.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" 24","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135292587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Correction to: Correcting Selection Bias in Big Data by Pseudo-Weighting 修正:用伪加权法修正大数据中的选择偏差

4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2023-11-09 DOI: 10.1093/jssam/smad042

引用次数: 0

THE EFFECTS OF A TARGETED "EARLY BIRD" INCENTIVE STRATEGY ON RESPONSE RATES, FIELDWORK EFFORT, AND COSTS IN A NATIONAL PANEL STUDY. 在一项全国小组研究中，有针对性的 "早起鸟儿 "激励战略对回复率、实地工作和成本的影响。

IF 1.6 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2023-11-01 Epub Date: 2022-02-01 DOI: 10.1093/jssam/smab042

Katherine A McGonagle Narayan Sastry Vicki A Freedman

Adaptive survey designs are increasingly used by survey practitioners to counteract ongoing declines in household survey response rates and manage rising fieldwork costs. This paper reports findings from an evaluation of an early-bird incentive (EBI) experiment targeting high-effort respondents who participate in the 2019 wave of the US Panel Study of Income Dynamics. We identified a subgroup of high-effort respondents at risk of nonresponse based on their prior wave fieldwork effort and randomized them to a treatment offering an extra time-delimited monetary incentive for completing their interview within the first month of data collection (treatment group; N = 800) or the standard study incentive (control group; N = 400). In recent waves, we have found that the costs of the protracted fieldwork needed to complete interviews with high-effort cases in the form of interviewer contact attempts plus an increased incentive near the close of data collection are extremely high. By incentivizing early participation and reducing the number of interviewer contact attempts and fieldwork days to complete the interview, our goal was to manage both nonresponse and survey costs. We found that the EBI treatment increased response rates and reduced fieldwork effort and costs compared to a control group. We review several key findings and limitations, discuss their implications, and identify the next steps for future research.

调查从业者越来越多地采用适应性调查设计来应对住户调查响应率的持续下降，并管理不断上升的实地调查成本。本文报告了针对参加 2019 年美国收入动态面板研究（US Panel Study of Income Dynamics）的高努力程度受访者的 "早鸟激励"（EBI）实验的评估结果。我们根据高努力受访者在前一轮实地调查中的努力程度，确定了一个有可能不作回应的高努力受访者子群，并将他们随机分配到一个提供额外限时货币奖励的处理组（处理组；N = 800）或标准研究奖励组（对照组；N = 400），以便在数据收集的第一个月内完成访谈。在最近的几次调查中，我们发现，为了完成对高难度个案的访谈，访谈者需要多次尝试与他们接触，再加上在数据收集工作接近尾声时增加奖励，这样的长期实地工作成本非常高。我们的目标是通过激励早期参与，减少访问员接触的次数和完成访谈所需的实地工作天数，来控制非响应率和调查成本。我们发现，与对照组相比，EBI 方法提高了回复率，减少了实地工作和成本。我们回顾了几个主要发现和局限性，讨论了它们的影响，并确定了未来研究的下一步。

{"title":"THE EFFECTS OF A TARGETED \"EARLY BIRD\" INCENTIVE STRATEGY ON RESPONSE RATES, FIELDWORK EFFORT, AND COSTS IN A NATIONAL PANEL STUDY.","authors":"Katherine A McGonagle, Narayan Sastry, Vicki A Freedman","doi":"10.1093/jssam/smab042","DOIUrl":"10.1093/jssam/smab042","url":null,"abstract":"Adaptive survey designs are increasingly used by survey practitioners to counteract ongoing declines in household survey response rates and manage rising fieldwork costs. This paper reports findings from an evaluation of an early-bird incentive (EBI) experiment targeting high-effort respondents who participate in the 2019 wave of the US Panel Study of Income Dynamics. We identified a subgroup of high-effort respondents at risk of nonresponse based on their prior wave fieldwork effort and randomized them to a treatment offering an extra time-delimited monetary incentive for completing their interview within the first month of data collection (treatment group; N = 800) or the standard study incentive (control group; N = 400). In recent waves, we have found that the costs of the protracted fieldwork needed to complete interviews with high-effort cases in the form of interviewer contact attempts plus an increased incentive near the close of data collection are extremely high. By incentivizing early participation and reducing the number of interviewer contact attempts and fieldwork days to complete the interview, our goal was to manage both nonresponse and survey costs. We found that the EBI treatment increased response rates and reduced fieldwork effort and costs compared to a control group. We review several key findings and limitations, discuss their implications, and identify the next steps for future research.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":"11 5","pages":"1032-1053"},"PeriodicalIF":1.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10702785/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138801468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Responsive and Adaptive Designs in Repeated Cross-National Surveys: A Simulation Study 重复跨国调查中的响应性和适应性设计:模拟研究

4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2023-10-27 DOI: 10.1093/jssam/smad038

Hafsteinn Einarsson Alexandru Cernat Natalie Shlomo

Abstract Cross-national surveys run the risk of differential survey errors, where data collected vary in quality from country to country. Responsive and adaptive survey designs (RASDs) have been proposed as a way to reduce survey errors, by leveraging auxiliary variables to inform fieldwork efforts, but have rarely been considered in the context of cross-national surveys. Using data from the European Social Survey, we simulate fieldwork in a repeated cross-national survey using RASD where fieldwork efforts are ended early for selected units in the final stage of data collection. Demographic variables, paradata (interviewer observations), and contact data are used to inform fieldwork efforts. Eight combinations of response propensity models and selection mechanisms are evaluated in terms of sample composition (as measured by the coefficient of variation of response propensities), response rates, number of contact attempts saved, and effects on estimates of target variables in the survey. We find that sample balance can be improved in many country-round combinations. Response rates can be increased marginally and targeting high propensity respondents could lead to significant cost savings associated with making fewer contact attempts. Estimates of target variables are not changed by the case prioritizations used in the simulations, indicating that they do not impact nonresponse bias. We conclude that RASDs should be considered in cross-national surveys, but that more work is needed to identify suitable covariates to inform fieldwork efforts.

跨国调查存在差异调查误差的风险，其中收集的数据质量因国而异。响应性和适应性调查设计(rasd)已经被提出作为一种减少调查误差的方法，通过利用辅助变量来通知实地工作，但很少在跨国调查的背景下被考虑。使用来自欧洲社会调查的数据，我们使用RASD在重复的跨国调查中模拟实地工作，在数据收集的最后阶段，对选定的单位提前结束实地工作。人口统计变量、para(采访者观察结果)和联系数据被用来为实地工作提供信息。根据样本组成(通过反应倾向的变异系数来衡量)、反应率、节省的接触次数以及对调查中目标变量估计的影响，评估了8种反应倾向模型和选择机制的组合。我们发现，在许多国家/地区的组合中，样本平衡可以得到改善。回复率可以略微提高，针对高倾向的受访者可以通过减少接触尝试来节省大量成本。在模拟中使用的情况优先级不会改变目标变量的估计，这表明它们不会影响非响应偏差。我们的结论是，rasd应该在跨国调查中考虑，但需要更多的工作来确定合适的协变量，以告知实地工作的努力。

{"title":"Responsive and Adaptive Designs in Repeated Cross-National Surveys: A Simulation Study","authors":"Hafsteinn Einarsson, Alexandru Cernat, Natalie Shlomo","doi":"10.1093/jssam/smad038","DOIUrl":"https://doi.org/10.1093/jssam/smad038","url":null,"abstract":"Abstract Cross-national surveys run the risk of differential survey errors, where data collected vary in quality from country to country. Responsive and adaptive survey designs (RASDs) have been proposed as a way to reduce survey errors, by leveraging auxiliary variables to inform fieldwork efforts, but have rarely been considered in the context of cross-national surveys. Using data from the European Social Survey, we simulate fieldwork in a repeated cross-national survey using RASD where fieldwork efforts are ended early for selected units in the final stage of data collection. Demographic variables, paradata (interviewer observations), and contact data are used to inform fieldwork efforts. Eight combinations of response propensity models and selection mechanisms are evaluated in terms of sample composition (as measured by the coefficient of variation of response propensities), response rates, number of contact attempts saved, and effects on estimates of target variables in the survey. We find that sample balance can be improved in many country-round combinations. Response rates can be increased marginally and targeting high propensity respondents could lead to significant cost savings associated with making fewer contact attempts. Estimates of target variables are not changed by the case prioritizations used in the simulations, indicating that they do not impact nonresponse bias. We conclude that RASDs should be considered in cross-national surveys, but that more work is needed to identify suitable covariates to inform fieldwork efforts.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136317165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0