首页 > 最新文献

Journal of Survey Statistics and Methodology最新文献

英文 中文
Rank-Based Inference for Survey Sampling Data 基于秩的调查抽样数据推理
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-08-09 DOI: 10.1093/jssam/smab019
A. Adekpedjou, H. Bindele
For regression models where data are obtained from sampling surveies, the statistical analysis is often based on approaches that are either non-robust or inefficient. The handling of survey data requires more appropriate techniques, as the classical methods usually result in biased and inefficient estimates of the underlying model parameters. This article is concerned with the development of a new approach of obtaining robust and efficient estimates of regression model parameters when dealing with survey sampling data. Asymptotic properties of such estimators are established under mild regularity conditions. To demonstrate the performance of the proposed method, Monte Carlo simulation experiments are carried out and show that the estimators obtained from the proposed methodology are robust and more efficient than many of those obtained from existing approaches, mainly if the survey data tend to result in residuals with heavy-tailed or skewed distributions and/or when there are few gross outliers. Finally, the proposed approach is illustrated with a real data example.
对于从抽样调查中获得数据的回归模型,统计分析通常基于非稳健或低效的方法。调查数据的处理需要更合适的技术,因为经典方法通常会导致对基本模型参数的估计有偏差且效率低下。本文致力于开发一种新方法,在处理调查抽样数据时,获得回归模型参数的稳健有效估计。在温和的正则性条件下,建立了这类估计量的渐近性质。为了证明所提出方法的性能,进行了蒙特卡洛模拟实验,并表明从所提出方法获得的估计量是稳健的,并且比从现有方法获得的许多估计量更有效,主要是如果调查数据倾向于产生具有重尾或偏斜分布的残差和/或当几乎没有总异常值时。最后,以实际数据为例说明了所提出的方法。
{"title":"Rank-Based Inference for Survey Sampling Data","authors":"A. Adekpedjou, H. Bindele","doi":"10.1093/jssam/smab019","DOIUrl":"https://doi.org/10.1093/jssam/smab019","url":null,"abstract":"\u0000 For regression models where data are obtained from sampling surveies, the statistical analysis is often based on approaches that are either non-robust or inefficient. The handling of survey data requires more appropriate techniques, as the classical methods usually result in biased and inefficient estimates of the underlying model parameters. This article is concerned with the development of a new approach of obtaining robust and efficient estimates of regression model parameters when dealing with survey sampling data. Asymptotic properties of such estimators are established under mild regularity conditions. To demonstrate the performance of the proposed method, Monte Carlo simulation experiments are carried out and show that the estimators obtained from the proposed methodology are robust and more efficient than many of those obtained from existing approaches, mainly if the survey data tend to result in residuals with heavy-tailed or skewed distributions and/or when there are few gross outliers. Finally, the proposed approach is illustrated with a real data example.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44965476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Model-Assisted Approach for Finding Coding Errors in Manual Coding of Open-Ended Questions 开放式问题手工编码中编码错误的模型辅助发现方法
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-08-03 DOI: 10.1093/jssam/smab022
Zhoushanyue He, Matthias Schonlau
Text answers to open-ended questions are typically manually coded into one of several codes. Usually, a random subset of text answers is double-coded to assess intercoder reliability, but most of the data remain single-coded. Any disagreement between the two coders points to an error by one of the coders. When the budget allows double coding additional text answers, we propose employing statistical learning models to predict which single-coded answers have a high risk of a coding error. Specifically, we train a model on the double-coded random subset and predict the probability that the single-coded codes are correct. Then, text answers with the highest risk are double-coded to verify. In experiments with three data sets, we found that this method identifies two to three times as many coding errors in the additional text answers as compared to random guessing, on average. We conclude that this method is preferred if the budget permits additional double-coding. When there are a lot of intercoder disagreements, the benefit can be substantial.
开放式问题的文本答案通常被手动编码为几个代码中的一个。通常,文本答案的随机子集是双重编码的,以评估代码间的可靠性,但大多数数据仍然是单一编码的。两个编码器之间的任何分歧都指向其中一个编码器的错误。当预算允许对额外的文本答案进行双重编码时,我们建议使用统计学习模型来预测哪些单一编码的答案具有较高的编码错误风险。具体来说,我们在双编码随机子集上训练一个模型,并预测单编码码正确的概率。然后,对风险最高的文本答案进行双重编码以进行验证。在使用三个数据集的实验中,我们发现,与随机猜测相比,这种方法在额外的文本答案中识别的编码错误平均是随机猜测的两到三倍。我们得出的结论是,如果预算允许额外的双重编码,这种方法是首选的。当有很多跨部门的分歧时,利益可能是巨大的。
{"title":"A Model-Assisted Approach for Finding Coding Errors in Manual Coding of Open-Ended Questions","authors":"Zhoushanyue He, Matthias Schonlau","doi":"10.1093/jssam/smab022","DOIUrl":"https://doi.org/10.1093/jssam/smab022","url":null,"abstract":"\u0000 Text answers to open-ended questions are typically manually coded into one of several codes. Usually, a random subset of text answers is double-coded to assess intercoder reliability, but most of the data remain single-coded. Any disagreement between the two coders points to an error by one of the coders. When the budget allows double coding additional text answers, we propose employing statistical learning models to predict which single-coded answers have a high risk of a coding error. Specifically, we train a model on the double-coded random subset and predict the probability that the single-coded codes are correct. Then, text answers with the highest risk are double-coded to verify. In experiments with three data sets, we found that this method identifies two to three times as many coding errors in the additional text answers as compared to random guessing, on average. We conclude that this method is preferred if the budget permits additional double-coding. When there are a lot of intercoder disagreements, the benefit can be substantial.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45517153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
DETERMINED BY MODE? REPRESENTATION AND MEASUREMENT EFFECTS IN A DUAL-MODE STATEWIDE SURVEY 由模式决定?双模全州调查的表征与测量效应
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-06-18 DOI: 10.1093/jssam/smab012
Enrijeta Shino, Michael D. Martinez, Michael Binder
With the increasing usage of dual-mode data collection, researchers of public opinion have shown considerable interest in understanding response differences across different interview modes. Are mode effects an outcome of representation or measurement differences across modes? We conducted a dual-mode survey (web and telephone) using Florida’s voter file as the sampling frame, randomly assigning registered voters into one mode versus the other. Having a priori information about the respondents allows us to gauge whether and how sample composition differences may be driven by mode effects, and whether mode affects estimated models of political behavior. Survey mode effects are still significant for issue voting even when sampling design is similar for both modes.
随着双模式数据收集的使用越来越多,民意研究人员对了解不同访谈模式下的反应差异表现出了极大的兴趣。模式效应是模式间表现或测量差异的结果吗?我们使用佛罗里达州的选民档案作为抽样框架,进行了一项双模式调查(网络和电话),随机将登记选民分配到一种模式与另一种模式。有了关于被调查者的先验信息,我们可以衡量样本组成差异是否以及如何由模式效应驱动,以及模式是否影响政治行为的估计模型。调查模式对议题投票的影响仍然显著,即使两种模式的抽样设计相似。
{"title":"DETERMINED BY MODE? REPRESENTATION AND MEASUREMENT EFFECTS IN A DUAL-MODE STATEWIDE SURVEY","authors":"Enrijeta Shino, Michael D. Martinez, Michael Binder","doi":"10.1093/jssam/smab012","DOIUrl":"https://doi.org/10.1093/jssam/smab012","url":null,"abstract":"\u0000 With the increasing usage of dual-mode data collection, researchers of public opinion have shown considerable interest in understanding response differences across different interview modes. Are mode effects an outcome of representation or measurement differences across modes? We conducted a dual-mode survey (web and telephone) using Florida’s voter file as the sampling frame, randomly assigning registered voters into one mode versus the other. Having a priori information about the respondents allows us to gauge whether and how sample composition differences may be driven by mode effects, and whether mode affects estimated models of political behavior. Survey mode effects are still significant for issue voting even when sampling design is similar for both modes.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45317436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
QUESTIONNAIRE COMPLEXITY, REST PERIOD, AND RESPONSE LIKELIHOOD IN ESTABLISHMENT SURVEYS 企业调查问卷的复杂性、休息时间与回应可能性
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-06-15 DOI: 10.1093/JSSAM/SMAB017
J. Rodhouse, T. Wilson, Heather E Ridolfo
Response burden has been a concern in survey research for some time. One area of concern is the negative impact that response burden can have on response rates. In an effort to mitigate negative impacts on response rates, survey research organizations try to minimize the burden respondents are exposed to and maximize the likelihood of response. Many organizations also try to be mindful of the role burden may play in respondents’ likelihood to participate in future surveys by implementing rest periods or survey holidays. Recently, new evidence from a study of cross-sectional household surveys provided an interesting lens to examine burden. The evidence demonstrated that those sampled in two independent surveys are more likely to respond to the second survey if the first survey was more difficult to complete, and that this effect was not significantly influenced by the rest period in between the two surveys. These findings are compelling, and since the mechanisms influencing response in household and establishment surveys differ in important ways, a similar examination in an establishment survey context is warranted. To accomplish this, data are used from the National Agricultural Statistics Service. Overall, our research finds that prior survey features such as questionnaire complexity (or burden), prior response disposition and rest period are significantly associated with response to subsequent surveys. We also find that sample units first receiving a more complex questionnaire have significantly higher probabilities of response to a subsequent survey than do those receiving a simpler questionnaire first. The findings in this paper have implications for nonresponse adjustments and identification of subgroups for adaptive design data collection.
一段时间以来,响应负担一直是调查研究中关注的问题。一个令人关切的领域是反应负担可能对反应率产生的负面影响。为了减轻对回复率的负面影响,调查研究组织试图将受访者面临的负担降至最低,并最大限度地提高回复的可能性。许多组织还试图通过实施休息期或调查假期来注意负担可能在受访者参与未来调查的可能性中发挥的作用。最近,一项横断面家庭调查研究的新证据为研究负担提供了一个有趣的视角。证据表明,如果第一次调查更难完成,那么在两次独立调查中抽样的人更有可能对第二次调查做出回应,而且这种影响没有受到两次调查之间休息期的显著影响。这些发现令人信服,而且由于影响家庭和机构调查反应的机制在重要方面有所不同,因此有必要在机构调查背景下进行类似的审查。为此,使用了国家农业统计局的数据。总体而言,我们的研究发现,先前的调查特征,如问卷复杂性(或负担)、先前的反应倾向和休息时间,与对后续调查的反应显著相关。我们还发现,首先接受更复杂问卷的样本单位对后续调查的回答概率明显高于首先接受更简单问卷的样本单元。本文的研究结果对无响应调整和识别自适应设计数据收集的亚组具有启示意义。
{"title":"QUESTIONNAIRE COMPLEXITY, REST PERIOD, AND RESPONSE LIKELIHOOD IN ESTABLISHMENT SURVEYS","authors":"J. Rodhouse, T. Wilson, Heather E Ridolfo","doi":"10.1093/JSSAM/SMAB017","DOIUrl":"https://doi.org/10.1093/JSSAM/SMAB017","url":null,"abstract":"\u0000 Response burden has been a concern in survey research for some time. One area of concern is the negative impact that response burden can have on response rates. In an effort to mitigate negative impacts on response rates, survey research organizations try to minimize the burden respondents are exposed to and maximize the likelihood of response. Many organizations also try to be mindful of the role burden may play in respondents’ likelihood to participate in future surveys by implementing rest periods or survey holidays. Recently, new evidence from a study of cross-sectional household surveys provided an interesting lens to examine burden. The evidence demonstrated that those sampled in two independent surveys are more likely to respond to the second survey if the first survey was more difficult to complete, and that this effect was not significantly influenced by the rest period in between the two surveys. These findings are compelling, and since the mechanisms influencing response in household and establishment surveys differ in important ways, a similar examination in an establishment survey context is warranted. To accomplish this, data are used from the National Agricultural Statistics Service. Overall, our research finds that prior survey features such as questionnaire complexity (or burden), prior response disposition and rest period are significantly associated with response to subsequent surveys. We also find that sample units first receiving a more complex questionnaire have significantly higher probabilities of response to a subsequent survey than do those receiving a simpler questionnaire first. The findings in this paper have implications for nonresponse adjustments and identification of subgroups for adaptive design data collection.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41839493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PERCEIVED BURDEN, FOCUS OF ATTENTION, AND THE URGE TO JUSTIFY: THE IMPACT OF THE NUMBER OF SCREENS AND PROBE ORDER ON THE RESPONSE BEHAVIOR OF PROBING QUESTIONS 感知负担、注意焦点和证明的冲动:屏幕数量和探究顺序对探究性问题反应行为的影响
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-06-12 DOI: 10.1093/JSSAM/SMAA043
Katharina Meitinger, A. Toroslu, Klara Raiber, Michael Braun
Web probing is a valuable tool to assess the validity and comparability of survey items. It uses different probe types—such as category-selection probes and specific probes—to inquire about different aspects of an item. Previous web probing studies often asked one probe type per item, but research situations exist where it might be preferable to test potentially problematic items with multiple probes. However, the response behavior might be affected by two factors: question order and the visual presentation of probes on one screen versus multiple screens as well as their interaction. In this study, we report evidence from a web experiment that was conducted with 532 respondents from Germany in September 2013. Experimental groups varied by screen number (1 versus 2) and probe order (category-selection probe first versus specific probe first). We assessed the impact of these manipulations on several indicators of response quality, probe answer content, and the respondents’ motivation with logistic regressions and two-way ANOVAs. We reveal that multiple mechanisms push response behavior in this context: perceived response burden, the focus of attention, the need for justification, and verbal context effects. We find that response behavior in the condition with two screens and category-selection probe first outperforms all other experimental conditions. We recommend this implementation in all but one scenario: if the goal is to test an item that includes a key term with a potentially too large lexical scope, we recommend starting with a specific probe but on the same screen as the category-selection probe.
网络探测是评估调查项目有效性和可比性的重要工具。它使用不同的探测类型(例如类别选择探测和特定探测)来查询项目的不同方面。以前的网络探测研究通常要求每个项目使用一种探测类型,但研究情况可能更可取的是使用多个探测来测试潜在的问题项目。然而,反应行为可能受到两个因素的影响:问题顺序和探针在一个屏幕上与多个屏幕上的视觉呈现以及它们之间的相互作用。在这项研究中,我们报告了2013年9月对德国532名受访者进行的网络实验的证据。实验组根据筛选数量(1对2)和探针顺序(类别选择探针优先还是特定探针优先)而变化。我们用逻辑回归和双向方差分析评估了这些操作对回答质量、探针回答内容和受访者动机等几个指标的影响。我们发现在这种情况下,多种机制推动了反应行为:知觉反应负担、注意焦点、辩护需要和言语语境效应。我们发现在两个屏幕和类别选择探针条件下的反应行为首先优于所有其他实验条件。除了一种情况外,我们建议在所有场景中都使用这种实现:如果目标是测试包含一个词法范围可能太大的关键术语的项,我们建议从一个特定的探测开始,但与类别选择探测在同一屏幕上。
{"title":"PERCEIVED BURDEN, FOCUS OF ATTENTION, AND THE URGE TO JUSTIFY: THE IMPACT OF THE NUMBER OF SCREENS AND PROBE ORDER ON THE RESPONSE BEHAVIOR OF PROBING QUESTIONS","authors":"Katharina Meitinger, A. Toroslu, Klara Raiber, Michael Braun","doi":"10.1093/JSSAM/SMAA043","DOIUrl":"https://doi.org/10.1093/JSSAM/SMAA043","url":null,"abstract":"\u0000 Web probing is a valuable tool to assess the validity and comparability of survey items. It uses different probe types—such as category-selection probes and specific probes—to inquire about different aspects of an item. Previous web probing studies often asked one probe type per item, but research situations exist where it might be preferable to test potentially problematic items with multiple probes. However, the response behavior might be affected by two factors: question order and the visual presentation of probes on one screen versus multiple screens as well as their interaction. In this study, we report evidence from a web experiment that was conducted with 532 respondents from Germany in September 2013. Experimental groups varied by screen number (1 versus 2) and probe order (category-selection probe first versus specific probe first). We assessed the impact of these manipulations on several indicators of response quality, probe answer content, and the respondents’ motivation with logistic regressions and two-way ANOVAs. We reveal that multiple mechanisms push response behavior in this context: perceived response burden, the focus of attention, the need for justification, and verbal context effects. We find that response behavior in the condition with two screens and category-selection probe first outperforms all other experimental conditions. We recommend this implementation in all but one scenario: if the goal is to test an item that includes a key term with a potentially too large lexical scope, we recommend starting with a specific probe but on the same screen as the category-selection probe.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45424242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A DYNAMIC SURVIVAL MODELING APPROACH TO THE PREDICTION OF WEB SURVEY BREAKOFF 网络调查断裂预测的动态生存建模方法
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-06-12 DOI: 10.1093/JSSAM/SMAB015
Felicitas Mittereder, B. West
Respondents who break off from a web survey prior to completing it are a prevalent problem in data collection. To prevent breakoff bias, it is crucial to keep as many diverse respondents in a web survey as possible. As a first step of preventing breakoffs, this study aims to understand breakoff and the associated response behavior. We analyze data from an annual online survey using dynamic survival models and ROC analyses. We find that breakoff risks between respondents using mobile devices versus PCs do not differ at the beginning of the questionnaire, but the risk for mobile device users increases as the survey progresses. Very fast respondents as well as respondents with changing response times both have a higher risk of quitting the questionnaire, compared to respondents with slower and steady response times. We conclude with a discussion of the implications of these findings for future practice and research in web survey methodology.
受访者在完成网络调查之前中断调查是数据收集中的一个普遍问题。为了防止脱离偏见,在网络调查中保持尽可能多的不同受访者是至关重要的。作为预防突发事件的第一步,本研究旨在了解突发事件及其相关的反应行为。我们使用动态生存模型和ROC分析来分析年度在线调查的数据。我们发现,在问卷调查开始时,使用移动设备和个人电脑的受访者之间的脱离风险没有差异,但随着调查的进行,移动设备用户的风险会增加。与反应时间较慢和稳定的受访者相比,反应速度较快的受访者和反应时间不断变化的受访者退出问卷的风险都更高。最后,我们讨论了这些发现对未来网络调查方法的实践和研究的影响。
{"title":"A DYNAMIC SURVIVAL MODELING APPROACH TO THE PREDICTION OF WEB SURVEY BREAKOFF","authors":"Felicitas Mittereder, B. West","doi":"10.1093/JSSAM/SMAB015","DOIUrl":"https://doi.org/10.1093/JSSAM/SMAB015","url":null,"abstract":"\u0000 Respondents who break off from a web survey prior to completing it are a prevalent problem in data collection. To prevent breakoff bias, it is crucial to keep as many diverse respondents in a web survey as possible. As a first step of preventing breakoffs, this study aims to understand breakoff and the associated response behavior. We analyze data from an annual online survey using dynamic survival models and ROC analyses. We find that breakoff risks between respondents using mobile devices versus PCs do not differ at the beginning of the questionnaire, but the risk for mobile device users increases as the survey progresses. Very fast respondents as well as respondents with changing response times both have a higher risk of quitting the questionnaire, compared to respondents with slower and steady response times. We conclude with a discussion of the implications of these findings for future practice and research in web survey methodology.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46746139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
FIT FOR PURPOSE IN ACTION: DESIGN, IMPLEMENTATION, AND EVALUATION OF THE NATIONAL INTERNET FLU SURVEY. 符合行动目的:全国互联网流感调查的设计、实施和评估。
IF 1.6 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-06-01 DOI: 10.1093/jssam/smz050
Jill A Dever, Ashley Amaya, Anup Srivastav, Peng-Jun Lu, Jessica Roycroft, Marshica Stanley, M Christopher Stringer, Michael G Bostwick, Stacie M Greby, Tammy A Santibanez, Walter W Williams

Researchers strive to design and implement high-quality surveys to maximize the utility of the data collected. The definitions of quality and usefulness, however, vary from survey to survey and depend on the analytic needs. Survey teams must evaluate the trade-offs of various decisions, such as when results are needed and their required level of precision, in addition to practical constraints like budget, before finalizing the design. Characteristics within the concept of fit for purpose (FfP) can provide the framework for considering the trade-offs. Furthermore, this tool can enable an evaluation of quality for the resulting estimates. Implementation of a FfP framework in this context, however, is not straightforward. In this article, we provide the reader with a glimpse of a FfP framework in action for obtaining estimates on early season influenza vaccination coverage estimates and on knowledge, attitudes, behaviors, and barriers related to influenza and influenza prevention among civilian noninstitutionalized adults aged 18 years and older in the United States. The result is the National Internet Flu Survey (NIFS), an annual, two-week internet survey sponsored by the US Centers for Disease Control and Prevention. In addition to critical design decisions, we use the established NIFS FfP framework to discuss the quality of the NIFS in meeting the intended objectives. We highlight aspects that work well and other survey traits requiring further evaluation. Differences found in comparing the NIFS to the National Flu Survey, the National Health Interview Survey, and Behavioral Risk Factor Surveillance System are discussed via their respective FfP characteristics. The findings presented here highlight the importance of the FfP framework for designing surveys, defining data quality, and providing a set a metrics used to advertise the intended use of the survey data and results.

研究人员努力设计和实施高质量的调查,以最大限度地利用所收集的数据。然而,质量和有用性的定义因调查而异,并取决于分析需要。在完成设计之前,调查团队必须评估各种决策的权衡,例如何时需要结果及其所需的精度水平,以及预算等实际约束。适合目的(FfP)概念中的特征可以提供考虑权衡的框架。此外,该工具能够对结果估计的质量进行评估。然而,在这种情况下实现FfP框架并不是直截了当的。在这篇文章中,我们为读者提供了一个FfP框架的一瞥,以获得对早期流感疫苗接种覆盖率的估计,以及在美国18岁及以上的非机构成年人中与流感和流感预防相关的知识、态度、行为和障碍的估计。该调查是由美国疾病控制与预防中心发起的一项为期两周的年度网络调查。除了关键的设计决策外,我们还使用已建立的NIFS FfP框架来讨论NIFS在满足预期目标方面的质量。我们强调工作良好的方面和其他需要进一步评估的调查特征。比较NIFS与全国流感调查、全国健康访谈调查和行为风险因素监测系统所发现的差异,通过各自的FfP特征进行讨论。本文的研究结果强调了FfP框架在设计调查、定义数据质量以及提供一套用于宣传调查数据和结果预期用途的指标方面的重要性。
{"title":"FIT FOR PURPOSE IN ACTION: DESIGN, IMPLEMENTATION, AND EVALUATION OF THE NATIONAL INTERNET FLU SURVEY.","authors":"Jill A Dever, Ashley Amaya, Anup Srivastav, Peng-Jun Lu, Jessica Roycroft, Marshica Stanley, M Christopher Stringer, Michael G Bostwick, Stacie M Greby, Tammy A Santibanez, Walter W Williams","doi":"10.1093/jssam/smz050","DOIUrl":"10.1093/jssam/smz050","url":null,"abstract":"<p><p>Researchers strive to design and implement high-quality surveys to maximize the utility of the data collected. The definitions of quality and usefulness, however, vary from survey to survey and depend on the analytic needs. Survey teams must evaluate the trade-offs of various decisions, such as when results are needed and their required level of precision, in addition to practical constraints like budget, before finalizing the design. Characteristics within the concept of fit for purpose (FfP) can provide the framework for considering the trade-offs. Furthermore, this tool can enable an evaluation of quality for the resulting estimates. Implementation of a FfP framework in this context, however, is not straightforward. In this article, we provide the reader with a glimpse of a FfP framework in action for obtaining estimates on early season influenza vaccination coverage estimates and on knowledge, attitudes, behaviors, and barriers related to influenza and influenza prevention among civilian noninstitutionalized adults aged 18 years and older in the United States. The result is the National Internet Flu Survey (NIFS), an annual, two-week internet survey sponsored by the US Centers for Disease Control and Prevention. In addition to critical design decisions, we use the established NIFS FfP framework to discuss the quality of the NIFS in meeting the intended objectives. We highlight aspects that work well and other survey traits requiring further evaluation. Differences found in comparing the NIFS to the National Flu Survey, the National Health Interview Survey, and Behavioral Risk Factor Surveillance System are discussed via their respective FfP characteristics. The findings presented here highlight the importance of the FfP framework for designing surveys, defining data quality, and providing a set a metrics used to advertise the intended use of the survey data and results.</p>","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":"449-476"},"PeriodicalIF":1.6,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9434706/pdf/nihms-1800712.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40349807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimality of the Recursive Neyman Allocation 递归内曼分配的最优性
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-05-30 DOI: 10.1093/jssam/smab018
J. Wesołowski, R. Wieczorkowski, Wojciech W'ojciak
We derive a formula for the optimal sample allocation in a general stratified scheme under upper bounds on the sample strata sizes. Such a general scheme includes SRSWOR within strata as a special case. The solution is given in terms of V allocation with V being the set of take-all strata. We use V allocation to give a formal proof of optimality of the popular recursive Neyman algorithm, rNa. We also propose a quick proof of optimality of the algorithm of Stenger and Gabler, SGa, as well as of our proposed modification, coma. Finally, we compare running times of rNa, SGa, and coma. Ready-to-use R-implementations of these algorithms are available on CRAN repository at https://cran.r-project.org/web/packages/stratallo.
在样本层大小的上限下,我们导出了一般分层方案中最优样本分配的公式。这种一般方案包括地层内的SRSWOR作为特例。该解是根据V分配给出的,其中V是所有阶层的集合。我们使用V分配给出了流行的递归Neyman算法rNa的最优性的形式证明。我们还提出了Stenger和Gabler算法的最优性的快速证明,SGa,以及我们提出的修改,coma。最后,我们比较了rNa、SGa和昏迷的运行时间。这些算法的现成R实现可在CRAN存储库中获得,网址为https://cran.r-project.org/web/packages/stratallo.
{"title":"Optimality of the Recursive Neyman Allocation","authors":"J. Wesołowski, R. Wieczorkowski, Wojciech W'ojciak","doi":"10.1093/jssam/smab018","DOIUrl":"https://doi.org/10.1093/jssam/smab018","url":null,"abstract":"\u0000 We derive a formula for the optimal sample allocation in a general stratified scheme under upper bounds on the sample strata sizes. Such a general scheme includes SRSWOR within strata as a special case. The solution is given in terms of V allocation with V being the set of take-all strata. We use V allocation to give a formal proof of optimality of the popular recursive Neyman algorithm, rNa. We also propose a quick proof of optimality of the algorithm of Stenger and Gabler, SGa, as well as of our proposed modification, coma. Finally, we compare running times of rNa, SGa, and coma. Ready-to-use R-implementations of these algorithms are available on CRAN repository at https://cran.r-project.org/web/packages/stratallo.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43743867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Modeling public opinion over time: A simulation study of latent trend models 一段时间内的民意建模:潜在趋势模型的模拟研究
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-05-25 DOI: 10.31235/osf.io/gauvx
M. Kołczyńska, P. Bürkner
Analyzing trends in public opinion is important for monitoring social change and for testing theories aimed at explaining this change. With growing availability of multi-wave surveys, social scientists are increasingly turning to latent trend models applied to survey data for examining changes in social and political attitudes. With the aim of facilitating this research, our study compares different approaches to modeling latent trends of aggregate public opinion: splines, Gaussian processes, and discrete autoregressive models. We examine the ability of these models to recover latent trends with simulated data that vary with regard to the frequency and magnitude of changes in the true trend, model complexity and data availability. Overall, we find that all three latent trend models perform well in all scenarios, even the most difficult ones with frequent and weak changes of the latent trend and sparse data. The two main performance differences we find include the relatively higher squared errors of autoregressive models compared to the other models, and the under-coverage of posterior intervals in high-frequency low-amplitude trends with splines. For all models and across all scenarios performance improves with increased data availability, which emphasizes the need of supplying sufficient data for accurate estimation of latent trends.
分析公众舆论的趋势对于监测社会变化和检验旨在解释这种变化的理论非常重要。随着多波调查的日益普及,社会科学家越来越多地转向应用于调查数据的潜在趋势模型,以研究社会和政治态度的变化。为了促进这项研究,我们的研究比较了建模总体民意潜在趋势的不同方法:样条曲线、高斯过程和离散自回归模型。我们研究了这些模型利用模拟数据恢复潜在趋势的能力,模拟数据随着真实趋势、模型复杂性和数据可用性的变化频率和幅度而变化。总体而言,我们发现所有三个潜在趋势模型在所有情况下都表现良好,即使是最困难的潜在趋势变化频繁和微弱且数据稀疏的模型。我们发现的两个主要性能差异包括与其他模型相比,自回归模型的平方误差相对较高,以及在具有样条的高频低振幅趋势中后验区间覆盖不足。对于所有模型和所有场景,性能随着数据可用性的增加而提高,这强调了提供足够数据以准确估计潜在趋势的必要性。
{"title":"Modeling public opinion over time: A simulation study of latent trend models","authors":"M. Kołczyńska, P. Bürkner","doi":"10.31235/osf.io/gauvx","DOIUrl":"https://doi.org/10.31235/osf.io/gauvx","url":null,"abstract":"Analyzing trends in public opinion is important for monitoring social change and for testing theories aimed at explaining this change. With growing availability of multi-wave surveys, social scientists are increasingly turning to latent trend models applied to survey data for examining changes in social and political attitudes. With the aim of facilitating this research, our study compares different approaches to modeling latent trends of aggregate public opinion: splines, Gaussian processes, and discrete autoregressive models. We examine the ability of these models to recover latent trends with simulated data that vary with regard to the frequency and magnitude of changes in the true trend, model complexity and data availability. Overall, we find that all three latent trend models perform well in all scenarios, even the most difficult ones with frequent and weak changes of the latent trend and sparse data. The two main performance differences we find include the relatively higher squared errors of autoregressive models compared to the other models, and the under-coverage of posterior intervals in high-frequency low-amplitude trends with splines. For all models and across all scenarios performance improves with increased data availability, which emphasizes the need of supplying sufficient data for accurate estimation of latent trends.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49120772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Risk of Nonresponse Bias and the Length of the Field Period in a Mixed-Mode General Population Panel 混合模式一般人群面板中无反应偏倚的风险和现场周期的长度
IF 2.1 4区 数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS Pub Date : 2021-04-19 DOI: 10.1093/JSSAM/SMAB011
Bella Struminskaya, Tobias Gummer
Survey researchers are often confronted with the question of how long to set the length of the field period. Longer fielding time might lead to greater participation yet requires survey managers to devote more of their time to data collection efforts. With the aim of facilitating the decision about the length of the field period, we investigated whether a longer fielding time reduces the risk of nonresponse bias to judge whether field periods can be ended earlier without endangering the performance of the survey. By using data from six waves of a probability-based mixed-mode (online and mail) panel of the German population, we analyzed whether the risk of nonresponse bias decreases over the field period by investigating how day-by-day coefficients of variation develop during the field period. We then determined the optimal cut-off points for each mode after which data collection can be terminated without increasing the risk of nonresponse bias and found that the optimal cut-off points differ by mode. Our study complements prior research by shifting the perspective in the investigation of the risk of nonresponse bias to panel data as well as to mixed-mode surveys, in particular. Our proposed method of using coefficients of variation to assess whether the risk of nonresponse bias decreases significantly with each additional day of fieldwork can aid survey practitioners in finding the optimal field period for their mixed-mode surveys.
调查研究人员经常面临的问题是设置多长时间的现场周期。较长的实地时间可能会导致更多的参与,但需要调查经理投入更多的时间来收集数据。为了便于决定实地考察时间的长短,我们研究了较长的实地考察时间是否会降低无反应偏差的风险,从而判断实地考察是否可以在不影响调查效果的情况下提前结束。通过使用德国人口基于概率的混合模式(在线和邮件)小组的六波数据,我们通过调查在现场期间每天的变异系数如何发展,分析了无反应偏差的风险是否在现场期间降低。然后,我们确定了每种模式的最佳截止点,在此之后,可以在不增加非响应偏差风险的情况下终止数据收集,并发现最佳截止点因模式而异。我们的研究补充了先前的研究,将调查无反应偏倚风险的视角转移到面板数据以及混合模式调查中,特别是。我们提出的方法是使用变异系数来评估无反应偏差的风险是否随着每增加一天的实地工作而显著降低,这可以帮助调查从业者找到进行混合模式调查的最佳实地时间段。
{"title":"Risk of Nonresponse Bias and the Length of the Field Period in a Mixed-Mode General Population Panel","authors":"Bella Struminskaya, Tobias Gummer","doi":"10.1093/JSSAM/SMAB011","DOIUrl":"https://doi.org/10.1093/JSSAM/SMAB011","url":null,"abstract":"\u0000 Survey researchers are often confronted with the question of how long to set the length of the field period. Longer fielding time might lead to greater participation yet requires survey managers to devote more of their time to data collection efforts. With the aim of facilitating the decision about the length of the field period, we investigated whether a longer fielding time reduces the risk of nonresponse bias to judge whether field periods can be ended earlier without endangering the performance of the survey. By using data from six waves of a probability-based mixed-mode (online and mail) panel of the German population, we analyzed whether the risk of nonresponse bias decreases over the field period by investigating how day-by-day coefficients of variation develop during the field period. We then determined the optimal cut-off points for each mode after which data collection can be terminated without increasing the risk of nonresponse bias and found that the optimal cut-off points differ by mode. Our study complements prior research by shifting the perspective in the investigation of the risk of nonresponse bias to panel data as well as to mixed-mode surveys, in particular. Our proposed method of using coefficients of variation to assess whether the risk of nonresponse bias decreases significantly with each additional day of fieldwork can aid survey practitioners in finding the optimal field period for their mixed-mode surveys.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2021-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/JSSAM/SMAB011","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45924091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Journal of Survey Statistics and Methodology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1