Journal of Survey Statistics and Methodology最新文献

英文中文

A SEMIPARAMETRIC MULTIPLE IMPUTATION APPROACH TO FULLY SYNTHETIC DATA FOR COMPLEX SURVEYS. 针对复杂调查的全合成数据的半参数多重估算方法。

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-06-01 Epub Date: 2022-05-25 DOI: 10.1093/jssam/smac016

Mandi Yu, Yulei He, Trivellore E Raghunathan

Data synthesis is an effective statistical approach for reducing data disclosure risk. Generating fully synthetic data might minimize such risk, but its modeling and application can be difficult for data from large, complex surveys. This article extended the two-stage imputation to simultaneously impute item missing values and generate fully synthetic data. A new combining rule for making inferences using data generated in this manner was developed. Two semiparametric missing data imputation models were adapted to generate fully synthetic data for skewed continuous variable and sparse binary variable, respectively. The proposed approach was evaluated using simulated data and real longitudinal data from the Health and Retirement Study. The proposed approach was also compared with two existing synthesis approaches: (1) parametric regressions models as implemented in IVEware; and (2) nonparametric Classification and Regression Trees as implemented in synthpop package for R using real data. The results show that high data utility is maintained for a wide variety of descriptive and model-based statistics using the proposed strategy. The proposed strategy also performs better than existing methods for sophisticated analyses such as factor analysis.

数据合成是降低数据披露风险的有效统计方法。生成全合成数据可以最大限度地降低这种风险，但其建模和应用对于来自大型复杂调查的数据来说可能比较困难。本文对两阶段估算进行了扩展，以同时估算项目缺失值和生成全合成数据。文章开发了一种新的组合规则，用于使用以这种方式生成的数据进行推断。对两个半参数缺失数据估算模型进行了调整，以分别生成偏斜连续变量和稀疏二元变量的全合成数据。使用模拟数据和健康与退休研究的真实纵向数据对所提出的方法进行了评估。此外，还将提出的方法与现有的两种合成方法进行了比较：(1) 在 IVEware 中实现的参数回归模型；(2) 使用真实数据在 R 的 synthpop 软件包中实现的非参数分类和回归树。结果表明，使用所提出的策略，各种描述性和基于模型的统计数据都能保持较高的数据效用。在进行因子分析等复杂分析时，拟议策略的表现也优于现有方法。

{"title":"A SEMIPARAMETRIC MULTIPLE IMPUTATION APPROACH TO FULLY SYNTHETIC DATA FOR COMPLEX SURVEYS.","authors":"Mandi Yu, Yulei He, Trivellore E Raghunathan","doi":"10.1093/jssam/smac016","DOIUrl":"10.1093/jssam/smac016","url":null,"abstract":"Data synthesis is an effective statistical approach for reducing data disclosure risk. Generating fully synthetic data might minimize such risk, but its modeling and application can be difficult for data from large, complex surveys. This article extended the two-stage imputation to simultaneously impute item missing values and generate fully synthetic data. A new combining rule for making inferences using data generated in this manner was developed. Two semiparametric missing data imputation models were adapted to generate fully synthetic data for skewed continuous variable and sparse binary variable, respectively. The proposed approach was evaluated using simulated data and real longitudinal data from the Health and Retirement Study. The proposed approach was also compared with two existing synthesis approaches: (1) parametric regressions models as implemented in IVEware; and (2) nonparametric Classification and Regression Trees as implemented in synthpop package for R using real data. The results show that high data utility is maintained for a wide variety of descriptive and model-based statistics using the proposed strategy. The proposed strategy also performs better than existing methods for sophisticated analyses such as factor analysis.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":"1 1","pages":"618-641"},"PeriodicalIF":2.1,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11044899/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"61006847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

INTERVIEWER EFFECTS IN LIVE VIDEO AND PRERECORDED VIDEO INTERVIEWING. 面试官在现场视频和预先录制的视频面试中的效果。

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-04-01 DOI: 10.1093/jssam/smab040

Brady T West, Ai Rene Ong, Frederick G Conrad, Michael F Schober, Kallan M Larsen, Andrew L Hupp

Live video (LV) communication tools (e.g., Zoom) have the potential to provide survey researchers with many of the benefits of in-person interviewing, while also greatly reducing data collection costs, given that interviewers do not need to travel and make in-person visits to sampled households. The COVID-19 pandemic has exposed the vulnerability of in-person data collection to public health crises, forcing survey researchers to explore remote data collection modes-such as LV interviewing-that seem likely to yield high-quality data without in-person interaction. Given the potential benefits of these technologies, the operational and methodological aspects of video interviewing have started to receive research attention from survey methodologists. Although it is remote, video interviewing still involves respondent-interviewer interaction that introduces the possibility of interviewer effects. No research to date has evaluated this potential threat to the quality of the data collected in video interviews. This research note presents an evaluation of interviewer effects in a recent experimental study of alternative approaches to video interviewing including both LV interviewing and the use of prerecorded videos of the same interviewers asking questions embedded in a web survey ("prerecorded video" interviewing). We find little evidence of significant interviewer effects when using these two approaches, which is a promising result. We also find that when interviewer effects were present, they tended to be slightly larger in the LV approach as would be expected in light of its being an interactive approach. We conclude with a discussion of the implications of these findings for future research using video interviewing.

实时视频(LV)通信工具(如Zoom)有可能为调查研究人员提供面对面访谈的许多好处，同时也大大降低了数据收集成本，因为采访者不需要旅行和亲自访问抽样家庭。COVID-19大流行暴露了面对面数据收集在公共卫生危机中的脆弱性，迫使调查研究人员探索远程数据收集模式，如LV访谈，这种模式似乎可以在没有面对面互动的情况下获得高质量的数据。鉴于这些技术的潜在好处，录像访谈的操作和方法方面已开始受到调查方法学家的研究注意。虽然是远程的，但视频访谈仍然涉及到受访者与采访者的互动，这就引入了采访者效应的可能性。到目前为止，还没有研究评估过这种对视频采访中收集的数据质量的潜在威胁。本研究报告在最近的一项实验研究中对访谈者的效果进行了评估，该实验研究采用了视频访谈的替代方法，包括LV访谈和使用预先录制的同一访谈者在网络调查中提问的视频(“预先录制的视频”访谈)。在使用这两种方法时，我们发现很少有证据表明访谈者效应显著，这是一个有希望的结果。我们还发现，当面试官效应存在时，他们倾向于在LV方法中略大，因为它是一种互动方法。最后，我们讨论了这些发现对未来视频访谈研究的影响。

{"title":"INTERVIEWER EFFECTS IN LIVE VIDEO AND PRERECORDED VIDEO INTERVIEWING.","authors":"Brady T West, Ai Rene Ong, Frederick G Conrad, Michael F Schober, Kallan M Larsen, Andrew L Hupp","doi":"10.1093/jssam/smab040","DOIUrl":"https://doi.org/10.1093/jssam/smab040","url":null,"abstract":"Live video (LV) communication tools (e.g., Zoom) have the potential to provide survey researchers with many of the benefits of in-person interviewing, while also greatly reducing data collection costs, given that interviewers do not need to travel and make in-person visits to sampled households. The COVID-19 pandemic has exposed the vulnerability of in-person data collection to public health crises, forcing survey researchers to explore remote data collection modes-such as LV interviewing-that seem likely to yield high-quality data without in-person interaction. Given the potential benefits of these technologies, the operational and methodological aspects of video interviewing have started to receive research attention from survey methodologists. Although it is remote, video interviewing still involves respondent-interviewer interaction that introduces the possibility of interviewer effects. No research to date has evaluated this potential threat to the quality of the data collected in video interviews. This research note presents an evaluation of interviewer effects in a recent experimental study of alternative approaches to video interviewing including both LV interviewing and the use of prerecorded videos of the same interviewers asking questions embedded in a web survey (\"prerecorded video\" interviewing). We find little evidence of significant interviewer effects when using these two approaches, which is a promising result. We also find that when interviewer effects were present, they tended to be slightly larger in the LV approach as would be expected in light of its being an interactive approach. We conclude with a discussion of the implications of these findings for future research using video interviewing.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":"10 2","pages":"317-336"},"PeriodicalIF":2.1,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8690284/pdf/smab040.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9793318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

On The Robustness Of Respondent-Driven Sampling Estimators To Measurement Error. 调查对象驱动抽样估计器对测量误差的鲁棒性研究。

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-04-01 DOI: 10.1093/jssam/smab056

Ian E Fellows

Respondent-driven sampling (RDS) is a popular method of conducting surveys in hard to reach populations where strong assumptions are required in order to make valid statistical inferences. In this paper we investigate the assumption that network degrees are measured accurately by the RDS survey and find that there is likely significant measurement error present in typical studies. We prove that most RDS estimators remain consistent under an imperfect measurement model with little to no added bias, though the variance of the estimators does increase.

被调查者驱动抽样(RDS)是在难以接触到的人群中进行调查的一种流行方法，这些人群需要强有力的假设才能做出有效的统计推断。在本文中，我们研究了RDS调查准确测量网络度的假设，并发现典型研究可能存在显着的测量误差。我们证明了大多数RDS估计量在不完善的测量模型下保持一致，几乎没有额外的偏差，尽管估计量的方差确实增加了。

引用次数: 2

On The Robustness Of Respondent-Driven Sampling Estimators To Measurement Error. 论被调查者驱动的抽样估计对测量误差的稳健性。

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-03-11 DOI: 10.1093/jssam/smac004

Ian E. Fellows

受访者驱动抽样（RDS）是一种在难以接触的人群中进行调查的流行方法，在这种人群中，需要强有力的假设才能做出有效的统计推断。在本文中，我们研究了RDS调查准确测量网络度的假设，并发现在典型研究中可能存在显著的测量误差。我们证明了大多数RDS估计量在不完美的测量模型下保持一致，几乎没有增加的偏差，尽管估计量的方差确实增加了。

引用次数: 2

Panel Conditioning in a German Probability-Based Longitudinal Study: A Comparison of Respondents with Different Levels of Survey Experience 德国基于概率的纵向研究中的面板调节:不同调查经验水平的受访者的比较

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-02-22 DOI: 10.31235/osf.io/vd5xp

Fabienne Kraemer, Henning Silber, Bella Struminskaya, M. Bošnjak, J. Kossmann, Bernd Weiss

Learning effects due to repeated interviewing, which are referred to as panel conditioning, are a major threat to response quality in later waves of a panel study. Up to date, research has not provided a clear picture regarding the circumstances, mechanisms, and dimensions of potential panel conditioning effects. Especially the effects of conditioning frequency, that is, different levels of experience within a panel, on response quality are underexplored. Against this background, we investigated the effects of panel conditioning by using data from the GESIS Panel, a German mixed-mode probability-based panel study. Using two refreshment samples, we compared three panel cohorts with differing levels of experience with respect to several response quality indicators related to the mechanisms of reflection, satisficing, and social desirability. Overall, we find evidence for both negative (i.e., disadvantageous for response quality) as well as positive (i.e., advantageous for response quality) panel conditioning. Highly experienced respondents were more likely to satisfice by selecting mid-point responses or by speeding through the questionnaire. They also had a higher probability of refusing to answer sensitive questions than less experienced panel members. However, more experienced respondents were also more likely to optimize the response processes by needing less time compared to panelists with lower experience levels (when controlling for speeding). In contrast, we did not find significant differences with respect to the number of “don’t know” responses, non-differentiation, the selection of first response categories, and the number of non-triggered filter questions. Of the observed differences, speeding showed the highest magnitude with an average increase of 5.9 percentage points for highly experienced panel members compared to low experienced panelists.

重复访谈产生的学习效应被称为小组条件反射，是对小组研究后期反应质量的主要威胁。到目前为止，研究还没有提供一个关于潜在面板条件作用的环境、机制和维度的清晰画面。特别是调节频率，即一个面板内不同水平的经验，对响应质量的影响还没有得到充分的研究。在这种背景下，我们使用GESIS面板的数据研究了面板条件的影响，GESIS面板是一项基于德国混合模式概率的面板研究。使用两个刷新样本，我们比较了三个具有不同经验水平的小组队列的几个反应质量指标，这些指标与反思、满足和社会期望的机制有关。总的来说，我们发现了负面（即对响应质量不利）和正面（即对反应质量有利）面板条件反射的证据。经验丰富的受访者更有可能通过选择中点回答或快速完成问卷来获得满意。与经验不足的小组成员相比，他们拒绝回答敏感问题的概率也更高。然而，与经验水平较低的小组成员（在控制超速时）相比，经验丰富的受访者也更有可能通过更少的时间来优化响应过程。相比之下，我们在“不知道”回答的数量、非差异化、第一回答类别的选择和非触发过滤问题的数量方面没有发现显著差异。在观察到的差异中，经验丰富的小组成员与经验不足的小组成员相比，超速表现出最高的幅度，平均增加5.9个百分点。

{"title":"Panel Conditioning in a German Probability-Based Longitudinal Study: A Comparison of Respondents with Different Levels of Survey Experience","authors":"Fabienne Kraemer, Henning Silber, Bella Struminskaya, M. Bošnjak, J. Kossmann, Bernd Weiss","doi":"10.31235/osf.io/vd5xp","DOIUrl":"https://doi.org/10.31235/osf.io/vd5xp","url":null,"abstract":"Learning effects due to repeated interviewing, which are referred to as panel conditioning, are a major threat to response quality in later waves of a panel study. Up to date, research has not provided a clear picture regarding the circumstances, mechanisms, and dimensions of potential panel conditioning effects. Especially the effects of conditioning frequency, that is, different levels of experience within a panel, on response quality are underexplored. Against this background, we investigated the effects of panel conditioning by using data from the GESIS Panel, a German mixed-mode probability-based panel study. Using two refreshment samples, we compared three panel cohorts with differing levels of experience with respect to several response quality indicators related to the mechanisms of reflection, satisficing, and social desirability. Overall, we find evidence for both negative (i.e., disadvantageous for response quality) as well as positive (i.e., advantageous for response quality) panel conditioning. Highly experienced respondents were more likely to satisfice by selecting mid-point responses or by speeding through the questionnaire. They also had a higher probability of refusing to answer sensitive questions than less experienced panel members. However, more experienced respondents were also more likely to optimize the response processes by needing less time compared to panelists with lower experience levels (when controlling for speeding). In contrast, we did not find significant differences with respect to the number of “don’t know” responses, non-differentiation, the selection of first response categories, and the number of non-triggered filter questions. Of the observed differences, speeding showed the highest magnitude with an average increase of 5.9 percentage points for highly experienced panel members compared to low experienced panelists.","PeriodicalId":17146,"journal":{"name":"Journal of Survey Statistics and Methodology","volume":" ","pages":""},"PeriodicalIF":2.1,"publicationDate":"2022-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44830407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Inference from Nonrandom Samples Using Bayesian Machine Learning. 使用贝叶斯机器学习从非随机样本推断。

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-01-20 eCollection Date: 2023-04-01 DOI: 10.1093/jssam/smab049

Yutao Liu, Andrew Gelman, Qixuan Chen

We consider inference from nonrandom samples in data-rich settings where high-dimensional auxiliary information is available both in the sample and the target population, with survey inference being a special case. We propose a regularized prediction approach that predicts the outcomes in the population using a large number of auxiliary variables such that the ignorability assumption is reasonable and the Bayesian framework is straightforward for quantification of uncertainty. Besides the auxiliary variables, we also extend the approach by estimating the propensity score for a unit to be included in the sample and also including it as a predictor in the machine learning models. We find in simulation studies that the regularized predictions using soft Bayesian additive regression trees yield valid inference for the population means and coverage rates close to the nominal levels. We demonstrate the application of the proposed methods using two different real data applications, one in a survey and one in an epidemiologic study.

我们考虑在数据丰富的环境中从非随机样本进行推断，其中样本和目标人群中都有高维辅助信息，调查推断是一种特殊情况。我们提出了一种正则化预测方法，该方法使用大量辅助变量来预测人群中的结果，使得可忽略性假设是合理的，并且贝叶斯框架对于不确定性的量化是直接的。除了辅助变量之外，我们还通过估计样本中包含的单元的倾向得分来扩展该方法，并将其作为机器学习模型中的预测器。我们在模拟研究中发现，使用软贝叶斯加性回归树的正则化预测对接近标称水平的总体均值和覆盖率产生了有效的推断。我们使用两种不同的真实数据应用，一种在调查中，另一种在流行病学研究中，展示了所提出方法的应用。

引用次数: 7

OUP accepted manuscript OUP接受稿件

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-01-01 DOI: 10.1093/jssam/smac009

引用次数: 0

OUP accepted manuscript OUP接受稿件

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-01-01 DOI: 10.1093/jssam/smac002

引用次数: 1

OUP accepted manuscript OUP接受稿件

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-01-01 DOI: 10.1093/jssam/smac003

引用次数: 0

OUP accepted manuscript OUP接受稿件

IF 2.1 4区数学 Q2 SOCIAL SCIENCES, MATHEMATICAL METHODS

Journal of Survey Statistics and Methodology

Pub Date : 2022-01-01 DOI: 10.1093/jssam/smac015

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Journal of Survey Statistics and Methodology

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀