首页 > 最新文献

Methodology: European Journal of Research Methods for The Behavioral and Social Sciences最新文献

英文 中文
Looking for a Consensus in the Discussion About the Concept of Validity: A Delphi Study 在效度概念讨论中寻求共识:德尔菲研究
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-10-01 DOI: 10.1027/1614-2241/a000157
Sandra Liliana Camargo, A. Herrera, A. Traynor
The purpose of this work is to identify issues regarding the concept of validity in educational and psychological testing on which there is, and is not, consensus among experts, using an online Delphi study. Although many theorists have expressed their views about the proper characterization of validity, it is important to systematically collect ideas about each aspect of validity. Study participants were recognized academic experts who have led the discussion on the concept of validity in publications and academic meetings in Europe and the United States during recent decades. The Delphi study’s results identify some aspects of the concept of validity, the Standards (2014), and validation about which experts are at an impasse, and others about which consensus can be reached. Based on our findings, some recommendations to advance the conceptualization of validity are offered.
这项工作的目的是确定关于教育和心理测试的有效性概念的问题,其中有,并不是,专家之间的共识,使用在线德尔菲研究。尽管许多理论家都对有效性的正确表征表达了自己的观点,但系统地收集关于有效性各个方面的观点是很重要的。研究参与者是公认的学术专家,他们在近几十年来在欧洲和美国的出版物和学术会议上领导了对效度概念的讨论。德尔菲研究的结果确定了效度概念、标准(2014)和验证的某些方面,即哪些专家处于僵局,哪些可以达成共识。在此基础上,提出了一些建议,以促进效度概念的概念化。
{"title":"Looking for a Consensus in the Discussion About the Concept of Validity: A Delphi Study","authors":"Sandra Liliana Camargo, A. Herrera, A. Traynor","doi":"10.1027/1614-2241/a000157","DOIUrl":"https://doi.org/10.1027/1614-2241/a000157","url":null,"abstract":"The purpose of this work is to identify issues regarding the concept of validity in educational and psychological testing on which there is, and is not, consensus among experts, using an online Delphi study. Although many theorists have expressed their views about the proper characterization of validity, it is important to systematically collect ideas about each aspect of validity. Study participants were recognized academic experts who have led the discussion on the concept of validity in publications and academic meetings in Europe and the United States during recent decades. The Delphi study’s results identify some aspects of the concept of validity, the Standards (2014), and validation about which experts are at an impasse, and others about which consensus can be reached. Based on our findings, some recommendations to advance the conceptualization of validity are offered.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"14 1","pages":"146–155"},"PeriodicalIF":3.1,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44361970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Comparing the Performance of Agree/Disagree and Item-Specific Questions Across PCs and Smartphones 比较PC和智能手机中同意/不同意和项目特定问题的表现
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-07-01 DOI: 10.1027/1614-2241/a000151
Jan Karem Höhne, M. Revilla, Timo Lenzner
The use of agree/disagree (A/D) questions is a common technique to measure attitudes. For instance, this question format is employed frequently in the Eurobarometer and International Social Survey Programme (ISSP). Theoretical considerations, however, suggest that A/D questions require a complex processing. Therefore, many survey researchers have recommended the use of item-specific (IS) questions, since they seem to be less burdensome. Parallel to this methodological discussion is the discussion around the use of mobile devices for responding to surveys. However, until now, evidence has been lacking as to whether the use of mobile devices for survey response affects the performance of established question formats. In this study, implemented in the Netquest panel in Spain (N = 1,476), we investigated the cognitive effort and response quality associated with A/D and IS questions across PCs and smartphones. For this purpose, we applied a split-ballot design defined by device type and question format. Our analyses revealed longer response times for IS questions than A/D questions, irrespective of the device type and scale length. Also, the IS questions produced better response quality than their A/D counterparts. All in all, the findings indicate a more conscientious response to IS questions compared to A/D questions.
使用同意/不同意(A/D)问题是衡量态度的常用技巧。例如,欧洲晴雨表和国际社会调查方案经常采用这种问题格式。然而,理论上的考虑表明,A/D问题需要复杂的处理。因此,许多调查研究人员建议使用项目特定(IS)问题,因为它们似乎不那么麻烦。与这一方法论讨论平行的是关于使用移动设备回应调查的讨论。然而,到目前为止,还缺乏证据表明使用移动设备进行调查回复是否会影响既定问题格式的性能。在这项由西班牙Netquest小组实施的研究中(N=1476),我们调查了PC和智能手机中与A/D和IS问题相关的认知努力和反应质量。为此,我们采用了由设备类型和问题格式定义的拆分投票设计。我们的分析显示,无论设备类型和量表长度如何,IS问题的回答时间都比A/D问题长。此外,IS问题比A/D问题的回答质量更好。总之,调查结果表明,与a/D问题相比,对IS问题的回答更加认真。
{"title":"Comparing the Performance of Agree/Disagree and Item-Specific Questions Across PCs and Smartphones","authors":"Jan Karem Höhne, M. Revilla, Timo Lenzner","doi":"10.1027/1614-2241/a000151","DOIUrl":"https://doi.org/10.1027/1614-2241/a000151","url":null,"abstract":"The use of agree/disagree (A/D) questions is a common technique to measure attitudes. For instance, this question format is employed frequently in the Eurobarometer and International Social Survey Programme (ISSP). Theoretical considerations, however, suggest that A/D questions require a complex processing. Therefore, many survey researchers have recommended the use of item-specific (IS) questions, since they seem to be less burdensome. Parallel to this methodological discussion is the discussion around the use of mobile devices for responding to surveys. However, until now, evidence has been lacking as to whether the use of mobile devices for survey response affects the performance of established question formats. In this study, implemented in the Netquest panel in Spain (N = 1,476), we investigated the cognitive effort and response quality associated with A/D and IS questions across PCs and smartphones. For this purpose, we applied a split-ballot design defined by device type and question format. Our analyses revealed longer response times for IS questions than A/D questions, irrespective of the device type and scale length. Also, the IS questions produced better response quality than their A/D counterparts. All in all, the findings indicate a more conscientious response to IS questions compared to A/D questions.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"14 1","pages":"109–118"},"PeriodicalIF":3.1,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48970377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Modeling Intraindividual Variability in Three-Level Multilevel Models 三层次多层次模型中个体内变异性的建模
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-07-01 DOI: 10.1027/1614-2241/a000150
S. Nestler, K. Geukes, M. Back
The mixed-effects location scale model is an extension of a multilevel model for longitudinal data. It allows covariates to affect both the within-subject variance and the between-subject variance (i.e., the intercept variance) beyond their influence on the means. Typically, the model is applied to two-level data (e.g., the repeated measurements of persons), although researchers are often faced with three-level data (e.g., the repeated measurements of persons within specific situations). Here, we describe an extension of the two-level mixed-effects location scale model to such three-level data. Furthermore, we show how the suggested model can be estimated with Bayesian software, and we present the results of a small simulation study that was conducted to investigate the statistical properties of the suggested approach. Finally, we illustrate the approach by presenting an example from a psychological study that employed ecological momentary assessment.
混合效应位置尺度模型是纵向数据多层次模型的扩展。它允许协变量影响主体内方差和主体间方差(即截距方差)超出其对均值的影响。通常,该模型适用于两层数据(例如,对人的重复测量),尽管研究人员经常面临三层数据(例如,特定情况下对人的重复测量)。在这里,我们描述了一个扩展的两级混合效应的位置尺度模型到这样的三级数据。此外,我们展示了如何使用贝叶斯软件来估计建议的模型,并提出了一项小型模拟研究的结果,该研究是为了调查建议方法的统计特性而进行的。最后,我们通过提出一个采用生态瞬时评估的心理学研究的例子来说明这种方法。
{"title":"Modeling Intraindividual Variability in Three-Level Multilevel Models","authors":"S. Nestler, K. Geukes, M. Back","doi":"10.1027/1614-2241/a000150","DOIUrl":"https://doi.org/10.1027/1614-2241/a000150","url":null,"abstract":"The mixed-effects location scale model is an extension of a multilevel model for longitudinal data. It allows covariates to affect both the within-subject variance and the between-subject variance (i.e., the intercept variance) beyond their influence on the means. Typically, the model is applied to two-level data (e.g., the repeated measurements of persons), although researchers are often faced with three-level data (e.g., the repeated measurements of persons within specific situations). Here, we describe an extension of the two-level mixed-effects location scale model to such three-level data. Furthermore, we show how the suggested model can be estimated with Bayesian software, and we present the results of a small simulation study that was conducted to investigate the statistical properties of the suggested approach. Finally, we illustrate the approach by presenting an example from a psychological study that employed ecological momentary assessment.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"34 8","pages":"95–108"},"PeriodicalIF":3.1,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41266862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
The Consequences of Varying Measurement Occasions in Discrete-Time Survival Analysis 离散时间生存分析中测量场合变化的后果
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-06-21 DOI: 10.1027/1614-2241/a000145
M. Moerbeek, L. Hesen
In a discrete-time survival model the occurrence of some event is measured by the end of each time interval. In practice it is not always possible to measure all subjects at the same point in time. In this study the consequences of varying measurement occasions are investigated by means of a simulation study and the analysis of data from an empirical study. The results of the simulation study suggest that the effects of varying measurement occasions are negligible, at least for the scenarios that were covered in the simulation. The empirical example shows varying measurement occasions have minor effects on parameter estimates, standard errors, and significance levels.
在离散时间生存模型中,某些事件的发生是通过每个时间间隔的结束来测量的。在实践中,不可能总是在同一时间点测量所有科目。在这项研究中,通过模拟研究和实证研究数据分析,研究了不同测量场合的后果。模拟研究的结果表明,不同测量时机的影响可以忽略不计,至少对于模拟中涵盖的场景来说是如此。实证例子表明,不同的测量场合对参数估计、标准误差和显著性水平的影响很小。
{"title":"The Consequences of Varying Measurement Occasions in Discrete-Time Survival Analysis","authors":"M. Moerbeek, L. Hesen","doi":"10.1027/1614-2241/a000145","DOIUrl":"https://doi.org/10.1027/1614-2241/a000145","url":null,"abstract":"In a discrete-time survival model the occurrence of some event is measured by the end of each time interval. In practice it is not always possible to measure all subjects at the same point in time. In this study the consequences of varying measurement occasions are investigated by means of a simulation study and the analysis of data from an empirical study. The results of the simulation study suggest that the effects of varying measurement occasions are negligible, at least for the scenarios that were covered in the simulation. The empirical example shows varying measurement occasions have minor effects on parameter estimates, standard errors, and significance levels.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"14 1","pages":"45–55"},"PeriodicalIF":3.1,"publicationDate":"2018-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47659661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Agreement on the Classification of Latent Class Membership Between Different Identification Constraint Approaches in the Mixture Rasch Model 混合Rasch模型中不同识别约束方法之间潜在类隶属度分类的一致性
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-06-21 DOI: 10.1027/1614-2241/a000148
Yi-Jhen Wu, Insu Paek
When using the mixture Rasch model, the model identification constraints are either to set the equal means for all classes in the assumed normal ability distributions (equal ability mean constraint in short), or to set the sum of item difficulties to be zero for each class. In real data analysis, however, both constraints are not always sufficient to establish a common scale across latent classes unless some items are specified as anchor items in the estimation. If these two conventional constraint approaches recover the class membership as good as the anchor item constraint approach, the conventional constraint approaches may be considered useful for the purpose of class membership classification. This study investigated agreement on class membership between one conventional constraint (the equal ability mean) and the anchor item constraint approaches. Results showed high agreement between these two constraint approaches, indicating that the conventional constraint of the equal mean ability approach may be used to recover the latent class membership although item profiles are not correctly estimated across latent classes.
当使用混合Rasch模型时,模型识别约束要么为假设的正态能力分布中所有类别设置相等的均值(简称为等均值约束),要么将每个类别的项目难度之和设置为零。然而,在实际的数据分析中,这两个约束并不总是足以在潜在类别之间建立一个共同的尺度,除非在估计中指定一些项目作为锚定项目。如果这两种传统约束方法能像锚项约束方法那样很好地恢复类隶属度,那么传统约束方法对于类隶属度分类是有用的。本研究考察了一种传统约束(能力均值)与锚项目约束方法对班级成员的一致性。结果表明,这两种约束方法具有较高的一致性,表明传统的等平均能力约束方法可以用于恢复潜在类别的隶属度,尽管项目特征在潜在类别之间不能正确估计。
{"title":"Agreement on the Classification of Latent Class Membership Between Different Identification Constraint Approaches in the Mixture Rasch Model","authors":"Yi-Jhen Wu, Insu Paek","doi":"10.1027/1614-2241/a000148","DOIUrl":"https://doi.org/10.1027/1614-2241/a000148","url":null,"abstract":"When using the mixture Rasch model, the model identification constraints are either to set the equal means for all classes in the assumed normal ability distributions (equal ability mean constraint in short), or to set the sum of item difficulties to be zero for each class. In real data analysis, however, both constraints are not always sufficient to establish a common scale across latent classes unless some items are specified as anchor items in the estimation. If these two conventional constraint approaches recover the class membership as good as the anchor item constraint approach, the conventional constraint approaches may be considered useful for the purpose of class membership classification. This study investigated agreement on class membership between one conventional constraint (the equal ability mean) and the anchor item constraint approaches. Results showed high agreement between these two constraint approaches, indicating that the conventional constraint of the equal mean ability approach may be used to recover the latent class membership although item profiles are not correctly estimated across latent classes.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"14 1","pages":"82–93"},"PeriodicalIF":3.1,"publicationDate":"2018-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43688635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Assessing a Bayesian Embedding Approach to Circular Regression Models 评估循环回归模型的贝叶斯嵌入方法
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-06-21 DOI: 10.1027/1614-2241/a000147
J. Cremers, T. Mainhard, I. Klugkist
Circular data is different from linear data and its analysis also requires methods different from conventional methods. In this study a Bayesian embedding approach to estimating circular regression models is investigated, by means of simulation studies, in terms of performance, efficiency, and flexibility. A new Markov chain Monte Carlo (MCMC) sampling method is proposed and contrasted to an existing method. An empirical example of a regression model predicting teachers’ scores on the interpersonal circumplex will be used throughout. Performance and efficiency are better for the newly proposed sampler and reasonable to good in most situations. Furthermore, the method in general is deemed very flexible. Additional research should be done that provides an overview of what circular data looks like in practice, investigates the interpretation of the circular effects and examines how we might conduct a way of hypothesis testing or model checking for the embedding approach.
圆形数据不同于线性数据,其分析也需要不同于传统方法的方法。在这项研究中,通过模拟研究,从性能、效率和灵活性方面研究了一种估计循环回归模型的贝叶斯嵌入方法。提出了一种新的马尔可夫链蒙特卡罗(MCMC)采样方法,并与现有方法进行了对比。我们将使用一个回归模型的实证例子来预测教师在人际交往中的得分。新提出的采样器的性能和效率更好,在大多数情况下都是合理的。此外,一般认为这种方法非常灵活。应该进行更多的研究,概述循环数据在实践中的样子,调查循环效应的解释,并研究我们如何对嵌入方法进行假设测试或模型检查。
{"title":"Assessing a Bayesian Embedding Approach to Circular Regression Models","authors":"J. Cremers, T. Mainhard, I. Klugkist","doi":"10.1027/1614-2241/a000147","DOIUrl":"https://doi.org/10.1027/1614-2241/a000147","url":null,"abstract":"Circular data is different from linear data and its analysis also requires methods different from conventional methods. In this study a Bayesian embedding approach to estimating circular regression models is investigated, by means of simulation studies, in terms of performance, efficiency, and flexibility. A new Markov chain Monte Carlo (MCMC) sampling method is proposed and contrasted to an existing method. An empirical example of a regression model predicting teachers’ scores on the interpersonal circumplex will be used throughout. Performance and efficiency are better for the newly proposed sampler and reasonable to good in most situations. Furthermore, the method in general is deemed very flexible. Additional research should be done that provides an overview of what circular data looks like in practice, investigates the interpretation of the circular effects and examines how we might conduct a way of hypothesis testing or model checking for the embedding approach.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"14 1","pages":"69–81"},"PeriodicalIF":3.1,"publicationDate":"2018-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44890967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Bayesian Latent Class Models for the Multiple Imputation of Categorical Data 范畴数据多重输入的贝叶斯潜在类模型
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-06-21 DOI: 10.1027/1614-2241/a000146
D. Vidotto, J. Vermunt, K. Van Deun
Latent class analysis has been recently proposed for the multiple imputation (MI) of missing categorical data, using either a standard frequentist approach or a nonparametric Bayesian model called Dirichlet process mixture of multinomial distributions (DPMM). The main advantage of using a latent class model for multiple imputation is that it is very flexible in the sense that it can capture complex relationships in the data given that the number of latent classes is large enough. However, the two existing approaches also have certain disadvantages. The frequentist approach is computationally demanding because it requires estimating many LC models: first models with different number of classes should be estimated to determine the required number of classes and subsequently the selected model is reestimated for multiple bootstrap samples to take into account parameter uncertainty during the imputation stage. Whereas the Bayesian Dirichlet process models perform the model selection and the handling of the parameter uncertainty automatically, the disadvantage of this method is that it tends to use a too small number of clusters during the Gibbs sampling, leading to an underfitting model yielding invalid imputations. In this paper, we propose an alternative approach which combined the strengths of the two existing approaches; that is, we use the Bayesian standard latent class model as an imputation model. We show how model selection can be performed prior to the imputation step using a single run of the Gibbs sampler and, moreover, show how underfitting is prevented by using large values for the hyperparameters of the mixture weights. The results of two simulation studies and one real-data study indicate that with a proper setting of the prior distributions, the Bayesian latent class model yields valid imputations and outperforms competing methods.
最近,有人提出了潜在类分析,用于缺失分类数据的多重插补(MI),使用标准的频率论方法或称为多项分布的狄利克雷过程混合(DPMM)的非参数贝叶斯模型。使用潜在类别模型进行多重插补的主要优点是,它非常灵活,因为它可以在潜在类别数量足够大的情况下捕捉数据中的复杂关系。然而,现有的两种方法也有一定的缺点。频率论方法在计算上要求很高,因为它需要估计许多LC模型:首先应该估计具有不同类别数量的模型,以确定所需的类别数量,然后为多个bootstrap样本重新估计所选模型,以考虑插补阶段的参数不确定性。尽管贝叶斯狄利克雷过程模型自动执行模型选择和参数不确定性的处理,但该方法的缺点是,在吉布斯采样过程中,它倾向于使用太少的聚类,导致模型拟合不足,产生无效的输入。在本文中,我们提出了一种替代方法,它结合了两种现有方法的优点;也就是说,我们使用贝叶斯标准潜在类模型作为插补模型。我们展示了如何在插补步骤之前使用单次吉布斯采样器进行模型选择,此外,还展示了如何通过使用混合物权重的超参数的大值来防止拟合不足。两项模拟研究和一项真实数据研究的结果表明,在适当设置先验分布的情况下,贝叶斯潜在类模型产生了有效的推断,并优于竞争方法。
{"title":"Bayesian Latent Class Models for the Multiple Imputation of Categorical Data","authors":"D. Vidotto, J. Vermunt, K. Van Deun","doi":"10.1027/1614-2241/a000146","DOIUrl":"https://doi.org/10.1027/1614-2241/a000146","url":null,"abstract":"Latent class analysis has been recently proposed for the multiple imputation (MI) of missing categorical data, using either a standard frequentist approach or a nonparametric Bayesian model called Dirichlet process mixture of multinomial distributions (DPMM). The main advantage of using a latent class model for multiple imputation is that it is very flexible in the sense that it can capture complex relationships in the data given that the number of latent classes is large enough. However, the two existing approaches also have certain disadvantages. The frequentist approach is computationally demanding because it requires estimating many LC models: first models with different number of classes should be estimated to determine the required number of classes and subsequently the selected model is reestimated for multiple bootstrap samples to take into account parameter uncertainty during the imputation stage. Whereas the Bayesian Dirichlet process models perform the model selection and the handling of the parameter uncertainty automatically, the disadvantage of this method is that it tends to use a too small number of clusters during the Gibbs sampling, leading to an underfitting model yielding invalid imputations. In this paper, we propose an alternative approach which combined the strengths of the two existing approaches; that is, we use the Bayesian standard latent class model as an imputation model. We show how model selection can be performed prior to the imputation step using a single run of the Gibbs sampler and, moreover, show how underfitting is prevented by using large values for the hyperparameters of the mixture weights. The results of two simulation studies and one real-data study indicate that with a proper setting of the prior distributions, the Bayesian latent class model yields valid imputations and outperforms competing methods.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"14 1","pages":"56–68"},"PeriodicalIF":3.1,"publicationDate":"2018-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41370291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Multiple Imputation by Predictive Mean Matching When Sample Size Is Small 样本量小时预测均值匹配的多重脉冲
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-04-23 DOI: 10.1027/1614-2241/a000141
Kristian Kleinke
Predictive mean matching (PMM) is a state-of-the-art hot deck multiple imputation (MI) procedure. The quality of its results depends, inter alia, on the availability of suitable donor cases. Applying PMM in small sample scenarios often found in psychological or medical research could be problematic, as there might not be many (or any) suitable donor cases in the data set. So far, there has not been any systematic research that examined the performance of PMM, when sample size is small. The present study evaluated PMM in various multiple regression scenarios, where sample size, missing data percentages, the size of the regression coefficients, and PMM’s donor selection strategy were systematically varied. Results show that PMM could be used in most scenarios, however results depended on the donor selection strategy: overall, PMM using either automatic distance-aided selection of donors (Gaffert, Meinfelder, & Bosch, 2016) or using the nearest neighbor produced the best results.
预测均值匹配(PMM)是一种最先进的热甲板多重插值(MI)方法。其结果的质量,除其他外,取决于是否有合适的捐赠病例。在心理学或医学研究中经常发现的小样本场景中应用PMM可能会有问题,因为数据集中可能没有很多(或任何)合适的供体病例。到目前为止,还没有系统的研究在样本量较小的情况下检验PMM的性能。本研究在不同的多元回归情景下评估了PMM,其中样本量、缺失数据百分比、回归系数大小和PMM的供体选择策略是系统变化的。结果表明,PMM可以在大多数情况下使用,但结果取决于供体选择策略:总体而言,PMM使用自动距离辅助选择供体(Gaffert, Meinfelder, & Bosch, 2016)或使用最近的邻居产生最佳结果。
{"title":"Multiple Imputation by Predictive Mean Matching When Sample Size Is Small","authors":"Kristian Kleinke","doi":"10.1027/1614-2241/a000141","DOIUrl":"https://doi.org/10.1027/1614-2241/a000141","url":null,"abstract":"Predictive mean matching (PMM) is a state-of-the-art hot deck multiple imputation (MI) procedure. The quality of its results depends, inter alia, on the availability of suitable donor cases. Applying PMM in small sample scenarios often found in psychological or medical research could be problematic, as there might not be many (or any) suitable donor cases in the data set. So far, there has not been any systematic research that examined the performance of PMM, when sample size is small. The present study evaluated PMM in various multiple regression scenarios, where sample size, missing data percentages, the size of the regression coefficients, and PMM’s donor selection strategy were systematically varied. Results show that PMM could be used in most scenarios, however results depended on the donor selection strategy: overall, PMM using either automatic distance-aided selection of donors (Gaffert, Meinfelder, & Bosch, 2016) or using the nearest neighbor produced the best results.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"14 1","pages":"3–15"},"PeriodicalIF":3.1,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48312465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Estimating a Three-Level Latent Variable Regression Model With Cross-Classified Multiple Membership Data 交叉分类多隶属度数据的三水平潜变量回归模型估计
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2018-04-23 DOI: 10.1027/1614-2241/a000143
Audrey J. Leroux, S. Beretvas
The current study proposed a new model, termed the cross-classified multiple membership latent variable regression (CCMM-LVR) model that provides an extension to the three-level latent variable regression (HM3-LVR) model that can be used with cross-classified multiple membership data, for example, in the presence of student mobility across schools. The HM3-LVR model is beneficial for testing more flexible hypotheses about growth trajectory parameters and handles pure clustering of participants within higher-level (level-3) units. However, the HM3-LVR model involves the assumption that students remain in the same cluster (school) throughout the duration of the time period of interest. The CCMM-LVR model appropriately models the participants’ changing clusters over time. The impact of ignoring mobility in the real data was investigated by comparing parameter estimates, standard error estimates, and model fit indices for the model (CCMM-LVR) that appropriately modeled the cross-classified multiple membership structure with results when this structure was ignored (HM3-LVR).
目前的研究提出了一种新的模型,称为交叉分类多成员潜在变量回归(CCMM-LVR)模型,该模型为三级潜在变量回归模型(HM3-LVR)提供了扩展,该模型可用于交叉分类的多成员数据,例如,在学生跨学校流动的情况下。HM3-LVR模型有利于测试关于增长轨迹参数的更灵活的假设,并处理更高级别(3级)单元中参与者的纯聚类。然而,HM3-LVR模型涉及学生在感兴趣的整个时间段内都留在同一集群(学校)的假设。CCMM-LVR模型适当地对参与者随时间变化的集群进行建模。通过比较模型(CCMM-LVR)的参数估计、标准误差估计和模型拟合指数,研究了忽略真实数据中移动性的影响,该模型适当地对交叉分类的多成员结构进行了建模,并与忽略该结构时的结果(HM3-LVR)进行了比较。
{"title":"Estimating a Three-Level Latent Variable Regression Model With Cross-Classified Multiple Membership Data","authors":"Audrey J. Leroux, S. Beretvas","doi":"10.1027/1614-2241/a000143","DOIUrl":"https://doi.org/10.1027/1614-2241/a000143","url":null,"abstract":"The current study proposed a new model, termed the cross-classified multiple membership latent variable regression (CCMM-LVR) model that provides an extension to the three-level latent variable regression (HM3-LVR) model that can be used with cross-classified multiple membership data, for example, in the presence of student mobility across schools. The HM3-LVR model is beneficial for testing more flexible hypotheses about growth trajectory parameters and handles pure clustering of participants within higher-level (level-3) units. However, the HM3-LVR model involves the assumption that students remain in the same cluster (school) throughout the duration of the time period of interest. The CCMM-LVR model appropriately models the participants’ changing clusters over time. The impact of ignoring mobility in the real data was investigated by comparing parameter estimates, standard error estimates, and model fit indices for the model (CCMM-LVR) that appropriately modeled the cross-classified multiple membership structure with results when this structure was ignored (HM3-LVR).","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"14 1","pages":"30–44"},"PeriodicalIF":3.1,"publicationDate":"2018-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43905996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Comparison of Uni- and Multidimensional Models Applied in Testlet-Based Tests 基于测试集的测试中单模型与多维模型的比较
IF 3.1 3区 心理学 Q2 PSYCHOLOGY, MATHEMATICAL Pub Date : 2017-11-24 DOI: 10.1027/1614-2241/a000137
Alejandro Hernández‐Camacho, J. Olea, F. J. Abad
The bifactor model (BM) and the testlet response model (TRM) are the most common multidimensional models applied to testlet-based tests. The common procedure is to estimate these models using different estimation methods (see, e.g., DeMars, 2006). A possible consequence of this is that previous findings about the implications of fitting a wrong model to the data may be confounded with the estimation procedures they employed. With this in mind, the present study uses the same method (maximum marginal likelihood [MML] using dimensional reduction) to compare uni- and multidimensional strategies to testlet-based tests, and assess the performance of various relative fit indices. Data were simulated under three different models, namely BM, TRM, and the unidimensional model. Recovery of item parameters, reliability estimates, and selection rates of the relative fit indices were documented. The results were essentially consistent with those obtained through different methods (DeMars, 2006), indicating that the effect of the estimation method is negligible. Regarding the fit indices, Akaike Information Criterion (AIC) showed the best selection rates, whereas Bayes Information Criterion (BIC) tended to select a model which is simpler than the true one. The work concludes with recommendations for practitioners and proposals for future research.
双因子模型(BM)和小测试响应模型(TRM)是应用于基于小测试的最常见的多维模型。常见的程序是使用不同的估计方法来估计这些模型(例如,见DeMars,2006)。这样做的一个可能后果是,以前关于将错误模型拟合到数据中的影响的发现可能与他们使用的估计程序相混淆。考虑到这一点,本研究使用相同的方法(使用降维的最大边际似然[MML])将单一和多维策略与基于测试集的测试进行比较,并评估各种相对拟合指数的性能。数据在三个不同的模型下进行了模拟,即BM、TRM和一维模型。记录了项目参数的恢复、可靠性估计和相对拟合指数的选择率。结果与通过不同方法获得的结果基本一致(DeMars,2006),表明估计方法的影响可以忽略不计。关于拟合指数,Akaike信息准则(AIC)显示出最佳的选择率,而Bayes信息准则(BIC)倾向于选择比真实模型更简单的模型。这项工作最后提出了对从业者的建议和对未来研究的建议。
{"title":"Comparison of Uni- and Multidimensional Models Applied in Testlet-Based Tests","authors":"Alejandro Hernández‐Camacho, J. Olea, F. J. Abad","doi":"10.1027/1614-2241/a000137","DOIUrl":"https://doi.org/10.1027/1614-2241/a000137","url":null,"abstract":"The bifactor model (BM) and the testlet response model (TRM) are the most common multidimensional models applied to testlet-based tests. The common procedure is to estimate these models using different estimation methods (see, e.g., DeMars, 2006). A possible consequence of this is that previous findings about the implications of fitting a wrong model to the data may be confounded with the estimation procedures they employed. With this in mind, the present study uses the same method (maximum marginal likelihood [MML] using dimensional reduction) to compare uni- and multidimensional strategies to testlet-based tests, and assess the performance of various relative fit indices. Data were simulated under three different models, namely BM, TRM, and the unidimensional model. Recovery of item parameters, reliability estimates, and selection rates of the relative fit indices were documented. The results were essentially consistent with those obtained through different methods (DeMars, 2006), indicating that the effect of the estimation method is negligible. Regarding the fit indices, Akaike Information Criterion (AIC) showed the best selection rates, whereas Bayes Information Criterion (BIC) tended to select a model which is simpler than the true one. The work concludes with recommendations for practitioners and proposals for future research.","PeriodicalId":18476,"journal":{"name":"Methodology: European Journal of Research Methods for The Behavioral and Social Sciences","volume":"13 1","pages":"135–143"},"PeriodicalIF":3.1,"publicationDate":"2017-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42189419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Methodology: European Journal of Research Methods for The Behavioral and Social Sciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1