首页 > 最新文献

British Journal of Mathematical & Statistical Psychology最新文献

英文 中文
Average treatment effects on binary outcomes with stochastic covariates. 随机协变量对二元结果的平均治疗效果。
IF 1.5 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-07-24 DOI: 10.1111/bmsp.12355
Christoph Kiefer, Marcella L Woud, Simon E Blackwell, Axel Mayer

When evaluating the effect of psychological treatments on a dichotomous outcome variable in a randomized controlled trial (RCT), covariate adjustment using logistic regression models is often applied. In the presence of covariates, average marginal effects (AMEs) are often preferred over odds ratios, as AMEs yield a clearer substantive and causal interpretation. However, standard error computation of AMEs neglects sampling-based uncertainty (i.e., covariate values are assumed to be fixed over repeated sampling), which leads to underestimation of AME standard errors in other generalized linear models (e.g., Poisson regression). In this paper, we present and compare approaches allowing for stochastic (i.e., randomly sampled) covariates in models for binary outcomes. In a simulation study, we investigated the quality of the AME and stochastic-covariate approaches focusing on statistical inference in finite samples. Our results indicate that the fixed-covariate approach provides reliable results only if there is no heterogeneity in interindividual treatment effects (i.e., presence of treatment-covariate interactions), while the stochastic-covariate approaches are preferable in all other simulated conditions. We provide an illustrative example from clinical psychology investigating the effect of a cognitive bias modification training on post-traumatic stress disorder while accounting for patients' anxiety using an RCT.

在随机对照试验(RCT)中评估心理治疗对二分法结果变量的影响时,通常会使用逻辑回归模型进行协变量调整。在存在协变量的情况下,平均边际效应(AMEs)往往比几率比较大,因为平均边际效应能产生更清晰的实质和因果解释。然而,平均边际效应的标准误差计算忽略了基于抽样的不确定性(即假设协变量值在重复抽样中是固定的),这导致在其他广义线性模型(如泊松回归)中平均边际效应标准误差被低估。在本文中,我们介绍并比较了二元结果模型中允许随机(即随机抽样)协变量的方法。在一项模拟研究中,我们以有限样本的统计推断为重点,调查了 AME 和随机协变量方法的质量。我们的结果表明,只有在个体间治疗效果不存在异质性(即存在治疗-变量交互作用)的情况下,固定-变量方法才能提供可靠的结果,而在所有其他模拟条件下,随机-变量方法更为可取。我们提供了一个临床心理学的示例,研究认知偏差修正训练对创伤后应激障碍的影响,同时使用 RCT 考虑患者的焦虑。
{"title":"Average treatment effects on binary outcomes with stochastic covariates.","authors":"Christoph Kiefer, Marcella L Woud, Simon E Blackwell, Axel Mayer","doi":"10.1111/bmsp.12355","DOIUrl":"https://doi.org/10.1111/bmsp.12355","url":null,"abstract":"<p><p>When evaluating the effect of psychological treatments on a dichotomous outcome variable in a randomized controlled trial (RCT), covariate adjustment using logistic regression models is often applied. In the presence of covariates, average marginal effects (AMEs) are often preferred over odds ratios, as AMEs yield a clearer substantive and causal interpretation. However, standard error computation of AMEs neglects sampling-based uncertainty (i.e., covariate values are assumed to be fixed over repeated sampling), which leads to underestimation of AME standard errors in other generalized linear models (e.g., Poisson regression). In this paper, we present and compare approaches allowing for stochastic (i.e., randomly sampled) covariates in models for binary outcomes. In a simulation study, we investigated the quality of the AME and stochastic-covariate approaches focusing on statistical inference in finite samples. Our results indicate that the fixed-covariate approach provides reliable results only if there is no heterogeneity in interindividual treatment effects (i.e., presence of treatment-covariate interactions), while the stochastic-covariate approaches are preferable in all other simulated conditions. We provide an illustrative example from clinical psychology investigating the effect of a cognitive bias modification training on post-traumatic stress disorder while accounting for patients' anxiety using an RCT.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141753440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Are alternative variables in a set differently associated with a target variable? Statistical tests and practical advice for dealing with dependent correlations. 一组变量中的其他变量与目标变量的相关性是否不同?处理从属相关性的统计检验和实用建议。
IF 1.5 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-24 DOI: 10.1111/bmsp.12354
Miguel A García-Pérez

The analysis of multiple bivariate correlations is often carried out by conducting simple tests to check whether each of them is significantly different from zero. In addition, pairwise differences are often judged by eye or by comparing the p-values of the individual tests of significance despite the existence of statistical tests for differences between correlations. This paper uses simulation methods to assess the accuracy (empirical Type I error rate), power, and robustness of 10 tests designed to check the significance of the difference between two dependent correlations with overlapping variables (i.e., the correlation between X1 and Y and the correlation between X2 and Y). Five of the tests turned out to be inadvisable because their empirical Type I error rates under normality differ greatly from the nominal alpha level of .05 either across the board or within certain sub-ranges of the parameter space. The remaining five tests were acceptable and their merits were similar in terms of all comparison criteria, although none of them was robust across all forms of non-normality explored in the study. Practical recommendations are given for the choice of a statistical test to compare dependent correlations with overlapping variables.

在分析多个二元相关性时,通常会进行简单的检验,检查每个相关性是否与零有显著 差异。此外,尽管存在相关性之间差异的统计检验,但通常通过眼睛或比较单个显著性检验的 p 值来判断成对差异。本文使用模拟方法评估了 10 个检验的准确性(经验 I 类错误率)、有效性和稳健性,这些检验旨在检查两个变量重叠的因变量相关性(即 X1 和 Y 之间的相关性以及 X2 和 Y 之间的相关性)之间差异的显著性。其中五个测试结果是不可取的,因为它们在正态性下的经验 I 类误差率与 0.05 的名义α水平相差很大,要么是全面相差,要么是在参数空间的某些子范围内相差很大。其余五种检验是可以接受的,它们在所有比较标准方面的优点相似,但没有一种检验在本研究探讨的所有非正态性形式中都是稳健的。本文就如何选择统计检验来比较具有重叠变量的因果相关性提出了实用建议。
{"title":"Are alternative variables in a set differently associated with a target variable? Statistical tests and practical advice for dealing with dependent correlations.","authors":"Miguel A García-Pérez","doi":"10.1111/bmsp.12354","DOIUrl":"https://doi.org/10.1111/bmsp.12354","url":null,"abstract":"<p><p>The analysis of multiple bivariate correlations is often carried out by conducting simple tests to check whether each of them is significantly different from zero. In addition, pairwise differences are often judged by eye or by comparing the p-values of the individual tests of significance despite the existence of statistical tests for differences between correlations. This paper uses simulation methods to assess the accuracy (empirical Type I error rate), power, and robustness of 10 tests designed to check the significance of the difference between two dependent correlations with overlapping variables (i.e., the correlation between X<sub>1</sub> and Y and the correlation between X<sub>2</sub> and Y). Five of the tests turned out to be inadvisable because their empirical Type I error rates under normality differ greatly from the nominal alpha level of .05 either across the board or within certain sub-ranges of the parameter space. The remaining five tests were acceptable and their merits were similar in terms of all comparison criteria, although none of them was robust across all forms of non-normality explored in the study. Practical recommendations are given for the choice of a statistical test to compare dependent correlations with overlapping variables.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141460882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Determining the number of attributes in the GDINA model. 确定 GDINA 模型的属性数量。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-06-18 DOI: 10.1111/bmsp.12349
Juntao Wang, Jiangtao Duan

Exploratory cognitive diagnosis models have been widely used in psychology, education and other fields. This paper focuses on determining the number of attributes in a widely used cognitive diagnosis model, the GDINA model. Under some conditions of cognitive diagnosis models, we prove that there exists a special structure for the covariance matrix of observed data. Due to the special structure of the covariance matrix, an estimator based on eigen-decomposition is proposed for the number of attributes for the GDINA model. The performance of the proposed estimator is verified by simulation studies. Finally, the proposed estimator is applied to two real data sets Examination for the Certificate of Proficiency in English (ECPE) and Big Five Personality (BFP).

探索性认知诊断模型已广泛应用于心理学、教育学和其他领域。本文主要研究如何确定一种广泛使用的认知诊断模型--GDINA 模型--中的属性数量。在认知诊断模型的某些条件下,我们证明了观察数据的协方差矩阵存在一种特殊结构。基于协方差矩阵的特殊结构,我们提出了一种基于特征分解的 GDINA 模型属性数估计器。模拟研究验证了所提估计器的性能。最后,将所提出的估计器应用于两个真实数据集:英语水平证书考试(ECPE)和大五人格(BFP)。
{"title":"Determining the number of attributes in the GDINA model.","authors":"Juntao Wang, Jiangtao Duan","doi":"10.1111/bmsp.12349","DOIUrl":"https://doi.org/10.1111/bmsp.12349","url":null,"abstract":"<p><p>Exploratory cognitive diagnosis models have been widely used in psychology, education and other fields. This paper focuses on determining the number of attributes in a widely used cognitive diagnosis model, the GDINA model. Under some conditions of cognitive diagnosis models, we prove that there exists a special structure for the covariance matrix of observed data. Due to the special structure of the covariance matrix, an estimator based on eigen-decomposition is proposed for the number of attributes for the GDINA model. The performance of the proposed estimator is verified by simulation studies. Finally, the proposed estimator is applied to two real data sets Examination for the Certificate of Proficiency in English (ECPE) and Big Five Personality (BFP).</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141421977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric CD-CAT for multiple-choice items: Item selection method and Q-optimality. 多选题的非参数 CD-CAT:项目选择方法和 Q-最优性。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-05-25 DOI: 10.1111/bmsp.12350
Yu Wang, Chia-Yi Chiu, Hans Friedrich Köhn

Computerized adaptive testing for cognitive diagnosis (CD-CAT) achieves remarkable estimation efficiency and accuracy by adaptively selecting and then administering items tailored to each examinee. The process of item selection stands as a pivotal component of a CD-CAT algorithm, with various methods having been developed for binary responses. However, multiple-choice (MC) items, an important item type that allows for the extraction of richer diagnostic information from incorrect answers, have been underemphasized. Currently, the Jensen-Shannon divergence (JSD) index introduced by Yigit et al. (Applied Psychological Measurement, 2019, 43, 388) is the only item selection method exclusively designed for MC items. However, the JSD index requires a large sample to calibrate item parameters, which may be infeasible when there is only a small or no calibration sample. To bridge this gap, the study first proposes a nonparametric item selection method for MC items (MC-NPS) by implementing novel discrimination power that measures an item's ability to effectively distinguish among different attribute profiles. A Q-optimal procedure for MC items is also developed to improve the classification during the initial phase of a CD-CAT algorithm. The effectiveness and efficiency of the two proposed algorithms were confirmed by simulation studies.

用于认知诊断的计算机化自适应测试(CD-CAT)通过自适应地选择和实施适合每个受试者的项目,实现了显著的估计效率和准确性。项目选择过程是 CD-CAT 算法的关键组成部分,针对二元应答开发了各种方法。然而,多选题(MC)作为一种重要的题目类型,可以从错误答案中提取更丰富的诊断信息,却一直未得到足够重视。目前,Yigit 等人提出的詹森-香农分歧(JSD)指数(《应用心理测量》,2019 年,43 期,388)是唯一一种专为 MC 题项设计的题项选择方法。然而,JSD 指数需要大量样本来校准项目参数,这在只有少量校准样本或没有校准样本的情况下可能是不可行的。为了弥补这一差距,本研究首先提出了一种适用于 MC 项目的非参数项目选择方法(MC-NPS),它采用了新颖的区分度来衡量项目有效区分不同属性特征的能力。此外,还为 MC 项目开发了 Q 最佳程序,以改进 CD-CAT 算法初始阶段的分类。模拟研究证实了这两种拟议算法的有效性和效率。
{"title":"Nonparametric CD-CAT for multiple-choice items: Item selection method and Q-optimality.","authors":"Yu Wang, Chia-Yi Chiu, Hans Friedrich Köhn","doi":"10.1111/bmsp.12350","DOIUrl":"https://doi.org/10.1111/bmsp.12350","url":null,"abstract":"<p><p>Computerized adaptive testing for cognitive diagnosis (CD-CAT) achieves remarkable estimation efficiency and accuracy by adaptively selecting and then administering items tailored to each examinee. The process of item selection stands as a pivotal component of a CD-CAT algorithm, with various methods having been developed for binary responses. However, multiple-choice (MC) items, an important item type that allows for the extraction of richer diagnostic information from incorrect answers, have been underemphasized. Currently, the Jensen-Shannon divergence (JSD) index introduced by Yigit et al. (Applied Psychological Measurement, 2019, 43, 388) is the only item selection method exclusively designed for MC items. However, the JSD index requires a large sample to calibrate item parameters, which may be infeasible when there is only a small or no calibration sample. To bridge this gap, the study first proposes a nonparametric item selection method for MC items (MC-NPS) by implementing novel discrimination power that measures an item's ability to effectively distinguish among different attribute profiles. A Q-optimal procedure for MC items is also developed to improve the classification during the initial phase of a CD-CAT algorithm. The effectiveness and efficiency of the two proposed algorithms were confirmed by simulation studies.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141096972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incorporating calibration errors in oral reading fluency scoring. 将校准误差纳入口语阅读流利度评分。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-05-10 DOI: 10.1111/bmsp.12348
Xin Qiao, Akihito Kamata, Cornelis Potgieter

Oral reading fluency (ORF) assessments are commonly used to screen at-risk readers and evaluate interventions' effectiveness as curriculum-based measurements. Similar to the standard practice in item response theory (IRT), calibrated passage parameter estimates are currently used as if they were population values in model-based ORF scoring. However, calibration errors that are unaccounted for may bias ORF score estimates and, in particular, lead to underestimated standard errors (SEs) of ORF scores. Therefore, we consider an approach that incorporates the calibration errors in latent variable scores. We further derive the SEs of ORF scores based on the delta method to incorporate the calibration uncertainty. We conduct a simulation study to evaluate the recovery of point estimates and SEs of latent variable scores and ORF scores in various simulated conditions. Results suggest that ignoring calibration errors leads to underestimated latent variable score SEs and ORF score SEs, especially when the calibration sample is small.

口语阅读流利度(ORF)评估通常用于筛选高危读者和评估干预措施的有效性,是以课程为基础的测量方法。与项目反应理论(IRT)中的标准做法类似,目前在基于模型的口语阅读流利度评分中,校准过的段落参数估计值被当作人口值使用。然而,未考虑的校准误差可能会使 ORF 分数估计值出现偏差,特别是会导致 ORF 分数的标准误差(SE)被低估。因此,我们考虑了一种将校准误差纳入潜在变量得分的方法。我们根据德尔塔法进一步推导 ORF 分数的 SE,以纳入校准的不确定性。我们进行了一项模拟研究,以评估在各种模拟条件下潜在变量得分和 ORF 分数的点估计值和 SE 的恢复情况。结果表明,忽略校准误差会导致低估潜变量得分 SE 和 ORF 分数 SE,尤其是当校准样本较小时。
{"title":"Incorporating calibration errors in oral reading fluency scoring.","authors":"Xin Qiao, Akihito Kamata, Cornelis Potgieter","doi":"10.1111/bmsp.12348","DOIUrl":"https://doi.org/10.1111/bmsp.12348","url":null,"abstract":"<p><p>Oral reading fluency (ORF) assessments are commonly used to screen at-risk readers and evaluate interventions' effectiveness as curriculum-based measurements. Similar to the standard practice in item response theory (IRT), calibrated passage parameter estimates are currently used as if they were population values in model-based ORF scoring. However, calibration errors that are unaccounted for may bias ORF score estimates and, in particular, lead to underestimated standard errors (SEs) of ORF scores. Therefore, we consider an approach that incorporates the calibration errors in latent variable scores. We further derive the SEs of ORF scores based on the delta method to incorporate the calibration uncertainty. We conduct a simulation study to evaluate the recovery of point estimates and SEs of latent variable scores and ORF scores in various simulated conditions. Results suggest that ignoring calibration errors leads to underestimated latent variable score SEs and ORF score SEs, especially when the calibration sample is small.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140900014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sample size determination for interval estimation of the prevalence of a sensitive attribute under non-randomized response models. 在非随机反应模型下,确定敏感属性流行率区间估计的样本量。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-02-26 DOI: 10.1111/bmsp.12338
Shi-Fang Qiu, Jie Lei, Wai-Yin Poon, Man-Lai Tang, Ricky S Wong, Ji-Ran Tao

A sufficient number of participants should be included to adequately address the research interest in the surveys with sensitive questions. In this paper, sample size formulas/iterative algorithms are developed from the perspective of controlling the confidence interval width of the prevalence of a sensitive attribute under four non-randomized response models: the crosswise model, parallel model, Poisson item count technique model and negative binomial item count technique model. In contrast to the conventional approach for sample size determination, our sample size formulas/algorithms explicitly incorporate an assurance probability of controlling the width of a confidence interval within the pre-specified range. The performance of the proposed methods is evaluated with respect to the empirical coverage probability, empirical assurance probability and confidence width. Simulation results show that all formulas/algorithms are effective and hence are recommended for practical applications. A real example is used to illustrate the proposed methods.

在涉及敏感问题的调查中,应纳入足够数量的参与者,以充分满足研究兴趣。本文从控制四种非随机响应模型(交叉模型、平行模型、泊松项目计数技术模型和负二项项目计数技术模型)下敏感属性流行率置信区间宽度的角度出发,建立了样本量计算公式/迭代算法。与确定样本容量的传统方法不同,我们的样本容量公式/算法明确包含了将置信区间宽度控制在预先指定范围内的保证概率。我们根据经验覆盖概率、经验保证概率和置信区间宽度对所提方法的性能进行了评估。仿真结果表明,所有公式/算法都是有效的,因此建议实际应用。一个真实的例子用于说明所提出的方法。
{"title":"Sample size determination for interval estimation of the prevalence of a sensitive attribute under non-randomized response models.","authors":"Shi-Fang Qiu, Jie Lei, Wai-Yin Poon, Man-Lai Tang, Ricky S Wong, Ji-Ran Tao","doi":"10.1111/bmsp.12338","DOIUrl":"https://doi.org/10.1111/bmsp.12338","url":null,"abstract":"<p><p>A sufficient number of participants should be included to adequately address the research interest in the surveys with sensitive questions. In this paper, sample size formulas/iterative algorithms are developed from the perspective of controlling the confidence interval width of the prevalence of a sensitive attribute under four non-randomized response models: the crosswise model, parallel model, Poisson item count technique model and negative binomial item count technique model. In contrast to the conventional approach for sample size determination, our sample size formulas/algorithms explicitly incorporate an assurance probability of controlling the width of a confidence interval within the pre-specified range. The performance of the proposed methods is evaluated with respect to the empirical coverage probability, empirical assurance probability and confidence width. Simulation results show that all formulas/algorithms are effective and hence are recommended for practical applications. A real example is used to illustrate the proposed methods.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139974734","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of fit of the time-varying dynamic partial credit model using the posterior predictive model checking method. 使用后验预测模型检查法评估时变动态部分信贷模型的拟合度。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-02-21 DOI: 10.1111/bmsp.12339
Sebastian Castro-Alvarez, Sandip Sinharay, Laura F Bringmann, Rob R Meijer, Jorge N Tendeiro

Several new models based on item response theory have recently been suggested to analyse intensive longitudinal data. One of these new models is the time-varying dynamic partial credit model (TV-DPCM; Castro-Alvarez et al., Multivariate Behavioral Research, 2023, 1), which is a combination of the partial credit model and the time-varying autoregressive model. The model allows the study of the psychometric properties of the items and the modelling of nonlinear trends at the latent state level. However, there is a severe lack of tools to assess the fit of the TV-DPCM. In this paper, we propose and develop several test statistics and discrepancy measures based on the posterior predictive model checking (PPMC) method (PPMC; Rubin, The Annals of Statistics, 1984, 12, 1151) to assess the fit of the TV-DPCM. Simulated and empirical data are used to study the performance of and illustrate the effectiveness of the PPMC method.

最近,有人提出了几种基于项目反应理论的新模型来分析密集的纵向数据。其中一个新模型是时变动态部分学分模型(TV-DPCM;Castro-Alvarez 等人,《多变量行为研究》,2023 年第 1 期),它是部分学分模型和时变自回归模型的结合。该模型可以研究项目的心理测量特性,并在潜态水平上建立非线性趋势模型。然而,目前严重缺乏评估 TV-DPCM 拟合度的工具。在本文中,我们基于后验预测模型检查(PPMC)方法(PPMC; Rubin, The Annals of Statistics, 1984, 12, 1151)提出并开发了几种测试统计量和差异测量方法,用于评估 TV-DPCM 的拟合度。模拟数据和经验数据用于研究 PPMC 方法的性能并说明其有效性。
{"title":"Assessment of fit of the time-varying dynamic partial credit model using the posterior predictive model checking method.","authors":"Sebastian Castro-Alvarez, Sandip Sinharay, Laura F Bringmann, Rob R Meijer, Jorge N Tendeiro","doi":"10.1111/bmsp.12339","DOIUrl":"https://doi.org/10.1111/bmsp.12339","url":null,"abstract":"<p><p>Several new models based on item response theory have recently been suggested to analyse intensive longitudinal data. One of these new models is the time-varying dynamic partial credit model (TV-DPCM; Castro-Alvarez et al., Multivariate Behavioral Research, 2023, 1), which is a combination of the partial credit model and the time-varying autoregressive model. The model allows the study of the psychometric properties of the items and the modelling of nonlinear trends at the latent state level. However, there is a severe lack of tools to assess the fit of the TV-DPCM. In this paper, we propose and develop several test statistics and discrepancy measures based on the posterior predictive model checking (PPMC) method (PPMC; Rubin, The Annals of Statistics, 1984, 12, 1151) to assess the fit of the TV-DPCM. Simulated and empirical data are used to study the performance of and illustrate the effectiveness of the PPMC method.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139914100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
When and how to use set-exploratory structural equation modelling to test structural models: A tutorial using the R package lavaan. 何时以及如何使用集合探索式结构方程模型来检验结构模型:使用 R 软件包 lavaan 的教程。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-02-15 DOI: 10.1111/bmsp.12336
Herb Marsh, Abdullah Alamer

Exploratory structural equation modelling (ESEM) is an alternative to the well-known method of confirmatory factor analysis (CFA). ESEM is mainly used to assess the quality of measurement models of common factors but can be efficiently extended to test structural models. However, ESEM may not be the best option in some model specifications, especially when structural models are involved, because the full flexibility of ESEM could result in technical difficulties in model estimation. Thus, set-ESEM was developed to accommodate the balance between full-ESEM and CFA. In the present paper, we show examples where set-ESEM should be used rather than full-ESEM. Rather than relying on a simulation study, we provide two applied examples using real data that are included in the OSF repository. Additionally, we provide the code needed to run set-ESEM in the free R package lavaan to make the paper practical. Set-ESEM structural models outperform their CFA-based counterparts in terms of goodness of fit and realistic factor correlation, and hence path coefficients in the two empirical examples. In several instances, effects that were non-significant (i.e., attenuated) in the CFA-based structural model become larger and significant in the set-ESEM structural model, suggesting that set-ESEM models may generate more accurate model parameters and, hence, lower Type II error rate.

探索性结构方程模型(ESEM)是著名的确证因素分析(CFA)方法的替代方法。ESEM 主要用于评估常见因子测量模型的质量,但也可以有效地扩展到测试结构模型。然而,ESEM 在某些模型规格中可能不是最佳选择,尤其是涉及结构模型时,因为 ESEM 的充分灵活性可能会导致模型估计中的技术困难。因此,为了兼顾完全 ESEM 和 CFA,我们开发了集合 ESEM。在本文中,我们将举例说明在哪些情况下应使用集合-ESEM,而不是完全-ESEM。我们没有依赖模拟研究,而是使用 OSF 存储库中的真实数据提供了两个应用实例。此外,我们还在免费的 R 软件包 lavaan 中提供了运行 Set-ESEM 所需的代码,从而使本文更加实用。在拟合优度和现实因子相关性方面,集合-ESEM 结构模型优于基于 CFA 的结构模型,因此在两个实证例子中的路径系数也优于基于 CFA 的结构模型。有几次,在基于 CFA 的结构模型中不显著(即衰减)的效应在集合-ESEM 结构模型中变得更大和显著,这表明集合-ESEM 模型可能会生成更准确的模型参数,从而降低 II 类错误率。
{"title":"When and how to use set-exploratory structural equation modelling to test structural models: A tutorial using the R package lavaan.","authors":"Herb Marsh, Abdullah Alamer","doi":"10.1111/bmsp.12336","DOIUrl":"https://doi.org/10.1111/bmsp.12336","url":null,"abstract":"<p><p>Exploratory structural equation modelling (ESEM) is an alternative to the well-known method of confirmatory factor analysis (CFA). ESEM is mainly used to assess the quality of measurement models of common factors but can be efficiently extended to test structural models. However, ESEM may not be the best option in some model specifications, especially when structural models are involved, because the full flexibility of ESEM could result in technical difficulties in model estimation. Thus, set-ESEM was developed to accommodate the balance between full-ESEM and CFA. In the present paper, we show examples where set-ESEM should be used rather than full-ESEM. Rather than relying on a simulation study, we provide two applied examples using real data that are included in the OSF repository. Additionally, we provide the code needed to run set-ESEM in the free R package lavaan to make the paper practical. Set-ESEM structural models outperform their CFA-based counterparts in terms of goodness of fit and realistic factor correlation, and hence path coefficients in the two empirical examples. In several instances, effects that were non-significant (i.e., attenuated) in the CFA-based structural model become larger and significant in the set-ESEM structural model, suggesting that set-ESEM models may generate more accurate model parameters and, hence, lower Type II error rate.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139742778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast estimation of generalized linear latent variable models for performance and process data with ordinal, continuous, and count observed variables. 快速估计具有顺序、连续和计数观测变量的性能和过程数据的广义线性潜变量模型。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-02-12 DOI: 10.1111/bmsp.12337
Maoxin Zhang, Björn Andersson, Shaobo Jin

Different data types often occur in psychological and educational measurement such as computer-based assessments that record performance and process data (e.g., response times and the number of actions). Modelling such data requires specific models for each data type and accommodating complex dependencies between multiple variables. Generalized linear latent variable models are suitable for modelling mixed data simultaneously, but estimation can be computationally demanding. A fast solution is to use Laplace approximations, but existing implementations of joint modelling of mixed data types are limited to ordinal and continuous data. To address this limitation, we derive an efficient estimation method that uses first- or second-order Laplace approximations to simultaneously model ordinal data, continuous data, and count data. We illustrate the approach with an example and conduct simulations to evaluate the performance of the method in terms of estimation efficiency, convergence, and parameter recovery. The results suggest that the second-order Laplace approximation achieves a higher convergence rate and produces accurate yet fast parameter estimates compared to the first-order Laplace approximation, while the time cost increases with higher model complexity. Additionally, models that consider the dependence of variables from the same stimulus fit the empirical data substantially better than models that disregarded the dependence.

在心理和教育测量中经常会出现不同的数据类型,如记录表现和过程数据(如反应时间和操作次数)的基于计算机的评估。对这类数据建模需要针对每种数据类型建立特定的模型,并适应多个变量之间复杂的依赖关系。广义线性潜变量模型适用于同时对混合数据建模,但估算需要大量计算。快速的解决方案是使用拉普拉斯近似,但现有的混合数据类型联合建模方法仅限于序数和连续数据。为了解决这一局限性,我们推导出一种高效的估计方法,利用一阶或二阶拉普拉斯近似同时对序数数据、连续数据和计数数据建模。我们以实例说明了该方法,并进行了模拟,以评估该方法在估计效率、收敛性和参数恢复方面的性能。结果表明,与一阶拉普拉斯近似法相比,二阶拉普拉斯近似法能达到更高的收敛速度,并能产生准确而快速的参数估计,而时间成本会随着模型复杂度的提高而增加。此外,考虑同一刺激变量依赖性的模型比忽略依赖性的模型更符合经验数据。
{"title":"Fast estimation of generalized linear latent variable models for performance and process data with ordinal, continuous, and count observed variables.","authors":"Maoxin Zhang, Björn Andersson, Shaobo Jin","doi":"10.1111/bmsp.12337","DOIUrl":"https://doi.org/10.1111/bmsp.12337","url":null,"abstract":"<p><p>Different data types often occur in psychological and educational measurement such as computer-based assessments that record performance and process data (e.g., response times and the number of actions). Modelling such data requires specific models for each data type and accommodating complex dependencies between multiple variables. Generalized linear latent variable models are suitable for modelling mixed data simultaneously, but estimation can be computationally demanding. A fast solution is to use Laplace approximations, but existing implementations of joint modelling of mixed data types are limited to ordinal and continuous data. To address this limitation, we derive an efficient estimation method that uses first- or second-order Laplace approximations to simultaneously model ordinal data, continuous data, and count data. We illustrate the approach with an example and conduct simulations to evaluate the performance of the method in terms of estimation efficiency, convergence, and parameter recovery. The results suggest that the second-order Laplace approximation achieves a higher convergence rate and produces accurate yet fast parameter estimates compared to the first-order Laplace approximation, while the time cost increases with higher model complexity. Additionally, models that consider the dependence of variables from the same stimulus fit the empirical data substantially better than models that disregarded the dependence.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139725087","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifiability and estimability of Bayesian linear and nonlinear crossed random effects models 贝叶斯线性和非线性交叉随机效应模型的可识别性和可估算性。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-01-24 DOI: 10.1111/bmsp.12334
Corissa T. Rohloff, Nidhi Kohli, Eric F. Lock

Crossed random effects models (CREMs) are particularly useful in longitudinal data applications because they allow researchers to account for the impact of dynamic group membership on individual outcomes. However, no research has determined what data conditions need to be met to sufficiently identify these models, especially the group effects, in a longitudinal context. This is a significant gap in the current literature as future applications to real data may need to consider these conditions to yield accurate and precise model parameter estimates, specifically for the group effects on individual outcomes. Furthermore, there are no existing CREMs that can model intrinsically nonlinear growth. The goals of this study are to develop a Bayesian piecewise CREM to model intrinsically nonlinear growth and evaluate what data conditions are necessary to empirically identify both intrinsically linear and nonlinear longitudinal CREMs. This study includes an applied example that utilizes the piecewise CREM with real data and three simulation studies to assess the data conditions necessary to estimate linear, quadratic, and piecewise CREMs. Results show that the number of repeated measurements collected on groups impacts the ability to recover the group effects. Additionally, functional form complexity impacted data collection requirements for estimating longitudinal CREMs.

交叉随机效应模型(CREMs)在纵向数据应用中特别有用,因为它们允许研究人员考虑动态群体成员身份对个体结果的影响。然而,目前还没有研究确定在纵向背景下需要满足哪些数据条件才能充分识别这些模型,尤其是群体效应。这是目前文献中的一个重要空白,因为未来在真实数据中的应用可能需要考虑这些条件,以获得准确和精确的模型参数估计,特别是群体效应对个体结果的影响。此外,还没有现有的 CREM 可以对内在非线性增长进行建模。本研究的目标是开发一种贝叶斯片断式 CREM,以模拟内在非线性增长,并评估哪些数据条件是通过经验识别内在线性和非线性纵向 CREM 所必需的。本研究包括一个利用真实数据利用片断 CREM 的应用实例和三项模拟研究,以评估估计线性、二次和片断 CREM 所需的数据条件。结果表明,收集到的群体重复测量次数会影响恢复群体效应的能力。此外,函数形式的复杂性也影响了估计纵向 CREM 的数据收集要求。
{"title":"Identifiability and estimability of Bayesian linear and nonlinear crossed random effects models","authors":"Corissa T. Rohloff,&nbsp;Nidhi Kohli,&nbsp;Eric F. Lock","doi":"10.1111/bmsp.12334","DOIUrl":"10.1111/bmsp.12334","url":null,"abstract":"<p>Crossed random effects models (CREMs) are particularly useful in longitudinal data applications because they allow researchers to account for the impact of dynamic group membership on individual outcomes. However, no research has determined what data conditions need to be met to sufficiently identify these models, especially the group effects, in a longitudinal context. This is a significant gap in the current literature as future applications to real data may need to consider these conditions to yield accurate and precise model parameter estimates, specifically for the group effects on individual outcomes. Furthermore, there are no existing CREMs that can model intrinsically nonlinear growth. The goals of this study are to develop a Bayesian piecewise CREM to model intrinsically nonlinear growth and evaluate what data conditions are necessary to empirically identify both intrinsically linear and nonlinear longitudinal CREMs. This study includes an applied example that utilizes the piecewise CREM with real data and three simulation studies to assess the data conditions necessary to estimate linear, quadratic, and piecewise CREMs. Results show that the number of repeated measurements collected on groups impacts the ability to recover the group effects. Additionally, functional form complexity impacted data collection requirements for estimating longitudinal CREMs.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12334","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139543995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
British Journal of Mathematical & Statistical Psychology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1