首页 > 最新文献

British Journal of Mathematical & Statistical Psychology最新文献

英文 中文
Are alternative variables in a set differently associated with a target variable? Statistical tests and practical advice for dealing with dependent correlations 一组变量中的其他变量与目标变量的相关性是否不同?处理从属相关性的统计检验和实用建议。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-24 DOI: 10.1111/bmsp.12354
Miguel A. García-Pérez

The analysis of multiple bivariate correlations is often carried out by conducting simple tests to check whether each of them is significantly different from zero. In addition, pairwise differences are often judged by eye or by comparing the p-values of the individual tests of significance despite the existence of statistical tests for differences between correlations. This paper uses simulation methods to assess the accuracy (empirical Type I error rate), power, and robustness of 10 tests designed to check the significance of the difference between two dependent correlations with overlapping variables (i.e., the correlation between X1 and Y and the correlation between X2 and Y). Five of the tests turned out to be inadvisable because their empirical Type I error rates under normality differ greatly from the nominal alpha level of .05 either across the board or within certain sub-ranges of the parameter space. The remaining five tests were acceptable and their merits were similar in terms of all comparison criteria, although none of them was robust across all forms of non-normality explored in the study. Practical recommendations are given for the choice of a statistical test to compare dependent correlations with overlapping variables.

在分析多个二元相关性时,通常会进行简单的检验,检查每个相关性是否与零有显著 差异。此外,尽管存在相关性之间差异的统计检验,但通常通过眼睛或比较单个显著性检验的 p 值来判断成对差异。本文使用模拟方法评估了 10 个检验的准确性(经验 I 类错误率)、有效性和稳健性,这些检验旨在检查两个变量重叠的因变量相关性(即 X1 和 Y 之间的相关性以及 X2 和 Y 之间的相关性)之间差异的显著性。其中五个测试结果是不可取的,因为它们在正态性下的经验 I 类误差率与 0.05 的名义α水平相差很大,要么是全面相差,要么是在参数空间的某些子范围内相差很大。其余五种检验是可以接受的,它们在所有比较标准方面的优点相似,但没有一种检验在本研究探讨的所有非正态性形式中都是稳健的。本文就如何选择统计检验来比较具有重叠变量的因果相关性提出了实用建议。
{"title":"Are alternative variables in a set differently associated with a target variable? Statistical tests and practical advice for dealing with dependent correlations","authors":"Miguel A. García-Pérez","doi":"10.1111/bmsp.12354","DOIUrl":"10.1111/bmsp.12354","url":null,"abstract":"<p>The analysis of multiple bivariate correlations is often carried out by conducting simple tests to check whether each of them is significantly different from zero. In addition, pairwise differences are often judged by eye or by comparing the <i>p</i>-values of the individual tests of significance despite the existence of statistical tests for differences between correlations. This paper uses simulation methods to assess the accuracy (empirical Type I error rate), power, and robustness of 10 tests designed to check the significance of the difference between two dependent correlations with overlapping variables (i.e., the correlation between <i>X</i><sub>1</sub> and <i>Y</i> and the correlation between <i>X</i><sub>2</sub> and <i>Y</i>). Five of the tests turned out to be inadvisable because their empirical Type I error rates under normality differ greatly from the nominal alpha level of .05 either across the board or within certain sub-ranges of the parameter space. The remaining five tests were acceptable and their merits were similar in terms of all comparison criteria, although none of them was robust across all forms of non-normality explored in the study. Practical recommendations are given for the choice of a statistical test to compare dependent correlations with overlapping variables.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 1","pages":"112-140"},"PeriodicalIF":1.8,"publicationDate":"2024-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12354","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141460882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Determining the number of attributes in the GDINA model 确定 GDINA 模型的属性数量。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-06-18 DOI: 10.1111/bmsp.12349
Juntao Wang, Jiangtao Duan

Exploratory cognitive diagnosis models have been widely used in psychology, education and other fields. This paper focuses on determining the number of attributes in a widely used cognitive diagnosis model, the GDINA model. Under some conditions of cognitive diagnosis models, we prove that there exists a special structure for the covariance matrix of observed data. Due to the special structure of the covariance matrix, an estimator based on eigen-decomposition is proposed for the number of attributes for the GDINA model. The performance of the proposed estimator is verified by simulation studies. Finally, the proposed estimator is applied to two real data sets Examination for the Certificate of Proficiency in English (ECPE) and Big Five Personality (BFP).

探索性认知诊断模型已广泛应用于心理学、教育学和其他领域。本文主要研究如何确定一种广泛使用的认知诊断模型--GDINA 模型--中的属性数量。在认知诊断模型的某些条件下,我们证明了观察数据的协方差矩阵存在一种特殊结构。基于协方差矩阵的特殊结构,我们提出了一种基于特征分解的 GDINA 模型属性数估计器。模拟研究验证了所提估计器的性能。最后,将所提出的估计器应用于两个真实数据集:英语水平证书考试(ECPE)和大五人格(BFP)。
{"title":"Determining the number of attributes in the GDINA model","authors":"Juntao Wang,&nbsp;Jiangtao Duan","doi":"10.1111/bmsp.12349","DOIUrl":"10.1111/bmsp.12349","url":null,"abstract":"<p>Exploratory cognitive diagnosis models have been widely used in psychology, education and other fields. This paper focuses on determining the number of attributes in a widely used cognitive diagnosis model, the GDINA model. Under some conditions of cognitive diagnosis models, we prove that there exists a special structure for the covariance matrix of observed data. Due to the special structure of the covariance matrix, an estimator based on eigen-decomposition is proposed for the number of attributes for the GDINA model. The performance of the proposed estimator is verified by simulation studies. Finally, the proposed estimator is applied to two real data sets Examination for the Certificate of Proficiency in English (ECPE) and Big Five Personality (BFP).</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 1","pages":"84-111"},"PeriodicalIF":1.8,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141421977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric CD-CAT for multiple-choice items: Item selection method and Q-optimality 多选题的非参数 CD-CAT:项目选择方法和 Q-最优性。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-25 DOI: 10.1111/bmsp.12350
Yu Wang, Chia-Yi Chiu, Hans Friedrich Köhn

Computerized adaptive testing for cognitive diagnosis (CD-CAT) achieves remarkable estimation efficiency and accuracy by adaptively selecting and then administering items tailored to each examinee. The process of item selection stands as a pivotal component of a CD-CAT algorithm, with various methods having been developed for binary responses. However, multiple-choice (MC) items, an important item type that allows for the extraction of richer diagnostic information from incorrect answers, have been underemphasized. Currently, the Jensen–Shannon divergence (JSD) index introduced by Yigit et al. (Applied Psychological Measurement, 2019, 43, 388) is the only item selection method exclusively designed for MC items. However, the JSD index requires a large sample to calibrate item parameters, which may be infeasible when there is only a small or no calibration sample. To bridge this gap, the study first proposes a nonparametric item selection method for MC items (MC-NPS) by implementing novel discrimination power that measures an item's ability to effectively distinguish among different attribute profiles. A Q-optimal procedure for MC items is also developed to improve the classification during the initial phase of a CD-CAT algorithm. The effectiveness and efficiency of the two proposed algorithms were confirmed by simulation studies.

用于认知诊断的计算机化自适应测试(CD-CAT)通过自适应地选择和实施适合每个受试者的项目,实现了显著的估计效率和准确性。项目选择过程是 CD-CAT 算法的关键组成部分,针对二元应答开发了各种方法。然而,多选题(MC)作为一种重要的题目类型,可以从错误答案中提取更丰富的诊断信息,却一直未得到足够重视。目前,Yigit 等人提出的詹森-香农分歧(JSD)指数(《应用心理测量》,2019 年,43 期,388)是唯一一种专为 MC 题项设计的题项选择方法。然而,JSD 指数需要大量样本来校准项目参数,这在只有少量校准样本或没有校准样本的情况下可能是不可行的。为了弥补这一差距,本研究首先提出了一种适用于 MC 项目的非参数项目选择方法(MC-NPS),它采用了新颖的区分度来衡量项目有效区分不同属性特征的能力。此外,还为 MC 项目开发了 Q 最佳程序,以改进 CD-CAT 算法初始阶段的分类。模拟研究证实了这两种拟议算法的有效性和效率。
{"title":"Nonparametric CD-CAT for multiple-choice items: Item selection method and Q-optimality","authors":"Yu Wang,&nbsp;Chia-Yi Chiu,&nbsp;Hans Friedrich Köhn","doi":"10.1111/bmsp.12350","DOIUrl":"10.1111/bmsp.12350","url":null,"abstract":"<p>Computerized adaptive testing for cognitive diagnosis (CD-CAT) achieves remarkable estimation efficiency and accuracy by adaptively selecting and then administering items tailored to each examinee. The process of item selection stands as a pivotal component of a CD-CAT algorithm, with various methods having been developed for binary responses. However, multiple-choice (MC) items, an important item type that allows for the extraction of richer diagnostic information from incorrect answers, have been underemphasized. Currently, the Jensen–Shannon divergence (JSD) index introduced by Yigit et al. (<i>Applied Psychological Measurement</i>, 2019, 43, 388) is the only item selection method exclusively designed for MC items. However, the JSD index requires a large sample to calibrate item parameters, which may be infeasible when there is only a small or no calibration sample. To bridge this gap, the study first proposes a nonparametric item selection method for MC items (MC-NPS) by implementing novel discrimination power that measures an item's ability to effectively distinguish among different attribute profiles. A Q-optimal procedure for MC items is also developed to improve the classification during the initial phase of a CD-CAT algorithm. The effectiveness and efficiency of the two proposed algorithms were confirmed by simulation studies.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 1","pages":"61-83"},"PeriodicalIF":1.8,"publicationDate":"2024-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12350","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141096972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incorporating calibration errors in oral reading fluency scoring 将校准误差纳入口语阅读流利度评分。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-05-10 DOI: 10.1111/bmsp.12348
Xin Qiao, Akihito Kamata, Cornelis Potgieter

Oral reading fluency (ORF) assessments are commonly used to screen at-risk readers and evaluate interventions' effectiveness as curriculum-based measurements. Similar to the standard practice in item response theory (IRT), calibrated passage parameter estimates are currently used as if they were population values in model-based ORF scoring. However, calibration errors that are unaccounted for may bias ORF score estimates and, in particular, lead to underestimated standard errors (SEs) of ORF scores. Therefore, we consider an approach that incorporates the calibration errors in latent variable scores. We further derive the SEs of ORF scores based on the delta method to incorporate the calibration uncertainty. We conduct a simulation study to evaluate the recovery of point estimates and SEs of latent variable scores and ORF scores in various simulated conditions. Results suggest that ignoring calibration errors leads to underestimated latent variable score SEs and ORF score SEs, especially when the calibration sample is small.

口语阅读流利度(ORF)评估通常用于筛选高危读者和评估干预措施的有效性,是以课程为基础的测量方法。与项目反应理论(IRT)中的标准做法类似,目前在基于模型的口语阅读流利度评分中,校准过的段落参数估计值被当作人口值使用。然而,未考虑的校准误差可能会使 ORF 分数估计值出现偏差,特别是会导致 ORF 分数的标准误差(SE)被低估。因此,我们考虑了一种将校准误差纳入潜在变量得分的方法。我们根据德尔塔法进一步推导 ORF 分数的 SE,以纳入校准的不确定性。我们进行了一项模拟研究,以评估在各种模拟条件下潜在变量得分和 ORF 分数的点估计值和 SE 的恢复情况。结果表明,忽略校准误差会导致低估潜变量得分 SE 和 ORF 分数 SE,尤其是当校准样本较小时。
{"title":"Incorporating calibration errors in oral reading fluency scoring","authors":"Xin Qiao,&nbsp;Akihito Kamata,&nbsp;Cornelis Potgieter","doi":"10.1111/bmsp.12348","DOIUrl":"10.1111/bmsp.12348","url":null,"abstract":"<p>Oral reading fluency (ORF) assessments are commonly used to screen at-risk readers and evaluate interventions' effectiveness as curriculum-based measurements. Similar to the standard practice in item response theory (IRT), calibrated passage parameter estimates are currently used as if they were population values in model-based ORF scoring. However, calibration errors that are unaccounted for may bias ORF score estimates and, in particular, lead to underestimated standard errors (SEs) of ORF scores. Therefore, we consider an approach that incorporates the calibration errors in latent variable scores. We further derive the SEs of ORF scores based on the delta method to incorporate the calibration uncertainty. We conduct a simulation study to evaluate the recovery of point estimates and SEs of latent variable scores and ORF scores in various simulated conditions. Results suggest that ignoring calibration errors leads to underestimated latent variable score SEs and ORF score SEs, especially when the calibration sample is small.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 1","pages":"44-60"},"PeriodicalIF":1.8,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140900014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pairwise stochastic approximation for confirmatory factor analysis of categorical data 用于分类数据确证因子分析的成对随机近似法
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-27 DOI: 10.1111/bmsp.12347
Giuseppe Alfonzetti, Ruggero Bellio, Yunxiao Chen, Irini Moustaki

Pairwise likelihood is a limited-information method widely used to estimate latent variable models, including factor analysis of categorical data. It can often avoid evaluating high-dimensional integrals and, thus, is computationally more efficient than relying on the full likelihood. Despite its computational advantage, the pairwise likelihood approach can still be demanding for large-scale problems that involve many observed variables. We tackle this challenge by employing an approximation of the pairwise likelihood estimator, which is derived from an optimization procedure relying on stochastic gradients. The stochastic gradients are constructed by subsampling the pairwise log-likelihood contributions, for which the subsampling scheme controls the per-iteration computational complexity. The stochastic estimator is shown to be asymptotically equivalent to the pairwise likelihood one. However, finite-sample performance can be improved by compounding the sampling variability of the data with the uncertainty introduced by the subsampling scheme. We demonstrate the performance of the proposed method using simulation studies and two real data applications.

配对似然法是一种信息有限的方法,广泛用于估计潜在变量模型,包括分类数据的因子分析。它通常可以避免评估高维积分,因此在计算上比依赖完全似然法更有效。尽管成对似然法具有计算优势,但对于涉及许多观察变量的大规模问题来说,它的要求仍然很高。我们通过使用成对似然估计器的近似值来应对这一挑战,该近似值来自于依赖随机梯度的优化程序。随机梯度通过对对数似然贡献进行子采样来构建,子采样方案控制了每次迭代的计算复杂度。结果表明,随机估计器在渐近上等同于成对似然估计器。然而,通过将数据的采样变异性与子采样方案引入的不确定性结合起来,可以提高有限样本的性能。我们通过模拟研究和两个实际数据应用来证明所提方法的性能。
{"title":"Pairwise stochastic approximation for confirmatory factor analysis of categorical data","authors":"Giuseppe Alfonzetti,&nbsp;Ruggero Bellio,&nbsp;Yunxiao Chen,&nbsp;Irini Moustaki","doi":"10.1111/bmsp.12347","DOIUrl":"10.1111/bmsp.12347","url":null,"abstract":"<p>Pairwise likelihood is a limited-information method widely used to estimate latent variable models, including factor analysis of categorical data. It can often avoid evaluating high-dimensional integrals and, thus, is computationally more efficient than relying on the full likelihood. Despite its computational advantage, the pairwise likelihood approach can still be demanding for large-scale problems that involve many observed variables. We tackle this challenge by employing an approximation of the pairwise likelihood estimator, which is derived from an optimization procedure relying on stochastic gradients. The stochastic gradients are constructed by subsampling the pairwise log-likelihood contributions, for which the subsampling scheme controls the per-iteration computational complexity. The stochastic estimator is shown to be asymptotically equivalent to the pairwise likelihood one. However, finite-sample performance can be improved by compounding the sampling variability of the data with the uncertainty introduced by the subsampling scheme. We demonstrate the performance of the proposed method using simulation studies and two real data applications.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 1","pages":"22-43"},"PeriodicalIF":1.8,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140811401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advances in meta-analysis: A unifying modelling framework with measurement error correction 荟萃分析的进展:测量误差校正的统一建模框架
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-26 DOI: 10.1111/bmsp.12345
Betsy Jane Becker, Qian Zhang

In psychological studies, multivariate outcomes measured on the same individuals are often encountered. Effects originating from these outcomes are consequently dependent. Multivariate meta-analysis examines the relationships of multivariate outcomes by estimating the mean effects and their variance–covariance matrices from series of primary studies. In this paper we discuss a unified modelling framework for multivariate meta-analysis that also incorporates measurement error corrections. We focus on two types of effect sizes, standardized mean differences (d) and correlations (r), that are common in psychological studies. Using generalized least squares estimation, we outline estimated mean vectors and variance–covariance matrices for d and r that are corrected for measurement error. Given the burgeoning research involving multivariate outcomes, and the largely overlooked ramifications of measurement error, we advocate addressing measurement error while conducting multivariate meta-analysis to enhance the replicability of psychological research.

在心理学研究中,经常会遇到对同一个人进行多变量测量的结果。因此,这些结果所产生的效应具有依赖性。多元荟萃分析通过估算一系列主要研究的平均效应及其方差-协方差矩阵来研究多元结果之间的关系。本文讨论了多元荟萃分析的统一建模框架,该框架还包含测量误差校正。我们将重点放在心理学研究中常见的两种效应大小--标准化平均差(d)和相关性(r)。利用广义最小二乘法估计,我们概述了经测量误差校正的 d 和 r 的估计均值向量和方差-协方差矩阵。鉴于涉及多元结果的研究方兴未艾,而测量误差的影响在很大程度上被忽视,我们主张在进行多元荟萃分析时解决测量误差问题,以提高心理学研究的可复制性。
{"title":"Advances in meta-analysis: A unifying modelling framework with measurement error correction","authors":"Betsy Jane Becker,&nbsp;Qian Zhang","doi":"10.1111/bmsp.12345","DOIUrl":"10.1111/bmsp.12345","url":null,"abstract":"<p>In psychological studies, multivariate outcomes measured on the same individuals are often encountered. Effects originating from these outcomes are consequently dependent. Multivariate meta-analysis examines the relationships of multivariate outcomes by estimating the mean effects and their variance–covariance matrices from series of primary studies. In this paper we discuss a unified modelling framework for multivariate meta-analysis that also incorporates measurement error corrections. We focus on two types of effect sizes, standardized mean differences (<i>d</i>) and correlations (<i>r</i>), that are common in psychological studies. Using generalized least squares estimation, we outline estimated mean vectors and variance–covariance matrices for <i>d</i> and <i>r</i> that are corrected for measurement error. Given the burgeoning research involving multivariate outcomes, and the largely overlooked ramifications of measurement error, we advocate addressing measurement error while conducting multivariate meta-analysis to enhance the replicability of psychological research.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"77 3","pages":"395-428"},"PeriodicalIF":1.8,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12345","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140798185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining regularization and logistic regression model to validate the Q-matrix for cognitive diagnosis model 结合正则化和逻辑回归模型,验证认知诊断模型的 Q 矩阵。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-22 DOI: 10.1111/bmsp.12346
Xiaojian Sun, Tongxin Zhang, Chang Nie, Naiqing Song, Tao Xin

Q-matrix is an important component of most cognitive diagnosis models (CDMs); however, it mainly relies on subject matter experts' judgements in empirical studies, which introduces the possibility of misspecified q-entries. To address this, statistical Q-matrix validation methods have been proposed to aid experts' judgement. A few of these methods, including the multiple logistic regression-based (MLR-B) method and the Hull method, can be applied to general CDMs, but they are either time-consuming or lack accuracy under certain conditions. In this study, we combine the L1 regularization and MLR model to validate the Q-matrix. Specifically, an L1 penalty term is imposed on the log-likelihood of the MLR model to select the necessary attributes for each item. A simulation study with various factors was conducted to examine the performance of the new method against the two existing methods. The results show that the regularized MLR-B method (a) produces the highest Q-matrix recovery rate (QRR) and true positive rate (TPR) for most conditions, especially with a small sample size; (b) yields a slightly higher true negative rate (TNR) than either the MLR-B or the Hull method for most conditions; and (c) requires less computation time than the MLR-B method and similar computation time as the Hull method. A real data set is analysed for illustration purposes.

Q 矩阵是大多数认知诊断模型(CDMs)的重要组成部分;然而,在实证研究中,它主要依赖于主题专家的判断,这就带来了 Q 条目被错误规范的可能性。为了解决这个问题,人们提出了统计 Q 矩阵验证方法来帮助专家做出判断。其中一些方法,包括基于多元逻辑回归(MLR-B)的方法和 Hull 方法,可以应用于一般的 CDM,但它们要么耗时长,要么在某些条件下缺乏准确性。在本研究中,我们结合了 L1 正则化和 MLR 模型来验证 Q 矩阵。具体来说,我们在 MLR 模型的对数概率上施加了 L1 惩罚项,以便为每个项目选择必要的属性。通过对各种因素进行模拟研究,考察了新方法与现有两种方法的性能对比。结果表明,正则化 MLR-B 方法(a) 在大多数情况下,尤其是样本量较小的情况下,Q 矩阵恢复率 (QRR) 和真阳性率 (TPR) 最高;(b) 在大多数情况下,真阴性率 (TNR) 略高于 MLR-B 或 Hull 方法;(c) 所需的计算时间少于 MLR-B 方法,与 Hull 方法相近。为说明起见,对一组真实数据进行了分析。
{"title":"Combining regularization and logistic regression model to validate the Q-matrix for cognitive diagnosis model","authors":"Xiaojian Sun,&nbsp;Tongxin Zhang,&nbsp;Chang Nie,&nbsp;Naiqing Song,&nbsp;Tao Xin","doi":"10.1111/bmsp.12346","DOIUrl":"10.1111/bmsp.12346","url":null,"abstract":"<p>Q-matrix is an important component of most cognitive diagnosis models (CDMs); however, it mainly relies on subject matter experts' judgements in empirical studies, which introduces the possibility of misspecified q-entries. To address this, statistical Q-matrix validation methods have been proposed to aid experts' judgement. A few of these methods, including the multiple logistic regression-based (MLR-B) method and the Hull method, can be applied to general CDMs, but they are either time-consuming or lack accuracy under certain conditions. In this study, we combine the <i>L</i><sub>1</sub> regularization and MLR model to validate the Q-matrix. Specifically, an <i>L</i><sub>1</sub> penalty term is imposed on the log-likelihood of the MLR model to select the necessary attributes for each item. A simulation study with various factors was conducted to examine the performance of the new method against the two existing methods. The results show that the regularized MLR-B method (a) produces the highest Q-matrix recovery rate (QRR) and true positive rate (TPR) for most conditions, especially with a small sample size; (b) yields a slightly higher true negative rate (TNR) than either the MLR-B or the Hull method for most conditions; and (c) requires less computation time than the MLR-B method and similar computation time as the Hull method. A real data set is analysed for illustration purposes.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 1","pages":"1-21"},"PeriodicalIF":1.8,"publicationDate":"2024-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140677462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Three new corrections for standardized person-fit statistics for tests with polytomous items 针对多项式项目测试的标准化人称拟合统计的三种新修正方法
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-17 DOI: 10.1111/bmsp.12342
Kylie Gorney

Recent years have seen a growing interest in the development of person-fit statistics for tests with polytomous items. Some of the most popular person-fit statistics for such tests belong to the class of standardized person-fit statistics, T, that is assumed to have a standard normal null distribution. However, this distribution only holds when (a) the true ability parameter is known and (b) an infinite number of items are available. In practice, both conditions are violated, and the quality of person-fit results is expected to deteriorate. In this paper, we propose three new corrections for T that simultaneously account for the use of an estimated ability parameter and the use of a finite number of items. The three new corrections are direct extensions of those that were developed by Gorney et al. (Psychometrika, 2024, https://doi.org/10.1007/s11336-024-09960-x) for tests with only dichotomous items. Our simulation study reveals that the three new corrections tend to outperform not only the original statistic T but also an existing correction for T proposed by Sinharay (Psychometrika, 2016, 81, 992). Therefore, the new corrections appear to be promising tools for assessing person fit in tests with polytomous items.

近些年来,人们对开发多变量项目测验的人称拟合统计量越来越感兴趣。此类测验中一些最常用的拟合统计量属于标准化拟合统计量,即假定具有标准正态空分布的拟合统计量。然而,这种分布只有在以下情况下才成立:(a) 真正的能力参数已知;(b) 有无限多的项目可用。在实践中,这两个条件都会被违反,从而导致拟人结果的质量下降。在本文中,我们提出了三种新的修正方法,同时考虑到使用估计的能力参数和使用有限数量的项目。这三种新的修正方法是 Gorney 等人(Psychometrika, 2024, https://doi.org/10.1007/s11336-024-09960-x)针对只有二分项目的测验所开发的修正方法的直接扩展。我们的模拟研究显示,这三种新的校正不仅往往优于原始统计量,而且也优于辛哈雷(Sinharay)提出的现有校正(Psychometrika,2016,81,992)。因此,新的校正似乎是评估多项式项目测试中的人称契合度的有前途的工具。
{"title":"Three new corrections for standardized person-fit statistics for tests with polytomous items","authors":"Kylie Gorney","doi":"10.1111/bmsp.12342","DOIUrl":"10.1111/bmsp.12342","url":null,"abstract":"<p>Recent years have seen a growing interest in the development of person-fit statistics for tests with polytomous items. Some of the most popular person-fit statistics for such tests belong to the class of standardized person-fit statistics, <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>T</mi>\u0000 </mrow>\u0000 </semantics></math>, that is assumed to have a standard normal null distribution. However, this distribution only holds when (a) the true ability parameter is known and (b) an infinite number of items are available. In practice, both conditions are violated, and the quality of person-fit results is expected to deteriorate. In this paper, we propose three new corrections for <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>T</mi>\u0000 </mrow>\u0000 </semantics></math> that simultaneously account for the use of an estimated ability parameter and the use of a finite number of items. The three new corrections are direct extensions of those that were developed by Gorney et al. (<i>Psychometrika</i>, 2024, https://doi.org/10.1007/s11336-024-09960-x) for tests with only dichotomous items. Our simulation study reveals that the three new corrections tend to outperform not only the original statistic <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>T</mi>\u0000 </mrow>\u0000 </semantics></math> but also an existing correction for <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>T</mi>\u0000 </mrow>\u0000 </semantics></math> proposed by Sinharay (<i>Psychometrika</i>, 2016, 81, 992). Therefore, the new corrections appear to be promising tools for assessing person fit in tests with polytomous items.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"77 3","pages":"634-650"},"PeriodicalIF":1.8,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12342","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140629021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modelling motion energy in psychotherapy: A dynamical systems approach 心理治疗中的运动能量建模:动态系统方法
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-16 DOI: 10.1111/bmsp.12341
Itai Dattner

In this study we introduce an innovative mathematical and statistical framework for the analysis of motion energy dynamics in psychotherapy sessions. Our method combines motion energy dynamics with coupled linear ordinary differential equations and a measurement error model, contributing new clinical parameters to enhance psychotherapy research. Our approach transforms raw motion energy data into an interpretable account of therapist–patient interactions, providing novel insights into the dynamics of these interactions. A key aspect of our framework is the development of a new measure of synchrony between the motion energies of therapists and patients, which holds significant clinical and theoretical value in psychotherapy. The practical applicability and effectiveness of our modelling and estimation framework are demonstrated through the analysis of real session data. This work advances the quantitative analysis of motion dynamics in psychotherapy, offering important implications for future research and therapeutic practice.

在本研究中,我们介绍了一种创新的数学和统计框架,用于分析心理治疗过程中的运动能量动态。我们的方法将运动能量动力学与耦合线性常微分方程和测量误差模型相结合,为加强心理治疗研究提供了新的临床参数。我们的方法将原始运动能量数据转化为治疗师与患者互动的可解释说明,为这些互动的动力学提供了新的见解。我们框架的一个关键方面是开发了一种新的治疗师与患者运动能量同步性测量方法,这在心理治疗中具有重要的临床和理论价值。通过对真实疗程数据的分析,我们展示了建模和估算框架的实际应用性和有效性。这项工作推进了心理治疗中运动动态的定量分析,对未来的研究和治疗实践具有重要意义。
{"title":"Modelling motion energy in psychotherapy: A dynamical systems approach","authors":"Itai Dattner","doi":"10.1111/bmsp.12341","DOIUrl":"10.1111/bmsp.12341","url":null,"abstract":"<p>In this study we introduce an innovative mathematical and statistical framework for the analysis of motion energy dynamics in psychotherapy sessions. Our method combines motion energy dynamics with coupled linear ordinary differential equations and a measurement error model, contributing new clinical parameters to enhance psychotherapy research. Our approach transforms raw motion energy data into an interpretable account of therapist–patient interactions, providing novel insights into the dynamics of these interactions. A key aspect of our framework is the development of a new measure of synchrony between the motion energies of therapists and patients, which holds significant clinical and theoretical value in psychotherapy. The practical applicability and effectiveness of our modelling and estimation framework are demonstrated through the analysis of real session data. This work advances the quantitative analysis of motion dynamics in psychotherapy, offering important implications for future research and therapeutic practice.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"77 3","pages":"613-633"},"PeriodicalIF":1.8,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12341","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140611390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing quality of selection procedures: Lower bound of false positive rate as a function of inter-rater reliability 评估筛选程序的质量:假阳性率下限与评分者间可靠性的关系
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-04-15 DOI: 10.1111/bmsp.12343
František Bartoš, Patrícia Martinková

Inter-rater reliability (IRR) is one of the commonly used tools for assessing the quality of ratings from multiple raters. However, applicant selection procedures based on ratings from multiple raters usually result in a binary outcome; the applicant is either selected or not. This final outcome is not considered in IRR, which instead focuses on the ratings of the individual subjects or objects. We outline the connection between the ratings' measurement model (used for IRR) and a binary classification framework. We develop a simple way of approximating the probability of correctly selecting the best applicants which allows us to compute error probabilities of the selection procedure (i.e., false positive and false negative rate) or their lower bounds. We draw connections between the IRR and the binary classification metrics, showing that binary classification metrics depend solely on the IRR coefficient and proportion of selected applicants. We assess the performance of the approximation in a simulation study and apply it in an example comparing the reliability of multiple grant peer review selection procedures. We also discuss other possible uses of the explored connections in other contexts, such as educational testing, psychological assessment, and health-related measurement, and implement the computations in the R package IRR2FPR.

评分者之间的可靠性(IRR)是评估多个评分者评分质量的常用工具之一。然而,基于多个评分者评分的申请人甄选程序通常会产生二元结果:申请人要么被选中,要么不被选中。IRR 并不考虑这一最终结果,而是将重点放在对单个主体或对象的评分上。我们概述了评级测量模型(用于 IRR)与二元分类框架之间的联系。我们开发了一种近似正确选择最佳申请人概率的简单方法,通过这种方法,我们可以计算选择程序的错误概率(即假阳性率和假阴性率)或其下限。我们得出了 IRR 和二元分类指标之间的联系,表明二元分类指标完全取决于 IRR 系数和入选申请人的比例。我们在模拟研究中评估了近似值的性能,并将其应用于一个比较多个基金同行评审选择程序可靠性的例子中。我们还讨论了在教育测试、心理评估和健康相关测量等其他情况下探索出的联系的其他可能用途,并在 R 软件包 IRR2FPR 中实现了计算。
{"title":"Assessing quality of selection procedures: Lower bound of false positive rate as a function of inter-rater reliability","authors":"František Bartoš,&nbsp;Patrícia Martinková","doi":"10.1111/bmsp.12343","DOIUrl":"10.1111/bmsp.12343","url":null,"abstract":"<p>Inter-rater reliability (IRR) is one of the commonly used tools for assessing the quality of ratings from multiple raters. However, applicant selection procedures based on ratings from multiple raters usually result in a binary outcome; the applicant is either selected or not. This final outcome is not considered in IRR, which instead focuses on the ratings of the individual subjects or objects. We outline the connection between the ratings' measurement model (used for IRR) and a binary classification framework. We develop a simple way of approximating the probability of correctly selecting the best applicants which allows us to compute error probabilities of the selection procedure (i.e., false positive and false negative rate) or their lower bounds. We draw connections between the IRR and the binary classification metrics, showing that binary classification metrics depend solely on the IRR coefficient and proportion of selected applicants. We assess the performance of the approximation in a simulation study and apply it in an example comparing the reliability of multiple grant peer review selection procedures. We also discuss other possible uses of the explored connections in other contexts, such as educational testing, psychological assessment, and health-related measurement, and implement the computations in the R package <span>IRR2FPR</span>.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"77 3","pages":"651-671"},"PeriodicalIF":1.8,"publicationDate":"2024-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12343","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140592832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
British Journal of Mathematical & Statistical Psychology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1