首页 > 最新文献

Psychometrika最新文献

英文 中文
Modeling Missing at Random Neuropsychological Test Scores Using a Mixture of Binomial Product Experts. 建模缺失随机神经心理学测试分数使用二项产品专家的混合物。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-22 DOI: 10.1017/psy.2025.10053
Daniel Suen, Yen-Chi Chen

Multivariate bounded discrete data arises in many fields. In the setting of dementia studies, such data are collected when individuals complete neuropsychological tests. We outline a modeling and inference procedure that can model the joint distribution conditional on baseline covariates, leveraging previous work on mixtures of experts and latent class models. Furthermore, we illustrate how the work can be extended when the outcome data are missing at random using a nested EM algorithm. The proposed model can incorporate covariate information and perform imputation and clustering. We apply our model to simulated data and an Alzheimer's disease data set.

多元有界离散数据在许多领域都有应用。在痴呆症研究的背景下,这些数据是在个体完成神经心理学测试时收集的。我们概述了一个建模和推理过程,该过程可以根据基线协变量对联合分布进行建模,利用以前在专家和潜在类模型混合方面的工作。此外,我们说明了当结果数据随机丢失时,如何使用嵌套EM算法扩展工作。该模型可以融合协变量信息,并进行插值和聚类。我们将我们的模型应用于模拟数据和阿尔茨海默病数据集。
{"title":"Modeling Missing at Random Neuropsychological Test Scores Using a Mixture of Binomial Product Experts.","authors":"Daniel Suen, Yen-Chi Chen","doi":"10.1017/psy.2025.10053","DOIUrl":"10.1017/psy.2025.10053","url":null,"abstract":"<p><p>Multivariate bounded discrete data arises in many fields. In the setting of dementia studies, such data are collected when individuals complete neuropsychological tests. We outline a modeling and inference procedure that can model the joint distribution conditional on baseline covariates, leveraging previous work on mixtures of experts and latent class models. Furthermore, we illustrate how the work can be extended when the outcome data are missing at random using a nested EM algorithm. The proposed model can incorporate covariate information and perform imputation and clustering. We apply our model to simulated data and an Alzheimer's disease data set.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-24"},"PeriodicalIF":3.1,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145349858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extending the Bicriterion Approach for Anticlustering: Exact and Hybrid Approaches. 扩展双准则方法的反聚类:精确和混合方法。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-07 DOI: 10.1017/psy.2025.10052
Martin Papenberg, Martin Breuer, Max Diekhoff, Nguyen K Tran, Gunnar W Klau
{"title":"Extending the Bicriterion Approach for Anticlustering: Exact and Hybrid Approaches.","authors":"Martin Papenberg, Martin Breuer, Max Diekhoff, Nguyen K Tran, Gunnar W Klau","doi":"10.1017/psy.2025.10052","DOIUrl":"10.1017/psy.2025.10052","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-43"},"PeriodicalIF":3.1,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805193/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145240242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Standard Errors for Reliability Coefficients. 可靠性系数的标准误差。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-30 DOI: 10.1017/psy.2025.10050
L Andries van der Ark

Reliability analysis is one of the most conducted analyses in applied psychometrics. It entails the assessment of reliability of both item scores and scale scores using coefficients that estimate the reliability (e.g., Cronbach's alpha), measurement precision (e.g., estimated standard error of measurement), or the contribution of individual items to the reliability (e.g., corrected item-total correlations). Most statistical software packages used in social and behavioral sciences offer these reliability coefficients, whereas standard errors are generally unavailable, which is a bit ironic for coefficients about measurement precision. This article provides analytic nonparametric standard errors for coefficients used in reliability analysis. As most scores used in behavioral sciences are discrete, standard errors are derived under the relatively unrestrictive multinomial sampling scheme. Tedious derivations are presented in appendices, and R functions for computing standard errors are available from the Open Science Framework. Bias and variance of standard errors, and coverage of the corresponding Wald-based confidence intervals are studied using simulated item scores. Bias and variance, and coverage are generally satisfactory for larger sample sizes, and parameter values are not close to the boundary of the parameter space.

信度分析是应用心理测量学中应用最多的分析方法之一。它需要使用估计可靠性(例如,Cronbach's alpha)、测量精度(例如,估计的测量标准误差)或单个项目对可靠性的贡献(例如,校正的项目-总相关性)的系数来评估项目分数和量表分数的可靠性。社会和行为科学中使用的大多数统计软件包都提供了这些可靠性系数,而标准误差通常是不可用的,这对于测量精度的系数来说有点讽刺。本文给出了可靠性分析中所用系数的解析性非参数标准误差。由于行为科学中使用的大多数分数是离散的,标准误差是在相对不受限制的多项抽样方案下得出的。冗长的推导在附录中给出,计算标准误差的R函数可以从开放科学框架中获得。使用模拟项目得分研究标准误差的偏差和方差,以及相应的基于wald的置信区间的覆盖率。对于较大的样本量,偏差、方差和覆盖率通常是令人满意的,参数值并不接近参数空间的边界。
{"title":"Standard Errors for Reliability Coefficients.","authors":"L Andries van der Ark","doi":"10.1017/psy.2025.10050","DOIUrl":"10.1017/psy.2025.10050","url":null,"abstract":"<p><p>Reliability analysis is one of the most conducted analyses in applied psychometrics. It entails the assessment of reliability of both item scores and scale scores using coefficients that estimate the reliability (e.g., Cronbach's alpha), measurement precision (e.g., estimated standard error of measurement), or the contribution of individual items to the reliability (e.g., corrected item-total correlations). Most statistical software packages used in social and behavioral sciences offer these reliability coefficients, whereas standard errors are generally unavailable, which is a bit ironic for coefficients about measurement precision. This article provides analytic nonparametric standard errors for coefficients used in reliability analysis. As most scores used in behavioral sciences are discrete, standard errors are derived under the relatively unrestrictive multinomial sampling scheme. Tedious derivations are presented in appendices, and R functions for computing standard errors are available from the Open Science Framework. Bias and variance of standard errors, and coverage of the corresponding Wald-based confidence intervals are studied using simulated item scores. Bias and variance, and coverage are generally satisfactory for larger sample sizes, and parameter values are not close to the boundary of the parameter space.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-26"},"PeriodicalIF":3.1,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805205/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145193499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Optimally Regularized Estimator of Multilevel Latent Variable Models, with Improved MSE Performance. 一种具有改进MSE性能的多水平潜变量模型的最优正则化估计器。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-22 DOI: 10.1017/psy.2025.10045
Valerii Dashuk, Martin Hecht, Oliver Lüdtke, Alexander Robitzsch, Steffen Zitzmann
{"title":"An Optimally Regularized Estimator of Multilevel Latent Variable Models, with Improved MSE Performance.","authors":"Valerii Dashuk, Martin Hecht, Oliver Lüdtke, Alexander Robitzsch, Steffen Zitzmann","doi":"10.1017/psy.2025.10045","DOIUrl":"10.1017/psy.2025.10045","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-75"},"PeriodicalIF":3.1,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805209/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145114795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Obituary Robert J. Mislevy (1950-2025). Robert J. Mislevy讣告(1950-2025)。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-15 DOI: 10.1017/psy.2025.10049
Roy Levy, Russell G Almond
{"title":"Obituary Robert J. Mislevy (1950-2025).","authors":"Roy Levy, Russell G Almond","doi":"10.1017/psy.2025.10049","DOIUrl":"https://doi.org/10.1017/psy.2025.10049","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-12"},"PeriodicalIF":3.1,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145066451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Method for Detecting Intersectional DIF: Multilevel Random Item Effects Model with Regularized Gaussian Variational Estimation. 一种检测交叉DIF的新方法:正则化高斯变分估计的多水平随机项目效应模型。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-15 DOI: 10.1017/psy.2025.10046
He Ren, Weicong Lyu, Chun Wang, Gongjun Xu

Differential item functioning (DIF) screening has long been suggested to ensure assessment fairness. Traditional DIF methods typically focus on the main effects of demographic variables on item parameters, overlooking the interactions among multiple identities. Drawing on the intersectionality framework, we define intersectional DIF as deviations in item parameters that arise from the interactions among demographic variables beyond their main effects and propose a novel item response theory (IRT) approach for detecting intersectional DIF. Under our framework, fixed effects are used to account for traditional DIF, while random item effects are introduced to capture intersectional DIF. We further introduce the concept of intersectional impact, which refers to interaction effects on group-level mean ability. Depending on which item parameters are affected and whether intersectional impact is considered, we propose four models, which aim to detect intersectional uniform DIF (UDIF), intersectional UDIF with intersectional impact, intersectional non-uniform DIF (NUDIF), and intersectional NUDIF with intersectional impact, respectively. For efficient model estimation, a regularized Gaussian variational expectation-maximization algorithm is developed. Simulation studies demonstrate that our methods can effectively detect intersectional UDIF, although their detection of intersectional NUDIF is more limited.

差异项目功能筛选(DIF)一直被建议用于确保评估的公平性。传统的DIF方法通常关注人口统计变量对项目参数的主要影响,而忽略了多个身份之间的相互作用。在交叉性框架的基础上,我们将交叉性DIF定义为人口变量之间的相互作用所产生的项目参数偏差,并提出了一种新的项目反应理论(IRT)方法来检测交叉性DIF。在我们的框架下,使用固定效应来解释传统的DIF,而引入随机项目效应来捕获交叉DIF。我们进一步引入了交叉影响的概念,它指的是群体水平平均能力的交互效应。根据受影响的项目参数和是否考虑交叉冲击,我们提出了四种模型,分别用于检测交叉均匀DIF (UDIF)、交叉不均匀DIF (NUDIF)、交叉性NUDIF和交叉性NUDIF。为了有效地估计模型,提出了一种正则化高斯变分期望最大化算法。仿真研究表明,我们的方法可以有效地检测出交叉的UDIF,尽管它们对交叉的UDIF的检测比较有限。
{"title":"A Novel Method for Detecting Intersectional DIF: Multilevel Random Item Effects Model with Regularized Gaussian Variational Estimation.","authors":"He Ren, Weicong Lyu, Chun Wang, Gongjun Xu","doi":"10.1017/psy.2025.10046","DOIUrl":"10.1017/psy.2025.10046","url":null,"abstract":"<p><p>Differential item functioning (DIF) screening has long been suggested to ensure assessment fairness. Traditional DIF methods typically focus on the main effects of demographic variables on item parameters, overlooking the interactions among multiple identities. Drawing on the intersectionality framework, we define intersectional DIF as deviations in item parameters that arise from the interactions among demographic variables beyond their main effects and propose a novel item response theory (IRT) approach for detecting intersectional DIF. Under our framework, fixed effects are used to account for traditional DIF, while random item effects are introduced to capture intersectional DIF. We further introduce the concept of intersectional impact, which refers to interaction effects on group-level mean ability. Depending on which item parameters are affected and whether intersectional impact is considered, we propose four models, which aim to detect intersectional uniform DIF (UDIF), intersectional UDIF with intersectional impact, intersectional non-uniform DIF (NUDIF), and intersectional NUDIF with intersectional impact, respectively. For efficient model estimation, a regularized Gaussian variational expectation-maximization algorithm is developed. Simulation studies demonstrate that our methods can effectively detect intersectional UDIF, although their detection of intersectional NUDIF is more limited.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-25"},"PeriodicalIF":3.1,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145066440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Algebraic Approach to Maximum Likelihood Factor Analysis. 极大似然因子分析的代数方法。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-15 DOI: 10.1017/psy.2025.10047
Ryoya Fukasaku, Kei Hirose, Yutaro Kabata, Keisuke Teramoto

In maximum likelihood factor analysis, we need to solve a complicated system of algebraic equations, known as the normal equation, to get maximum likelihood estimates (MLEs). Since this equation is difficult to solve analytically, its solutions are typically computed with continuous optimization methods, such as the Newton-Raphson method. With this procedure, however, the MLEs are dependent on initial values since the log-likelihood function is highly non-concave. Particularly, the estimates of unique variances can result in zero or negative, referred to as improper solutions; in this case, the MLE can be severely unstable. To delve into the issue of the instability, we algebraically compute all candidates for the MLE. We provide an algorithm based on algebraic computations that is carefully designed for maximum likelihood factor analysis. To be specific, Gröbner bases are employed, powerful tools to get simplified sub-problems for given systems of algebraic equations. Our algebraic algorithm provides the MLE independent of the initial values. While computationally demanding, our algebraic approach is applicable to small-scale problems and provides valuable insights into the characterization of improper solutions. For larger-scale problems, we provide numerical methods as practical alternatives to the algebraic approach. We perform numerical experiments to investigate the characteristics of the MLE with our two approaches.

在极大似然因子分析中,我们需要求解一个复杂的代数方程组,即正态方程,以得到极大似然估计(MLEs)。由于该方程难以解析求解,因此通常使用连续优化方法(如Newton-Raphson方法)计算其解。然而,在这个过程中,由于对数似然函数高度非凹,最大似然值依赖于初始值。特别是,唯一方差的估计可能导致零或负,称为不当解;在这种情况下,MLE可能会严重不稳定。为了深入研究不稳定性问题,我们用代数方法计算了MLE的所有候选点。我们提供了一种基于代数计算的算法,该算法经过精心设计,用于最大似然因子分析。具体地说,Gröbner基是一种强大的工具,可以得到给定代数方程组的简化子问题。我们的代数算法提供了独立于初始值的MLE。虽然计算要求高,但我们的代数方法适用于小规模问题,并为不适当解的表征提供了有价值的见解。对于更大规模的问题,我们提供数值方法作为代数方法的实用替代方案。我们用这两种方法进行了数值实验来研究MLE的特性。
{"title":"Algebraic Approach to Maximum Likelihood Factor Analysis.","authors":"Ryoya Fukasaku, Kei Hirose, Yutaro Kabata, Keisuke Teramoto","doi":"10.1017/psy.2025.10047","DOIUrl":"10.1017/psy.2025.10047","url":null,"abstract":"<p><p>In maximum likelihood factor analysis, we need to solve a complicated system of algebraic equations, known as the normal equation, to get maximum likelihood estimates (MLEs). Since this equation is difficult to solve analytically, its solutions are typically computed with continuous optimization methods, such as the Newton-Raphson method. With this procedure, however, the MLEs are dependent on initial values since the log-likelihood function is highly non-concave. Particularly, the estimates of unique variances can result in zero or negative, referred to as improper solutions; in this case, the MLE can be severely unstable. To delve into the issue of the instability, we algebraically compute all candidates for the MLE. We provide an algorithm based on algebraic computations that is carefully designed for maximum likelihood factor analysis. To be specific, Gröbner bases are employed, powerful tools to get simplified sub-problems for given systems of algebraic equations. Our algebraic algorithm provides the MLE independent of the initial values. While computationally demanding, our algebraic approach is applicable to small-scale problems and provides valuable insights into the characterization of improper solutions. For larger-scale problems, we provide numerical methods as practical alternatives to the algebraic approach. We perform numerical experiments to investigate the characteristics of the MLE with our two approaches.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-33"},"PeriodicalIF":3.1,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805208/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145066437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Extended Two-Parameter Logistic Item Response Model to Handle Continuous Responses and Sparse Polytomous Responses. 一种处理连续响应和稀疏多域响应的扩展双参数Logistic项响应模型。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-02 DOI: 10.1017/psy.2025.10044
Seewoo Li, Hyo Jeong Shin

The article proposes a novel item response theory model to handle continuous responses and sparse polytomous responses in psychological and educational measurement. The model extends the traditional two-parameter logistic model by incorporating a precision parameter, which, along with a beta distribution, forms an error component that accounts for the response continuity. Furthermore, transforming ordinal responses to a continuous scale enables the fitting of polytomous item responses while consistently applying three parameters per item for model parsimony. The model's accuracy, stability, and computational efficiency in parameter estimation were examined. An empirical application demonstrated the model's effectiveness in representing the characteristics of continuous item responses. Additionally, the model's applicability to sparse polytomous data was supported by cross-validation results from another empirical dataset, which indicates that the model's parsimony can enhance model-data fit compared to existing polytomous models.

本文提出了一种新的项目反应理论模型,用于处理心理和教育测量中的连续反应和稀疏多模反应。该模型扩展了传统的双参数逻辑模型,加入了一个精度参数,该参数与beta分布一起形成了解释响应连续性的误差分量。此外,将有序响应转换为连续尺度可以拟合多个项目响应,同时一致地为每个项目应用三个参数以实现模型的简约性。验证了模型在参数估计中的准确性、稳定性和计算效率。一个实证应用证明了该模型在表示连续项目反应特征方面的有效性。此外,另一个经验数据集的交叉验证结果支持了该模型对稀疏多片数据的适用性,这表明与现有多片模型相比,该模型的简约性可以增强模型-数据的拟合。
{"title":"An Extended Two-Parameter Logistic Item Response Model to Handle Continuous Responses and Sparse Polytomous Responses.","authors":"Seewoo Li, Hyo Jeong Shin","doi":"10.1017/psy.2025.10044","DOIUrl":"10.1017/psy.2025.10044","url":null,"abstract":"<p><p>The article proposes a novel item response theory model to handle continuous responses and sparse polytomous responses in psychological and educational measurement. The model extends the traditional two-parameter logistic model by incorporating a precision parameter, which, along with a beta distribution, forms an error component that accounts for the response continuity. Furthermore, transforming ordinal responses to a continuous scale enables the fitting of polytomous item responses while consistently applying three parameters per item for model parsimony. The model's accuracy, stability, and computational efficiency in parameter estimation were examined. An empirical application demonstrated the model's effectiveness in representing the characteristics of continuous item responses. Additionally, the model's applicability to sparse polytomous data was supported by cross-validation results from another empirical dataset, which indicates that the model's parsimony can enhance model-data fit compared to existing polytomous models.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-27"},"PeriodicalIF":3.1,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805198/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144978359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification and Interpretation of the Completely Oblique Rasch Bifactor Model. 完全斜拉希双因子模型的识别与解释。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-01 Epub Date: 2025-04-24 DOI: 10.1017/psy.2025.14
Denis Federiakin, Mark R Wilson

Bifactor Item Response Theory (IRT) models are the usual option for modeling composite constructs. However, in application, researchers typically must assume that all dimensions of person parameter space are orthogonal. This can result in absurd model interpretations. We propose a new bifactor model-the Completely Oblique Rasch Bifactor (CORB) model-which allows for estimation of correlations between all dimensions. We discuss relations of this model to other oblique bifactor models and study the conditions for its identification in the dichotomous case. We analytically prove that this model is identified in the case that (a) at least one item loads solely on the general factor and no items are shared between any pair of specific factors (we call this the G-structure), or (b) if no items load solely on the general factor, but at least one item is shared between every pair of the specific factors (the S-structure). Using simulated and real data, we show that this model outperforms the other partially oblique bifactor models in terms of model fit because it corresponds to the more realistic assumptions about construct structure. We also discuss possible difficulties in the interpretation of the CORB model's parameters using, by analogy, the "explaining away" phenomenon from Bayesian reasoning.

双因素项目反应理论(IRT)模型是组合构造建模的常用选择。但在实际应用中,研究人员通常必须假设人参数空间的所有维度都是正交的。这可能导致荒谬的模型解释。我们提出了一个新的双因子模型-完全斜拉希双因子(CORB)模型-它允许估计所有维度之间的相关性。讨论了该模型与其他斜双因子模型的关系,并研究了其在二分情况下的辨识条件。我们分析证明,在(a)至少有一个项目仅在一般因素上加载,没有项目在任何一对特定因素之间共享(我们称之为g结构),或(b)如果没有项目仅在一般因素上加载,但至少有一个项目在每对特定因素之间共享(s结构),则该模型是确定的。使用模拟和真实数据,我们表明该模型在模型拟合方面优于其他部分倾斜双因子模型,因为它对应于关于构造结构的更现实的假设。我们还讨论了在解释CORB模型参数时可能遇到的困难,通过类比,使用贝叶斯推理中的“解释”现象。
{"title":"Identification and Interpretation of the Completely Oblique Rasch Bifactor Model.","authors":"Denis Federiakin, Mark R Wilson","doi":"10.1017/psy.2025.14","DOIUrl":"10.1017/psy.2025.14","url":null,"abstract":"<p><p>Bifactor Item Response Theory (IRT) models are the usual option for modeling composite constructs. However, in application, researchers typically must assume that all dimensions of person parameter space are orthogonal. This can result in absurd model interpretations. We propose a new bifactor model-the Completely Oblique Rasch Bifactor (CORB) model-which allows for estimation of correlations between all dimensions. We discuss relations of this model to other oblique bifactor models and study the conditions for its identification in the dichotomous case. We analytically prove that this model is identified in the case that (a) at least one item loads solely on the general factor and no items are shared between any pair of specific factors (we call this the G-structure), or (b) if no items load solely on the general factor, but at least one item is shared between every pair of the specific factors (the S-structure). Using simulated and real data, we show that this model outperforms the other partially oblique bifactor models in terms of model fit because it corresponds to the more realistic assumptions about construct structure. We also discuss possible difficulties in the interpretation of the CORB model's parameters using, by analogy, the \"explaining away\" phenomenon from Bayesian reasoning.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1284-1318"},"PeriodicalIF":3.1,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660021/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144029305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Latent Markov Model for Noninvariant Measurements: An Application to Interaction Log Data From Computer-Interactive Assessments. 非不变测量的潜马尔可夫模型:在计算机交互评估的交互日志数据中的应用。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-01 Epub Date: 2025-08-26 DOI: 10.1017/psy.2025.10029
Hyeon-Ah Kang

The latent Markov model (LMM) has been increasingly used to analyze log data from computer-interactive assessments. An important consideration in applying the LMM to assessment data is measurement effects of items. In educational and psychological assessment, items exhibit distinct psychometric qualities and induce systematic variance to assessment outcome data. The current development in LMM, however, assumes that items have uniform effects and do not contribute to the variance of measurement outcomes. In this study, we propose a refinement of LMM that relaxes the measurement invariance constraint and examine empirical performance of the new framework through numerical experimentation. We modify the LMM for noninvariant measurements and refine the inferential scheme to accommodate the event-specific measurement effects. Numerical experiments are conducted to validate the proposed inference methods and evaluate the performance of the new framework. Results suggest that the proposed inferential scheme performs adequately well in retrieving the model parameters and state profiles. The new LMM framework demonstrated reliable and stable performance in modeling latent processes while appropriately accounting for items' measurement effects. Compared with the traditional scheme, the refined framework demonstrated greater relevance to real assessment data and yielded more robust inference results when the model was ill-specified. The findings from the empirical evaluations suggest that the new framework has potential for serving large-scale assessment data that exhibit distinct measurement effects.

潜马尔可夫模型(LMM)越来越多地用于分析计算机交互评估的测井数据。将LMM应用于评估数据的一个重要考虑因素是项目的测量效果。在教育和心理评估中,项目表现出不同的心理测量质量,并导致评估结果数据的系统方差。然而,当前LMM的发展假设项目具有统一的效果,并且不会导致测量结果的方差。在这项研究中,我们提出了一种改进的LMM,放宽了测量不变性约束,并通过数值实验检验了新框架的经验性能。我们修改了非不变测量的LMM,并改进了推理方案以适应特定于事件的测量效果。数值实验验证了所提出的推理方法,并对新框架的性能进行了评价。结果表明,所提出的推理方案在检索模型参数和状态概况方面表现良好。新的LMM框架在对潜在过程的建模中表现出可靠和稳定的性能,同时适当地考虑了项目的测量效应。与传统方案相比,在模型不明确的情况下,改进框架与真实评估数据的相关性更强,推理结果更稳健。实证评估结果表明,新框架具有服务于具有明显测量效果的大规模评估数据的潜力。
{"title":"A Latent Markov Model for Noninvariant Measurements: An Application to Interaction Log Data From Computer-Interactive Assessments.","authors":"Hyeon-Ah Kang","doi":"10.1017/psy.2025.10029","DOIUrl":"10.1017/psy.2025.10029","url":null,"abstract":"<p><p>The latent Markov model (LMM) has been increasingly used to analyze log data from computer-interactive assessments. An important consideration in applying the LMM to assessment data is measurement effects of items. In educational and psychological assessment, items exhibit distinct psychometric qualities and induce systematic variance to assessment outcome data. The current development in LMM, however, assumes that items have uniform effects and do not contribute to the variance of measurement outcomes. In this study, we propose a refinement of LMM that relaxes the measurement invariance constraint and examine empirical performance of the new framework through numerical experimentation. We modify the LMM for noninvariant measurements and refine the inferential scheme to accommodate the event-specific measurement effects. Numerical experiments are conducted to validate the proposed inference methods and evaluate the performance of the new framework. Results suggest that the proposed inferential scheme performs adequately well in retrieving the model parameters and state profiles. The new LMM framework demonstrated reliable and stable performance in modeling latent processes while appropriately accounting for items' measurement effects. Compared with the traditional scheme, the refined framework demonstrated greater relevance to real assessment data and yielded more robust inference results when the model was ill-specified. The findings from the empirical evaluations suggest that the new framework has potential for serving large-scale assessment data that exhibit distinct measurement effects.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1481-1505"},"PeriodicalIF":3.1,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660023/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144978378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Psychometrika
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1