首页 > 最新文献

Psychometrika最新文献

英文 中文
Generative Adversarial Networks for High-Dimensional Item Factor Analysis: A Deep Adversarial Learning Algorithm. 高维项目因子分析的生成对抗网络:一种深度对抗学习算法。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-11 DOI: 10.1017/psy.2025.10059
Nanyu Luo, Feng Ji

Advances in deep learning and representation learning have transformed item factor analysis (IFA) in the item response theory (IRT) literature by enabling more efficient and accurate parameter estimation. Variational autoencoders (VAEs) are widely used to model high-dimensional latent variables in this context, but the limited expressiveness of their inference networks can still hinder performance. We introduce adversarial variational Bayes (AVB) and an importance-weighted extension (IWAVB) as more flexible inference algorithms for IFA. By combining VAEs with generative adversarial networks (GANs), AVB uses an auxiliary discriminator network to frame estimation as a two-player game and removes the restrictive standard normal assumption on the latent variables. Theoretically, AVB and IWAVB can achieve likelihoods that match or exceed those of VAEs and importance-weighted autoencoders (IWAEs). In exploratory analyses of empirical data, IWAVB attained higher likelihoods than IWAE, indicating greater expressiveness. In confirmatory simulations, IWAVB achieved comparable mean-square error in parameter recovery while consistently yielding higher likelihoods, and it clearly outperformed IWAE when the latent distribution was multimodal. These findings suggest that IWAVB can scale IFA to complex, large-scale, and potentially multimodal settings, supporting closer integration of psychometrics with modern multimodal data analysis.

深度学习和表征学习的进步通过实现更有效和准确的参数估计,改变了项目反应理论(IRT)文献中的项目因素分析(IFA)。在这种情况下,变分自编码器(VAEs)被广泛用于高维潜在变量的建模,但其推理网络的有限表达能力仍然会阻碍性能。我们引入了对抗变分贝叶斯(AVB)和重要加权扩展(IWAVB)作为更灵活的IFA推理算法。AVB通过将ves与生成对抗网络(GANs)相结合,使用辅助判别器网络将估计框架为两人博弈,并消除了对潜在变量的限制性标准正态假设。理论上,AVB和IWAVB可以实现匹配或超过VAEs和重要性加权自编码器(IWAEs)的可能性。在实证数据的探索性分析中,IWAVB比IWAE获得更高的可能性,表明更强的表达能力。在验证性模拟中,IWAVB在参数恢复中获得了相当的均方误差,同时始终产生更高的似然,并且当潜在分布是多模态时,它明显优于IWAE。这些发现表明,IWAVB可以将IFA扩展到复杂、大规模和潜在的多模态环境,支持心理测量学与现代多模态数据分析的更紧密整合。
{"title":"Generative Adversarial Networks for High-Dimensional Item Factor Analysis: A Deep Adversarial Learning Algorithm.","authors":"Nanyu Luo, Feng Ji","doi":"10.1017/psy.2025.10059","DOIUrl":"10.1017/psy.2025.10059","url":null,"abstract":"<p><p>Advances in deep learning and representation learning have transformed item factor analysis (IFA) in the item response theory (IRT) literature by enabling more efficient and accurate parameter estimation. Variational autoencoders (VAEs) are widely used to model high-dimensional latent variables in this context, but the limited expressiveness of their inference networks can still hinder performance. We introduce adversarial variational Bayes (AVB) and an importance-weighted extension (IWAVB) as more flexible inference algorithms for IFA. By combining VAEs with generative adversarial networks (GANs), AVB uses an auxiliary discriminator network to frame estimation as a two-player game and removes the restrictive standard normal assumption on the latent variables. Theoretically, AVB and IWAVB can achieve likelihoods that match or exceed those of VAEs and importance-weighted autoencoders (IWAEs). In exploratory analyses of empirical data, IWAVB attained higher likelihoods than IWAE, indicating greater expressiveness. In confirmatory simulations, IWAVB achieved comparable mean-square error in parameter recovery while consistently yielding higher likelihoods, and it clearly outperformed IWAE when the latent distribution was multimodal. These findings suggest that IWAVB can scale IFA to complex, large-scale, and potentially multimodal settings, supporting closer integration of psychometrics with modern multimodal data analysis.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-24"},"PeriodicalIF":3.1,"publicationDate":"2025-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805202/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145490878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayes Factor Tests for Group Differences in Ordinal and Binary Graphical Models. 有序和二元图模型中组差异的贝叶斯因子检验。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-04 DOI: 10.1017/psy.2025.10060
M Marsman, L J Waldorp, N Sekulovski, J M B Haslbeck
{"title":"Bayes Factor Tests for Group Differences in Ordinal and Binary Graphical Models.","authors":"M Marsman, L J Waldorp, N Sekulovski, J M B Haslbeck","doi":"10.1017/psy.2025.10060","DOIUrl":"10.1017/psy.2025.10060","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-49"},"PeriodicalIF":3.1,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805206/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145439986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Interval Consensus Model: Aggregating Continuous Bounded Interval Responses. 区间一致性模型:聚合连续有界区间响应。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-04 DOI: 10.1017/psy.2025.10058
Matthias Kloft, Björn S Siepe, Daniel W Heck

Cultural consensus theory (CCT) leverages shared knowledge between individuals to optimally aggregate answers to questions for which the underlying truth is unknown. Existing CCT models have predominantly focused on unidimensional point truths using dichotomous, polytomous, or continuous response formats. However, certain domains, such as risk assessment or interpretation of verbal quantifiers, may require a consensus focused on intervals, capturing a range of relevant values. We introduce the interval consensus model (ICM), a novel extension of CCT designed to estimate consensus intervals from continuous bounded interval responses. We use a Bayesian hierarchical modeling approach to estimate latent consensus intervals. In a simulation study, we show that, under the conditions studied, the ICM performs better than using simple means and medians of the responses. We then apply the model to empirical judgments of verbal quantifiers.

文化共识理论(CCT)利用个人之间的共享知识,对潜在真相未知的问题进行最佳汇总答案。现有的CCT模型主要关注使用二分类、多分类或连续响应格式的一维点真。然而,某些领域,如风险评估或口头量词的解释,可能需要集中于时间间隔的共识,获取一系列相关值。我们引入区间一致性模型(ICM),这是CCT的一个新扩展,用于从连续有界区间响应估计一致区间。我们使用贝叶斯分层建模方法来估计潜在的一致区间。在模拟研究中,我们表明,在所研究的条件下,ICM比使用简单的响应均值和中位数表现得更好。然后,我们将该模型应用于言语量词的经验判断。
{"title":"The Interval Consensus Model: Aggregating Continuous Bounded Interval Responses.","authors":"Matthias Kloft, Björn S Siepe, Daniel W Heck","doi":"10.1017/psy.2025.10058","DOIUrl":"10.1017/psy.2025.10058","url":null,"abstract":"<p><p>Cultural consensus theory (CCT) leverages shared knowledge between individuals to optimally aggregate answers to questions for which the underlying truth is unknown. Existing CCT models have predominantly focused on unidimensional point truths using dichotomous, polytomous, or continuous response formats. However, certain domains, such as risk assessment or interpretation of verbal quantifiers, may require a consensus focused on intervals, capturing a range of relevant values. We introduce the interval consensus model (ICM), a novel extension of CCT designed to estimate consensus intervals from continuous bounded interval responses. We use a Bayesian hierarchical modeling approach to estimate latent consensus intervals. In a simulation study, we show that, under the conditions studied, the ICM performs better than using simple means and medians of the responses. We then apply the model to empirical judgments of verbal quantifiers.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-24"},"PeriodicalIF":3.1,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145440013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visualization for Departures from Symmetry with the Power-Divergence-Type Measure in Square Contingency Tables. 方形列联表中幂散度测度的对称偏离可视化。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-11-03 DOI: 10.1017/psy.2025.10057
Wataru Urasaki, Tomoyuki Nakagawa, Jun Tsuchida, Kouji Tahata

When the row and column variables consist of the same category in a two-way contingency table, it is called a square contingency table. Since square contingency tables have an association structure due to the concentration of observed values near the main diagonal, a primary objective is to examine symmetric relationships and transitions between variables. Various models and measures have been proposed to analyze these structures to understand the changes between two variables' behavior at two-time points or cohorts. This is necessary for a detailed investigation of individual categories and their interrelationships, such as shifts in brand preferences. We propose a novel approach to correspondence analysis (CA) for evaluating departures from symmetry in square contingency tables with nominal categories, using a modified divergence statistic. This approach ensures that well-known divergence statistics can also be visualized and regardless of the divergence statistics used, the CA plot consists of two principal axes with equal contribution rates. Notably, the scaling of the departures from symmetry provided by the modified divergence statistic is independent of sample size, allowing for meaningful comparisons and unification of results across different tables. Confidence regions are also constructed to enhance the accuracy of the CA plot.

当行变量和列变量在双向列联表中由同一类别组成时,称为方形列联表。由于在主对角线附近观测值的集中,方形列联表具有关联结构,因此主要目标是检查变量之间的对称关系和转换。已经提出了各种模型和措施来分析这些结构,以了解两个变量在两个时间点或队列中的行为变化。这对于详细调查单个类别及其相互关系(如品牌偏好的变化)是必要的。我们提出了一种新的方法来对应分析(CA)评估偏离对称的平方列联表与名义范畴,使用修改散度统计。这种方法确保了众所周知的散度统计也可以可视化,并且无论使用哪种散度统计,CA图都由两个贡献率相等的主轴组成。值得注意的是,由修改的散度统计提供的偏离对称的比例与样本量无关,允许有意义的比较和跨不同表的结果统一。为了提高CA图的精度,还构造了置信区域。
{"title":"Visualization for Departures from Symmetry with the Power-Divergence-Type Measure in Square Contingency Tables.","authors":"Wataru Urasaki, Tomoyuki Nakagawa, Jun Tsuchida, Kouji Tahata","doi":"10.1017/psy.2025.10057","DOIUrl":"10.1017/psy.2025.10057","url":null,"abstract":"<p><p>When the row and column variables consist of the same category in a two-way contingency table, it is called a square contingency table. Since square contingency tables have an association structure due to the concentration of observed values near the main diagonal, a primary objective is to examine symmetric relationships and transitions between variables. Various models and measures have been proposed to analyze these structures to understand the changes between two variables' behavior at two-time points or cohorts. This is necessary for a detailed investigation of individual categories and their interrelationships, such as shifts in brand preferences. We propose a novel approach to correspondence analysis (CA) for evaluating departures from symmetry in square contingency tables with nominal categories, using a modified divergence statistic. This approach ensures that well-known divergence statistics can also be visualized and regardless of the divergence statistics used, the CA plot consists of two principal axes with equal contribution rates. Notably, the scaling of the departures from symmetry provided by the modified divergence statistic is independent of sample size, allowing for meaningful comparisons and unification of results across different tables. Confidence regions are also constructed to enhance the accuracy of the CA plot.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-20"},"PeriodicalIF":3.1,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805207/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145432909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MODELING MISSING AT RANDOM NEUROPSYCHOLOGICAL TEST SCORES USING A MIXTURE OF BINOMIAL PRODUCT EXPERTS. 建模缺失随机神经心理学测试分数使用二项产品专家的混合物。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-22 DOI: 10.1017/psy.2025.10053
Daniel Suen, Yen-Chi Chen
{"title":"MODELING MISSING AT RANDOM NEUROPSYCHOLOGICAL TEST SCORES USING A MIXTURE OF BINOMIAL PRODUCT EXPERTS.","authors":"Daniel Suen, Yen-Chi Chen","doi":"10.1017/psy.2025.10053","DOIUrl":"https://doi.org/10.1017/psy.2025.10053","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-74"},"PeriodicalIF":3.1,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145349858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extending the Bicriterion Approach for Anticlustering: Exact and Hybrid Approaches. 扩展双准则方法的反聚类:精确和混合方法。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-10-07 DOI: 10.1017/psy.2025.10052
Martin Papenberg, Martin Breuer, Max Diekhoff, Nguyen K Tran, Gunnar W Klau
{"title":"Extending the Bicriterion Approach for Anticlustering: Exact and Hybrid Approaches.","authors":"Martin Papenberg, Martin Breuer, Max Diekhoff, Nguyen K Tran, Gunnar W Klau","doi":"10.1017/psy.2025.10052","DOIUrl":"10.1017/psy.2025.10052","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-43"},"PeriodicalIF":3.1,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805193/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145240242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Standard Errors for Reliability Coefficients. 可靠性系数的标准误差。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-30 DOI: 10.1017/psy.2025.10050
L Andries van der Ark

Reliability analysis is one of the most conducted analyses in applied psychometrics. It entails the assessment of reliability of both item scores and scale scores using coefficients that estimate the reliability (e.g., Cronbach's alpha), measurement precision (e.g., estimated standard error of measurement), or the contribution of individual items to the reliability (e.g., corrected item-total correlations). Most statistical software packages used in social and behavioral sciences offer these reliability coefficients, whereas standard errors are generally unavailable, which is a bit ironic for coefficients about measurement precision. This article provides analytic nonparametric standard errors for coefficients used in reliability analysis. As most scores used in behavioral sciences are discrete, standard errors are derived under the relatively unrestrictive multinomial sampling scheme. Tedious derivations are presented in appendices, and R functions for computing standard errors are available from the Open Science Framework. Bias and variance of standard errors, and coverage of the corresponding Wald-based confidence intervals are studied using simulated item scores. Bias and variance, and coverage are generally satisfactory for larger sample sizes, and parameter values are not close to the boundary of the parameter space.

信度分析是应用心理测量学中应用最多的分析方法之一。它需要使用估计可靠性(例如,Cronbach's alpha)、测量精度(例如,估计的测量标准误差)或单个项目对可靠性的贡献(例如,校正的项目-总相关性)的系数来评估项目分数和量表分数的可靠性。社会和行为科学中使用的大多数统计软件包都提供了这些可靠性系数,而标准误差通常是不可用的,这对于测量精度的系数来说有点讽刺。本文给出了可靠性分析中所用系数的解析性非参数标准误差。由于行为科学中使用的大多数分数是离散的,标准误差是在相对不受限制的多项抽样方案下得出的。冗长的推导在附录中给出,计算标准误差的R函数可以从开放科学框架中获得。使用模拟项目得分研究标准误差的偏差和方差,以及相应的基于wald的置信区间的覆盖率。对于较大的样本量,偏差、方差和覆盖率通常是令人满意的,参数值并不接近参数空间的边界。
{"title":"Standard Errors for Reliability Coefficients.","authors":"L Andries van der Ark","doi":"10.1017/psy.2025.10050","DOIUrl":"10.1017/psy.2025.10050","url":null,"abstract":"<p><p>Reliability analysis is one of the most conducted analyses in applied psychometrics. It entails the assessment of reliability of both item scores and scale scores using coefficients that estimate the reliability (e.g., Cronbach's alpha), measurement precision (e.g., estimated standard error of measurement), or the contribution of individual items to the reliability (e.g., corrected item-total correlations). Most statistical software packages used in social and behavioral sciences offer these reliability coefficients, whereas standard errors are generally unavailable, which is a bit ironic for coefficients about measurement precision. This article provides analytic nonparametric standard errors for coefficients used in reliability analysis. As most scores used in behavioral sciences are discrete, standard errors are derived under the relatively unrestrictive multinomial sampling scheme. Tedious derivations are presented in appendices, and R functions for computing standard errors are available from the Open Science Framework. Bias and variance of standard errors, and coverage of the corresponding Wald-based confidence intervals are studied using simulated item scores. Bias and variance, and coverage are generally satisfactory for larger sample sizes, and parameter values are not close to the boundary of the parameter space.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-26"},"PeriodicalIF":3.1,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805205/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145193499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Optimally Regularized Estimator of Multilevel Latent Variable Models, with Improved MSE Performance. 一种具有改进MSE性能的多水平潜变量模型的最优正则化估计器。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-22 DOI: 10.1017/psy.2025.10045
Valerii Dashuk, Martin Hecht, Oliver Lüdtke, Alexander Robitzsch, Steffen Zitzmann
{"title":"An Optimally Regularized Estimator of Multilevel Latent Variable Models, with Improved MSE Performance.","authors":"Valerii Dashuk, Martin Hecht, Oliver Lüdtke, Alexander Robitzsch, Steffen Zitzmann","doi":"10.1017/psy.2025.10045","DOIUrl":"10.1017/psy.2025.10045","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-75"},"PeriodicalIF":3.1,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805209/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145114795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Obituary Robert J. Mislevy (1950-2025). Robert J. Mislevy讣告(1950-2025)。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-15 DOI: 10.1017/psy.2025.10049
Roy Levy, Russell G Almond
{"title":"Obituary Robert J. Mislevy (1950-2025).","authors":"Roy Levy, Russell G Almond","doi":"10.1017/psy.2025.10049","DOIUrl":"https://doi.org/10.1017/psy.2025.10049","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-12"},"PeriodicalIF":3.1,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145066451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Method for Detecting Intersectional DIF: Multilevel Random Item Effects Model with Regularized Gaussian Variational Estimation. 一种检测交叉DIF的新方法:正则化高斯变分估计的多水平随机项目效应模型。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-15 DOI: 10.1017/psy.2025.10046
He Ren, Weicong Lyu, Chun Wang, Gongjun Xu

Differential item functioning (DIF) screening has long been suggested to ensure assessment fairness. Traditional DIF methods typically focus on the main effects of demographic variables on item parameters, overlooking the interactions among multiple identities. Drawing on the intersectionality framework, we define intersectional DIF as deviations in item parameters that arise from the interactions among demographic variables beyond their main effects and propose a novel item response theory (IRT) approach for detecting intersectional DIF. Under our framework, fixed effects are used to account for traditional DIF, while random item effects are introduced to capture intersectional DIF. We further introduce the concept of intersectional impact, which refers to interaction effects on group-level mean ability. Depending on which item parameters are affected and whether intersectional impact is considered, we propose four models, which aim to detect intersectional uniform DIF (UDIF), intersectional UDIF with intersectional impact, intersectional non-uniform DIF (NUDIF), and intersectional NUDIF with intersectional impact, respectively. For efficient model estimation, a regularized Gaussian variational expectation-maximization algorithm is developed. Simulation studies demonstrate that our methods can effectively detect intersectional UDIF, although their detection of intersectional NUDIF is more limited.

差异项目功能筛选(DIF)一直被建议用于确保评估的公平性。传统的DIF方法通常关注人口统计变量对项目参数的主要影响,而忽略了多个身份之间的相互作用。在交叉性框架的基础上,我们将交叉性DIF定义为人口变量之间的相互作用所产生的项目参数偏差,并提出了一种新的项目反应理论(IRT)方法来检测交叉性DIF。在我们的框架下,使用固定效应来解释传统的DIF,而引入随机项目效应来捕获交叉DIF。我们进一步引入了交叉影响的概念,它指的是群体水平平均能力的交互效应。根据受影响的项目参数和是否考虑交叉冲击,我们提出了四种模型,分别用于检测交叉均匀DIF (UDIF)、交叉不均匀DIF (NUDIF)、交叉性NUDIF和交叉性NUDIF。为了有效地估计模型,提出了一种正则化高斯变分期望最大化算法。仿真研究表明,我们的方法可以有效地检测出交叉的UDIF,尽管它们对交叉的UDIF的检测比较有限。
{"title":"A Novel Method for Detecting Intersectional DIF: Multilevel Random Item Effects Model with Regularized Gaussian Variational Estimation.","authors":"He Ren, Weicong Lyu, Chun Wang, Gongjun Xu","doi":"10.1017/psy.2025.10046","DOIUrl":"10.1017/psy.2025.10046","url":null,"abstract":"<p><p>Differential item functioning (DIF) screening has long been suggested to ensure assessment fairness. Traditional DIF methods typically focus on the main effects of demographic variables on item parameters, overlooking the interactions among multiple identities. Drawing on the intersectionality framework, we define intersectional DIF as deviations in item parameters that arise from the interactions among demographic variables beyond their main effects and propose a novel item response theory (IRT) approach for detecting intersectional DIF. Under our framework, fixed effects are used to account for traditional DIF, while random item effects are introduced to capture intersectional DIF. We further introduce the concept of intersectional impact, which refers to interaction effects on group-level mean ability. Depending on which item parameters are affected and whether intersectional impact is considered, we propose four models, which aim to detect intersectional uniform DIF (UDIF), intersectional UDIF with intersectional impact, intersectional non-uniform DIF (NUDIF), and intersectional NUDIF with intersectional impact, respectively. For efficient model estimation, a regularized Gaussian variational expectation-maximization algorithm is developed. Simulation studies demonstrate that our methods can effectively detect intersectional UDIF, although their detection of intersectional NUDIF is more limited.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-25"},"PeriodicalIF":3.1,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145066440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Psychometrika
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1