Differential Item Functioning via Robust Scaling.

IF 3.1 2区心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Psychometrika Pub Date : 2024-09-01 Epub Date: 2024-05-04 DOI:10.1007/s11336-024-09957-6

Peter F Halpin

引用次数: 0

Abstract

This paper proposes a method for assessing differential item functioning (DIF) in item response theory (IRT) models. The method does not require pre-specification of anchor items, which is its main virtue. It is developed in two main steps: first by showing how DIF can be re-formulated as a problem of outlier detection in IRT-based scaling and then tackling the latter using methods from robust statistics. The proposal is a redescending M-estimator of IRT scaling parameters that is tuned to flag items with DIF at the desired asymptotic type I error rate. Theoretical results describe the efficiency of the estimator in the absence of DIF and its robustness in the presence of DIF. Simulation studies show that the proposed method compares favorably to currently available approaches for DIF detection, and a real data example illustrates its application in a research context where pre-specification of anchor items is infeasible. The focus of the paper is the two-parameter logistic model in two independent groups, with extensions to other settings considered in the conclusion.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过稳健缩放实现差异项目功能。

本文提出了一种在项目反应理论（IRT）模型中评估差异项目功能（DIF）的方法。该方法无需预先指定锚项目，这是它的主要优点。该方法的开发主要分两步：首先说明如何将 DIF 重新表述为基于 IRT 的缩放中的离群点检测问题，然后使用稳健统计的方法解决后者。本文提出了一种 IRT 缩放参数的重新降序 M-估计器，该估计器经过调整，可以在所需的渐近 I 类错误率下标记出具有 DIF 的项目。理论结果描述了该估计器在无 DIF 时的效率以及在有 DIF 时的稳健性。模拟研究表明，与目前可用的 DIF 检测方法相比，所提出的方法更胜一筹，而且一个真实数据示例说明了该方法在预设锚点项目不可行的研究环境中的应用。本文的重点是两个独立组中的双参数逻辑模型，并在结论中考虑了对其他设置的扩展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Psychometrika 数学-数学跨学科应用

CiteScore

4.40

自引率

10.00%

发文量

审稿时长

>12 weeks

期刊介绍： The journal Psychometrika is devoted to the advancement of theory and methodology for behavioral data in psychology, education and the social and behavioral sciences generally. Its coverage is offered in two sections: Theory and Methods (T& M), and Application Reviews and Case Studies (ARCS). T&M articles present original research and reviews on the development of quantitative models, statistical methods, and mathematical techniques for evaluating data from psychology, the social and behavioral sciences and related fields. Application Reviews can be integrative, drawing together disparate methodologies for applications, or comparative and evaluative, discussing advantages and disadvantages of one or more methodologies in applications. Case Studies highlight methodology that deepens understanding of substantive phenomena through more informative data analysis, or more elegant data description.