检验链等式的差异

IF 0.8 3区数学 Q2 STATISTICS & PROBABILITY Statistica Neerlandica Pub Date : 2022-07-22 DOI:10.1111/stan.12277

Michela Battauz

{"title":"检验链等式的差异","authors":"Michela Battauz","doi":"10.1111/stan.12277","DOIUrl":null,"url":null,"abstract":"The comparability of the scores obtained in different forms of a test is certainly an essential requirement. This paper proposes a statistical test for the detection of noncomparable scores based on item response theory (IRT) methods. When the IRT model is fit separately for different forms of a test, the item parameter estimates are expressed on different measurement scales. The first step to obtain comparable scores is to convert the item parameters to a common metric using two constants, called equating coefficients. The equating coefficients can be estimated for two forms with common items, or derived through a chain of forms. The proposal of this paper is a statistical test to verify whether the scale conversions provided by the equating coefficients are as expected when the assumptions of the model are satisfied, hence leading to comparable scores. The method is illustrated through simulation studies and a real‐data example.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":"2 1","pages":"134 - 145"},"PeriodicalIF":0.8000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Testing for differences in chain equating\",\"authors\":\"Michela Battauz\",\"doi\":\"10.1111/stan.12277\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The comparability of the scores obtained in different forms of a test is certainly an essential requirement. This paper proposes a statistical test for the detection of noncomparable scores based on item response theory (IRT) methods. When the IRT model is fit separately for different forms of a test, the item parameter estimates are expressed on different measurement scales. The first step to obtain comparable scores is to convert the item parameters to a common metric using two constants, called equating coefficients. The equating coefficients can be estimated for two forms with common items, or derived through a chain of forms. The proposal of this paper is a statistical test to verify whether the scale conversions provided by the equating coefficients are as expected when the assumptions of the model are satisfied, hence leading to comparable scores. The method is illustrated through simulation studies and a real‐data example.\",\"PeriodicalId\":51178,\"journal\":{\"name\":\"Statistica Neerlandica\",\"volume\":\"2 1\",\"pages\":\"134 - 145\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2022-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistica Neerlandica\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1111/stan.12277\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistica Neerlandica","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1111/stan.12277","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 1

摘要

在不同形式的考试中获得的分数的可比性当然是一项基本要求。本文提出了一种基于项目反应理论(IRT)方法的非可比分数检测的统计检验方法。当对不同形式的测试分别拟合IRT模型时，项目参数估计在不同的测量尺度上表示。获得可比分数的第一步是使用两个常量(称为相等系数)将项目参数转换为公共度量。相等系数可以用共同项估计两种形式，或通过一系列形式推导。本文的提议是一个统计检验，验证在满足模型假设的情况下，等式系数提供的尺度转换是否如预期的那样，从而得到可比较的分数。通过仿真研究和一个实际数据实例说明了该方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Testing for differences in chain equating

The comparability of the scores obtained in different forms of a test is certainly an essential requirement. This paper proposes a statistical test for the detection of noncomparable scores based on item response theory (IRT) methods. When the IRT model is fit separately for different forms of a test, the item parameter estimates are expressed on different measurement scales. The first step to obtain comparable scores is to convert the item parameters to a common metric using two constants, called equating coefficients. The equating coefficients can be estimated for two forms with common items, or derived through a chain of forms. The proposal of this paper is a statistical test to verify whether the scale conversions provided by the equating coefficients are as expected when the assumptions of the model are satisfied, hence leading to comparable scores. The method is illustrated through simulation studies and a real‐data example.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Statistica Neerlandica 数学-统计学与概率论

CiteScore

2.60

自引率

6.70%

发文量

审稿时长

>12 weeks

期刊介绍： Statistica Neerlandica has been the journal of the Netherlands Society for Statistics and Operations Research since 1946. It covers all areas of statistics, from theoretical to applied, with a special emphasis on mathematical statistics, statistics for the behavioural sciences and biostatistics. This wide scope is reflected by the expertise of the journal’s editors representing these areas. The diverse editorial board is committed to a fast and fair reviewing process, and will judge submissions on quality, correctness, relevance and originality. Statistica Neerlandica encourages transparency and reproducibility, and offers online resources to make data, code, simulation results and other additional materials publicly available.

期刊最新文献

Efficient estimation for the multivariate Cox model with missing covariates. On global robustness of an adversarial risk analysis solution Heterogeneous dense subhypergraph detection General adapted‐threshold monitoring in discrete environments and rules for imbalanced classes VC‐PCR: A prediction method based on variable selection and clustering