Testing for differences in chain equating

IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Statistica Neerlandica Pub Date : 2022-07-22 DOI:10.1111/stan.12277
Michela Battauz
{"title":"Testing for differences in chain equating","authors":"Michela Battauz","doi":"10.1111/stan.12277","DOIUrl":null,"url":null,"abstract":"The comparability of the scores obtained in different forms of a test is certainly an essential requirement. This paper proposes a statistical test for the detection of noncomparable scores based on item response theory (IRT) methods. When the IRT model is fit separately for different forms of a test, the item parameter estimates are expressed on different measurement scales. The first step to obtain comparable scores is to convert the item parameters to a common metric using two constants, called equating coefficients. The equating coefficients can be estimated for two forms with common items, or derived through a chain of forms. The proposal of this paper is a statistical test to verify whether the scale conversions provided by the equating coefficients are as expected when the assumptions of the model are satisfied, hence leading to comparable scores. The method is illustrated through simulation studies and a real‐data example.","PeriodicalId":51178,"journal":{"name":"Statistica Neerlandica","volume":null,"pages":null},"PeriodicalIF":1.4000,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistica Neerlandica","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1111/stan.12277","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 1

Abstract

The comparability of the scores obtained in different forms of a test is certainly an essential requirement. This paper proposes a statistical test for the detection of noncomparable scores based on item response theory (IRT) methods. When the IRT model is fit separately for different forms of a test, the item parameter estimates are expressed on different measurement scales. The first step to obtain comparable scores is to convert the item parameters to a common metric using two constants, called equating coefficients. The equating coefficients can be estimated for two forms with common items, or derived through a chain of forms. The proposal of this paper is a statistical test to verify whether the scale conversions provided by the equating coefficients are as expected when the assumptions of the model are satisfied, hence leading to comparable scores. The method is illustrated through simulation studies and a real‐data example.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
检验链等式的差异
在不同形式的考试中获得的分数的可比性当然是一项基本要求。本文提出了一种基于项目反应理论(IRT)方法的非可比分数检测的统计检验方法。当对不同形式的测试分别拟合IRT模型时,项目参数估计在不同的测量尺度上表示。获得可比分数的第一步是使用两个常量(称为相等系数)将项目参数转换为公共度量。相等系数可以用共同项估计两种形式,或通过一系列形式推导。本文的提议是一个统计检验,验证在满足模型假设的情况下,等式系数提供的尺度转换是否如预期的那样,从而得到可比较的分数。通过仿真研究和一个实际数据实例说明了该方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Statistica Neerlandica
Statistica Neerlandica 数学-统计学与概率论
CiteScore
2.60
自引率
6.70%
发文量
26
审稿时长
>12 weeks
期刊介绍: Statistica Neerlandica has been the journal of the Netherlands Society for Statistics and Operations Research since 1946. It covers all areas of statistics, from theoretical to applied, with a special emphasis on mathematical statistics, statistics for the behavioural sciences and biostatistics. This wide scope is reflected by the expertise of the journal’s editors representing these areas. The diverse editorial board is committed to a fast and fair reviewing process, and will judge submissions on quality, correctness, relevance and originality. Statistica Neerlandica encourages transparency and reproducibility, and offers online resources to make data, code, simulation results and other additional materials publicly available.
期刊最新文献
Poisson average maximum likelihood‐centred penalized estimator: A new estimator to better address multicollinearity in Poisson regression Orthogonal Contrasts for both Balanced and Unbalanced Designs and both Ordered and Unordered Treatments Estimating function method for nonnegative autoregressive models A partial posterior p value test for multilevel mediation A portmanteau test for the iid hypothesis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1