Inconsistencies in rater-based assessments mainly affect borderline candidates: but using simple heuristics might improve pass-fail decisions

IF 3 2区 教育学 Q1 EDUCATION & EDUCATIONAL RESEARCH Advances in Health Sciences Education Pub Date : 2024-04-23 DOI:10.1007/s10459-024-10328-0
Stefan K. Schauber, Anne O. Olsen, Erik L. Werner, Morten Magelssen
{"title":"Inconsistencies in rater-based assessments mainly affect borderline candidates: but using simple heuristics might improve pass-fail decisions","authors":"Stefan K. Schauber,&nbsp;Anne O. Olsen,&nbsp;Erik L. Werner,&nbsp;Morten Magelssen","doi":"10.1007/s10459-024-10328-0","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><p>Research in various areas indicates that expert judgment can be highly inconsistent. However, expert judgment is indispensable in many contexts. In medical education, experts often function as examiners in rater-based assessments. Here, disagreement between examiners can have far-reaching consequences. The literature suggests that inconsistencies in ratings depend on the level of performance a to-be-evaluated candidate shows. This possibility has not been addressed deliberately and with appropriate statistical methods. By adopting the theoretical lens of ecological rationality, we evaluate if easily implementable strategies can enhance decision making in real-world assessment contexts.</p><h3>Methods</h3><p>We address two objectives. First, we investigate the dependence of rater-consistency on performance levels. We recorded videos of mock-exams and had examiners (N=10) evaluate four students’ performances and compare inconsistencies in performance ratings between examiner-pairs using a bootstrapping procedure. Our second objective is to provide an approach that aids decision making by implementing simple heuristics.</p><h3>Results</h3><p>We found that discrepancies were largely a function of the level of performance the candidates showed. Lower performances were rated more inconsistently than excellent performances. Furthermore, our analyses indicated that the use of simple heuristics might improve decisions in examiner pairs.</p><h3>Discussion</h3><p>Inconsistencies in performance judgments continue to be a matter of concern, and we provide empirical evidence for them to be related to candidate performance. We discuss implications for research and the advantages of adopting the perspective of ecological rationality. We point to directions both for further research and for development of assessment practices.</p></div>","PeriodicalId":50959,"journal":{"name":"Advances in Health Sciences Education","volume":"29 5","pages":"1749 - 1767"},"PeriodicalIF":3.0000,"publicationDate":"2024-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10459-024-10328-0.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Health Sciences Education","FirstCategoryId":"95","ListUrlMain":"https://link.springer.com/article/10.1007/s10459-024-10328-0","RegionNum":2,"RegionCategory":"教育学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EDUCATION & EDUCATIONAL RESEARCH","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction

Research in various areas indicates that expert judgment can be highly inconsistent. However, expert judgment is indispensable in many contexts. In medical education, experts often function as examiners in rater-based assessments. Here, disagreement between examiners can have far-reaching consequences. The literature suggests that inconsistencies in ratings depend on the level of performance a to-be-evaluated candidate shows. This possibility has not been addressed deliberately and with appropriate statistical methods. By adopting the theoretical lens of ecological rationality, we evaluate if easily implementable strategies can enhance decision making in real-world assessment contexts.

Methods

We address two objectives. First, we investigate the dependence of rater-consistency on performance levels. We recorded videos of mock-exams and had examiners (N=10) evaluate four students’ performances and compare inconsistencies in performance ratings between examiner-pairs using a bootstrapping procedure. Our second objective is to provide an approach that aids decision making by implementing simple heuristics.

Results

We found that discrepancies were largely a function of the level of performance the candidates showed. Lower performances were rated more inconsistently than excellent performances. Furthermore, our analyses indicated that the use of simple heuristics might improve decisions in examiner pairs.

Discussion

Inconsistencies in performance judgments continue to be a matter of concern, and we provide empirical evidence for them to be related to candidate performance. We discuss implications for research and the advantages of adopting the perspective of ecological rationality. We point to directions both for further research and for development of assessment practices.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
以评分者为基础的评估中的不一致性主要影响边缘候选人:但使用简单的启发式方法可能会改进通过-失败的决定。
引言 各个领域的研究表明,专家的判断可能极不一致。然而,专家判断在很多情况下是不可或缺的。在医学教育中,专家通常在基于评分的评估中担任主考官。在这种情况下,考官之间的分歧可能会产生深远的影响。文献表明,评分的不一致性取决于被评估者的表现水平。这种可能性尚未被有意识地用适当的统计方法加以解决。通过采用生态理性的理论视角,我们评估了易于实施的策略能否在真实世界的评估情境中增强决策制定能力。首先,我们研究了评分者一致性对成绩水平的依赖性。我们录制了模拟考试的视频,让考官(10 人)对四名学生的表现进行评价,并使用引导程序比较考官对之间表现评分的不一致性。我们的第二个目标是通过实施简单的启发式方法,提供一种有助于决策的方法。结果我们发现,差异在很大程度上是考生表现水平的函数。与出色的表现相比,较差的表现得到的评分更不一致。此外,我们的分析表明,使用简单的启发式方法可能会改善考官配对的决定。讨论成绩评判中的不一致性仍然是一个值得关注的问题,我们提供的经验证据表明,这种不一致性与考生的成绩有关。我们讨论了研究的意义以及采用生态理性视角的优势。我们指出了进一步研究和发展评估实践的方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.90
自引率
12.50%
发文量
86
审稿时长
>12 weeks
期刊介绍: Advances in Health Sciences Education is a forum for scholarly and state-of-the art research into all aspects of health sciences education. It will publish empirical studies as well as discussions of theoretical issues and practical implications. The primary focus of the Journal is linking theory to practice, thus priority will be given to papers that have a sound theoretical basis and strong methodology.
期刊最新文献
The interpretation-use argument- the essential ingredient for high quality assessment design and validation. Correction: Self-directed learning and the student learning experience in undergraduate clinical science programs: a scoping review. Social support and academic procrastination in health professions students: the serial mediating effect of intrinsic learning motivation and academic self-efficacy. To define or not to define: a commentary on 'The case for metacognitive reflection'. Team science in interdisciplinary health professions education research: a multi-institutional case study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1