What Type of Self-Assessment Is Best for Your Educational Activity? A Review of Pre-Post, Now-Then, and Post-Only Designs.

IF 4.3 2区医学 Q1 HEALTH CARE SCIENCES & SERVICES Journal of General Internal Medicine Pub Date : 2024-11-04 DOI:10.1007/s11606-024-09176-w

James H Wykowski, Helene Starks

{"title":"What Type of Self-Assessment Is Best for Your Educational Activity? A Review of Pre-Post, Now-Then, and Post-Only Designs.","authors":"James H Wykowski, Helene Starks","doi":"10.1007/s11606-024-09176-w","DOIUrl":null,"url":null,"abstract":"<p><p>In medical education, learner self-assessments are standard methods used to evaluate the impact of curricula and workshops. Due to the subjective nature of self-assessment, these measures are prone to known biases including framing, recall, social desirability, and response-shift bias. These biases can contribute to floor and ceiling effects of measurement, which can lead to false conclusions about whether the intended learning objectives were achieved. Ideal assessments of skills-based educational activities would include standardized tests and structured observations of learners demonstrating skill use before and after the educational intervention. However, educators often lack the necessary resources, time, and expertise to routinely conduct these appraisals and rely on self-assessment as a pragmatic approach to obtaining curriculum feedback and evaluation data. In this review, we describe three common designs for self-assessments: the pre-post, now-then, and post-only designs. We then give recommendations for choosing between each design to minimize bias. The choice of the best design is based on alignments with four considerations: (1) the educational objectives (e.g., demonstrate skill competency and/or change in skill level); (2) participants' prior experience and shared understanding of levels of skill performance; (3) the nature of the educational activity; and (4) available resources. For each design, we review strengths, weaknesses, and known biases and discuss examples to highlight trade-offs between options. We also discuss the use of control groups and follow-up surveys to measure retention over time as additional methods to address bias and related confounding. The guidance presented here is intended to raise educators' awareness of common pitfalls in self-assessment; minimize the impact of known biases when possible; provide evidence, examples, and rationales for optimal design choices; and increase the rigor of self-assessment evaluations.</p>","PeriodicalId":15860,"journal":{"name":"Journal of General Internal Medicine","volume":" ","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of General Internal Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11606-024-09176-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

In medical education, learner self-assessments are standard methods used to evaluate the impact of curricula and workshops. Due to the subjective nature of self-assessment, these measures are prone to known biases including framing, recall, social desirability, and response-shift bias. These biases can contribute to floor and ceiling effects of measurement, which can lead to false conclusions about whether the intended learning objectives were achieved. Ideal assessments of skills-based educational activities would include standardized tests and structured observations of learners demonstrating skill use before and after the educational intervention. However, educators often lack the necessary resources, time, and expertise to routinely conduct these appraisals and rely on self-assessment as a pragmatic approach to obtaining curriculum feedback and evaluation data. In this review, we describe three common designs for self-assessments: the pre-post, now-then, and post-only designs. We then give recommendations for choosing between each design to minimize bias. The choice of the best design is based on alignments with four considerations: (1) the educational objectives (e.g., demonstrate skill competency and/or change in skill level); (2) participants' prior experience and shared understanding of levels of skill performance; (3) the nature of the educational activity; and (4) available resources. For each design, we review strengths, weaknesses, and known biases and discuss examples to highlight trade-offs between options. We also discuss the use of control groups and follow-up surveys to measure retention over time as additional methods to address bias and related confounding. The guidance presented here is intended to raise educators' awareness of common pitfalls in self-assessment; minimize the impact of known biases when possible; provide evidence, examples, and rationales for optimal design choices; and increase the rigor of self-assessment evaluations.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

哪种类型的自我评估最适合您的教育活动？对事前、事中和事后设计的回顾。

在医学教育中，学习者自我评估是用于评估课程和研讨会影响的标准方法。由于自我评估的主观性，这些测量方法很容易出现已知的偏差，包括框架偏差、回忆偏差、社会可取性偏差和反应偏移偏差。这些偏差会造成测量的下限和上限效应，从而导致对是否达到预期学习目标的错误结论。以技能为基础的教育活动的理想评估包括标准化测试和对学习者在教育干预前后展示技能使用情况的结构化观察。然而，教育工作者往往缺乏必要的资源、时间和专业知识来定期进行这些评估，而只能依靠自我评估作为获取课程反馈和评估数据的实用方法。在这篇综述中，我们介绍了三种常见的自我评估设计：前-后设计、现-后设计和仅后设计。然后，我们就如何选择每种设计以尽量减少偏差提出了建议。最佳设计的选择基于以下四个方面的考虑：(1) 教育目标（例如，展示技能能力和/或技能水平的变化）；(2) 参与者的先前经验和对技能水平的共同理解；(3) 教育活动的性质；(4) 可用资源。对于每种设计，我们都会回顾其优点、缺点和已知的偏差，并通过实例讨论来突出不同方案之间的权衡。我们还讨论了使用对照组和跟踪调查来衡量一段时间内的保留率，以此作为解决偏差和相关混杂问题的其他方法。本文提供的指导旨在提高教育工作者对自我评估中常见误区的认识；尽可能减少已知偏差的影响；提供最佳设计选择的证据、实例和理由；以及提高自我评估评价的严谨性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of General Internal Medicine 医学-医学：内科

CiteScore

7.70

自引率

5.30%

发文量

749

审稿时长

3-6 weeks

期刊介绍： The Journal of General Internal Medicine is the official journal of the Society of General Internal Medicine. It promotes improved patient care, research, and education in primary care, general internal medicine, and hospital medicine. Its articles focus on topics such as clinical medicine, epidemiology, prevention, health care delivery, curriculum development, and numerous other non-traditional themes, in addition to classic clinical research on problems in internal medicine.