{"title":"What Type of Self-Assessment Is Best for Your Educational Activity? A Review of Pre-Post, Now-Then, and Post-Only Designs.","authors":"James H Wykowski, Helene Starks","doi":"10.1007/s11606-024-09176-w","DOIUrl":null,"url":null,"abstract":"<p><p>In medical education, learner self-assessments are standard methods used to evaluate the impact of curricula and workshops. Due to the subjective nature of self-assessment, these measures are prone to known biases including framing, recall, social desirability, and response-shift bias. These biases can contribute to floor and ceiling effects of measurement, which can lead to false conclusions about whether the intended learning objectives were achieved. Ideal assessments of skills-based educational activities would include standardized tests and structured observations of learners demonstrating skill use before and after the educational intervention. However, educators often lack the necessary resources, time, and expertise to routinely conduct these appraisals and rely on self-assessment as a pragmatic approach to obtaining curriculum feedback and evaluation data. In this review, we describe three common designs for self-assessments: the pre-post, now-then, and post-only designs. We then give recommendations for choosing between each design to minimize bias. The choice of the best design is based on alignments with four considerations: (1) the educational objectives (e.g., demonstrate skill competency and/or change in skill level); (2) participants' prior experience and shared understanding of levels of skill performance; (3) the nature of the educational activity; and (4) available resources. For each design, we review strengths, weaknesses, and known biases and discuss examples to highlight trade-offs between options. We also discuss the use of control groups and follow-up surveys to measure retention over time as additional methods to address bias and related confounding. The guidance presented here is intended to raise educators' awareness of common pitfalls in self-assessment; minimize the impact of known biases when possible; provide evidence, examples, and rationales for optimal design choices; and increase the rigor of self-assessment evaluations.</p>","PeriodicalId":15860,"journal":{"name":"Journal of General Internal Medicine","volume":" ","pages":""},"PeriodicalIF":4.3000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of General Internal Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11606-024-09176-w","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
In medical education, learner self-assessments are standard methods used to evaluate the impact of curricula and workshops. Due to the subjective nature of self-assessment, these measures are prone to known biases including framing, recall, social desirability, and response-shift bias. These biases can contribute to floor and ceiling effects of measurement, which can lead to false conclusions about whether the intended learning objectives were achieved. Ideal assessments of skills-based educational activities would include standardized tests and structured observations of learners demonstrating skill use before and after the educational intervention. However, educators often lack the necessary resources, time, and expertise to routinely conduct these appraisals and rely on self-assessment as a pragmatic approach to obtaining curriculum feedback and evaluation data. In this review, we describe three common designs for self-assessments: the pre-post, now-then, and post-only designs. We then give recommendations for choosing between each design to minimize bias. The choice of the best design is based on alignments with four considerations: (1) the educational objectives (e.g., demonstrate skill competency and/or change in skill level); (2) participants' prior experience and shared understanding of levels of skill performance; (3) the nature of the educational activity; and (4) available resources. For each design, we review strengths, weaknesses, and known biases and discuss examples to highlight trade-offs between options. We also discuss the use of control groups and follow-up surveys to measure retention over time as additional methods to address bias and related confounding. The guidance presented here is intended to raise educators' awareness of common pitfalls in self-assessment; minimize the impact of known biases when possible; provide evidence, examples, and rationales for optimal design choices; and increase the rigor of self-assessment evaluations.
期刊介绍:
The Journal of General Internal Medicine is the official journal of the Society of General Internal Medicine. It promotes improved patient care, research, and education in primary care, general internal medicine, and hospital medicine. Its articles focus on topics such as clinical medicine, epidemiology, prevention, health care delivery, curriculum development, and numerous other non-traditional themes, in addition to classic clinical research on problems in internal medicine.