Alok Kumar, Colette George, Michael Harry Campbell, Kandamaran Krishnamurthy, Paula Michele Lashley, V. Singh, S. Motilal, S. Sakhamuri, Tamara Thompson, Corrine Sinquee-Brown, B. Sa, Md Anbarul Azim Mujumder
{"title":"Item Analysis of Multiple Choice and Extended Matching Questions in the Final MBBS Medicine and Therapeutics Examination","authors":"Alok Kumar, Colette George, Michael Harry Campbell, Kandamaran Krishnamurthy, Paula Michele Lashley, V. Singh, S. Motilal, S. Sakhamuri, Tamara Thompson, Corrine Sinquee-Brown, B. Sa, Md Anbarul Azim Mujumder","doi":"10.5812/jme-129450","DOIUrl":null,"url":null,"abstract":"Background: Most universities around the world use the multiple-choice question (MCQ) examination format to evaluate medical education. However, the suitability and advantages of traditional MCQs and extended matching questions (EMQs) continue to be debated. Objectives: This study mainly aimed to perform a comprehensive comparative analysis of the performance of the EMQ and traditional MCQ formats in the final MBBS exit examination. Methods: We conducted an item analysis of 80 EMQs, and 200 MCQs administered to 532 examinees across the four campuses of the University of the West Indies during the final MBBS medicine and therapeutics examination of 2019. Exam performance measures included central tendency, item discrimination, reliability, item difficulty, and distractor efficacy. Results: For the 532 students who sat the exam, the highest, lowest, and mean (± SD) scores for the EMQs were 93, 41, and 69.0 (± 9.8), respectively; for the MCQs, the respective values were 82, 41, and 62.7 (± 7.4). The predictive value of the EMQ and MCQ grades individually in the overall failure was 0.67 (95% CI = 0.39, 0.87) and 0.89 (95% CI = 0.65, 0.98), respectively. KR-20 coefficients for the EMQs and MCQs ranged from 0.52 to 0.70 and 0.71 to 0.79, respectively. The proportion of questions with two or more functional distractors was consistently higher for the MCQs than for the EMQs in all four cohorts of students. Conclusions: The MCQs were more predictive of the overall failure and had higher inter-item reliability, making the MCQ format more suitable for high-stakes examinations.","PeriodicalId":30594,"journal":{"name":"Journal of Medical Education Development","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Education Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5812/jme-129450","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Most universities around the world use the multiple-choice question (MCQ) examination format to evaluate medical education. However, the suitability and advantages of traditional MCQs and extended matching questions (EMQs) continue to be debated. Objectives: This study mainly aimed to perform a comprehensive comparative analysis of the performance of the EMQ and traditional MCQ formats in the final MBBS exit examination. Methods: We conducted an item analysis of 80 EMQs, and 200 MCQs administered to 532 examinees across the four campuses of the University of the West Indies during the final MBBS medicine and therapeutics examination of 2019. Exam performance measures included central tendency, item discrimination, reliability, item difficulty, and distractor efficacy. Results: For the 532 students who sat the exam, the highest, lowest, and mean (± SD) scores for the EMQs were 93, 41, and 69.0 (± 9.8), respectively; for the MCQs, the respective values were 82, 41, and 62.7 (± 7.4). The predictive value of the EMQ and MCQ grades individually in the overall failure was 0.67 (95% CI = 0.39, 0.87) and 0.89 (95% CI = 0.65, 0.98), respectively. KR-20 coefficients for the EMQs and MCQs ranged from 0.52 to 0.70 and 0.71 to 0.79, respectively. The proportion of questions with two or more functional distractors was consistently higher for the MCQs than for the EMQs in all four cohorts of students. Conclusions: The MCQs were more predictive of the overall failure and had higher inter-item reliability, making the MCQ format more suitable for high-stakes examinations.