ChatGPT as a teaching tool: Preparing pathology residents for board examination with AI-generated digestive system pathology tests.

IF 2.3 4区医学 Q2 PATHOLOGY American journal of clinical pathology Pub Date : 2024-11-04 DOI:10.1093/ajcp/aqae062

Thiyaphat Laohawetwanit, Sompon Apornvirat, Charinee Kantasiripitak

{"title":"ChatGPT as a teaching tool: Preparing pathology residents for board examination with AI-generated digestive system pathology tests.","authors":"Thiyaphat Laohawetwanit, Sompon Apornvirat, Charinee Kantasiripitak","doi":"10.1093/ajcp/aqae062","DOIUrl":null,"url":null,"abstract":"Objectives: To evaluate the effectiveness of ChatGPT 4 in generating multiple-choice questions (MCQs) with explanations for pathology board examinations, specifically for digestive system pathology.Methods: The customized ChatGPT 4 model was developed for MCQ and explanation generation. Expert pathologists evaluated content accuracy and relevance. These MCQs were then administered to pathology residents, followed by an analysis focusing on question difficulty, accuracy, item discrimination, and internal consistency.Results: The customized ChatGPT 4 generated 80 MCQs covering various gastrointestinal and hepatobiliary topics. While the MCQs demonstrated moderate to high agreement in evaluation parameters such as content accuracy, clinical relevance, and overall quality, there were issues in cognitive level and distractor quality. The explanations were generally acceptable. Involving 9 residents with a median experience of 1 year, the average score was 57.4 (71.8%). Pairwise comparisons revealed a significant difference in performance between each year group (P < .01). The test analysis showed moderate difficulty, effective item discrimination (index = 0.15), and good internal consistency (Cronbach's α = 0.74).Conclusions: ChatGPT 4 demonstrated significant potential as a supplementary educational tool in medical education, especially in generating MCQs with explanations similar to those seen in board examinations. While artificial intelligence-generated content was of high quality, it necessitated refinement and expert review.","PeriodicalId":7506,"journal":{"name":"American journal of clinical pathology","volume":" ","pages":"471-479"},"PeriodicalIF":2.3000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of clinical pathology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/ajcp/aqae062","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PATHOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: To evaluate the effectiveness of ChatGPT 4 in generating multiple-choice questions (MCQs) with explanations for pathology board examinations, specifically for digestive system pathology.

Methods: The customized ChatGPT 4 model was developed for MCQ and explanation generation. Expert pathologists evaluated content accuracy and relevance. These MCQs were then administered to pathology residents, followed by an analysis focusing on question difficulty, accuracy, item discrimination, and internal consistency.

Results: The customized ChatGPT 4 generated 80 MCQs covering various gastrointestinal and hepatobiliary topics. While the MCQs demonstrated moderate to high agreement in evaluation parameters such as content accuracy, clinical relevance, and overall quality, there were issues in cognitive level and distractor quality. The explanations were generally acceptable. Involving 9 residents with a median experience of 1 year, the average score was 57.4 (71.8%). Pairwise comparisons revealed a significant difference in performance between each year group (P < .01). The test analysis showed moderate difficulty, effective item discrimination (index = 0.15), and good internal consistency (Cronbach's α = 0.74).

Conclusions: ChatGPT 4 demonstrated significant potential as a supplementary educational tool in medical education, especially in generating MCQs with explanations similar to those seen in board examinations. While artificial intelligence-generated content was of high quality, it necessitated refinement and expert review.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

将 ChatGPT 作为教学工具：用人工智能生成的消化系统病理学测试为病理学住院医师考试做准备。

目的评估 ChatGPT 4 在为病理学考试（特别是消化系统病理学）生成附有解释的多选题（MCQ）方面的有效性：开发了用于生成 MCQ 和解释的定制 ChatGPT 4 模型。病理专家对内容的准确性和相关性进行了评估。然后将这些 MCQ 交给病理学住院医师，然后对问题的难度、准确性、项目区分度和内部一致性进行分析：结果：定制的 ChatGPT 4 生成了 80 个 MCQ，涵盖了各种胃肠道和肝胆主题。虽然 MCQ 在内容准确性、临床相关性和整体质量等评价参数方面表现出中等至高等的一致性，但在认知水平和干扰项质量方面存在问题。解释总体上可以接受。9 名住院医师的平均年龄为 1 年，平均得分为 57.4 分（71.8%）。配对比较显示，每个年级组之间的成绩差异显著（P 结论）：ChatGPT 4 作为医学教育中的辅助教学工具，尤其是在生成与医师资格考试中的解释类似的 MCQ 方面，表现出了巨大的潜力。虽然人工智能生成的内容质量很高，但仍需改进和专家审核。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

American journal of clinical pathology 医学-病理学

CiteScore

7.70

自引率

2.90%

发文量

367

审稿时长

3-6 weeks

期刊介绍： The American Journal of Clinical Pathology (AJCP) is the official journal of the American Society for Clinical Pathology and the Academy of Clinical Laboratory Physicians and Scientists. It is a leading international journal for publication of articles concerning novel anatomic pathology and laboratory medicine observations on human disease. AJCP emphasizes articles that focus on the application of evolving technologies for the diagnosis and characterization of diseases and conditions, as well as those that have a direct link toward improving patient care.