Evaluation of the performance of ChatGPT-4 and ChatGPT-4o as a learning tool in endodontics.

IF 5.4 1区医学 Q1 DENTISTRY, ORAL SURGERY & MEDICINE International endodontic journal Pub Date : 2025-03-02 DOI:10.1111/iej.14217

Esra Arılı Öztürk, Ceren Turan Gökduman, Burhan Can Çanakçi

{"title":"Evaluation of the performance of ChatGPT-4 and ChatGPT-4o as a learning tool in endodontics.","authors":"Esra Arılı Öztürk, Ceren Turan Gökduman, Burhan Can Çanakçi","doi":"10.1111/iej.14217","DOIUrl":null,"url":null,"abstract":"Aims: The aim of this study was to evaluate the accuracy and consistency of responses given by two different versions of Chat Generative Pre-trained Transformer (ChatGPT), ChatGPT-4, and ChatGPT-4o, to multiple-choice questions prepared from undergraduate endodontic education topics at different times of the day and on different days.Methodology: In total, 60 multiple-choice, text-based questions from 6 topics of undergraduate endodontic education were prepared. Each question was asked to ChatGPT-4 and ChatGPT-4o 3 times a day (morning, noon, and evening) and for 3 consecutive days. The accuracy and consistency of AIs were compared using SPSS and R programs (p < .05, 95% confidence interval).Results: The accuracy rate of ChatGPT-4o (92.8%) was significantly higher than that of ChatGPT-4 (81.7%; p < .001). The question groups affected the accuracy rates of both AIs (p < .001). The times at which the questions were asked did not affect the accuracy of either AI (p > .05). There was no statistically significant difference in the consistency rate between ChatGPT-4 and ChatGPT-4o (p = .123). The question groups did not affect the consistency of either AI, too (p > .05).Conclusions: According to the results of this study, the accuracy of ChatGPT-4o was better than that of ChatGPT-4. These findings demonstrate that AI chatbots can be used in dental education. However, it is also necessary to consider the limitations and potential risks associated with AI.","PeriodicalId":13724,"journal":{"name":"International endodontic journal","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International endodontic journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/iej.14217","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}

引用次数: 0

Abstract

Aims: The aim of this study was to evaluate the accuracy and consistency of responses given by two different versions of Chat Generative Pre-trained Transformer (ChatGPT), ChatGPT-4, and ChatGPT-4o, to multiple-choice questions prepared from undergraduate endodontic education topics at different times of the day and on different days.

Methodology: In total, 60 multiple-choice, text-based questions from 6 topics of undergraduate endodontic education were prepared. Each question was asked to ChatGPT-4 and ChatGPT-4o 3 times a day (morning, noon, and evening) and for 3 consecutive days. The accuracy and consistency of AIs were compared using SPSS and R programs (p < .05, 95% confidence interval).

Results: The accuracy rate of ChatGPT-4o (92.8%) was significantly higher than that of ChatGPT-4 (81.7%; p < .001). The question groups affected the accuracy rates of both AIs (p < .001). The times at which the questions were asked did not affect the accuracy of either AI (p > .05). There was no statistically significant difference in the consistency rate between ChatGPT-4 and ChatGPT-4o (p = .123). The question groups did not affect the consistency of either AI, too (p > .05).

Conclusions: According to the results of this study, the accuracy of ChatGPT-4o was better than that of ChatGPT-4. These findings demonstrate that AI chatbots can be used in dental education. However, it is also necessary to consider the limitations and potential risks associated with AI.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

求助全文

约1分钟内获得全文去求助

来源期刊

International endodontic journal 医学-牙科与口腔外科

CiteScore

10.20

自引率

28.00%

发文量

195

审稿时长

4-8 weeks

期刊介绍： The International Endodontic Journal is published monthly and strives to publish original articles of the highest quality to disseminate scientific and clinical knowledge; all manuscripts are subjected to peer review. Original scientific articles are published in the areas of biomedical science, applied materials science, bioengineering, epidemiology and social science relevant to endodontic disease and its management, and to the restoration of root-treated teeth. In addition, review articles, reports of clinical cases, book reviews, summaries and abstracts of scientific meetings and news items are accepted. The International Endodontic Journal is essential reading for general dental practitioners, specialist endodontists, research, scientists and dental teachers.