Performance of Novel GPT-4 in Otolaryngology Knowledge Assessment.

IF 0.6 Q4 SURGERY Indian Journal of Otolaryngology and Head and Neck Surgery Pub Date : 2024-12-01 Epub Date: 2024-08-03 DOI:10.1007/s12070-024-04935-x
Lucy Revercomb, Aman M Patel, Daniel Fu, Andrey Filimonov
{"title":"Performance of Novel GPT-4 in Otolaryngology Knowledge Assessment.","authors":"Lucy Revercomb, Aman M Patel, Daniel Fu, Andrey Filimonov","doi":"10.1007/s12070-024-04935-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>GPT-4, recently released by OpenAI, improves upon GPT-3.5 with increased reliability and expanded capabilities, including user-specified, customizable GPT-4 models. This study aims to investigate updates in GPT-4 performance vs. GPT-3.5 on Otolaryngology board-style questions.</p><p><strong>Methods: </strong>150 Otolaryngology board-style questions were obtained from the BoardVitals question bank. These questions, which were previously assessed with GPT-3.5, were inputted into standard GPT-4 and a custom GPT-4 model designed to specialize in Otolaryngology board-style questions, emphasize precision, and provide evidence-based explanations.</p><p><strong>Results: </strong>Standard GPT-4 correctly answered 72.0% and custom GPT-4 correctly answered 81.3% of the questions, vs. GPT-3.5 which answered 51.3% of the same questions correctly. On multivariable analysis, custom GPT-4 had higher odds of correctly answering questions than standard GPT-4 (adjusted odds ratio 2.19, <i>P</i> = 0.015). Both GPT-4 and custom GPT-4 demonstrated a decrease in performance between questions rated as easy and hard (<i>P</i> < 0.001).</p><p><strong>Conclusions: </strong>Our study suggests that GPT-4 has higher accuracy than GPT-3.5 in answering Otolaryngology board-style questions. Our custom GPT-4 model demonstrated higher accuracy than standard GPT-4, potentially as a result of its instructions to specialize in Otolaryngology board-style questions, select exactly one answer, and emphasize precision. This demonstrates custom models may further enhance utilization of ChatGPT in medical education.</p>","PeriodicalId":49190,"journal":{"name":"Indian Journal of Otolaryngology and Head and Neck Surgery","volume":"76 6","pages":"6112-6114"},"PeriodicalIF":0.6000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11569072/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal of Otolaryngology and Head and Neck Surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s12070-024-04935-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/3 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: GPT-4, recently released by OpenAI, improves upon GPT-3.5 with increased reliability and expanded capabilities, including user-specified, customizable GPT-4 models. This study aims to investigate updates in GPT-4 performance vs. GPT-3.5 on Otolaryngology board-style questions.

Methods: 150 Otolaryngology board-style questions were obtained from the BoardVitals question bank. These questions, which were previously assessed with GPT-3.5, were inputted into standard GPT-4 and a custom GPT-4 model designed to specialize in Otolaryngology board-style questions, emphasize precision, and provide evidence-based explanations.

Results: Standard GPT-4 correctly answered 72.0% and custom GPT-4 correctly answered 81.3% of the questions, vs. GPT-3.5 which answered 51.3% of the same questions correctly. On multivariable analysis, custom GPT-4 had higher odds of correctly answering questions than standard GPT-4 (adjusted odds ratio 2.19, P = 0.015). Both GPT-4 and custom GPT-4 demonstrated a decrease in performance between questions rated as easy and hard (P < 0.001).

Conclusions: Our study suggests that GPT-4 has higher accuracy than GPT-3.5 in answering Otolaryngology board-style questions. Our custom GPT-4 model demonstrated higher accuracy than standard GPT-4, potentially as a result of its instructions to specialize in Otolaryngology board-style questions, select exactly one answer, and emphasize precision. This demonstrates custom models may further enhance utilization of ChatGPT in medical education.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
新版 GPT-4 在耳鼻喉科知识评估中的表现。
目的:OpenAI 最近发布的 GPT-4 在 GPT-3.5 的基础上进行了改进,提高了可靠性并扩展了功能,包括用户指定、可定制的 GPT-4 模型。本研究旨在调查 GPT-4 与 GPT-3.5 在耳鼻喉科板式试题上的性能对比。这些问题以前曾用 GPT-3.5 进行过评估,现在被输入到标准 GPT-4 和定制的 GPT-4 模型中,该模型专为耳鼻喉科委员会风格的问题而设计,强调精确性并提供基于证据的解释:标准 GPT-4 正确回答了 72.0% 的问题,自定义 GPT-4 正确回答了 81.3% 的问题,而 GPT-3.5 正确回答了 51.3% 的问题。经多变量分析,定制的 GPT-4 比标准的 GPT-4 正确回答问题的几率更高(调整后的几率比 2.19,P = 0.015)。GPT-4 和自定义 GPT-4 在被评为简单和困难的问题之间的成绩都有所下降(P 结论:GPT-4 和自定义 GPT-4 在被评为简单和困难的问题之间的成绩都有所下降:我们的研究表明,与 GPT-3.5 相比,GPT-4 在回答耳鼻喉科委员会类型的问题时具有更高的准确性。我们的定制 GPT-4 模型比标准 GPT-4 显示出更高的准确性,这可能是由于其指示专门针对耳鼻喉科委员会风格的问题、准确选择一个答案以及强调精确性的结果。这表明自定义模型可进一步提高 ChatGPT 在医学教育中的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
0.80
自引率
0.00%
发文量
226
审稿时长
6-12 weeks
期刊介绍: Indian Journal of Otolaryngology and Head & Neck Surgery was founded as Indian Journal of Otolaryngology in 1949 as a scientific Journal published by the Association of Otolaryngologists of India and was later rechristened as IJOHNS to incorporate the changes and progress. IJOHNS, undoubtedly one of the oldest Journals in India, is the official publication of the Association of Otolaryngologists of India and is about to publish it is 67th Volume in 2015. The Journal published quarterly accepts articles in general Oto-Rhino-Laryngology and various subspecialities such as Otology, Rhinology, Laryngology and Phonosurgery, Neurotology, Head and Neck Surgery etc. The Journal acts as a window to showcase and project the clinical and research work done by Otolaryngologists community in India and around the world. It is a continued source of useful clinical information with peer review by eminent Otolaryngologists of repute in their respective fields. The Journal accepts articles pertaining to clinical reports, Clinical studies, Research articles in basic and applied Otolaryngology, short Communications, Clinical records reporting unusual presentations or lesions and new surgical techniques. The journal acts as a catalyst and mirrors the Indian Otolaryngologist’s active interests and pursuits. The Journal also invites articles from senior and experienced authors on interesting topics in Otolaryngology and allied sciences from all over the world. The print version is distributed free to about 4000 members of Association of Otolaryngologists of India and the e-Journal shortly going to make its appearance on the Springer Board can be accessed by all the members. Association of Otolaryngologists of India and M/s Springer India group have come together to co-publish IJOHNS from January 2007 and this bondage is going to provide an impetus to the Journal in terms of international presence and global exposure.
期刊最新文献
The Effect of Total Thyroidectomy on Trace Elements, Namely Zinc and Copper in Patients with Thyroid Diseases. Could DTI Unlock the Mystery of Subjective Tinnitus: It's Time for Parameters That Go A Little Out of the Routine. The Expressions of p16, HPV16-L1 and HPV18-E6 in Salivary Gland Tumours. Demographic Variations in VEMP Responses: A Cross-Sectional Study of Normative Data from an Indian Population. The Global Prevalence of Noise Induced Hearing Impairment Among Industrial Workers: A Systematic Review and Meta-Analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1