Evaluation of Responses to Questions About Keratoconus Using ChatGPT-4.0, Google Gemini and Microsoft Copilot: A Comparative Study of Large Language Models on Keratoconus.
{"title":"Evaluation of Responses to Questions About Keratoconus Using ChatGPT-4.0, Google Gemini and Microsoft Copilot: A Comparative Study of Large Language Models on Keratoconus.","authors":"Suleyman Demir","doi":"10.1097/ICL.0000000000001158","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Large language models (LLMs) are increasingly being used today and are becoming increasingly important for providing accurate clinical information to patients and physicians. This study aimed to evaluate the effectiveness of generative pre-trained transforme-4.0 (ChatGPT-4.0), Google Gemini, and Microsoft Copilot LLMs in responding to patient questions regarding keratoconus.</p><p><strong>Methods: </strong>The LLMs' responses to the 25 most common questions about keratoconus asked by real-life patients were blindly rated by two ophthalmologists using a 5-point Likert scale. In addition, the DISCERN scale was used to evaluate the responses of the language models in terms of reliability, and the Flesch reading ease and Flesch-Kincaid grade level indices were used to determine readability.</p><p><strong>Results: </strong>ChatGPT-4.0 provided more detailed and accurate answers to patients' questions about keratoconus than Google Gemini and Microsoft Copilot, with 92% of the answers belonging to the \"agree\" or \"strongly agree\" categories. Significant differences were observed between all three LLMs on the Likert scale (P<0.001).</p><p><strong>Conclusions: </strong>Although the answers of ChatGPT-4.0 to questions about keratoconus were more complex for patients than those of other language programs, the information provided was reliable and accurate.</p>","PeriodicalId":50457,"journal":{"name":"Eye & Contact Lens-Science and Clinical Practice","volume":" ","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eye & Contact Lens-Science and Clinical Practice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/ICL.0000000000001158","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Large language models (LLMs) are increasingly being used today and are becoming increasingly important for providing accurate clinical information to patients and physicians. This study aimed to evaluate the effectiveness of generative pre-trained transforme-4.0 (ChatGPT-4.0), Google Gemini, and Microsoft Copilot LLMs in responding to patient questions regarding keratoconus.
Methods: The LLMs' responses to the 25 most common questions about keratoconus asked by real-life patients were blindly rated by two ophthalmologists using a 5-point Likert scale. In addition, the DISCERN scale was used to evaluate the responses of the language models in terms of reliability, and the Flesch reading ease and Flesch-Kincaid grade level indices were used to determine readability.
Results: ChatGPT-4.0 provided more detailed and accurate answers to patients' questions about keratoconus than Google Gemini and Microsoft Copilot, with 92% of the answers belonging to the "agree" or "strongly agree" categories. Significant differences were observed between all three LLMs on the Likert scale (P<0.001).
Conclusions: Although the answers of ChatGPT-4.0 to questions about keratoconus were more complex for patients than those of other language programs, the information provided was reliable and accurate.
期刊介绍:
Eye & Contact Lens: Science and Clinical Practice is the official journal of the Contact Lens Association of Ophthalmologists (CLAO), an international educational association for anterior segment research and clinical practice of interest to ophthalmologists, optometrists, and other vision care providers and researchers. Focusing especially on contact lenses, it also covers dry eye disease, MGD, infections, toxicity of drops and contact lens care solutions, topography, cornea surgery and post-operative care, optics, refractive surgery and corneal stability (eg, UV cross-linking). Peer-reviewed and published six times annually, it is a highly respected scientific journal in its field.