Accuracy of ChatGPT responses on tracheotomy for patient education.

IF 2.2 3区医学 Q2 OTORHINOLARYNGOLOGY European Archives of Oto-Rhino-Laryngology Pub Date : 2024-11-01 Epub Date: 2024-10-02 DOI:10.1007/s00405-024-08859-8

Amina Khaldi, Shahram Machayekhi, Michele Salvagno, Antonino Maniaci, Luigi A Vaira, Luigi La Via, Fabio S Taccone, Jerome R Lechien

{"title":"Accuracy of ChatGPT responses on tracheotomy for patient education.","authors":"Amina Khaldi, Shahram Machayekhi, Michele Salvagno, Antonino Maniaci, Luigi A Vaira, Luigi La Via, Fabio S Taccone, Jerome R Lechien","doi":"10.1007/s00405-024-08859-8","DOIUrl":null,"url":null,"abstract":"Objective: To investigate the accuracy of information provided by ChatGPT-4o to patients about tracheotomy.Methods: Twenty common questions of patients about tracheotomy were presented to ChatGPT-4o twice (7-day intervals). The accuracy, clarity, relevance, completeness, referencing, and usefulness of responses were assessed by a board-certified otolaryngologist and a board-certified intensive care unit practitioner with the Quality Analysis of Medical Artificial Intelligence (QAMAI) tool. The interrater reliability and the stability of the ChatGPT-4o responses were evaluated with intraclass correlation coefficient (ICC) and Pearson correlation analysis.Results: The total scores of QAMAI were 22.85 ± 4.75 for the intensive care practitioner and 21.45 ± 3.95 for the otolaryngologist, which consists of moderate-to-high accuracy. The otolaryngologist and the ICU practitioner reported high ICC (0.807; 95%CI: 0.655-0.911). The highest QAMAI scores have been found for clarity and completeness of explanations. The QAMAI scores for the accuracy of the information and the referencing were the lowest. The information related to the post-laryngectomy tracheostomy remains incomplete or erroneous. ChatGPT-4o did not provide references for their responses. The stability analysis reported high stability in regenerated questions.Conclusion: The accuracy of ChatGPT-4o is moderate-to-high in providing information related to the tracheotomy. However, patients using ChatGPT-4o need to be cautious about the information related to tracheotomy care, steps, and the differences between temporary and permanent tracheotomies.","PeriodicalId":11952,"journal":{"name":"European Archives of Oto-Rhino-Laryngology","volume":" ","pages":"6167-6172"},"PeriodicalIF":2.2000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Archives of Oto-Rhino-Laryngology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00405-024-08859-8","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/2 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: To investigate the accuracy of information provided by ChatGPT-4o to patients about tracheotomy.

Methods: Twenty common questions of patients about tracheotomy were presented to ChatGPT-4o twice (7-day intervals). The accuracy, clarity, relevance, completeness, referencing, and usefulness of responses were assessed by a board-certified otolaryngologist and a board-certified intensive care unit practitioner with the Quality Analysis of Medical Artificial Intelligence (QAMAI) tool. The interrater reliability and the stability of the ChatGPT-4o responses were evaluated with intraclass correlation coefficient (ICC) and Pearson correlation analysis.

Results: The total scores of QAMAI were 22.85 ± 4.75 for the intensive care practitioner and 21.45 ± 3.95 for the otolaryngologist, which consists of moderate-to-high accuracy. The otolaryngologist and the ICU practitioner reported high ICC (0.807; 95%CI: 0.655-0.911). The highest QAMAI scores have been found for clarity and completeness of explanations. The QAMAI scores for the accuracy of the information and the referencing were the lowest. The information related to the post-laryngectomy tracheostomy remains incomplete or erroneous. ChatGPT-4o did not provide references for their responses. The stability analysis reported high stability in regenerated questions.

Conclusion: The accuracy of ChatGPT-4o is moderate-to-high in providing information related to the tracheotomy. However, patients using ChatGPT-4o need to be cautious about the information related to tracheotomy care, steps, and the differences between temporary and permanent tracheotomies.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于患者教育的气管切开术 ChatGPT 响应的准确性。

目的调查 ChatGPT-4o 向患者提供的气管切开术相关信息的准确性：向 ChatGPT-4o 提交患者关于气管切开术的 20 个常见问题两次（间隔 7 天）。一位获得医学委员会认证的耳鼻喉科医生和一位获得医学委员会认证的重症监护室医生使用医学人工智能质量分析（QAMAI）工具对回答的准确性、清晰度、相关性、完整性、参考性和实用性进行了评估。通过类内相关系数（ICC）和皮尔逊相关分析评估了聊天GPT-4o回答的交互可靠性和稳定性：结果：重症监护医生的 QAMAI 总分为 22.85 ± 4.75，耳鼻喉科医生的总分为 21.45 ± 3.95，准确度为中高。耳鼻喉科医生和重症监护室医生报告的 ICC 较高（0.807；95%CI：0.655-0.911）。解释的清晰度和完整性的 QAMAI 得分最高。信息准确性和参考性的 QAMAI 得分最低。与喉切除术后气管切开术相关的信息仍不完整或存在错误。ChatGPT-4o 没有为其答复提供参考资料。稳定性分析表明，再生问题的稳定性较高：结论：在提供与气管切开术相关的信息方面，ChatGPT-4o 的准确性为中高水平。然而，使用 ChatGPT-4o 的患者需要谨慎对待与气管切开术护理、步骤以及临时和永久气管切开术之间的区别相关的信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

European Archives of Oto-Rhino-Laryngology 医学-耳鼻喉科学

CiteScore

5.30

自引率

7.70%

发文量

537

审稿时长

2-4 weeks

期刊介绍： Official Journal of European Union of Medical Specialists – ORL Section and Board Official Journal of Confederation of European Oto-Rhino-Laryngology Head and Neck Surgery "European Archives of Oto-Rhino-Laryngology" publishes original clinical reports and clinically relevant experimental studies, as well as short communications presenting new results of special interest. With peer review by a respected international editorial board and prompt English-language publication, the journal provides rapid dissemination of information by authors from around the world. This particular feature makes it the journal of choice for readers who want to be informed about the continuing state of the art concerning basic sciences and the diagnosis and management of diseases of the head and neck on an international level. European Archives of Oto-Rhino-Laryngology was founded in 1864 as "Archiv für Ohrenheilkunde" by A. von Tröltsch, A. Politzer and H. Schwartze.