Comparing ChatGPT's and Surgeon's Responses to Thyroid-related Questions From Patients.

IF 5.1 2区医学 Q1 ENDOCRINOLOGY & METABOLISM Journal of Clinical Endocrinology & Metabolism Pub Date : 2025-02-18 DOI:10.1210/clinem/dgae235

Siyin Guo, Ruicen Li, Genpeng Li, Wenjie Chen, Jing Huang, Linye He, Yu Ma, Liying Wang, Hongping Zheng, Chunxiang Tian, Yatong Zhao, Xinmin Pan, Hongxing Wan, Dasheng Liu, Zhihui Li, Jianyong Lei

{"title":"Comparing ChatGPT's and Surgeon's Responses to Thyroid-related Questions From Patients.","authors":"Siyin Guo, Ruicen Li, Genpeng Li, Wenjie Chen, Jing Huang, Linye He, Yu Ma, Liying Wang, Hongping Zheng, Chunxiang Tian, Yatong Zhao, Xinmin Pan, Hongxing Wan, Dasheng Liu, Zhihui Li, Jianyong Lei","doi":"10.1210/clinem/dgae235","DOIUrl":null,"url":null,"abstract":"Context: For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions.Objective: In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions.Methods: First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the 2 interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), a junior specialist, and a senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on 4 dimensions: accuracy, comprehensiveness, compassion, and satisfaction.Results: Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs 4.33 [4.05-4.60]; P < .001) and the senior specialist (8.69 [7.53-9.48] vs 4.22 [3.36-4.76]; P < .001). The word count of the ChatGPT's responses was greater than that of both the junior specialist (341.50 [301.00-384.25] vs 74.50 [51.75-84.75]; P < .001) and senior specialist (341.50 [301.00-384.25] vs 104.00 [63.75-177.75]; P < .001). ChatGPT received higher scores than the junior specialist and senior specialist in terms of accuracy, comprehensiveness, compassion, and satisfaction in responding to common thyroid-related questions.Conclusion: ChatGPT performed better than a junior specialist and senior specialist in answering common thyroid-related questions, but further research is needed to validate the logical ability of the ChatGPT for complex thyroid questions.","PeriodicalId":50238,"journal":{"name":"Journal of Clinical Endocrinology & Metabolism","volume":" ","pages":"e841-e850"},"PeriodicalIF":5.1000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Endocrinology & Metabolism","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1210/clinem/dgae235","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}

引用次数: 0

Abstract

Context: For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions.

Objective: In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions.

Methods: First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the 2 interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), a junior specialist, and a senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on 4 dimensions: accuracy, comprehensiveness, compassion, and satisfaction.

Results: Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs 4.33 [4.05-4.60]; P < .001) and the senior specialist (8.69 [7.53-9.48] vs 4.22 [3.36-4.76]; P < .001). The word count of the ChatGPT's responses was greater than that of both the junior specialist (341.50 [301.00-384.25] vs 74.50 [51.75-84.75]; P < .001) and senior specialist (341.50 [301.00-384.25] vs 104.00 [63.75-177.75]; P < .001). ChatGPT received higher scores than the junior specialist and senior specialist in terms of accuracy, comprehensiveness, compassion, and satisfaction in responding to common thyroid-related questions.

Conclusion: ChatGPT performed better than a junior specialist and senior specialist in answering common thyroid-related questions, but further research is needed to validate the logical ability of the ChatGPT for complex thyroid questions.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

比较 ChatGPT 和外科医生对患者提出的甲状腺相关问题的回答。

背景：对于一些发病率高、随访时间长的常见甲状腺相关疾病，可以使用 ChatGPT 来回答常见的甲状腺相关问题。在这项横断面研究中，我们评估了 ChatGPT（GPT-4.0 版）为常见甲状腺相关问题提供准确、全面、体贴和满意答复的能力：研究设计：首先，我们从 "华亿通 "应用程序中获取了 28 个甲状腺相关问题，这些问题与两个干扰问题最终组成了 30 个问题。然后，这些问题分别由ChatGPT（2023年7月19日）、初级专家和高级专家（2023年7月20日）进行回复。最后，26 名患者和 11 名甲状腺外科医生从准确性、全面性、同情心和满意度四个方面对这些回答进行了评价：结果：在 30 个问题和回答中，ChatGPT 的回答速度快于初级专家（8.69 [7.53-9.48] vs. 4.33 [4.05-4.60]，P 结论：ChatGPT 的表现优于初级专家：在回答常见的甲状腺相关问题时，ChatGPT 的表现优于初级专家和高级专家，但还需要进一步的研究来验证 ChatGPT 处理复杂甲状腺问题的逻辑能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Clinical Endocrinology & Metabolism 医学-内分泌学与代谢

CiteScore

11.40

自引率

5.20%

发文量

673

审稿时长

1 months

期刊介绍： The Journal of Clinical Endocrinology & Metabolism is the world"s leading peer-reviewed journal for endocrine clinical research and cutting edge clinical practice reviews. Each issue provides the latest in-depth coverage of new developments enhancing our understanding, diagnosis and treatment of endocrine and metabolic disorders. Regular features of special interest to endocrine consultants include clinical trials, clinical reviews, clinical practice guidelines, case seminars, and controversies in clinical endocrinology, as well as original reports of the most important advances in patient-oriented endocrine and metabolic research. According to the latest Thomson Reuters Journal Citation Report, JCE&M articles were cited 64,185 times in 2008.