Comparing ChatGPT's and Surgeon's Responses to Thyroid-related Questions From Patients.

IF 5 2区 医学 Q1 ENDOCRINOLOGY & METABOLISM Journal of Clinical Endocrinology & Metabolism Pub Date : 2025-02-18 DOI:10.1210/clinem/dgae235
Siyin Guo, Ruicen Li, Genpeng Li, Wenjie Chen, Jing Huang, Linye He, Yu Ma, Liying Wang, Hongping Zheng, Chunxiang Tian, Yatong Zhao, Xinmin Pan, Hongxing Wan, Dasheng Liu, Zhihui Li, Jianyong Lei
{"title":"Comparing ChatGPT's and Surgeon's Responses to Thyroid-related Questions From Patients.","authors":"Siyin Guo, Ruicen Li, Genpeng Li, Wenjie Chen, Jing Huang, Linye He, Yu Ma, Liying Wang, Hongping Zheng, Chunxiang Tian, Yatong Zhao, Xinmin Pan, Hongxing Wan, Dasheng Liu, Zhihui Li, Jianyong Lei","doi":"10.1210/clinem/dgae235","DOIUrl":null,"url":null,"abstract":"<p><strong>Context: </strong>For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions.</p><p><strong>Objective: </strong>In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions.</p><p><strong>Methods: </strong>First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the 2 interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), a junior specialist, and a senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on 4 dimensions: accuracy, comprehensiveness, compassion, and satisfaction.</p><p><strong>Results: </strong>Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs 4.33 [4.05-4.60]; P < .001) and the senior specialist (8.69 [7.53-9.48] vs 4.22 [3.36-4.76]; P < .001). The word count of the ChatGPT's responses was greater than that of both the junior specialist (341.50 [301.00-384.25] vs 74.50 [51.75-84.75]; P < .001) and senior specialist (341.50 [301.00-384.25] vs 104.00 [63.75-177.75]; P < .001). ChatGPT received higher scores than the junior specialist and senior specialist in terms of accuracy, comprehensiveness, compassion, and satisfaction in responding to common thyroid-related questions.</p><p><strong>Conclusion: </strong>ChatGPT performed better than a junior specialist and senior specialist in answering common thyroid-related questions, but further research is needed to validate the logical ability of the ChatGPT for complex thyroid questions.</p>","PeriodicalId":50238,"journal":{"name":"Journal of Clinical Endocrinology & Metabolism","volume":" ","pages":"e841-e850"},"PeriodicalIF":5.0000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Endocrinology & Metabolism","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1210/clinem/dgae235","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Context: For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions.

Objective: In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions.

Methods: First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the 2 interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), a junior specialist, and a senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on 4 dimensions: accuracy, comprehensiveness, compassion, and satisfaction.

Results: Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs 4.33 [4.05-4.60]; P < .001) and the senior specialist (8.69 [7.53-9.48] vs 4.22 [3.36-4.76]; P < .001). The word count of the ChatGPT's responses was greater than that of both the junior specialist (341.50 [301.00-384.25] vs 74.50 [51.75-84.75]; P < .001) and senior specialist (341.50 [301.00-384.25] vs 104.00 [63.75-177.75]; P < .001). ChatGPT received higher scores than the junior specialist and senior specialist in terms of accuracy, comprehensiveness, compassion, and satisfaction in responding to common thyroid-related questions.

Conclusion: ChatGPT performed better than a junior specialist and senior specialist in answering common thyroid-related questions, but further research is needed to validate the logical ability of the ChatGPT for complex thyroid questions.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
比较 ChatGPT 和外科医生对患者提出的甲状腺相关问题的回答。
背景:对于一些发病率高、随访时间长的常见甲状腺相关疾病,可以使用 ChatGPT 来回答常见的甲状腺相关问题。在这项横断面研究中,我们评估了 ChatGPT(GPT-4.0 版)为常见甲状腺相关问题提供准确、全面、体贴和满意答复的能力:研究设计:首先,我们从 "华亿通 "应用程序中获取了 28 个甲状腺相关问题,这些问题与两个干扰问题最终组成了 30 个问题。然后,这些问题分别由ChatGPT(2023年7月19日)、初级专家和高级专家(2023年7月20日)进行回复。最后,26 名患者和 11 名甲状腺外科医生从准确性、全面性、同情心和满意度四个方面对这些回答进行了评价:结果:在 30 个问题和回答中,ChatGPT 的回答速度快于初级专家(8.69 [7.53-9.48] vs. 4.33 [4.05-4.60],P 结论:ChatGPT 的表现优于初级专家:在回答常见的甲状腺相关问题时,ChatGPT 的表现优于初级专家和高级专家,但还需要进一步的研究来验证 ChatGPT 处理复杂甲状腺问题的逻辑能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Clinical Endocrinology & Metabolism
Journal of Clinical Endocrinology & Metabolism 医学-内分泌学与代谢
CiteScore
11.40
自引率
5.20%
发文量
673
审稿时长
1 months
期刊介绍: The Journal of Clinical Endocrinology & Metabolism is the world"s leading peer-reviewed journal for endocrine clinical research and cutting edge clinical practice reviews. Each issue provides the latest in-depth coverage of new developments enhancing our understanding, diagnosis and treatment of endocrine and metabolic disorders. Regular features of special interest to endocrine consultants include clinical trials, clinical reviews, clinical practice guidelines, case seminars, and controversies in clinical endocrinology, as well as original reports of the most important advances in patient-oriented endocrine and metabolic research. According to the latest Thomson Reuters Journal Citation Report, JCE&M articles were cited 64,185 times in 2008.
期刊最新文献
The Risk of Adrenal Insufficiency after Treatment with Relatlimab in Combination with Nivolumab is Higher than Expected. Tailoring Exercise Prescription for Effective Diabetes Glucose Management. Association of Maternal Thyroglobulin Antibody with Preterm Birth in Euthyroid Women. Response to Letter to the Editor from Prickett and Espiner: 'Dynamic Response of Musclin, a Myokine, to Aerobic Exercise and Its Interplay with Natriuretic Peptides and Receptor C'. SGLT2i and Cardiovascular Events in Patients With Concomitant Atrial Fibrillation and Diabetes: A TriNetX Cohort Study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1