The evaluation of the performance of ChatGPT in the management of labor analgesia

IF 5 2区 医学 Q1 ANESTHESIOLOGY Journal of Clinical Anesthesia Pub Date : 2024-08-20 DOI:10.1016/j.jclinane.2024.111582
{"title":"The evaluation of the performance of ChatGPT in the management of labor analgesia","authors":"","doi":"10.1016/j.jclinane.2024.111582","DOIUrl":null,"url":null,"abstract":"<div><p><em>ChatGPT4</em> is a leading large language model (LLM) chatbot released by OpenAI in 2023. <em>ChatGPT4</em> can respond to free-text queries, answer questions and make suggestions regarding virtually any topic. <em>ChatGPT4</em> has successfully answered anesthesia and even obstetric anesthesia knowledge-based questions with reasonable accuracy. However, <em>ChatGPT4</em> has yet to be challenged in obstetric anesthesia clinical decision-making. <strong>Study Objective:</strong> In this study, we evaluated the performance of <em>ChatGPT4</em> in the management of clinical labor analgesia scenarios compared to expert obstetric anesthesiologists. <strong>Intervention:</strong> Eight clinical questions with progressively increasing medical complexity were posed to <em>ChatGPT4</em>. <strong>Measurements:</strong> The <em>ChatGPT4</em> responses were rated by seven expert obstetric anesthesiologists based on safety, accuracy and completeness of each response using a five-point Likert rating scale. <strong>Main Results:</strong> <em>ChatGPT4</em> was deemed safe in 73% of responses to the presented obstetric anesthesia clinical scenarios (27% of responses were deemed unsafe). None of the <em>ChatGPT4</em> responses were unanimously deemed to be safe by all seven expert obstetric anesthesiologists. Moreover, <em>ChatGPT4</em> responses were overall partly accurate (score 4 out of 5) and somewhat incomplete (score 3.5 out of 5). <strong>Conclusions:</strong> In summary, approximately one quarter of all responses by <em>ChatGPT4</em> were deemed unsafe by expert obstetric anesthesiologists. These findings may suggest the need for more fine-tuning and training of LLMs such as <em>ChatGPT4</em> specifically for clinical decision making in obstetric anesthesia or other specialized medical fields. These LLMs may come to play an important future role in assisting obstetric anesthesiologists in clinical decision making and enhancing overall patient care.</p></div>","PeriodicalId":15506,"journal":{"name":"Journal of Clinical Anesthesia","volume":null,"pages":null},"PeriodicalIF":5.0000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Anesthesia","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952818024002113","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANESTHESIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

ChatGPT4 is a leading large language model (LLM) chatbot released by OpenAI in 2023. ChatGPT4 can respond to free-text queries, answer questions and make suggestions regarding virtually any topic. ChatGPT4 has successfully answered anesthesia and even obstetric anesthesia knowledge-based questions with reasonable accuracy. However, ChatGPT4 has yet to be challenged in obstetric anesthesia clinical decision-making. Study Objective: In this study, we evaluated the performance of ChatGPT4 in the management of clinical labor analgesia scenarios compared to expert obstetric anesthesiologists. Intervention: Eight clinical questions with progressively increasing medical complexity were posed to ChatGPT4. Measurements: The ChatGPT4 responses were rated by seven expert obstetric anesthesiologists based on safety, accuracy and completeness of each response using a five-point Likert rating scale. Main Results: ChatGPT4 was deemed safe in 73% of responses to the presented obstetric anesthesia clinical scenarios (27% of responses were deemed unsafe). None of the ChatGPT4 responses were unanimously deemed to be safe by all seven expert obstetric anesthesiologists. Moreover, ChatGPT4 responses were overall partly accurate (score 4 out of 5) and somewhat incomplete (score 3.5 out of 5). Conclusions: In summary, approximately one quarter of all responses by ChatGPT4 were deemed unsafe by expert obstetric anesthesiologists. These findings may suggest the need for more fine-tuning and training of LLMs such as ChatGPT4 specifically for clinical decision making in obstetric anesthesia or other specialized medical fields. These LLMs may come to play an important future role in assisting obstetric anesthesiologists in clinical decision making and enhancing overall patient care.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估 ChatGPT 在分娩镇痛管理中的作用
ChatGPT4 是 OpenAI 于 2023 年发布的一款领先的大型语言模型(LLM)聊天机器人。ChatGPT4 可以回复自由文本查询,回答几乎任何主题的问题并提出建议。ChatGPT4 已经成功回答了麻醉甚至产科麻醉知识方面的问题,准确率相当高。然而,ChatGPT4 在产科麻醉临床决策方面还未受到挑战。研究目的:在本研究中,我们评估了 ChatGPT4 与产科麻醉专家相比在临床分娩镇痛情景管理中的表现。干预:向 ChatGPT4 提出八个临床问题,这些问题的医学复杂性逐渐增加。测量:七位产科麻醉专家根据每个回答的安全性、准确性和完整性,采用五点李克特评分法对 ChatGPT4 的回答进行评分。主要结果:73%的产科麻醉临床场景回复认为 ChatGPT4 是安全的(27% 的回复被认为是不安全的)。所有七位产科麻醉专家一致认为 ChatGPT4 的回答都不安全。此外,ChatGPT4 的回答总体上部分准确(满分 5 分,得 4 分),部分不完整(满分 5 分,得 3.5 分)。结论:总之,在 ChatGPT4 的所有回复中,约有四分之一被产科麻醉专家认为是不安全的。这些发现可能表明,有必要对 ChatGPT4 等 LLM 进行更多的微调和培训,以专门用于产科麻醉或其他专业医疗领域的临床决策。这些 LLMs 未来可能会在协助产科麻醉医师进行临床决策和加强整体患者护理方面发挥重要作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.40
自引率
4.50%
发文量
346
审稿时长
23 days
期刊介绍: The Journal of Clinical Anesthesia (JCA) addresses all aspects of anesthesia practice, including anesthetic administration, pharmacokinetics, preoperative and postoperative considerations, coexisting disease and other complicating factors, cost issues, and similar concerns anesthesiologists contend with daily. Exceptionally high standards of presentation and accuracy are maintained. The core of the journal is original contributions on subjects relevant to clinical practice, and rigorously peer-reviewed. Highly respected international experts have joined together to form the Editorial Board, sharing their years of experience and clinical expertise. Specialized section editors cover the various subspecialties within the field. To keep your practical clinical skills current, the journal bridges the gap between the laboratory and the clinical practice of anesthesiology and critical care to clarify how new insights can improve daily practice.
期刊最新文献
Benefit of intraoperative intravenous lidocaine on cognitive function following noncardiac surgery: An updated meta-analysis. Esketamine in postoperative recovery: Reliable for negative emotional relief, ambiguous for cognitive function. National trends in perioperative epidural analgesia use for surgical patients Response to comment on: “Effect of remimazolam versus propofol on hypotension after anesthetic induction in patients undergoing coronary artery bypass grafting: A randomized controlled trial” Letter to the editor regarding “Effect of remimazolam versus propofol on hypotension after anesthetic induction in patients undergoing coronary artery bypass grafting: A randomized controlled trial”
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1