Accuracy of Spanish and English-generated ChatGPT responses to commonly asked patient questions about labor epidurals: a survey-based study among bilingual obstetric anesthesia experts

IF 2.6 3区 医学 Q2 ANESTHESIOLOGY International journal of obstetric anesthesia Pub Date : 2024-11-06 DOI:10.1016/j.ijoa.2024.104290
Antonio Gonzalez Fiol , Allison A. Mootz , Zili He , Carlos Delgado , Vilma Ortiz , Sharon C. Reale
{"title":"Accuracy of Spanish and English-generated ChatGPT responses to commonly asked patient questions about labor epidurals: a survey-based study among bilingual obstetric anesthesia experts","authors":"Antonio Gonzalez Fiol ,&nbsp;Allison A. Mootz ,&nbsp;Zili He ,&nbsp;Carlos Delgado ,&nbsp;Vilma Ortiz ,&nbsp;Sharon C. Reale","doi":"10.1016/j.ijoa.2024.104290","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Large language models (LLMs), of which ChatGPT is the most well known, are now available to patients to seek medical advice in various languages. However, the accuracy of the information utilized to train these models remains unknown.</div></div><div><h3>Methods</h3><div>Ten commonly asked questions regarding labor epidurals were translated from English to Spanish, and all 20 questions were entered into ChatGPT version 3.5. The answers were transcribed. A survey was then sent to 10 bilingual fellowship-trained obstetric anesthesiologists to assess the accuracy of these answers utilizing a 5-point Likert scale.</div></div><div><h3>Results</h3><div>Overall, the accuracy scores for the ChatGPT-generated answers in Spanish were lower than for the English answers with a median score of 34 (IQR 33–36.5) versus 40.5 (IQR 39–44.3), respectively (<em>P</em> value 0.02). Answers to two questions were scored significantly lower: “Do epidurals prolong labor?” (2 (IQR 2–2.5) versus 4 (IQR 4–4.5), <em>P</em> value 0.03) and “Do epidurals increase the risk of needing cesarean delivery?” (3(IQR 2–4) versus 4 (IQR 4–5); P value 0.03). There was a strong agreement that answers to the question “Do epidurals cause autism” were accurate in both Spanish and English.</div></div><div><h3>Conclusion</h3><div>ChatGPT-generated answers in Spanish to ten questions about labor epidurals scored lower for accuracy<!--> <!-->than<!--> <!-->answers generated in English, particularly regarding the effect of labor epidurals on labor course and mode of delivery. This disparity in ChatGPT-generated information may extend already-known health inequities among non-English-speaking patients and perpetuate misinformation.</div></div>","PeriodicalId":14250,"journal":{"name":"International journal of obstetric anesthesia","volume":"61 ","pages":"Article 104290"},"PeriodicalIF":2.6000,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of obstetric anesthesia","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0959289X24003029","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ANESTHESIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Large language models (LLMs), of which ChatGPT is the most well known, are now available to patients to seek medical advice in various languages. However, the accuracy of the information utilized to train these models remains unknown.

Methods

Ten commonly asked questions regarding labor epidurals were translated from English to Spanish, and all 20 questions were entered into ChatGPT version 3.5. The answers were transcribed. A survey was then sent to 10 bilingual fellowship-trained obstetric anesthesiologists to assess the accuracy of these answers utilizing a 5-point Likert scale.

Results

Overall, the accuracy scores for the ChatGPT-generated answers in Spanish were lower than for the English answers with a median score of 34 (IQR 33–36.5) versus 40.5 (IQR 39–44.3), respectively (P value 0.02). Answers to two questions were scored significantly lower: “Do epidurals prolong labor?” (2 (IQR 2–2.5) versus 4 (IQR 4–4.5), P value 0.03) and “Do epidurals increase the risk of needing cesarean delivery?” (3(IQR 2–4) versus 4 (IQR 4–5); P value 0.03). There was a strong agreement that answers to the question “Do epidurals cause autism” were accurate in both Spanish and English.

Conclusion

ChatGPT-generated answers in Spanish to ten questions about labor epidurals scored lower for accuracy than answers generated in English, particularly regarding the effect of labor epidurals on labor course and mode of delivery. This disparity in ChatGPT-generated information may extend already-known health inequities among non-English-speaking patients and perpetuate misinformation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
西班牙语和英语生成的 ChatGPT 回答患者有关分娩硬膜外麻醉的常见问题的准确性:一项针对双语产科麻醉专家的调查研究。
背景:大语言模型(LLMs),其中最著名的是 ChatGPT,现在病人可以用各种语言寻求医疗建议。然而,用于训练这些模型的信息的准确性仍是未知数:将有关分娩硬膜外麻醉的 10 个常见问题从英语翻译成西班牙语,并将所有 20 个问题输入 ChatGPT 3.5 版。对答案进行了转录。然后向 10 位接受过双语研究培训的产科麻醉师发送了一份调查问卷,采用 5 点李克特量表评估这些答案的准确性:总体而言,ChatGPT 生成的西班牙语答案的准确性得分低于英语答案,中位数分别为 34(IQR 33-36.5)和 40.5(IQR 39-44.3)(P 值 0.02)。有两个问题的答案得分明显较低:"硬膜外麻醉会延长产程吗?"(2 (IQR 2-2.5) 对 4 (IQR 4-4.5),P 值 0.03)和 "硬膜外麻醉会增加需要剖宫产的风险吗?3(IQR 2-4)对 4(IQR 4-5);P 值 0.03)。对于 "硬膜外麻醉会导致自闭症吗 "这一问题,西班牙文和英文的答案都非常准确:结论:对于有关分娩镇痛剂的十个问题,用西班牙语通过 ChatGPT 生成的答案在准确性方面得分低于用英语生成的答案,尤其是在分娩镇痛剂对产程和分娩方式的影响方面。在 ChatGPT 生成的信息中存在的这种差异可能会扩大非英语患者中已知的健康不公平现象,并使错误信息长期存在。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.70
自引率
7.10%
发文量
285
审稿时长
58 days
期刊介绍: The International Journal of Obstetric Anesthesia is the only journal publishing original articles devoted exclusively to obstetric anesthesia and bringing together all three of its principal components; anesthesia care for operative delivery and the perioperative period, pain relief in labour and care of the critically ill obstetric patient. • Original research (both clinical and laboratory), short reports and case reports will be considered. • The journal also publishes invited review articles and debates on topical and controversial subjects in the area of obstetric anesthesia. • Articles on related topics such as perinatal physiology and pharmacology and all subjects of importance to obstetric anaesthetists/anesthesiologists are also welcome. The journal is peer-reviewed by international experts. Scholarship is stressed to include the focus on discovery, application of knowledge across fields, and informing the medical community. Through the peer-review process, we hope to attest to the quality of scholarships and guide the Journal to extend and transform knowledge in this important and expanding area.
期刊最新文献
Artificial intelligence-created personal statements compared with applicant-written personal statements: a survey of obstetric anesthesia fellowship program directors in the United States Patients’ perspectives on pain relief during childbirth and labor epidurals: A pilot qualitative study among women who chose to deliver without neuraxial labor analgesia Inhaled epoprostenol via high-flow nasal cannula and intravenous treprostinil for management of severe pulmonary arterial hypertension during cesarean delivery with epidural anesthesia: a case report Accuracy of Spanish and English-generated ChatGPT responses to commonly asked patient questions about labor epidurals: a survey-based study among bilingual obstetric anesthesia experts Labor epidural analgesia among Han and Uyghur parturients: a prospective observational study in China
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1