Evaluation of online chat-based artificial intelligence responses about inflammatory bowel disease and diet.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2024-09-01 Epub Date: 2024-07-08 DOI:10.1097/MEG.0000000000002815
Haider A Naqvi, Thilini Delungahawatta, Joseph O Atarere, Sumanth Kumar Bandaru, Jasmine B Barrow, Mark C Mattar
{"title":"Evaluation of online chat-based artificial intelligence responses about inflammatory bowel disease and diet.","authors":"Haider A Naqvi, Thilini Delungahawatta, Joseph O Atarere, Sumanth Kumar Bandaru, Jasmine B Barrow, Mark C Mattar","doi":"10.1097/MEG.0000000000002815","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>The USA has the highest age-standardized prevalence of inflammatory bowel disease (IBD). Both genetic and environmental factors have been implicated in IBD flares and multiple strategies are centered around avoiding dietary triggers to maintain remission. Chat-based artificial intelligence (CB-AI) has shown great potential in enhancing patient education in medicine. We evaluate the role of CB-AI in patient education on dietary management of IBD.</p><p><strong>Methods: </strong>Six questions evaluating important concepts about the dietary management of IBD which then were posed to three CB-AI models - ChatGPT, BingChat, and YouChat three different times. All responses were graded for appropriateness and reliability by two physicians using dietary information from the Crohn's and Colitis Foundation. The responses were graded as reliably appropriate, reliably inappropriate, and unreliable. The expert assessment of the reviewing physicians was validated by the joint probability of agreement for two raters.</p><p><strong>Results: </strong>ChatGPT provided reliably appropriate responses to questions on dietary management of IBD more often than BingChat and YouChat. There were two questions that more than one CB-AI provided unreliable responses to. Each CB-AI provided examples within their responses, but the examples were not always appropriate. Whether the response was appropriate or not, CB-AIs mentioned consulting with an expert in the field. The inter-rater reliability was 88.9%.</p><p><strong>Discussion: </strong>CB-AIs have the potential to improve patient education and outcomes but studies evaluating their appropriateness for various health conditions are sparse. Our study showed that CB-AIs have the ability to provide appropriate answers to most questions regarding the dietary management of IBD.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/MEG.0000000000002815","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/8 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: The USA has the highest age-standardized prevalence of inflammatory bowel disease (IBD). Both genetic and environmental factors have been implicated in IBD flares and multiple strategies are centered around avoiding dietary triggers to maintain remission. Chat-based artificial intelligence (CB-AI) has shown great potential in enhancing patient education in medicine. We evaluate the role of CB-AI in patient education on dietary management of IBD.

Methods: Six questions evaluating important concepts about the dietary management of IBD which then were posed to three CB-AI models - ChatGPT, BingChat, and YouChat three different times. All responses were graded for appropriateness and reliability by two physicians using dietary information from the Crohn's and Colitis Foundation. The responses were graded as reliably appropriate, reliably inappropriate, and unreliable. The expert assessment of the reviewing physicians was validated by the joint probability of agreement for two raters.

Results: ChatGPT provided reliably appropriate responses to questions on dietary management of IBD more often than BingChat and YouChat. There were two questions that more than one CB-AI provided unreliable responses to. Each CB-AI provided examples within their responses, but the examples were not always appropriate. Whether the response was appropriate or not, CB-AIs mentioned consulting with an expert in the field. The inter-rater reliability was 88.9%.

Discussion: CB-AIs have the potential to improve patient education and outcomes but studies evaluating their appropriateness for various health conditions are sparse. Our study showed that CB-AIs have the ability to provide appropriate answers to most questions regarding the dietary management of IBD.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估基于在线聊天的人工智能对炎症性肠病和饮食的反应。
导言:美国是炎症性肠病(IBD)年龄标准化发病率最高的国家。遗传和环境因素都与 IBD 复发有关,多种策略都围绕着避免饮食诱因以保持病情缓解。基于聊天的人工智能(CB-AI)在加强医学患者教育方面显示出巨大的潜力。我们评估了 CB-AI 在 IBD 患者饮食管理教育中的作用:方法:向三种 CB-AI 模型(ChatGPT、BingChat 和 YouChat)提出六个问题,评估有关 IBD 饮食管理的重要概念,然后分别进行三次提问。所有回答均由两名医生根据克罗恩氏和结肠炎基金会提供的饮食信息进行适当性和可靠性分级。回答被分为可靠适当、可靠不适当和不可靠。审查医生的专家评估由两名评分者的联合一致概率进行验证:与必聊和优聊相比,ChatGPT 对有关 IBD 饮食管理的问题提供了更多可靠、适当的回答。有两个问题有一个以上的 CB-AI 提供了不可靠的回答。每个 CB-AI 都在回答中提供了例子,但这些例子并不总是恰当的。无论回答是否恰当,CB-AI 都提到了咨询该领域的专家。评分者之间的可靠性为 88.9%:讨论:CB-AI 具有改善患者教育和治疗效果的潜力,但评估其是否适合各种健康状况的研究却很少。我们的研究表明,CB-AI 能够为大多数有关 IBD 饮食管理的问题提供适当的答案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
期刊最新文献
A Systematic Review of Sleep Disturbance in Idiopathic Intracranial Hypertension. Advancing Patient Education in Idiopathic Intracranial Hypertension: The Promise of Large Language Models. Anti-Myelin-Associated Glycoprotein Neuropathy: Recent Developments. Approach to Managing the Initial Presentation of Multiple Sclerosis: A Worldwide Practice Survey. Association Between LACE+ Index Risk Category and 90-Day Mortality After Stroke.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1