Evaluation of online chat-based artificial intelligence responses about inflammatory bowel disease and diet.

IF 1.8 4区医学 Q3 GASTROENTEROLOGY & HEPATOLOGY European Journal of Gastroenterology & Hepatology Pub Date : 2024-09-01 Epub Date: 2024-07-08 DOI:10.1097/MEG.0000000000002815

Haider A Naqvi, Thilini Delungahawatta, Joseph O Atarere, Sumanth Kumar Bandaru, Jasmine B Barrow, Mark C Mattar

{"title":"Evaluation of online chat-based artificial intelligence responses about inflammatory bowel disease and diet.","authors":"Haider A Naqvi, Thilini Delungahawatta, Joseph O Atarere, Sumanth Kumar Bandaru, Jasmine B Barrow, Mark C Mattar","doi":"10.1097/MEG.0000000000002815","DOIUrl":null,"url":null,"abstract":"Introduction: The USA has the highest age-standardized prevalence of inflammatory bowel disease (IBD). Both genetic and environmental factors have been implicated in IBD flares and multiple strategies are centered around avoiding dietary triggers to maintain remission. Chat-based artificial intelligence (CB-AI) has shown great potential in enhancing patient education in medicine. We evaluate the role of CB-AI in patient education on dietary management of IBD.Methods: Six questions evaluating important concepts about the dietary management of IBD which then were posed to three CB-AI models - ChatGPT, BingChat, and YouChat three different times. All responses were graded for appropriateness and reliability by two physicians using dietary information from the Crohn's and Colitis Foundation. The responses were graded as reliably appropriate, reliably inappropriate, and unreliable. The expert assessment of the reviewing physicians was validated by the joint probability of agreement for two raters.Results: ChatGPT provided reliably appropriate responses to questions on dietary management of IBD more often than BingChat and YouChat. There were two questions that more than one CB-AI provided unreliable responses to. Each CB-AI provided examples within their responses, but the examples were not always appropriate. Whether the response was appropriate or not, CB-AIs mentioned consulting with an expert in the field. The inter-rater reliability was 88.9%.Discussion: CB-AIs have the potential to improve patient education and outcomes but studies evaluating their appropriateness for various health conditions are sparse. Our study showed that CB-AIs have the ability to provide appropriate answers to most questions regarding the dietary management of IBD.","PeriodicalId":11999,"journal":{"name":"European Journal of Gastroenterology & Hepatology","volume":" ","pages":"1109-1112"},"PeriodicalIF":1.8000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Gastroenterology & Hepatology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/MEG.0000000000002815","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/8 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: The USA has the highest age-standardized prevalence of inflammatory bowel disease (IBD). Both genetic and environmental factors have been implicated in IBD flares and multiple strategies are centered around avoiding dietary triggers to maintain remission. Chat-based artificial intelligence (CB-AI) has shown great potential in enhancing patient education in medicine. We evaluate the role of CB-AI in patient education on dietary management of IBD.

Methods: Six questions evaluating important concepts about the dietary management of IBD which then were posed to three CB-AI models - ChatGPT, BingChat, and YouChat three different times. All responses were graded for appropriateness and reliability by two physicians using dietary information from the Crohn's and Colitis Foundation. The responses were graded as reliably appropriate, reliably inappropriate, and unreliable. The expert assessment of the reviewing physicians was validated by the joint probability of agreement for two raters.

Results: ChatGPT provided reliably appropriate responses to questions on dietary management of IBD more often than BingChat and YouChat. There were two questions that more than one CB-AI provided unreliable responses to. Each CB-AI provided examples within their responses, but the examples were not always appropriate. Whether the response was appropriate or not, CB-AIs mentioned consulting with an expert in the field. The inter-rater reliability was 88.9%.

Discussion: CB-AIs have the potential to improve patient education and outcomes but studies evaluating their appropriateness for various health conditions are sparse. Our study showed that CB-AIs have the ability to provide appropriate answers to most questions regarding the dietary management of IBD.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评估基于在线聊天的人工智能对炎症性肠病和饮食的反应。

导言：美国是炎症性肠病（IBD）年龄标准化发病率最高的国家。遗传和环境因素都与 IBD 复发有关，多种策略都围绕着避免饮食诱因以保持病情缓解。基于聊天的人工智能（CB-AI）在加强医学患者教育方面显示出巨大的潜力。我们评估了 CB-AI 在 IBD 患者饮食管理教育中的作用：方法：向三种 CB-AI 模型（ChatGPT、BingChat 和 YouChat）提出六个问题，评估有关 IBD 饮食管理的重要概念，然后分别进行三次提问。所有回答均由两名医生根据克罗恩氏和结肠炎基金会提供的饮食信息进行适当性和可靠性分级。回答被分为可靠适当、可靠不适当和不可靠。审查医生的专家评估由两名评分者的联合一致概率进行验证：与必聊和优聊相比，ChatGPT 对有关 IBD 饮食管理的问题提供了更多可靠、适当的回答。有两个问题有一个以上的 CB-AI 提供了不可靠的回答。每个 CB-AI 都在回答中提供了例子，但这些例子并不总是恰当的。无论回答是否恰当，CB-AI 都提到了咨询该领域的专家。评分者之间的可靠性为 88.9%：讨论：CB-AI 具有改善患者教育和治疗效果的潜力，但评估其是否适合各种健康状况的研究却很少。我们的研究表明，CB-AI 能够为大多数有关 IBD 饮食管理的问题提供适当的答案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

European Journal of Gastroenterology & Hepatology 医学-胃肠肝病学

CiteScore

4.40

自引率

4.80%

发文量

269

审稿时长

1 months

期刊介绍： European Journal of Gastroenterology & Hepatology publishes papers reporting original clinical and scientific research which are of a high standard and which contribute to the advancement of knowledge in the field of gastroenterology and hepatology. The journal publishes three types of manuscript: in-depth reviews (by invitation only), full papers and case reports. Manuscripts submitted to the journal will be accepted on the understanding that the author has not previously submitted the paper to another journal or had the material published elsewhere. Authors are asked to disclose any affiliations, including financial, consultant, or institutional associations, that might lead to bias or a conflict of interest.