ChatGPT’s ability to comprehend and answer cirrhosis related questions in Arabic

IF 1.1 4区 医学 Q4 GASTROENTEROLOGY & HEPATOLOGY Arab Journal of Gastroenterology Pub Date : 2023-08-01 DOI:10.1016/j.ajg.2023.08.001
Jamil S. Samaan , Yee Hui Yeo , Wee Han Ng , Peng-Sheng Ting , Hirsh Trivedi , Aarshi Vipani , Ju Dong Yang , Omer Liran , Brennan Spiegel , Alexander Kuo , Walid S. Ayoub
{"title":"ChatGPT’s ability to comprehend and answer cirrhosis related questions in Arabic","authors":"Jamil S. Samaan ,&nbsp;Yee Hui Yeo ,&nbsp;Wee Han Ng ,&nbsp;Peng-Sheng Ting ,&nbsp;Hirsh Trivedi ,&nbsp;Aarshi Vipani ,&nbsp;Ju Dong Yang ,&nbsp;Omer Liran ,&nbsp;Brennan Spiegel ,&nbsp;Alexander Kuo ,&nbsp;Walid S. Ayoub","doi":"10.1016/j.ajg.2023.08.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and study aims</h3><p>Cirrhosis is a chronic progressive disease which requires complex care. Its incidence is rising in the Arab countries making it the 7th leading cause of death in the Arab League in 2010. ChatGPT is a large language model with a growing body of literature demonstrating its ability to answer clinical questions. We examined ChatGPT’s accuracy in responding to cirrhosis related questions in Arabic and compared its performance to English.</p></div><div><h3>Materials and methods</h3><p>ChatGPTs responses to 91 questions in Arabic and English were graded by a transplant hepatologist fluent in both languages. Accuracy of responses was assessed using the scale: 1. Comprehensive, 2. Correct but inadequate, 3. Mixed with correct and incorrect/outdated data, and 4. Completely incorrect.<!--> <!-->Accuracy of Arabic compared to English responses was assessed using the scale: 1. Arabic response is more accurate, 2. Similar accuracy, 3. Arabic response is less accurate.</p></div><div><h3>Results</h3><p>The model provided 22 (24.2%) comprehensive, 44 (48.4%) correct but inadequate, 13 (14.3%) mixed with correct and incorrect/outdated data and 12 (13.2%) completely incorrect Arabic responses. When comparing the accuracy of Arabic and English responses, 9 (9.9%) of the Arabic responses were graded as more accurate, 52 (57.1%) similar in accuracy and 30 (33.0%) as less accurate compared to English.</p></div><div><h3>Conclusion</h3><p>ChatGPT has the potential to serve as an adjunct source of information for Arabic speaking patients with cirrhosis. The model provided correct responses in Arabic to 72.5% of questions, although its performance in Arabic was less accurate than in English. The model produced completely incorrect responses to 13.2% of questions, reinforcing its potential role as an adjunct and not replacement of care by licensed healthcare professionals. Future studies to refine this technology are needed to help Arabic speaking patients with cirrhosis across the globe understand their disease and improve their outcomes.</p></div>","PeriodicalId":48674,"journal":{"name":"Arab Journal of Gastroenterology","volume":null,"pages":null},"PeriodicalIF":1.1000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arab Journal of Gastroenterology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1687197923000588","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 3

Abstract

Background and study aims

Cirrhosis is a chronic progressive disease which requires complex care. Its incidence is rising in the Arab countries making it the 7th leading cause of death in the Arab League in 2010. ChatGPT is a large language model with a growing body of literature demonstrating its ability to answer clinical questions. We examined ChatGPT’s accuracy in responding to cirrhosis related questions in Arabic and compared its performance to English.

Materials and methods

ChatGPTs responses to 91 questions in Arabic and English were graded by a transplant hepatologist fluent in both languages. Accuracy of responses was assessed using the scale: 1. Comprehensive, 2. Correct but inadequate, 3. Mixed with correct and incorrect/outdated data, and 4. Completely incorrect. Accuracy of Arabic compared to English responses was assessed using the scale: 1. Arabic response is more accurate, 2. Similar accuracy, 3. Arabic response is less accurate.

Results

The model provided 22 (24.2%) comprehensive, 44 (48.4%) correct but inadequate, 13 (14.3%) mixed with correct and incorrect/outdated data and 12 (13.2%) completely incorrect Arabic responses. When comparing the accuracy of Arabic and English responses, 9 (9.9%) of the Arabic responses were graded as more accurate, 52 (57.1%) similar in accuracy and 30 (33.0%) as less accurate compared to English.

Conclusion

ChatGPT has the potential to serve as an adjunct source of information for Arabic speaking patients with cirrhosis. The model provided correct responses in Arabic to 72.5% of questions, although its performance in Arabic was less accurate than in English. The model produced completely incorrect responses to 13.2% of questions, reinforcing its potential role as an adjunct and not replacement of care by licensed healthcare professionals. Future studies to refine this technology are needed to help Arabic speaking patients with cirrhosis across the globe understand their disease and improve their outcomes.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ChatGPT用阿拉伯语理解和回答肝硬化相关问题的能力。
背景和研究目的:肝硬化是一种需要复杂护理的慢性进行性疾病。它在阿拉伯国家的发病率正在上升,使其成为2010年阿拉伯联盟第7大死亡原因。ChatGPT是一个大型语言模型,越来越多的文献证明了它回答临床问题的能力。我们检查了ChatGPT在回答阿拉伯语肝硬化相关问题方面的准确性,并将其表现与英语进行了比较。材料和方法:ChatGPT对91个阿拉伯语和英语问题的回答由一位精通两种语言的移植肝病学家进行了评分。使用量表评估了回答的准确性:1。综合,2。正确但不充分,3。与正确和不正确/过时的数据混合,以及4。完全不正确。阿拉伯语与英语回答的准确性使用量表进行评估:1。阿拉伯语的回答更准确,2。类似的准确性,3。阿拉伯语的回答不太准确。结果:该模型提供了22个(24.2%)全面的,44个(48.4%)正确但不充分的,13个(14.3%)混合了正确和不正确/过时的数据,12个(13.2%)完全不正确的阿拉伯语回答。当比较阿拉伯语和英语回答的准确性时,与英语相比,9个(9.9%)阿拉伯语回答被评为更准确,52个(57.1%)准确性相似,30个(33.0%)准确性较低。结论:ChatGPT有潜力作为阿拉伯语肝硬化患者的辅助信息来源。该模型对72.5%的问题提供了正确的阿拉伯语回答,尽管其阿拉伯语表现不如英语准确。该模型对13.2%的问题做出了完全错误的回答,强化了其作为辅助而非取代持照医疗专业人员护理的潜在作用。需要进一步完善这项技术的研究,以帮助全球讲阿拉伯语的肝硬化患者了解他们的疾病并改善他们的预后。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Arab Journal of Gastroenterology
Arab Journal of Gastroenterology Medicine-Gastroenterology
CiteScore
2.70
自引率
0.00%
发文量
52
期刊介绍: Arab Journal of Gastroenterology (AJG) publishes different studies related to the digestive system. It aims to be the foremost scientific peer reviewed journal encompassing diverse studies related to the digestive system and its disorders, and serving the Pan-Arab and wider community working on gastrointestinal disorders.
期刊最新文献
"Mitigating tuberculosis reactivation risk in IBD patients on anti-TNF therapy". Epidemiological and anatomopathological profile of colorectal cancer in Northern Morocco between 2017 and 2019. Ginsenoside Rg3 enhances the anticancer effects of 5-fluorouracil in colorectal cancer and reduces drug resistance and the Hedgehog pathway activation. Effect of Lactobacillus acidophilus, Calcium, and Moringa oleifera leaves extract co-administration can prevent chemical-induced carcinogenesis. Current trends and research hotspots in the study of flavonoids for ulcerative colitis: A bibliometric study.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1