ChatGPT’s ability to comprehend and answer cirrhosis related questions in Arabic

IF 1.1 4区医学 Q4 GASTROENTEROLOGY & HEPATOLOGY Arab Journal of Gastroenterology Pub Date : 2023-08-01 Epub Date: 2023-09-04 DOI:10.1016/j.ajg.2023.08.001

Jamil S. Samaan , Yee Hui Yeo , Wee Han Ng , Peng-Sheng Ting , Hirsh Trivedi , Aarshi Vipani , Ju Dong Yang , Omer Liran , Brennan Spiegel , Alexander Kuo , Walid S. Ayoub

{"title":"ChatGPT’s ability to comprehend and answer cirrhosis related questions in Arabic","authors":"Jamil S. Samaan , Yee Hui Yeo , Wee Han Ng , Peng-Sheng Ting , Hirsh Trivedi , Aarshi Vipani , Ju Dong Yang , Omer Liran , Brennan Spiegel , Alexander Kuo , Walid S. Ayoub","doi":"10.1016/j.ajg.2023.08.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and study aims</h3><p>Cirrhosis is a chronic progressive disease which requires complex care. Its incidence is rising in the Arab countries making it the 7th leading cause of death in the Arab League in 2010. ChatGPT is a large language model with a growing body of literature demonstrating its ability to answer clinical questions. We examined ChatGPT’s accuracy in responding to cirrhosis related questions in Arabic and compared its performance to English.</p></div><div><h3>Materials and methods</h3><p>ChatGPTs responses to 91 questions in Arabic and English were graded by a transplant hepatologist fluent in both languages. Accuracy of responses was assessed using the scale: 1. Comprehensive, 2. Correct but inadequate, 3. Mixed with correct and incorrect/outdated data, and 4. Completely incorrect.Accuracy of Arabic compared to English responses was assessed using the scale: 1. Arabic response is more accurate, 2. Similar accuracy, 3. Arabic response is less accurate.</p></div><div><h3>Results</h3><p>The model provided 22 (24.2%) comprehensive, 44 (48.4%) correct but inadequate, 13 (14.3%) mixed with correct and incorrect/outdated data and 12 (13.2%) completely incorrect Arabic responses. When comparing the accuracy of Arabic and English responses, 9 (9.9%) of the Arabic responses were graded as more accurate, 52 (57.1%) similar in accuracy and 30 (33.0%) as less accurate compared to English.</p></div><div><h3>Conclusion</h3><p>ChatGPT has the potential to serve as an adjunct source of information for Arabic speaking patients with cirrhosis. The model provided correct responses in Arabic to 72.5% of questions, although its performance in Arabic was less accurate than in English. The model produced completely incorrect responses to 13.2% of questions, reinforcing its potential role as an adjunct and not replacement of care by licensed healthcare professionals. Future studies to refine this technology are needed to help Arabic speaking patients with cirrhosis across the globe understand their disease and improve their outcomes.</p></div>","PeriodicalId":48674,"journal":{"name":"Arab Journal of Gastroenterology","volume":"24 3","pages":"Pages 145-148"},"PeriodicalIF":1.1000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Arab Journal of Gastroenterology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1687197923000588","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/9/4 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}

引用次数: 3

Abstract

Background and study aims

Cirrhosis is a chronic progressive disease which requires complex care. Its incidence is rising in the Arab countries making it the 7th leading cause of death in the Arab League in 2010. ChatGPT is a large language model with a growing body of literature demonstrating its ability to answer clinical questions. We examined ChatGPT’s accuracy in responding to cirrhosis related questions in Arabic and compared its performance to English.

Materials and methods

ChatGPTs responses to 91 questions in Arabic and English were graded by a transplant hepatologist fluent in both languages. Accuracy of responses was assessed using the scale: 1. Comprehensive, 2. Correct but inadequate, 3. Mixed with correct and incorrect/outdated data, and 4. Completely incorrect. Accuracy of Arabic compared to English responses was assessed using the scale: 1. Arabic response is more accurate, 2. Similar accuracy, 3. Arabic response is less accurate.

Results

The model provided 22 (24.2%) comprehensive, 44 (48.4%) correct but inadequate, 13 (14.3%) mixed with correct and incorrect/outdated data and 12 (13.2%) completely incorrect Arabic responses. When comparing the accuracy of Arabic and English responses, 9 (9.9%) of the Arabic responses were graded as more accurate, 52 (57.1%) similar in accuracy and 30 (33.0%) as less accurate compared to English.

Conclusion

ChatGPT has the potential to serve as an adjunct source of information for Arabic speaking patients with cirrhosis. The model provided correct responses in Arabic to 72.5% of questions, although its performance in Arabic was less accurate than in English. The model produced completely incorrect responses to 13.2% of questions, reinforcing its potential role as an adjunct and not replacement of care by licensed healthcare professionals. Future studies to refine this technology are needed to help Arabic speaking patients with cirrhosis across the globe understand their disease and improve their outcomes.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

ChatGPT用阿拉伯语理解和回答肝硬化相关问题的能力。

背景和研究目的：肝硬化是一种需要复杂护理的慢性进行性疾病。它在阿拉伯国家的发病率正在上升，使其成为2010年阿拉伯联盟第7大死亡原因。ChatGPT是一个大型语言模型，越来越多的文献证明了它回答临床问题的能力。我们检查了ChatGPT在回答阿拉伯语肝硬化相关问题方面的准确性，并将其表现与英语进行了比较。材料和方法：ChatGPT对91个阿拉伯语和英语问题的回答由一位精通两种语言的移植肝病学家进行了评分。使用量表评估了回答的准确性：1。综合，2。正确但不充分，3。与正确和不正确/过时的数据混合，以及4。完全不正确。阿拉伯语与英语回答的准确性使用量表进行评估：1。阿拉伯语的回答更准确，2。类似的准确性，3。阿拉伯语的回答不太准确。结果：该模型提供了22个（24.2%）全面的，44个（48.4%）正确但不充分的，13个（14.3%）混合了正确和不正确/过时的数据，12个（13.2%）完全不正确的阿拉伯语回答。当比较阿拉伯语和英语回答的准确性时，与英语相比，9个（9.9%）阿拉伯语回答被评为更准确，52个（57.1%）准确性相似，30个（33.0%）准确性较低。结论：ChatGPT有潜力作为阿拉伯语肝硬化患者的辅助信息来源。该模型对72.5%的问题提供了正确的阿拉伯语回答，尽管其阿拉伯语表现不如英语准确。该模型对13.2%的问题做出了完全错误的回答，强化了其作为辅助而非取代持照医疗专业人员护理的潜在作用。需要进一步完善这项技术的研究，以帮助全球讲阿拉伯语的肝硬化患者了解他们的疾病并改善他们的预后。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Arab Journal of Gastroenterology Medicine-Gastroenterology

CiteScore

2.70

自引率

0.00%

发文量

期刊介绍： Arab Journal of Gastroenterology (AJG) publishes different studies related to the digestive system. It aims to be the foremost scientific peer reviewed journal encompassing diverse studies related to the digestive system and its disorders, and serving the Pan-Arab and wider community working on gastrointestinal disorders.