Comparing patient education tools for chronic pain medications: Artificial intelligence chatbot versus traditional patient information leaflets

IF 2.9 Q1 ANESTHESIOLOGY Indian Journal of Anaesthesia Pub Date : 2024-06-07 DOI:10.4103/ija.ija_204_24

P. Gondode, Sakshi Duggal, Neha Garg, Surrender Sethupathy, Omshubham Asai, Pooja Lohakare

{"title":"Comparing patient education tools for chronic pain medications: Artificial intelligence chatbot versus traditional patient information leaflets","authors":"P. Gondode, Sakshi Duggal, Neha Garg, Surrender Sethupathy, Omshubham Asai, Pooja Lohakare","doi":"10.4103/ija.ija_204_24","DOIUrl":null,"url":null,"abstract":"\n \n Artificial intelligence (AI) chatbots like Conversational Generative Pre-trained Transformer (ChatGPT) have recently created much buzz, especially regarding patient education. Such informed patients understand and adhere to the management and get involved in shared decision making. The accuracy and understandability of the generated educational material are prime concerns. Thus, we compared ChatGPT with traditional patient information leaflets (PILs) about chronic pain medications.\n \n \n \n Patients' frequently asked questions were generated from PILs available on the official websites of the British Pain Society (BPS) and the Faculty of Pain Medicine. Eight blinded annexures were prepared for evaluation, consisting of traditional PILs from the BPS and AI-generated patient information materials structured similar to PILs by ChatGPT. The authors performed a comparative analysis to assess materials’ readability, emotional tone, accuracy, actionability, and understandability. Readability was measured using Flesch Reading Ease (FRE), Gunning Fog Index (GFI), and Flesch-Kincaid Grade Level (FKGL). Sentiment analysis determined emotional tone. An expert panel evaluated accuracy and completeness. Actionability and understandability were assessed with the Patient Education Materials Assessment Tool.\n \n \n \n Traditional PILs generally exhibited higher readability (P values < 0.05), with [mean (standard deviation)] FRE [62.25 (1.6) versus 48 (3.7)], GFI [11.85 (0.9) versus 13.65 (0.7)], and FKGL [8.33 (0.5) versus 10.23 (0.5)] but varied emotional tones, often negative, compared to more positive sentiments in ChatGPT-generated texts. Accuracy and completeness did not significantly differ between the two. Actionability and understandability scores were comparable.\n \n \n \n While AI chatbots offer efficient information delivery, ensuring accuracy and readability, patient-centeredness remains crucial. It is imperative to balance innovation with evidence-based practice.\n","PeriodicalId":13339,"journal":{"name":"Indian Journal of Anaesthesia","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indian Journal of Anaesthesia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4103/ija.ija_204_24","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ANESTHESIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Artificial intelligence (AI) chatbots like Conversational Generative Pre-trained Transformer (ChatGPT) have recently created much buzz, especially regarding patient education. Such informed patients understand and adhere to the management and get involved in shared decision making. The accuracy and understandability of the generated educational material are prime concerns. Thus, we compared ChatGPT with traditional patient information leaflets (PILs) about chronic pain medications. Patients' frequently asked questions were generated from PILs available on the official websites of the British Pain Society (BPS) and the Faculty of Pain Medicine. Eight blinded annexures were prepared for evaluation, consisting of traditional PILs from the BPS and AI-generated patient information materials structured similar to PILs by ChatGPT. The authors performed a comparative analysis to assess materials’ readability, emotional tone, accuracy, actionability, and understandability. Readability was measured using Flesch Reading Ease (FRE), Gunning Fog Index (GFI), and Flesch-Kincaid Grade Level (FKGL). Sentiment analysis determined emotional tone. An expert panel evaluated accuracy and completeness. Actionability and understandability were assessed with the Patient Education Materials Assessment Tool. Traditional PILs generally exhibited higher readability (P values < 0.05), with [mean (standard deviation)] FRE [62.25 (1.6) versus 48 (3.7)], GFI [11.85 (0.9) versus 13.65 (0.7)], and FKGL [8.33 (0.5) versus 10.23 (0.5)] but varied emotional tones, often negative, compared to more positive sentiments in ChatGPT-generated texts. Accuracy and completeness did not significantly differ between the two. Actionability and understandability scores were comparable. While AI chatbots offer efficient information delivery, ensuring accuracy and readability, patient-centeredness remains crucial. It is imperative to balance innovation with evidence-based practice.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

比较慢性疼痛药物的患者教育工具：人工智能聊天机器人与传统患者信息宣传单的比较

人工智能（AI）聊天机器人（如会话生成预训练转换器（ChatGPT））最近引起了热议，尤其是在患者教育方面。这些知情患者能够理解并坚持管理，并参与共同决策。生成的教育材料的准确性和可理解性是人们最关心的问题。因此，我们将 ChatGPT 与有关慢性疼痛药物的传统患者信息单（PILs）进行了比较。患者的常见问题来自英国疼痛协会（BPS）和疼痛医学学院官方网站上的 PIL。为评估准备了八份盲法附件，包括英国疼痛学会的传统 PIL 和 ChatGPT 的人工智能生成的患者信息资料，其结构与 PIL 相似。作者进行了对比分析，以评估材料的可读性、情感基调、准确性、可操作性和可理解性。可读性是通过弗莱什阅读容易度（FRE）、冈宁雾指数（GFI）和弗莱什-金凯德等级水平（FKGL）来衡量的。情感分析确定情感基调。专家小组评估了准确性和完整性。患者教育材料评估工具对可操作性和可理解性进行了评估。传统的 PIL 一般表现出更高的可读性（P 值小于 0.05），[平均值（标准差）] FRE [62.25 (1.6) 对 48 (3.7)]、GFI [11.85 (0.9) 对 13.65 (0.7)] 和 FKGL [8.33 (0.5) 对 10.23 (0.5)]，但与 ChatGPT 生成的文本中更积极的情感相比，情感基调各不相同，通常是消极的。两者在准确性和完整性方面没有明显差异。可操作性和可理解性得分相当。虽然人工智能聊天机器人能提供高效的信息传递，确保准确性和可读性，但以患者为中心仍然至关重要。必须在创新与循证实践之间取得平衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊