提高健康素养：评估经 ChatGPT 大语言模型修订的患者手册的可读性。

IF 2.6 3区医学 Q1 OTORHINOLARYNGOLOGY Otolaryngology- Head and Neck Surgery Pub Date : 2024-12-01 Epub Date: 2024-08-06 DOI:10.1002/ohn.927

Austin R Swisher, Arthur W Wu, Gene C Liu, Matthew K Lee, Taylor R Carle, Dennis M Tang

{"title":"提高健康素养：评估经 ChatGPT 大语言模型修订的患者手册的可读性。","authors":"Austin R Swisher, Arthur W Wu, Gene C Liu, Matthew K Lee, Taylor R Carle, Dennis M Tang","doi":"10.1002/ohn.927","DOIUrl":null,"url":null,"abstract":"Objective: To use an artificial intelligence (AI)-powered large language model (LLM) to improve readability of patient handouts.Study design: Review of online material modified by AI.Setting: Academic center.Methods: Five handout materials obtained from the American Rhinologic Society (ARS) and the American Academy of Facial Plastic and Reconstructive Surgery websites were assessed using validated readability metrics. The handouts were inputted into OpenAI's ChatGPT-4 after prompting: \"Rewrite the following at a 6th-grade reading level.\" The understandability and actionability of both native and LLM-revised versions were evaluated using the Patient Education Materials Assessment Tool (PEMAT). Results were compared using Wilcoxon rank-sum tests.Results: The mean readability scores of the standard (ARS, American Academy of Facial Plastic and Reconstructive Surgery) materials corresponded to \"difficult,\" with reading categories ranging between high school and university grade levels. Conversely, the LLM-revised handouts had an average seventh-grade reading level. LLM-revised handouts had better readability in nearly all metrics tested: Flesch-Kincaid Reading Ease (70.8 vs 43.9; P < .05), Gunning Fog Score (10.2 vs 14.42; P < .05), Simple Measure of Gobbledygook (9.9 vs 13.1; P < .05), Coleman-Liau (8.8 vs 12.6; P < .05), and Automated Readability Index (8.2 vs 10.7; P = .06). PEMAT scores were significantly higher in the LLM-revised handouts for understandability (91 vs 74%; P < .05) with similar actionability (42 vs 34%; P = .15) when compared to the standard materials.Conclusion: Patient-facing handouts can be augmented by ChatGPT with simple prompting to tailor information with improved readability. This study demonstrates the utility of LLMs to aid in rewriting patient handouts and may serve as a tool to help optimize education materials.Level of evidence: Level VI.","PeriodicalId":19707,"journal":{"name":"Otolaryngology- Head and Neck Surgery","volume":" ","pages":"1751-1757"},"PeriodicalIF":2.6000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing Health Literacy: Evaluating the Readability of Patient Handouts Revised by ChatGPT's Large Language Model.\",\"authors\":\"Austin R Swisher, Arthur W Wu, Gene C Liu, Matthew K Lee, Taylor R Carle, Dennis M Tang\",\"doi\":\"10.1002/ohn.927\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective: To use an artificial intelligence (AI)-powered large language model (LLM) to improve readability of patient handouts.Study design: Review of online material modified by AI.Setting: Academic center.Methods: Five handout materials obtained from the American Rhinologic Society (ARS) and the American Academy of Facial Plastic and Reconstructive Surgery websites were assessed using validated readability metrics. The handouts were inputted into OpenAI's ChatGPT-4 after prompting: \\\"Rewrite the following at a 6th-grade reading level.\\\" The understandability and actionability of both native and LLM-revised versions were evaluated using the Patient Education Materials Assessment Tool (PEMAT). Results were compared using Wilcoxon rank-sum tests.Results: The mean readability scores of the standard (ARS, American Academy of Facial Plastic and Reconstructive Surgery) materials corresponded to \\\"difficult,\\\" with reading categories ranging between high school and university grade levels. Conversely, the LLM-revised handouts had an average seventh-grade reading level. LLM-revised handouts had better readability in nearly all metrics tested: Flesch-Kincaid Reading Ease (70.8 vs 43.9; P < .05), Gunning Fog Score (10.2 vs 14.42; P < .05), Simple Measure of Gobbledygook (9.9 vs 13.1; P < .05), Coleman-Liau (8.8 vs 12.6; P < .05), and Automated Readability Index (8.2 vs 10.7; P = .06). PEMAT scores were significantly higher in the LLM-revised handouts for understandability (91 vs 74%; P < .05) with similar actionability (42 vs 34%; P = .15) when compared to the standard materials.Conclusion: Patient-facing handouts can be augmented by ChatGPT with simple prompting to tailor information with improved readability. This study demonstrates the utility of LLMs to aid in rewriting patient handouts and may serve as a tool to help optimize education materials.Level of evidence: Level VI.\",\"PeriodicalId\":19707,\"journal\":{\"name\":\"Otolaryngology- Head and Neck Surgery\",\"volume\":\" \",\"pages\":\"1751-1757\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Otolaryngology- Head and Neck Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/ohn.927\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/8/6 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"OTORHINOLARYNGOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Otolaryngology- Head and Neck Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/ohn.927","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

目的：使用人工智能（AI）驱动的大型语言模型（LLM）提高患者手册的可读性：使用人工智能（AI）驱动的大型语言模型（LLM）提高患者手册的可读性：研究设计：审查经人工智能修改的在线资料：学术中心：使用经过验证的可读性指标对从美国鼻科学会（ARS）和美国面部整形外科学会网站上获取的五份讲义进行评估。根据提示将讲义输入到 OpenAI 的 ChatGPT-4 中："以六年级的阅读水平重写以下内容"。使用患者教育材料评估工具（PEMAT）对原生版本和 LLM 修订版本的可理解性和可操作性进行了评估。结果采用 Wilcoxon 秩和检验进行比较：结果：标准版（ARS，美国面部整形与重建外科学会）材料的平均可读性评分为 "困难"，阅读类别介于高中和大学年级之间。相反，LLM 修订版讲义的平均阅读水平为七年级水平。在几乎所有测试指标中，LLM 修订版讲义的可读性都更好：Flesch-Kincaid 阅读轻松度（70.8 vs 43.9；P 结论：LLM 修订版讲义的可读性更好：面向患者的讲义可以通过 ChatGPT 进行扩充，并通过简单的提示来定制信息，从而提高可读性。这项研究证明了 LLMs 在帮助改写患者手册方面的实用性，可作为帮助优化教育材料的工具：证据等级：VI 级。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Enhancing Health Literacy: Evaluating the Readability of Patient Handouts Revised by ChatGPT's Large Language Model.

Objective: To use an artificial intelligence (AI)-powered large language model (LLM) to improve readability of patient handouts.

Study design: Review of online material modified by AI.

Setting: Academic center.

Methods: Five handout materials obtained from the American Rhinologic Society (ARS) and the American Academy of Facial Plastic and Reconstructive Surgery websites were assessed using validated readability metrics. The handouts were inputted into OpenAI's ChatGPT-4 after prompting: "Rewrite the following at a 6th-grade reading level." The understandability and actionability of both native and LLM-revised versions were evaluated using the Patient Education Materials Assessment Tool (PEMAT). Results were compared using Wilcoxon rank-sum tests.

Results: The mean readability scores of the standard (ARS, American Academy of Facial Plastic and Reconstructive Surgery) materials corresponded to "difficult," with reading categories ranging between high school and university grade levels. Conversely, the LLM-revised handouts had an average seventh-grade reading level. LLM-revised handouts had better readability in nearly all metrics tested: Flesch-Kincaid Reading Ease (70.8 vs 43.9; P < .05), Gunning Fog Score (10.2 vs 14.42; P < .05), Simple Measure of Gobbledygook (9.9 vs 13.1; P < .05), Coleman-Liau (8.8 vs 12.6; P < .05), and Automated Readability Index (8.2 vs 10.7; P = .06). PEMAT scores were significantly higher in the LLM-revised handouts for understandability (91 vs 74%; P < .05) with similar actionability (42 vs 34%; P = .15) when compared to the standard materials.

Conclusion: Patient-facing handouts can be augmented by ChatGPT with simple prompting to tailor information with improved readability. This study demonstrates the utility of LLMs to aid in rewriting patient handouts and may serve as a tool to help optimize education materials.

Level of evidence: Level VI.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Otolaryngology- Head and Neck Surgery 医学-耳鼻喉科学

CiteScore

6.70

自引率

2.90%

发文量

250

审稿时长

2-4 weeks

期刊介绍： Otolaryngology–Head and Neck Surgery (OTO-HNS) is the official peer-reviewed publication of the American Academy of Otolaryngology–Head and Neck Surgery Foundation. The mission of Otolaryngology–Head and Neck Surgery is to publish contemporary, ethical, clinically relevant information in otolaryngology, head and neck surgery (ear, nose, throat, head, and neck disorders) that can be used by otolaryngologists, clinicians, scientists, and specialists to improve patient care and public health.