Evaluation of the quality and readability of ChatGPT responses to frequently asked questions about myopia in traditional Chinese language.

IF 3.3 3区医学 Q2 HEALTH CARE SCIENCES & SERVICES DIGITAL HEALTH Pub Date : 2024-09-02 eCollection Date: 2024-01-01 DOI:10.1177/20552076241277021

Li-Chun Chang, Chi-Chin Sun, Ting-Han Chen, Der-Chong Tsai, Hui-Ling Lin, Li-Ling Liao

{"title":"Evaluation of the quality and readability of ChatGPT responses to frequently asked questions about myopia in traditional Chinese language.","authors":"Li-Chun Chang, Chi-Chin Sun, Ting-Han Chen, Der-Chong Tsai, Hui-Ling Lin, Li-Ling Liao","doi":"10.1177/20552076241277021","DOIUrl":null,"url":null,"abstract":"Introduction: ChatGPT can serve as an adjunct informational tool for ophthalmologists and their patients. However, the reliability and readability of its responses to myopia-related queries in the Chinese language remain underexplored.Purpose: This study aimed to evaluate the ability of ChatGPT to address frequently asked questions (FAQs) about myopia by parents and caregivers.Method: Myopia-related FAQs were input three times into fresh ChatGPT sessions, and the responses were evaluated by 10 ophthalmologists using a Likert scale for appropriateness, usability, and clarity. The Chinese Readability Index Explorer (CRIE) was used to evaluate the readability of each response. Inter-rater reliability among the reviewers was examined using Cohen's kappa coefficient, and Spearman's rank correlation analysis and one-way analysis of variance were used to investigate the relationship between CRIE scores and each criterion.Results: Forty-five percent of the responses of ChatGPT in Chinese language were appropriate and usable and only 35% met all the set criteria. The CRIE scores for 20 ChatGPT responses ranged from 7.29 to 12.09, indicating that the readability level was equivalent to a middle-to-high school level. Responses about the treatment efficacy and side effects were deficient for all three criteria.Conclusions: The performance of ChatGPT in addressing pediatric myopia-related questions is currently suboptimal. As parents increasingly utilize digital resources to obtain health information, it has become crucial for eye care professionals to familiarize themselves with artificial intelligence-driven information on pediatric myopia.","PeriodicalId":51333,"journal":{"name":"DIGITAL HEALTH","volume":"10 ","pages":"20552076241277021"},"PeriodicalIF":3.3000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11369861/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DIGITAL HEALTH","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/20552076241277021","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: ChatGPT can serve as an adjunct informational tool for ophthalmologists and their patients. However, the reliability and readability of its responses to myopia-related queries in the Chinese language remain underexplored.

Purpose: This study aimed to evaluate the ability of ChatGPT to address frequently asked questions (FAQs) about myopia by parents and caregivers.

Method: Myopia-related FAQs were input three times into fresh ChatGPT sessions, and the responses were evaluated by 10 ophthalmologists using a Likert scale for appropriateness, usability, and clarity. The Chinese Readability Index Explorer (CRIE) was used to evaluate the readability of each response. Inter-rater reliability among the reviewers was examined using Cohen's kappa coefficient, and Spearman's rank correlation analysis and one-way analysis of variance were used to investigate the relationship between CRIE scores and each criterion.

Results: Forty-five percent of the responses of ChatGPT in Chinese language were appropriate and usable and only 35% met all the set criteria. The CRIE scores for 20 ChatGPT responses ranged from 7.29 to 12.09, indicating that the readability level was equivalent to a middle-to-high school level. Responses about the treatment efficacy and side effects were deficient for all three criteria.

Conclusions: The performance of ChatGPT in addressing pediatric myopia-related questions is currently suboptimal. As parents increasingly utilize digital resources to obtain health information, it has become crucial for eye care professionals to familiarize themselves with artificial intelligence-driven information on pediatric myopia.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评估中文繁体近视常见问题聊天 GPT 回答的质量和可读性。

简介ChatGPT 可作为眼科医生及其患者的辅助信息工具。目的：本研究旨在评估 ChatGPT 解决家长和护理人员关于近视的常见问题（FAQs）的能力：方法：将与近视相关的常见问题三次输入到新鲜的 ChatGPT 会话中，由 10 位眼科医生使用李克特量表对回复的适当性、可用性和清晰度进行评估。中文可读性指数资源管理器（CRIE）用于评估每个回复的可读性。使用科恩卡帕系数（Cohen's kappa coefficient）检验了审稿人之间的可靠性，并使用斯皮尔曼等级相关分析和单因素方差分析来研究 CRIE 分数与各项标准之间的关系：45%的中文 ChatGPT 回答是恰当和可用的，只有 35%的回答符合所有设定的标准。20 个 ChatGPT 回答的 CRIE 分数从 7.29 到 12.09 不等，表明可读性水平相当于初中至高中水平。关于疗效和副作用的回答在所有三个标准中都存在不足：目前，ChatGPT 在解决小儿近视相关问题方面的表现并不理想。随着家长越来越多地利用数字资源获取健康信息，眼科专业人员熟悉人工智能驱动的儿童近视信息已变得至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

DIGITAL HEALTH Multiple-

CiteScore

2.90

自引率

7.70%

发文量

302