GPT-4 as a Source of Patient Information for Anterior Cervical Discectomy and Fusion: A Comparative Analysis Against Google Web Search.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2024-11-01 Epub Date: 2024-03-21 DOI:10.1177/21925682241241241

Paul G Mastrokostas, Leonidas E Mastrokostas, Ahmed K Emara, Ian J Wellington, Elizabeth Ginalis, John K Houten, Amrit S Khalsa, Ahmed Saleh, Afshin E Razi, Mitchell K Ng

{"title":"GPT-4 as a Source of Patient Information for Anterior Cervical Discectomy and Fusion: A Comparative Analysis Against Google Web Search.","authors":"Paul G Mastrokostas, Leonidas E Mastrokostas, Ahmed K Emara, Ian J Wellington, Elizabeth Ginalis, John K Houten, Amrit S Khalsa, Ahmed Saleh, Afshin E Razi, Mitchell K Ng","doi":"10.1177/21925682241241241","DOIUrl":null,"url":null,"abstract":"Study design: Comparative study.Objectives: This study aims to compare Google and GPT-4 in terms of (1) question types, (2) response readability, (3) source quality, and (4) numerical response accuracy for the top 10 most frequently asked questions (FAQs) about anterior cervical discectomy and fusion (ACDF).Methods: \"Anterior cervical discectomy and fusion\" was searched on Google and GPT-4 on December 18, 2023. Top 10 FAQs were classified according to the Rothwell system. Source quality was evaluated using JAMA benchmark criteria and readability was assessed using Flesch Reading Ease and Flesch-Kincaid grade level. Differences in JAMA scores, Flesch-Kincaid grade level, Flesch Reading Ease, and word count between platforms were analyzed using Student's t-tests. Statistical significance was set at the .05 level.Results: Frequently asked questions from Google were varied, while GPT-4 focused on technical details and indications/management. GPT-4 showed a higher Flesch-Kincaid grade level (12.96 vs 9.28, P = .003), lower Flesch Reading Ease score (37.07 vs 54.85, P = .005), and higher JAMA scores for source quality (3.333 vs 1.800, P = .016). Numerically, 6 out of 10 responses varied between platforms, with GPT-4 providing broader recovery timelines for ACDF.Conclusions: This study demonstrates GPT-4's ability to elevate patient education by providing high-quality, diverse information tailored to those with advanced literacy levels. As AI technology evolves, refining these tools for accuracy and user-friendliness remains crucial, catering to patients' varying literacy levels and information needs in spine surgery.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11529100/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/21925682241241241","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/3/21 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}

引用次数: 0

Abstract

Study design: Comparative study.

Objectives: This study aims to compare Google and GPT-4 in terms of (1) question types, (2) response readability, (3) source quality, and (4) numerical response accuracy for the top 10 most frequently asked questions (FAQs) about anterior cervical discectomy and fusion (ACDF).

Methods: "Anterior cervical discectomy and fusion" was searched on Google and GPT-4 on December 18, 2023. Top 10 FAQs were classified according to the Rothwell system. Source quality was evaluated using JAMA benchmark criteria and readability was assessed using Flesch Reading Ease and Flesch-Kincaid grade level. Differences in JAMA scores, Flesch-Kincaid grade level, Flesch Reading Ease, and word count between platforms were analyzed using Student's t-tests. Statistical significance was set at the .05 level.

Results: Frequently asked questions from Google were varied, while GPT-4 focused on technical details and indications/management. GPT-4 showed a higher Flesch-Kincaid grade level (12.96 vs 9.28, P = .003), lower Flesch Reading Ease score (37.07 vs 54.85, P = .005), and higher JAMA scores for source quality (3.333 vs 1.800, P = .016). Numerically, 6 out of 10 responses varied between platforms, with GPT-4 providing broader recovery timelines for ACDF.

Conclusions: This study demonstrates GPT-4's ability to elevate patient education by providing high-quality, diverse information tailored to those with advanced literacy levels. As AI technology evolves, refining these tools for accuracy and user-friendliness remains crucial, catering to patients' varying literacy levels and information needs in spine surgery.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

GPT-4 作为颈椎前路切除术和融合术的患者信息来源：与谷歌网络搜索的对比分析。

研究设计比较研究：本研究旨在从以下几个方面对谷歌和 GPT-4 进行比较：(1) 问题类型；(2) 回答的可读性；(3) 来源质量；(4) 关于颈椎前路椎间盘切除和融合术（ACDF）的前 10 个最常见问题（FAQ）的数字回答准确性：2023年12月18日，在谷歌和GPT-4上搜索 "颈椎前路椎间盘切除术和融合术"。根据 Rothwell 系统对排名前 10 的常见问题进行了分类。采用《美国医学会杂志》基准标准评估来源质量，采用 Flesch Reading Ease 和 Flesch-Kincaid grade level 评估可读性。采用学生 t 检验法分析了不同平台之间在 JAMA 分数、Flesch-Kincaid 等级、Flesch 阅读易读性和字数方面的差异。统计显著性设定为 0.05：结果：谷歌的常见问题多种多样，而 GPT-4 则侧重于技术细节和适应症/管理。GPT-4显示出更高的Flesch-Kincaid等级（12.96 vs 9.28，P = .003）、更低的Flesch阅读容易度得分（37.07 vs 54.85，P = .005）和更高的JAMA来源质量得分（3.333 vs 1.800，P = .016）。从数字上看，10 个回答中有 6 个在平台之间存在差异，GPT-4 为 ACDF 提供了更宽的恢复时间：本研究表明，GPT-4 有能力通过为文化水平较高的患者提供高质量、多样化的信息来提升患者教育水平。随着人工智能技术的发展，针对脊柱手术中患者不同的文化水平和信息需求，完善这些工具的准确性和用户友好性仍然至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊