Appropriateness and Consistency of an Online Artificial Intelligence System's Response to Common Questions Regarding Cervical Fusion.

IF 1.7 4区 医学 Q3 CLINICAL NEUROLOGY Clinical Spine Surgery Pub Date : 2025-12-01 Epub Date: 2025-02-10 DOI:10.1097/BSD.0000000000001768
Mark Miller, William T DiCiurcio, Matthew Meade, Levi Buchan, Jeffrey Gleimer, Barrett Woods, Christopher Kepler
{"title":"Appropriateness and Consistency of an Online Artificial Intelligence System's Response to Common Questions Regarding Cervical Fusion.","authors":"Mark Miller, William T DiCiurcio, Matthew Meade, Levi Buchan, Jeffrey Gleimer, Barrett Woods, Christopher Kepler","doi":"10.1097/BSD.0000000000001768","DOIUrl":null,"url":null,"abstract":"<p><strong>Study design: </strong>Prospective survey study.</p><p><strong>Objective: </strong>To address a gap that exists concerning ChatGPT's ability to respond to various types of questions regarding cervical surgery.</p><p><strong>Summary of background data: </strong>Artificial Intelligence (AI) and machine learning have been creating great change in the landscape of scientific research. Chat Generative Pre-trained Transformer(ChatGPT), an online AI language model, has emerged as a powerful tool in clinical medicine and surgery. Previous studies have demonstrated appropriate and reliable responses from ChatGPT concerning patient questions regarding total joint arthroplasty, distal radius fractures, and lumbar laminectomy. However, there is a gap that exists in examining how accurate and reliable ChatGPT responses are to common questions related to cervical surgery.</p><p><strong>Materials and methods: </strong>Twenty questions regarding cervical surgery were presented to the online ChatGPT-3.5 web application 3 separate times, creating 60 responses. Responses were then analyzed by 3 fellowship-trained spine surgeons across 2 institutions using a modified Global Quality Scale (1-5 rating) to evaluate accuracy and utility. Descriptive statistics were reported based on responses, and intraclass correlation coefficients were then calculated to assess the consistency of response quality.</p><p><strong>Results: </strong>Out of all questions proposed to the AI platform, the average score was 3.17 (95% CI, 2.92, 3.42), with 66.7% of responses being recorded to be of at least \"moderate\" quality by 1 reviewer. Nine (45%) questions yielded responses that were graded at least \"moderate\" quality by all 3 reviewers. The test-retest reliability was poor with the intraclass correlation coefficient (ICC) calculated as 0.0941 (-0.222, 0.135).</p><p><strong>Conclusion: </strong>This study demonstrated that ChatGPT can answer common patient questions concerning cervical surgery with moderate quality during the majority of responses. Further research within AI is necessary to increase response.</p>","PeriodicalId":10457,"journal":{"name":"Clinical Spine Surgery","volume":" ","pages":"498-505"},"PeriodicalIF":1.7000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Spine Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/BSD.0000000000001768","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/10 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Study design: Prospective survey study.

Objective: To address a gap that exists concerning ChatGPT's ability to respond to various types of questions regarding cervical surgery.

Summary of background data: Artificial Intelligence (AI) and machine learning have been creating great change in the landscape of scientific research. Chat Generative Pre-trained Transformer(ChatGPT), an online AI language model, has emerged as a powerful tool in clinical medicine and surgery. Previous studies have demonstrated appropriate and reliable responses from ChatGPT concerning patient questions regarding total joint arthroplasty, distal radius fractures, and lumbar laminectomy. However, there is a gap that exists in examining how accurate and reliable ChatGPT responses are to common questions related to cervical surgery.

Materials and methods: Twenty questions regarding cervical surgery were presented to the online ChatGPT-3.5 web application 3 separate times, creating 60 responses. Responses were then analyzed by 3 fellowship-trained spine surgeons across 2 institutions using a modified Global Quality Scale (1-5 rating) to evaluate accuracy and utility. Descriptive statistics were reported based on responses, and intraclass correlation coefficients were then calculated to assess the consistency of response quality.

Results: Out of all questions proposed to the AI platform, the average score was 3.17 (95% CI, 2.92, 3.42), with 66.7% of responses being recorded to be of at least "moderate" quality by 1 reviewer. Nine (45%) questions yielded responses that were graded at least "moderate" quality by all 3 reviewers. The test-retest reliability was poor with the intraclass correlation coefficient (ICC) calculated as 0.0941 (-0.222, 0.135).

Conclusion: This study demonstrated that ChatGPT can answer common patient questions concerning cervical surgery with moderate quality during the majority of responses. Further research within AI is necessary to increase response.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在线人工智能系统对颈椎融合常见问题响应的适当性和一致性。
研究设计:前瞻性调查研究。目的:解决ChatGPT在回答有关颈椎手术的各种问题方面存在的差距。背景数据总结:人工智能(AI)和机器学习正在给科学研究领域带来巨大变化。聊天生成预训练转换器(ChatGPT)是一种在线人工智能语言模型,已成为临床医学和外科手术的强大工具。先前的研究表明,ChatGPT对患者关于全关节置换术、桡骨远端骨折和腰椎椎板切除术的问题做出了适当和可靠的回应。然而,在检查ChatGPT回答与颈椎手术相关的常见问题的准确性和可靠性方面存在差距。材料与方法:将20个关于颈椎外科的问题分别3次提交到ChatGPT-3.5在线web应用程序中,产生60个回复。然后由来自2家机构的3名接受过奖学金培训的脊柱外科医生使用改良的全球质量量表(1-5分)对反馈进行分析,以评估准确性和实用性。根据反应报告描述性统计,然后计算类内相关系数以评估反应质量的一致性。结果:在向AI平台提出的所有问题中,平均得分为3.17分(95% CI, 2.92, 3.42),其中66.7%的回答被1名审稿人记录为至少“中等”质量。9个(45%)问题的回答被所有3位评论者评为至少“中等”质量。重测信度差,类内相关系数(ICC)为0.0941(-0.222,0.135)。结论:本研究表明,ChatGPT在大多数回答中都能以中等质量回答患者关于颈椎手术的常见问题。有必要在人工智能领域进行进一步研究,以提高反应。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Clinical Spine Surgery
Clinical Spine Surgery Medicine-Surgery
CiteScore
3.00
自引率
5.30%
发文量
236
期刊介绍: Clinical Spine Surgery is the ideal journal for the busy practicing spine surgeon or trainee, as it is the only journal necessary to keep up to date with new clinical research and surgical techniques. Readers get to watch leaders in the field debate controversial topics in a new controversies section, and gain access to evidence-based reviews of important pathologies in the systematic reviews section. The journal features a surgical technique complete with a video, and a tips and tricks section that allows surgeons to review the important steps prior to a complex procedure. Clinical Spine Surgery provides readers with primary research studies, specifically level 1, 2 and 3 studies, ensuring that articles that may actually change a surgeon’s practice will be read and published. Each issue includes a brief article that will help a surgeon better understand the business of healthcare, as well as an article that will help a surgeon understand how to interpret increasingly complex research methodology. Clinical Spine Surgery is your single source for up-to-date, evidence-based recommendations for spine care.
期刊最新文献
A Comparison of Demographic and Microorganism Differences Between De Novo and Postoperative Infections. Twelve-Month Comparative Efficacy of Lumbar Fusion and Radiofrequency Ablation in the Management of Lumbar Disc Herniation: A Retrospective Cohort Study. A Surgical Technique Guide for C1-2 Fixation By the Cervical Spine Research Society. Implant Migration After Anterior Cervical Spine Surgery: A Systematic Literature Review. Postoperative Footdrop Following Posterior Lumbar Spinal Fusion: Epidemiology, Risk Factors, and Associated Complications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1