Large language models: Are artificial intelligence-based chatbots a reliable source of patient information for spinal surgery?

IF 2.7 3区医学 Q2 CLINICAL NEUROLOGY European Spine Journal Pub Date : 2024-11-01 Epub Date: 2023-10-11 DOI:10.1007/s00586-023-07975-z

Anna Stroop, Tabea Stroop, Samer Zawy Alsofy, Makoto Nakamura, Frank Möllmann, Christoph Greiner, Ralf Stroop

{"title":"Large language models: Are artificial intelligence-based chatbots a reliable source of patient information for spinal surgery?","authors":"Anna Stroop, Tabea Stroop, Samer Zawy Alsofy, Makoto Nakamura, Frank Möllmann, Christoph Greiner, Ralf Stroop","doi":"10.1007/s00586-023-07975-z","DOIUrl":null,"url":null,"abstract":"Purpose: Large language models (LLM) have recently attracted attention because of their enormous performance. Based on artificial intelligence, LLM enable dialogic communication using quasi-natural language that approximates the quality of human communication. Thus, LLM could play an important role for patients to become informed. To evaluate the validity of an LLM in providing medical information, we used one of the first high-performance LLM (ChatGPT) on the clinical example of acute lumbar disc herniation (LDH).Methods: Twenty-four spinal surgeons experienced in LDH surgery directed questions to ChatGPT about the clinical picture of LDH from a patient's perspective. They evaluated the quality of ChatGPT responses and its potential use in medical communication. The responses were compared with the information content of a standard informed consent form.Results: ChatGPT provided good results in terms of comprehensibility, specificity, and satisfaction of responses and in terms of medical accuracy and completeness. ChatGPT was not able to provide all the information that was provided in the informed consent form, but did communicate information that was not listed there. In some cases, albeit minor, ChatGPT made medically inaccurate claims, such as listing kyphoplasty and vertebroplasty as surgical options for LDH.Conclusion: With the incipient use of artificial intelligence in communication, LLM will certainly become increasingly important to patients. Even if LLM are unlikely to play a role in clinical communication between physicians and patients at the moment, the opportunities-but also the risks-of this novel technology should be alertly monitored.","PeriodicalId":12323,"journal":{"name":"European Spine Journal","volume":" ","pages":"4135-4143"},"PeriodicalIF":2.7000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Spine Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00586-023-07975-z","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/10/11 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Large language models (LLM) have recently attracted attention because of their enormous performance. Based on artificial intelligence, LLM enable dialogic communication using quasi-natural language that approximates the quality of human communication. Thus, LLM could play an important role for patients to become informed. To evaluate the validity of an LLM in providing medical information, we used one of the first high-performance LLM (ChatGPT) on the clinical example of acute lumbar disc herniation (LDH).

Methods: Twenty-four spinal surgeons experienced in LDH surgery directed questions to ChatGPT about the clinical picture of LDH from a patient's perspective. They evaluated the quality of ChatGPT responses and its potential use in medical communication. The responses were compared with the information content of a standard informed consent form.

Results: ChatGPT provided good results in terms of comprehensibility, specificity, and satisfaction of responses and in terms of medical accuracy and completeness. ChatGPT was not able to provide all the information that was provided in the informed consent form, but did communicate information that was not listed there. In some cases, albeit minor, ChatGPT made medically inaccurate claims, such as listing kyphoplasty and vertebroplasty as surgical options for LDH.

Conclusion: With the incipient use of artificial intelligence in communication, LLM will certainly become increasingly important to patients. Even if LLM are unlikely to play a role in clinical communication between physicians and patients at the moment, the opportunities-but also the risks-of this novel technology should be alertly monitored.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

大型语言模型：基于人工智能的聊天机器人是脊柱手术患者信息的可靠来源吗？

目的：大型语言模型（LLM）最近因其巨大的性能而受到关注。LLM基于人工智能，使用接近人类交流质量的准自然语言实现对话交流。因此，LLM可以在患者知情方面发挥重要作用。为了评估LLM在提供医学信息方面的有效性，我们在急性腰椎间盘突出症（LDH）的临床例子中使用了第一种高性能LLM（ChatGPT）。方法：24名有LDH手术经验的脊柱外科医生从患者的角度向ChatGPT提出了关于LDH临床情况的问题。他们评估了ChatGPT反应的质量及其在医学交流中的潜在用途。将回复与标准知情同意书的信息内容进行了比较。结果：ChatGPT在回复的可理解性、特异性和满意度以及医疗准确性和完整性方面提供了良好的结果。ChatGPT无法提供知情同意书中提供的所有信息，但确实传达了未列出的信息。在某些情况下，尽管很小，但ChatGPT提出了医学上不准确的说法，例如将后凸成形术和椎体成形术列为LDH的手术选择。结论：随着人工智能在通信中的初步应用，LLM对患者来说肯定会变得越来越重要。即使LLM目前不太可能在医生和患者之间的临床沟通中发挥作用，也应该警惕地监测这项新技术的机会和风险。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

European Spine Journal 医学-临床神经学

CiteScore

4.80

自引率

10.70%

发文量

373

审稿时长

2-4 weeks

期刊介绍： "European Spine Journal" is a publication founded in response to the increasing trend toward specialization in spinal surgery and spinal pathology in general. The Journal is devoted to all spine related disciplines, including functional and surgical anatomy of the spine, biomechanics and pathophysiology, diagnostic procedures, and neurology, surgery and outcomes. The aim of "European Spine Journal" is to support the further development of highly innovative spine treatments including but not restricted to surgery and to provide an integrated and balanced view of diagnostic, research and treatment procedures as well as outcomes that will enhance effective collaboration among specialists worldwide. The “European Spine Journal” also participates in education by means of videos, interactive meetings and the endorsement of educative efforts. Official publication of EUROSPINE, The Spine Society of Europe