评估大型语言模型的性能：ChatGPT 和 Google Bard 在神经退行性疾病临床病理会议中生成鉴别诊断结果的作用

IF 6.2 2区医学 Q1 CLINICAL NEUROLOGY Brain Pathology Pub Date : 2023-08-08 DOI:10.1111/bpa.13207

Shunsuke Koga, Nicholas B. Martin, Dennis W. Dickson

{"title":"评估大型语言模型的性能：ChatGPT 和 Google Bard 在神经退行性疾病临床病理会议中生成鉴别诊断结果的作用","authors":"Shunsuke Koga, Nicholas B. Martin, Dennis W. Dickson","doi":"10.1111/bpa.13207","DOIUrl":null,"url":null,"abstract":"<p>This study explores the utility of the large language models (LLMs), specifically ChatGPT and Google Bard, in predicting neuropathologic diagnoses from clinical summaries. A total of 25 cases of neurodegenerative disorders presented at Mayo Clinic brain bank Clinico-Pathological Conferences were analyzed. The LLMs provided multiple pathologic diagnoses and their rationales, which were compared with the final clinical diagnoses made by physicians. ChatGPT-3.5, ChatGPT-4, and Google Bard correctly made primary diagnoses in 32%, 52%, and 40% of cases, respectively, while correct diagnoses were included in 76%, 84%, and 76% of cases, respectively. These findings highlight the potential of artificial intelligence tools like ChatGPT in neuropathology, suggesting they may facilitate more comprehensive discussions in clinicopathological conferences.</p>","PeriodicalId":9290,"journal":{"name":"Brain Pathology","volume":"34 3","pages":""},"PeriodicalIF":6.2000,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bpa.13207","citationCount":"0","resultStr":"{\"title\":\"Evaluating the performance of large language models: ChatGPT and Google Bard in generating differential diagnoses in clinicopathological conferences of neurodegenerative disorders\",\"authors\":\"Shunsuke Koga, Nicholas B. Martin, Dennis W. Dickson\",\"doi\":\"10.1111/bpa.13207\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>This study explores the utility of the large language models (LLMs), specifically ChatGPT and Google Bard, in predicting neuropathologic diagnoses from clinical summaries. A total of 25 cases of neurodegenerative disorders presented at Mayo Clinic brain bank Clinico-Pathological Conferences were analyzed. The LLMs provided multiple pathologic diagnoses and their rationales, which were compared with the final clinical diagnoses made by physicians. ChatGPT-3.5, ChatGPT-4, and Google Bard correctly made primary diagnoses in 32%, 52%, and 40% of cases, respectively, while correct diagnoses were included in 76%, 84%, and 76% of cases, respectively. These findings highlight the potential of artificial intelligence tools like ChatGPT in neuropathology, suggesting they may facilitate more comprehensive discussions in clinicopathological conferences.</p>\",\"PeriodicalId\":9290,\"journal\":{\"name\":\"Brain Pathology\",\"volume\":\"34 3\",\"pages\":\"\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2023-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bpa.13207\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Brain Pathology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/bpa.13207\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brain Pathology","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/bpa.13207","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

摘要

本研究探讨了大型语言模型（LLM），特别是 ChatGPT 和 Google Bard，在从临床摘要中预测神经病理学诊断方面的实用性。研究分析了在梅奥诊所脑库临床病理会议上提交的 25 例神经退行性疾病病例。LLM 提供了多种病理诊断及其依据，并与医生的最终临床诊断进行了比较。ChatGPT-3.5、ChatGPT-4 和 Google Bard 分别有 32%、52% 和 40% 的病例做出了正确的初步诊断，同时分别有 76%、84% 和 76% 的病例纳入了正确的诊断。这些发现凸显了 ChatGPT 等人工智能工具在神经病理学领域的潜力，表明它们可以促进临床病理学会议进行更全面的讨论。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Evaluating the performance of large language models: ChatGPT and Google Bard in generating differential diagnoses in clinicopathological conferences of neurodegenerative disorders

This study explores the utility of the large language models (LLMs), specifically ChatGPT and Google Bard, in predicting neuropathologic diagnoses from clinical summaries. A total of 25 cases of neurodegenerative disorders presented at Mayo Clinic brain bank Clinico-Pathological Conferences were analyzed. The LLMs provided multiple pathologic diagnoses and their rationales, which were compared with the final clinical diagnoses made by physicians. ChatGPT-3.5, ChatGPT-4, and Google Bard correctly made primary diagnoses in 32%, 52%, and 40% of cases, respectively, while correct diagnoses were included in 76%, 84%, and 76% of cases, respectively. These findings highlight the potential of artificial intelligence tools like ChatGPT in neuropathology, suggesting they may facilitate more comprehensive discussions in clinicopathological conferences.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Brain Pathology 医学-病理学

CiteScore

13.20

自引率

3.10%

发文量

审稿时长

6-12 weeks

期刊介绍： Brain Pathology is the journal of choice for biomedical scientists investigating diseases of the nervous system. The official journal of the International Society of Neuropathology, Brain Pathology is a peer-reviewed quarterly publication that includes original research, review articles and symposia focuses on the pathogenesis of neurological disease.