葡萄膜炎决策过程中的人工智能大语言模型。

Inès Schumacher, Virginie Manuela Marie Bühler, Damian Jaggi, Janice Roth
{"title":"葡萄膜炎决策过程中的人工智能大语言模型。","authors":"Inès Schumacher, Virginie Manuela Marie Bühler, Damian Jaggi, Janice Roth","doi":"10.1186/s40942-024-00581-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Uveitis is the ophthalmic subfield dealing with a broad range of intraocular inflammatory diseases. With the raising importance of LLM such as ChatGPT and their potential use in the medical field, this research explores the strengths and weaknesses of its applicability in the subfield of uveitis.</p><p><strong>Methods: </strong>A series of highly clinically relevant questions were asked three consecutive times (attempts 1, 2 and 3) of the LLM regarding current uveitis cases. The answers were classified on whether they were accurate and sufficient, partially accurate and sufficient or inaccurate and insufficient. Statistical analysis included descriptive analysis, normality distribution, non-parametric test and reliability tests. References were checked for their correctness in different medical databases.</p><p><strong>Results: </strong>The data showed non-normal distribution. Data between subgroups (attempts 1, 2 and 3) was comparable (Kruskal-Wallis H test, p-value = 0.7338). There was a moderate agreement between attempt 1 and attempt 2 (Cohen's kappa, ĸ = 0.5172) as well as between attempt 2 and attempt 3 (Cohen's kappa, ĸ = 0.4913). There was a fair agreement between attempt 1 and attempt 3 (Cohen's kappa, ĸ = 0.3647). The average agreement was moderate (Cohen's kappa, ĸ = 0.4577). Between the three attempts together, there was a moderate agreement (Fleiss' kappa, ĸ = 0.4534). A total of 52 references were generated by the LLM. 22 references (42.3%) were found to be accurate and correctly cited. Another 22 references (42.3%) could not be located in any of the searched databases. The remaining 8 references (15.4%) were found to exist, but were either misinterpreted or incorrectly cited by the LLM.</p><p><strong>Conclusion: </strong>Our results demonstrate the significant potential of LLMs in uveitis. However, their implementation requires rigorous training and comprehensive testing for specific medical tasks. We also found out that the references made by ChatGPT 4.o were in most cases incorrect. LLMs are likely to become invaluable tools in shaping the future of ophthalmology, enhancing clinical decision-making and patient care.</p>","PeriodicalId":14289,"journal":{"name":"International Journal of Retina and Vitreous","volume":"10 1","pages":"63"},"PeriodicalIF":1.9000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11389245/pdf/","citationCount":"0","resultStr":"{\"title\":\"Artificial intelligence derived large language model in decision-making process in uveitis.\",\"authors\":\"Inès Schumacher, Virginie Manuela Marie Bühler, Damian Jaggi, Janice Roth\",\"doi\":\"10.1186/s40942-024-00581-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Uveitis is the ophthalmic subfield dealing with a broad range of intraocular inflammatory diseases. With the raising importance of LLM such as ChatGPT and their potential use in the medical field, this research explores the strengths and weaknesses of its applicability in the subfield of uveitis.</p><p><strong>Methods: </strong>A series of highly clinically relevant questions were asked three consecutive times (attempts 1, 2 and 3) of the LLM regarding current uveitis cases. The answers were classified on whether they were accurate and sufficient, partially accurate and sufficient or inaccurate and insufficient. Statistical analysis included descriptive analysis, normality distribution, non-parametric test and reliability tests. References were checked for their correctness in different medical databases.</p><p><strong>Results: </strong>The data showed non-normal distribution. Data between subgroups (attempts 1, 2 and 3) was comparable (Kruskal-Wallis H test, p-value = 0.7338). There was a moderate agreement between attempt 1 and attempt 2 (Cohen's kappa, ĸ = 0.5172) as well as between attempt 2 and attempt 3 (Cohen's kappa, ĸ = 0.4913). There was a fair agreement between attempt 1 and attempt 3 (Cohen's kappa, ĸ = 0.3647). The average agreement was moderate (Cohen's kappa, ĸ = 0.4577). Between the three attempts together, there was a moderate agreement (Fleiss' kappa, ĸ = 0.4534). A total of 52 references were generated by the LLM. 22 references (42.3%) were found to be accurate and correctly cited. Another 22 references (42.3%) could not be located in any of the searched databases. The remaining 8 references (15.4%) were found to exist, but were either misinterpreted or incorrectly cited by the LLM.</p><p><strong>Conclusion: </strong>Our results demonstrate the significant potential of LLMs in uveitis. However, their implementation requires rigorous training and comprehensive testing for specific medical tasks. We also found out that the references made by ChatGPT 4.o were in most cases incorrect. LLMs are likely to become invaluable tools in shaping the future of ophthalmology, enhancing clinical decision-making and patient care.</p>\",\"PeriodicalId\":14289,\"journal\":{\"name\":\"International Journal of Retina and Vitreous\",\"volume\":\"10 1\",\"pages\":\"63\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11389245/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Retina and Vitreous\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1186/s40942-024-00581-1\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"OPHTHALMOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Retina and Vitreous","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s40942-024-00581-1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:葡萄膜炎是眼科的一个子领域,涉及范围广泛的眼内炎症性疾病。随着 ChatGPT 等 LLM 的重要性不断提高及其在医学领域的潜在应用,本研究探讨了其在葡萄膜炎子领域应用的优缺点:方法:就当前葡萄膜炎病例连续三次(尝试 1、2 和 3)向 LLM 提出一系列与临床高度相关的问题。答案按准确和充分、部分准确和充分或不准确和不充分进行分类。统计分析包括描述性分析、正态分布、非参数检验和可靠性检验。在不同的医学数据库中检查了参考文献的正确性:结果:数据呈非正态分布。分组(尝试 1、2 和 3)之间的数据具有可比性(Kruskal-Wallis H 检验,P 值 = 0.7338)。尝试 1 和尝试 2 之间(Cohen's kappa, ĸ = 0.5172)以及尝试 2 和尝试 3 之间(Cohen's kappa, ĸ = 0.4913)的一致性适中。尝试 1 和尝试 3 之间的一致性一般(Cohen's kappa, ĸ = 0.3647)。平均一致性为中等(Cohen's kappa, ĸ = 0.4577)。三次尝试的平均一致性为中等(Fleiss'kappa, ĸ = 0.4534)。LLM 共生成了 52 篇参考文献。其中 22 篇参考文献(42.3%)被认为是准确和正确引用的。另有 22 篇参考文献(42.3%)无法在任何检索数据库中找到。其余 8 篇参考文献(15.4%)被发现存在,但被 LLM 误读或错误引用:我们的研究结果表明,LLM 在葡萄膜炎方面具有巨大的潜力。结论:我们的研究结果表明了 LLM 在葡萄膜炎方面的巨大潜力,但其实施需要针对特定的医疗任务进行严格的培训和全面的测试。我们还发现,ChatGPT 4.o 的引用在大多数情况下都是错误的。LLM 很可能会成为塑造眼科未来、加强临床决策和患者护理的宝贵工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Artificial intelligence derived large language model in decision-making process in uveitis.

Background: Uveitis is the ophthalmic subfield dealing with a broad range of intraocular inflammatory diseases. With the raising importance of LLM such as ChatGPT and their potential use in the medical field, this research explores the strengths and weaknesses of its applicability in the subfield of uveitis.

Methods: A series of highly clinically relevant questions were asked three consecutive times (attempts 1, 2 and 3) of the LLM regarding current uveitis cases. The answers were classified on whether they were accurate and sufficient, partially accurate and sufficient or inaccurate and insufficient. Statistical analysis included descriptive analysis, normality distribution, non-parametric test and reliability tests. References were checked for their correctness in different medical databases.

Results: The data showed non-normal distribution. Data between subgroups (attempts 1, 2 and 3) was comparable (Kruskal-Wallis H test, p-value = 0.7338). There was a moderate agreement between attempt 1 and attempt 2 (Cohen's kappa, ĸ = 0.5172) as well as between attempt 2 and attempt 3 (Cohen's kappa, ĸ = 0.4913). There was a fair agreement between attempt 1 and attempt 3 (Cohen's kappa, ĸ = 0.3647). The average agreement was moderate (Cohen's kappa, ĸ = 0.4577). Between the three attempts together, there was a moderate agreement (Fleiss' kappa, ĸ = 0.4534). A total of 52 references were generated by the LLM. 22 references (42.3%) were found to be accurate and correctly cited. Another 22 references (42.3%) could not be located in any of the searched databases. The remaining 8 references (15.4%) were found to exist, but were either misinterpreted or incorrectly cited by the LLM.

Conclusion: Our results demonstrate the significant potential of LLMs in uveitis. However, their implementation requires rigorous training and comprehensive testing for specific medical tasks. We also found out that the references made by ChatGPT 4.o were in most cases incorrect. LLMs are likely to become invaluable tools in shaping the future of ophthalmology, enhancing clinical decision-making and patient care.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
3.50
自引率
4.30%
发文量
81
审稿时长
19 weeks
期刊介绍: International Journal of Retina and Vitreous focuses on the ophthalmic subspecialty of vitreoretinal disorders. The journal presents original articles on new approaches to diagnosis, outcomes of clinical trials, innovations in pharmacological therapy and surgical techniques, as well as basic science advances that impact clinical practice. Topical areas include, but are not limited to: -Imaging of the retina, choroid and vitreous -Innovations in optical coherence tomography (OCT) -Small-gauge vitrectomy, retinal detachment, chromovitrectomy -Electroretinography (ERG), microperimetry, other functional tests -Intraocular tumors -Retinal pharmacotherapy & drug delivery -Diabetic retinopathy & other vascular diseases -Age-related macular degeneration (AMD) & other macular entities
期刊最新文献
Choriocapillaris flow deficit is associated with disease duration in type 2 diabetic patients without retinopathy: a cross-sectional study. Retinal and choroidal microvascular assessment of children receiving recombinant growth hormone therapy : Study design: a prospective observational comparative study. Self-reported visual function and in-depth swept-source optical coherence tomography features of cystoid macular edema in retinitis pigmentosa. Evaluation of post-operative foveal location and microstructural changes after pars plana vitrectomy for rhegmatogenous retinal detachment using enhanced-depth imaging optical coherence tomography. Choroidal vascularity index in health and systemic diseases: a systematic review.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1