评估从缺血性中风患者机械血栓切除术非结构化报告中提取数据的本地开源大型语言模型。

IF 4.5 1区 医学 Q1 NEUROIMAGING Journal of NeuroInterventional Surgery Pub Date : 2025-01-26 DOI:10.1136/jnis-2024-022078
Aymen Meddeb, Philipe Ebert, Keno Kyrill Bressem, Dmitriy Desser, Andrea Dell'Orco, Georg Bohner, Justus F Kleine, Eberhard Siebert, Nils Grauhan, Marc A Brockmann, Ahmed Othman, Michael Scheel, Jawed Nawabi
{"title":"评估从缺血性中风患者机械血栓切除术非结构化报告中提取数据的本地开源大型语言模型。","authors":"Aymen Meddeb, Philipe Ebert, Keno Kyrill Bressem, Dmitriy Desser, Andrea Dell'Orco, Georg Bohner, Justus F Kleine, Eberhard Siebert, Nils Grauhan, Marc A Brockmann, Ahmed Othman, Michael Scheel, Jawed Nawabi","doi":"10.1136/jnis-2024-022078","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>A study was undertaken to assess the effectiveness of open-source large language models (LLMs) in extracting clinical data from unstructured mechanical thrombectomy reports in patients with ischemic stroke caused by a vessel occlusion.</p><p><strong>Methods: </strong>We deployed local open-source LLMs to extract data points from free-text procedural reports in patients who underwent mechanical thrombectomy between September 2020 and June 2023 in our institution. The external dataset was obtained from a second university hospital and comprised consecutive cases treated between September 2023 and March 2024. Ground truth labeling was facilitated by a human-in-the-loop (HITL) approach, with time metrics recorded for both automated and manual data extractions. We tested three models-Mixtral, Qwen, and BioMistral-assessing their performance on precision, recall, and F1 score across 15 clinical categories such as National Institute of Health Stroke Scale (NIHSS) scores, occluded vessels, and medication details.</p><p><strong>Results: </strong>The study included 1000 consecutive reports from our primary institution and 50 reports from a secondary institution. Mixtral showed the highest precision, achieving 0.99 for first series time extraction and 0.69 for occluded vessel identification within the internal dataset. In the external dataset, precision ranged from 1.00 for NIHSS scores to 0.70 for occluded vessels. Qwen showed moderate precision with a high of 0.85 for NIHSS scores and a low of 0.28 for occluded vessels. BioMistral had the broadest range of precision, from 0.81 for first series times to 0.14 for medication details. The HITL approach yielded an average time savings of 65.6% per case, with variations from 45.95% to 79.56%.</p><p><strong>Conclusion: </strong>This study highlights the potential of using LLMs for automated clinical data extraction from medical reports. Incorporating HITL annotations enhances precision and also ensures the reliability of the extracted data. This methodology presents a scalable privacy-preserving option that can significantly support clinical documentation and research endeavors.</p>","PeriodicalId":16411,"journal":{"name":"Journal of NeuroInterventional Surgery","volume":" ","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2025-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating local open-source large language models for data extraction from unstructured reports on mechanical thrombectomy in patients with ischemic stroke.\",\"authors\":\"Aymen Meddeb, Philipe Ebert, Keno Kyrill Bressem, Dmitriy Desser, Andrea Dell'Orco, Georg Bohner, Justus F Kleine, Eberhard Siebert, Nils Grauhan, Marc A Brockmann, Ahmed Othman, Michael Scheel, Jawed Nawabi\",\"doi\":\"10.1136/jnis-2024-022078\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>A study was undertaken to assess the effectiveness of open-source large language models (LLMs) in extracting clinical data from unstructured mechanical thrombectomy reports in patients with ischemic stroke caused by a vessel occlusion.</p><p><strong>Methods: </strong>We deployed local open-source LLMs to extract data points from free-text procedural reports in patients who underwent mechanical thrombectomy between September 2020 and June 2023 in our institution. The external dataset was obtained from a second university hospital and comprised consecutive cases treated between September 2023 and March 2024. Ground truth labeling was facilitated by a human-in-the-loop (HITL) approach, with time metrics recorded for both automated and manual data extractions. We tested three models-Mixtral, Qwen, and BioMistral-assessing their performance on precision, recall, and F1 score across 15 clinical categories such as National Institute of Health Stroke Scale (NIHSS) scores, occluded vessels, and medication details.</p><p><strong>Results: </strong>The study included 1000 consecutive reports from our primary institution and 50 reports from a secondary institution. Mixtral showed the highest precision, achieving 0.99 for first series time extraction and 0.69 for occluded vessel identification within the internal dataset. In the external dataset, precision ranged from 1.00 for NIHSS scores to 0.70 for occluded vessels. Qwen showed moderate precision with a high of 0.85 for NIHSS scores and a low of 0.28 for occluded vessels. BioMistral had the broadest range of precision, from 0.81 for first series times to 0.14 for medication details. The HITL approach yielded an average time savings of 65.6% per case, with variations from 45.95% to 79.56%.</p><p><strong>Conclusion: </strong>This study highlights the potential of using LLMs for automated clinical data extraction from medical reports. Incorporating HITL annotations enhances precision and also ensures the reliability of the extracted data. This methodology presents a scalable privacy-preserving option that can significantly support clinical documentation and research endeavors.</p>\",\"PeriodicalId\":16411,\"journal\":{\"name\":\"Journal of NeuroInterventional Surgery\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.5000,\"publicationDate\":\"2025-01-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of NeuroInterventional Surgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1136/jnis-2024-022078\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROIMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of NeuroInterventional Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1136/jnis-2024-022078","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROIMAGING","Score":null,"Total":0}
引用次数: 0

摘要

研究背景本研究旨在评估开源大型语言模型(LLMs)从非结构化机械血栓切除术报告中提取血管闭塞所致缺血性卒中患者临床数据的有效性:我们部署了本地开源 LLMs,从本机构 2020 年 9 月至 2023 年 6 月期间接受机械血栓切除术的患者的自由文本程序报告中提取数据点。外部数据集来自第二所大学医院,包括 2023 年 9 月至 2024 年 3 月间接受治疗的连续病例。通过人在环路(HITL)方法促进了地面实况标注,并记录了自动和手动数据提取的时间指标。我们测试了三种模型--Mixtral、Qwen 和 BioMistral--评估了它们在精确度、召回率和 F1 分数上的表现,涉及 15 个临床类别,如美国国立卫生研究院卒中量表(NIHSS)评分、闭塞血管和用药详情:研究包括来自我们主要机构的 1000 份连续报告和来自二级机构的 50 份报告。Mixtral 的精确度最高,在内部数据集中,第一序列时间提取的精确度为 0.99,闭塞血管识别的精确度为 0.69。在外部数据集中,NIHSS 评分的精确度为 1.00,闭塞血管的精确度为 0.70。Qwen 的精确度适中,NIHSS 评分的精确度最高为 0.85,闭塞血管的精确度最低为 0.28。BioMistral 的精确度范围最广,从首次序列时间的 0.81 到药物细节的 0.14。HITL 方法每个病例平均节省 65.6% 的时间,变化范围从 45.95% 到 79.56%:本研究强调了使用 LLM 从医疗报告中自动提取临床数据的潜力。纳入 HITL 注释可提高精确度,还能确保提取数据的可靠性。该方法提供了一种可扩展的隐私保护方案,可极大地支持临床文档和研究工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Evaluating local open-source large language models for data extraction from unstructured reports on mechanical thrombectomy in patients with ischemic stroke.

Background: A study was undertaken to assess the effectiveness of open-source large language models (LLMs) in extracting clinical data from unstructured mechanical thrombectomy reports in patients with ischemic stroke caused by a vessel occlusion.

Methods: We deployed local open-source LLMs to extract data points from free-text procedural reports in patients who underwent mechanical thrombectomy between September 2020 and June 2023 in our institution. The external dataset was obtained from a second university hospital and comprised consecutive cases treated between September 2023 and March 2024. Ground truth labeling was facilitated by a human-in-the-loop (HITL) approach, with time metrics recorded for both automated and manual data extractions. We tested three models-Mixtral, Qwen, and BioMistral-assessing their performance on precision, recall, and F1 score across 15 clinical categories such as National Institute of Health Stroke Scale (NIHSS) scores, occluded vessels, and medication details.

Results: The study included 1000 consecutive reports from our primary institution and 50 reports from a secondary institution. Mixtral showed the highest precision, achieving 0.99 for first series time extraction and 0.69 for occluded vessel identification within the internal dataset. In the external dataset, precision ranged from 1.00 for NIHSS scores to 0.70 for occluded vessels. Qwen showed moderate precision with a high of 0.85 for NIHSS scores and a low of 0.28 for occluded vessels. BioMistral had the broadest range of precision, from 0.81 for first series times to 0.14 for medication details. The HITL approach yielded an average time savings of 65.6% per case, with variations from 45.95% to 79.56%.

Conclusion: This study highlights the potential of using LLMs for automated clinical data extraction from medical reports. Incorporating HITL annotations enhances precision and also ensures the reliability of the extracted data. This methodology presents a scalable privacy-preserving option that can significantly support clinical documentation and research endeavors.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.50
自引率
14.60%
发文量
291
审稿时长
4-8 weeks
期刊介绍: The Journal of NeuroInterventional Surgery (JNIS) is a leading peer review journal for scientific research and literature pertaining to the field of neurointerventional surgery. The journal launch follows growing professional interest in neurointerventional techniques for the treatment of a range of neurological and vascular problems including stroke, aneurysms, brain tumors, and spinal compression.The journal is owned by SNIS and is also the official journal of the Interventional Chapter of the Australian and New Zealand Society of Neuroradiology (ANZSNR), the Canadian Interventional Neuro Group, the Hong Kong Neurological Society (HKNS) and the Neuroradiological Society of Taiwan.
期刊最新文献
Factors affecting visual outcomes after dural venous sinus stenting in idiopathic intracranial hypertension. Partial transvenous embolization as an intermediate strategy for large brain arteriovenous malformations: A technical note. Predicting symptomatic intracranial hemorrhage after endovascular treatment of vertebrobasilar artery occlusion: PEACE score. Pre-operative internal maxillary artery embolization for temporomandibular joint replacement: safety and outcomes from a 10-year institutional experience. SCUBA evacuation within 24 hours of basal ganglia hemorrhage results in promising functional outcomes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1