Evaluation of a BERT Natural Language Processing Model for Automating CT and MRI Triage and Protocol Selection.

IF 2.9 3区 医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Canadian Association of Radiologists Journal-Journal De L Association Canadienne Des Radiologistes Pub Date : 2024-06-04 DOI:10.1177/08465371241255895
Jason Yao, Abdullah Alabousi, Oleg Mironov
{"title":"Evaluation of a BERT Natural Language Processing Model for Automating CT and MRI Triage and Protocol Selection.","authors":"Jason Yao, Abdullah Alabousi, Oleg Mironov","doi":"10.1177/08465371241255895","DOIUrl":null,"url":null,"abstract":"<p><p><b>Purpose:</b> To evaluate the accuracy of a Bidirectional Encoder Representations for Transformers (BERT) Natural Language Processing (NLP) model for automating triage and protocol selection of cross-sectional image requisitions. <b>Methods:</b> A retrospective study was completed using 222 392 CT and MRI studies from a single Canadian university hospital database (January 2018-September 2022). Three hundred unique protocols (116 CT and 184 MRI) were included. A BERT model was trained, validated, and tested using an 80%-10%-10% stratified split. Naive Bayes (NB) and Support Vector Machine (SVM) machine learning models were used as comparators. Models were assessed using F1 score, precision, recall, and area under the receiver operating characteristic curve (AUROC). The BERT model was also assessed for multi-class protocol suggestion and subgroups based on referral location, modality, and imaging section. <b>Results:</b> BERT was superior to SVM for protocol selection (F1 score: BERT-0.901 vs SVM-0.881). However, was not significantly different from SVM for triage prediction (F1 score: BERT-0.844 vs SVM-0.845). Both models outperformed NB for protocol and triage. BERT had superior performance on minority classes compared to SVM and NB. For multiclass prediction, BERT accuracy was up to 0.991 for top-5 protocol suggestion, and 0.981 for top-2 triage suggestion. Emergency department patients had the highest F1 scores for both protocol (0.957) and triage (0.986), compared to inpatients and outpatients. <b>Conclusion:</b> The BERT NLP model demonstrated strong performance in automating the triage and protocol selection of radiology studies, showing potential to enhance radiologist workflows. These findings suggest the feasibility of using advanced NLP models to streamline radiology operations.</p>","PeriodicalId":55290,"journal":{"name":"Canadian Association of Radiologists Journal-Journal De L Association Canadienne Des Radiologistes","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian Association of Radiologists Journal-Journal De L Association Canadienne Des Radiologistes","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/08465371241255895","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: To evaluate the accuracy of a Bidirectional Encoder Representations for Transformers (BERT) Natural Language Processing (NLP) model for automating triage and protocol selection of cross-sectional image requisitions. Methods: A retrospective study was completed using 222 392 CT and MRI studies from a single Canadian university hospital database (January 2018-September 2022). Three hundred unique protocols (116 CT and 184 MRI) were included. A BERT model was trained, validated, and tested using an 80%-10%-10% stratified split. Naive Bayes (NB) and Support Vector Machine (SVM) machine learning models were used as comparators. Models were assessed using F1 score, precision, recall, and area under the receiver operating characteristic curve (AUROC). The BERT model was also assessed for multi-class protocol suggestion and subgroups based on referral location, modality, and imaging section. Results: BERT was superior to SVM for protocol selection (F1 score: BERT-0.901 vs SVM-0.881). However, was not significantly different from SVM for triage prediction (F1 score: BERT-0.844 vs SVM-0.845). Both models outperformed NB for protocol and triage. BERT had superior performance on minority classes compared to SVM and NB. For multiclass prediction, BERT accuracy was up to 0.991 for top-5 protocol suggestion, and 0.981 for top-2 triage suggestion. Emergency department patients had the highest F1 scores for both protocol (0.957) and triage (0.986), compared to inpatients and outpatients. Conclusion: The BERT NLP model demonstrated strong performance in automating the triage and protocol selection of radiology studies, showing potential to enhance radiologist workflows. These findings suggest the feasibility of using advanced NLP models to streamline radiology operations.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
评估 BERT 自然语言处理模型,实现 CT 和 MRI 分诊及方案选择自动化。
目的:评估双向变换器编码器表示法(BERT)自然语言处理(NLP)模型的准确性,以实现横断面图像申请单的自动分流和方案选择。研究方法利用加拿大一所大学医院数据库中的 222 392 项 CT 和 MRI 研究(2018 年 1 月至 2022 年 9 月)完成了一项回顾性研究。其中包括 300 份独特的协议(116 份 CT 和 184 份 MRI)。采用 80%-10%-10% 的分层方法对 BERT 模型进行了训练、验证和测试。使用 Naive Bayes (NB) 和支持向量机 (SVM) 机器学习模型进行比较。使用 F1 分数、精确度、召回率和接收者操作特征曲线下面积 (AUROC) 对模型进行评估。此外,还根据转诊地点、方式和成像部分对 BERT 模型进行了多类方案建议和分组评估。结果显示在方案选择方面,BERT 优于 SVM(F1 分数:BERT-0.901 vs SVM-0.881)。但是,在分流预测方面,BERT 与 SVM 没有明显差异(F1 分数:BERT-0.844 vs SVM-0.845)。两个模型在协议和分流方面的表现都优于 NB。与 SVM 和 NB 相比,BERT 在少数类别上的表现更为出色。在多类预测方面,BERT 对前 5 名协议建议的准确率高达 0.991,对前 2 名分诊建议的准确率高达 0.981。与住院病人和门诊病人相比,急诊科病人在协议(0.957)和分诊(0.986)方面的 F1 分数最高。结论BERT NLP 模型在放射学研究的自动分诊和方案选择方面表现出色,显示出增强放射科医生工作流程的潜力。这些研究结果表明,使用先进的 NLP 模型来简化放射科操作是可行的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.20
自引率
12.90%
发文量
98
审稿时长
6-12 weeks
期刊介绍: The Canadian Association of Radiologists Journal is a peer-reviewed, Medline-indexed publication that presents a broad scientific review of radiology in Canada. The Journal covers such topics as abdominal imaging, cardiovascular radiology, computed tomography, continuing professional development, education and training, gastrointestinal radiology, health policy and practice, magnetic resonance imaging, musculoskeletal radiology, neuroradiology, nuclear medicine, pediatric radiology, radiology history, radiology practice guidelines and advisories, thoracic and cardiac imaging, trauma and emergency room imaging, ultrasonography, and vascular and interventional radiology. Article types considered for publication include original research articles, critically appraised topics, review articles, guest editorials, pictorial essays, technical notes, and letter to the Editor.
期刊最新文献
Addressing Gender Disparities for Equitable Practice in Radiology. Cinematic Rendering of Pancreatic Neuroendocrine Tumours: Opportunities for Clinical Implementation: Part 1: Tumour Detection and Characterization. Cinematic Rendering of Pancreatic Neuroendocrine Tumours: Opportunities for Clinical Implementation: Part 2: Preoperative Planning and Evaluation of Metastatic Disease. Correlative Assessment of Machine Learning-Based Cobb Angle Measurements and Human-Based Measurements in Adolescent Idiopathic and Congenital Scoliosis. Driving Change: Direct Patient Access to Medical Imaging Reports and the Need for Radiologist Involvement in Decision-Making.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1