AI for fracture diagnosis in clinical practice: Four approaches to systematic AI-implementation and their impact on AI-effectiveness

IF 3.3 3区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING European Journal of Radiology Pub Date : 2025-04-14 DOI:10.1016/j.ejrad.2025.112113
Daan V. Loeffen , Frank M. Zijta , Tim A. Boymans , Joachim E. Wildberger , Estelle C. Nijssen
{"title":"AI for fracture diagnosis in clinical practice: Four approaches to systematic AI-implementation and their impact on AI-effectiveness","authors":"Daan V. Loeffen ,&nbsp;Frank M. Zijta ,&nbsp;Tim A. Boymans ,&nbsp;Joachim E. Wildberger ,&nbsp;Estelle C. Nijssen","doi":"10.1016/j.ejrad.2025.112113","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Artificial Intelligence (AI) has been shown to enhance fracture-detection-accuracy, but the most effective AI-implementation in clinical practice is less well understood. In the current study, four approaches to AI-implementation are evaluated for their impact on AI-effectiveness.</div></div><div><h3>Materials and Methods</h3><div>Retrospective single-center study based on all consecutive, around-the-clock radiographic examinations for suspected fractures, and accompanying clinical-practice radiologist-diagnoses, between January and March 2023. These image-sets were independently analysed by a dedicated bone-fracture-detection-AI. Findings were combined with radiologist clinical-practice diagnoses to simulate the four AI-implementation methods deemed most relevant to clinical workflows: AI-standalone (radiologist-findings not consulted); AI-problem-solving (AI-findings consulted when radiologist in doubt); AI-triage (radiologist-findings consulted when AI in doubt); and AI-safety net (AI-findings consulted when radiologist diagnosis negative). Reference-standard diagnoses were established by two senior musculoskeletal-radiologists (by consensus in cases of disagreement). Radiologist- and radiologist + AI diagnoses were compared for false negatives (FN), false positives (FP) and their clinical consequences. Experience-level-subgroups radiologists-in-training-, non-musculoskeletal-radiologists, and dedicated musculoskeletal-radiologists were analysed separately.</div></div><div><h3>Results</h3><div>1508 image-sets were included (1227 unique patients; 40 radiologist-readers). Radiologist results were: 2.7 % FN (40/1508), 28 with clinical consequences; 1.2 % FP (18/1508), 2 received full-fracture treatments (11.1 %). All AI-implementation methods changed overall FN and FP with statistical significance (p &lt; 0.001): AI-standalone 1.5 % FN (23/1508; 11 consequences), 6.8 % FP (103/1508); AI-problem-solving 3.2 % FN (48/1508; 31 consequences), 0.6 % FP (9/1508); AI-triage 2.1 % FN (32/1508; 18 consequences), 1.7 % FP (26/1508); AI-safety net 0.07 % FN (1/1508; 1 consequence), 7.6 % FP (115/1508). Subgroups show similar trends, except AI-triage increased FN for all subgroups except radiologists-in-training.</div></div><div><h3>Conclusion</h3><div>Implementation methods have a large impact on AI-effectiveness. These results suggest AI should not be considered for problem-solving or triage at this time; AI standalone performs better than either and may be a source of assistance where radiologists are unavailable. Best results were obtained implementing AI as safety net, which eliminates missed fractures with serious clinical consequences; even though false positives are increased, unnecessary treatments are limited.</div></div>","PeriodicalId":12063,"journal":{"name":"European Journal of Radiology","volume":"187 ","pages":"Article 112113"},"PeriodicalIF":3.3000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0720048X25001998","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose

Artificial Intelligence (AI) has been shown to enhance fracture-detection-accuracy, but the most effective AI-implementation in clinical practice is less well understood. In the current study, four approaches to AI-implementation are evaluated for their impact on AI-effectiveness.

Materials and Methods

Retrospective single-center study based on all consecutive, around-the-clock radiographic examinations for suspected fractures, and accompanying clinical-practice radiologist-diagnoses, between January and March 2023. These image-sets were independently analysed by a dedicated bone-fracture-detection-AI. Findings were combined with radiologist clinical-practice diagnoses to simulate the four AI-implementation methods deemed most relevant to clinical workflows: AI-standalone (radiologist-findings not consulted); AI-problem-solving (AI-findings consulted when radiologist in doubt); AI-triage (radiologist-findings consulted when AI in doubt); and AI-safety net (AI-findings consulted when radiologist diagnosis negative). Reference-standard diagnoses were established by two senior musculoskeletal-radiologists (by consensus in cases of disagreement). Radiologist- and radiologist + AI diagnoses were compared for false negatives (FN), false positives (FP) and their clinical consequences. Experience-level-subgroups radiologists-in-training-, non-musculoskeletal-radiologists, and dedicated musculoskeletal-radiologists were analysed separately.

Results

1508 image-sets were included (1227 unique patients; 40 radiologist-readers). Radiologist results were: 2.7 % FN (40/1508), 28 with clinical consequences; 1.2 % FP (18/1508), 2 received full-fracture treatments (11.1 %). All AI-implementation methods changed overall FN and FP with statistical significance (p < 0.001): AI-standalone 1.5 % FN (23/1508; 11 consequences), 6.8 % FP (103/1508); AI-problem-solving 3.2 % FN (48/1508; 31 consequences), 0.6 % FP (9/1508); AI-triage 2.1 % FN (32/1508; 18 consequences), 1.7 % FP (26/1508); AI-safety net 0.07 % FN (1/1508; 1 consequence), 7.6 % FP (115/1508). Subgroups show similar trends, except AI-triage increased FN for all subgroups except radiologists-in-training.

Conclusion

Implementation methods have a large impact on AI-effectiveness. These results suggest AI should not be considered for problem-solving or triage at this time; AI standalone performs better than either and may be a source of assistance where radiologists are unavailable. Best results were obtained implementing AI as safety net, which eliminates missed fractures with serious clinical consequences; even though false positives are increased, unnecessary treatments are limited.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
临床实践中骨折诊断的人工智能:系统实施人工智能的四种方法及其对人工智能有效性的影响
人工智能(AI)已被证明可以提高骨折检测的准确性,但在临床实践中最有效的人工智能应用尚不清楚。在目前的研究中,评估了四种人工智能实施方法对人工智能有效性的影响。材料和方法回顾性单中心研究,基于2023年1月至3月期间所有疑似骨折的连续24小时放射学检查,以及伴随的临床实践放射学诊断。这些图像集由专用的骨折检测人工智能独立分析。研究结果与放射科医生的临床实践诊断相结合,以模拟与临床工作流程最相关的四种人工智能实施方法:人工智能独立(不咨询放射科医生的发现);人工智能解决问题(当放射科医生有疑问时,咨询人工智能的发现);人工智能分类(当对人工智能有疑问时咨询放射科医生的检查结果);人工智能安全网(当放射科医生诊断为阴性时咨询人工智能结果)。参考标准诊断是由两名高级肌肉骨骼放射科医生建立的(在有分歧的情况下达成共识)。比较放射科医生和放射科医生+人工智能诊断的假阴性(FN)、假阳性(FP)及其临床后果。经验水平分组放射医师——培训中的放射医师、非肌肉骨骼放射医师和专门的肌肉骨骼放射医师分别进行了分析。结果共纳入1508组图像集(1227例独特患者;40 radiologist-readers)。放射科结果为:2.7% FN(40/1508), 28例有临床后果;1.2% FP(18/1508), 2例全骨折治疗(11.1%)。所有人工智能实施方法对FN和FP的总体改变均有统计学意义(p <;0.001): AI-standalone 1.5% FN (23/1508;11个结果),6.8% FP (103/1508);ai问题解决3.2% FN (48/1508;31个结果),0.6% FP (9/1508);人工智能分类2.1% FN (32/1508;18个结果),1.7% FP (26/1508);ai -安全网0.07% FN (1/1508;1个结果),7.6% FP(115/1508)。除了人工智能分类增加了除在职放射科医生外所有亚组的FN外,亚组显示出类似的趋势。结论实施方式对人工智能的有效性影响较大。这些结果表明,目前不应将人工智能用于解决问题或分类;独立的人工智能比两者都表现得更好,并且可能是放射科医生无法获得帮助的来源。采用人工智能作为安全网可获得最佳效果,消除了遗漏骨折造成的严重临床后果;尽管假阳性增加,但不必要的治疗是有限的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.70
自引率
3.00%
发文量
398
审稿时长
42 days
期刊介绍: European Journal of Radiology is an international journal which aims to communicate to its readers, state-of-the-art information on imaging developments in the form of high quality original research articles and timely reviews on current developments in the field. Its audience includes clinicians at all levels of training including radiology trainees, newly qualified imaging specialists and the experienced radiologist. Its aim is to inform efficient, appropriate and evidence-based imaging practice to the benefit of patients worldwide.
期刊最新文献
Role of magnetic resonance imaging in evaluation of urogenital sinus malformations Elevated 18F-FDG uptake in non-metastatic lymph nodes of POLE-mutated endometrial cancer on PET/CT Changes in the cerebral glymphatic system after rehabilitation of COVID-19 patients: A five-year follow-up study Diagnostic value of ultrasound in different positions for detecting nutcracker syndrome during physical examination of flying cadets Visualisation of lenticulostriate arteries using photon counting detector CT angiography: A retrospective cohort study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1