AI for fracture diagnosis in clinical practice: Four approaches to systematic AI-implementation and their impact on AI-effectiveness

IF 3.3 3区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING European Journal of Radiology Pub Date : 2025-04-14 DOI:10.1016/j.ejrad.2025.112113

Daan V. Loeffen , Frank M. Zijta , Tim A. Boymans , Joachim E. Wildberger , Estelle C. Nijssen

{"title":"AI for fracture diagnosis in clinical practice: Four approaches to systematic AI-implementation and their impact on AI-effectiveness","authors":"Daan V. Loeffen , Frank M. Zijta , Tim A. Boymans , Joachim E. Wildberger , Estelle C. Nijssen","doi":"10.1016/j.ejrad.2025.112113","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Artificial Intelligence (AI) has been shown to enhance fracture-detection-accuracy, but the most effective AI-implementation in clinical practice is less well understood. In the current study, four approaches to AI-implementation are evaluated for their impact on AI-effectiveness.</div></div><div><h3>Materials and Methods</h3><div>Retrospective single-center study based on all consecutive, around-the-clock radiographic examinations for suspected fractures, and accompanying clinical-practice radiologist-diagnoses, between January and March 2023. These image-sets were independently analysed by a dedicated bone-fracture-detection-AI. Findings were combined with radiologist clinical-practice diagnoses to simulate the four AI-implementation methods deemed most relevant to clinical workflows: AI-standalone (radiologist-findings not consulted); AI-problem-solving (AI-findings consulted when radiologist in doubt); AI-triage (radiologist-findings consulted when AI in doubt); and AI-safety net (AI-findings consulted when radiologist diagnosis negative). Reference-standard diagnoses were established by two senior musculoskeletal-radiologists (by consensus in cases of disagreement). Radiologist- and radiologist + AI diagnoses were compared for false negatives (FN), false positives (FP) and their clinical consequences. Experience-level-subgroups radiologists-in-training-, non-musculoskeletal-radiologists, and dedicated musculoskeletal-radiologists were analysed separately.</div></div><div><h3>Results</h3><div>1508 image-sets were included (1227 unique patients; 40 radiologist-readers). Radiologist results were: 2.7 % FN (40/1508), 28 with clinical consequences; 1.2 % FP (18/1508), 2 received full-fracture treatments (11.1 %). All AI-implementation methods changed overall FN and FP with statistical significance (p < 0.001): AI-standalone 1.5 % FN (23/1508; 11 consequences), 6.8 % FP (103/1508); AI-problem-solving 3.2 % FN (48/1508; 31 consequences), 0.6 % FP (9/1508); AI-triage 2.1 % FN (32/1508; 18 consequences), 1.7 % FP (26/1508); AI-safety net 0.07 % FN (1/1508; 1 consequence), 7.6 % FP (115/1508). Subgroups show similar trends, except AI-triage increased FN for all subgroups except radiologists-in-training.</div></div><div><h3>Conclusion</h3><div>Implementation methods have a large impact on AI-effectiveness. These results suggest AI should not be considered for problem-solving or triage at this time; AI standalone performs better than either and may be a source of assistance where radiologists are unavailable. Best results were obtained implementing AI as safety net, which eliminates missed fractures with serious clinical consequences; even though false positives are increased, unnecessary treatments are limited.</div></div>","PeriodicalId":12063,"journal":{"name":"European Journal of Radiology","volume":"187 ","pages":"Article 112113"},"PeriodicalIF":3.3000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0720048X25001998","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose

Artificial Intelligence (AI) has been shown to enhance fracture-detection-accuracy, but the most effective AI-implementation in clinical practice is less well understood. In the current study, four approaches to AI-implementation are evaluated for their impact on AI-effectiveness.

Materials and Methods

Retrospective single-center study based on all consecutive, around-the-clock radiographic examinations for suspected fractures, and accompanying clinical-practice radiologist-diagnoses, between January and March 2023. These image-sets were independently analysed by a dedicated bone-fracture-detection-AI. Findings were combined with radiologist clinical-practice diagnoses to simulate the four AI-implementation methods deemed most relevant to clinical workflows: AI-standalone (radiologist-findings not consulted); AI-problem-solving (AI-findings consulted when radiologist in doubt); AI-triage (radiologist-findings consulted when AI in doubt); and AI-safety net (AI-findings consulted when radiologist diagnosis negative). Reference-standard diagnoses were established by two senior musculoskeletal-radiologists (by consensus in cases of disagreement). Radiologist- and radiologist + AI diagnoses were compared for false negatives (FN), false positives (FP) and their clinical consequences. Experience-level-subgroups radiologists-in-training-, non-musculoskeletal-radiologists, and dedicated musculoskeletal-radiologists were analysed separately.

Results

1508 image-sets were included (1227 unique patients; 40 radiologist-readers). Radiologist results were: 2.7 % FN (40/1508), 28 with clinical consequences; 1.2 % FP (18/1508), 2 received full-fracture treatments (11.1 %). All AI-implementation methods changed overall FN and FP with statistical significance (p < 0.001): AI-standalone 1.5 % FN (23/1508; 11 consequences), 6.8 % FP (103/1508); AI-problem-solving 3.2 % FN (48/1508; 31 consequences), 0.6 % FP (9/1508); AI-triage 2.1 % FN (32/1508; 18 consequences), 1.7 % FP (26/1508); AI-safety net 0.07 % FN (1/1508; 1 consequence), 7.6 % FP (115/1508). Subgroups show similar trends, except AI-triage increased FN for all subgroups except radiologists-in-training.

Conclusion

Implementation methods have a large impact on AI-effectiveness. These results suggest AI should not be considered for problem-solving or triage at this time; AI standalone performs better than either and may be a source of assistance where radiologists are unavailable. Best results were obtained implementing AI as safety net, which eliminates missed fractures with serious clinical consequences; even though false positives are increased, unnecessary treatments are limited.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

临床实践中骨折诊断的人工智能：系统实施人工智能的四种方法及其对人工智能有效性的影响

人工智能（AI）已被证明可以提高骨折检测的准确性，但在临床实践中最有效的人工智能应用尚不清楚。在目前的研究中，评估了四种人工智能实施方法对人工智能有效性的影响。材料和方法回顾性单中心研究，基于2023年1月至3月期间所有疑似骨折的连续24小时放射学检查，以及伴随的临床实践放射学诊断。这些图像集由专用的骨折检测人工智能独立分析。研究结果与放射科医生的临床实践诊断相结合，以模拟与临床工作流程最相关的四种人工智能实施方法：人工智能独立（不咨询放射科医生的发现）；人工智能解决问题（当放射科医生有疑问时，咨询人工智能的发现）；人工智能分类（当对人工智能有疑问时咨询放射科医生的检查结果）；人工智能安全网（当放射科医生诊断为阴性时咨询人工智能结果）。参考标准诊断是由两名高级肌肉骨骼放射科医生建立的（在有分歧的情况下达成共识）。比较放射科医生和放射科医生+人工智能诊断的假阴性（FN）、假阳性（FP）及其临床后果。经验水平分组放射医师——培训中的放射医师、非肌肉骨骼放射医师和专门的肌肉骨骼放射医师分别进行了分析。结果共纳入1508组图像集(1227例独特患者；40 radiologist-readers)。放射科结果为：2.7% FN(40/1508)， 28例有临床后果；1.2% FP(18/1508)， 2例全骨折治疗（11.1%）。所有人工智能实施方法对FN和FP的总体改变均有统计学意义(p <；0.001): AI-standalone 1.5% FN (23/1508；11个结果),6.8% FP (103/1508)；ai问题解决3.2% FN (48/1508；31个结果),0.6% FP (9/1508)；人工智能分类2.1% FN (32/1508；18个结果),1.7% FP (26/1508)；ai -安全网0.07% FN (1/1508；1个结果),7.6% FP（115/1508）。除了人工智能分类增加了除在职放射科医生外所有亚组的FN外，亚组显示出类似的趋势。结论实施方式对人工智能的有效性影响较大。这些结果表明，目前不应将人工智能用于解决问题或分类；独立的人工智能比两者都表现得更好，并且可能是放射科医生无法获得帮助的来源。采用人工智能作为安全网可获得最佳效果，消除了遗漏骨折造成的严重临床后果；尽管假阳性增加，但不必要的治疗是有限的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

European Journal of Radiology 医学-核医学

CiteScore

6.70

自引率

3.00%

发文量

398

审稿时长

42 days

期刊介绍： European Journal of Radiology is an international journal which aims to communicate to its readers, state-of-the-art information on imaging developments in the form of high quality original research articles and timely reviews on current developments in the field. Its audience includes clinicians at all levels of training including radiology trainees, newly qualified imaging specialists and the experienced radiologist. Its aim is to inform efficient, appropriate and evidence-based imaging practice to the benefit of patients worldwide.