Artificial intelligence and pelvic fracture diagnosis on X-rays: a preliminary study on performance, workflow integration and radiologists' feedback assessment in a spoke emergency hospital

IF 1.8 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING European Journal of Radiology Open Pub Date : 2023-07-06 DOI:10.1016/j.ejro.2023.100504
Francesca Rosa , Duccio Buccicardi , Adolfo Romano , Fabio Borda , Maria Chiara D’Auria , Alessandro Gastaldo
{"title":"Artificial intelligence and pelvic fracture diagnosis on X-rays: a preliminary study on performance, workflow integration and radiologists' feedback assessment in a spoke emergency hospital","authors":"Francesca Rosa ,&nbsp;Duccio Buccicardi ,&nbsp;Adolfo Romano ,&nbsp;Fabio Borda ,&nbsp;Maria Chiara D’Auria ,&nbsp;Alessandro Gastaldo","doi":"10.1016/j.ejro.2023.100504","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>The aim of our study is to evaluate artificial intelligence (AI) support in pelvic fracture diagnosis on X-rays, focusing on performance, workflow integration and radiologists’ feedback in a spoke emergency hospital.</p></div><div><h3>Materials and methods</h3><p>Between August and November 2021, a total of 235 sites of fracture or suspected fracture were evaluated and enrolled in the prospective study. Radiologist’s specificity, sensibility accuracy, positive and negative predictive values were compared to AI. Cohen's kappa was used to calculate the agreement between AI and radiologist. We also reviewed the AI workflow integration process, focusing on potential issues and assessed radiologists’ opinion on AI via a survey.</p></div><div><h3>Results</h3><p>The radiologist performance in accuracy, sensitivity and specificity was better than AI but McNemar test demonstrated no statistically significant difference between AI and radiologist’s performance (<em>p</em> = 0.32). Calculated Cohen’s K of 0.64.</p></div><div><h3>Conclusion</h3><p>Contrary to expectations, our preliminary results did not prove a real improvement of patient outcome nor in reporting time but demonstrated AI high NPV (94,62%) and non-inferiority to radiologist performance. Moreover, the commercially available AI algorithm used in our study automatically learn from data and so we expect a progressive performance improvement. AI could be considered as a promising tool to rule-out fractures (especially when used as a “second reader”) and to prioritize positive cases, especially in increasing workload scenarios (ED, nightshifts) but further research is needed to evaluate the real impact on the clinical practice.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":null,"pages":null},"PeriodicalIF":1.8000,"publicationDate":"2023-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10359726/pdf/","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Radiology Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352047723000308","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 1

Abstract

Purpose

The aim of our study is to evaluate artificial intelligence (AI) support in pelvic fracture diagnosis on X-rays, focusing on performance, workflow integration and radiologists’ feedback in a spoke emergency hospital.

Materials and methods

Between August and November 2021, a total of 235 sites of fracture or suspected fracture were evaluated and enrolled in the prospective study. Radiologist’s specificity, sensibility accuracy, positive and negative predictive values were compared to AI. Cohen's kappa was used to calculate the agreement between AI and radiologist. We also reviewed the AI workflow integration process, focusing on potential issues and assessed radiologists’ opinion on AI via a survey.

Results

The radiologist performance in accuracy, sensitivity and specificity was better than AI but McNemar test demonstrated no statistically significant difference between AI and radiologist’s performance (p = 0.32). Calculated Cohen’s K of 0.64.

Conclusion

Contrary to expectations, our preliminary results did not prove a real improvement of patient outcome nor in reporting time but demonstrated AI high NPV (94,62%) and non-inferiority to radiologist performance. Moreover, the commercially available AI algorithm used in our study automatically learn from data and so we expect a progressive performance improvement. AI could be considered as a promising tool to rule-out fractures (especially when used as a “second reader”) and to prioritize positive cases, especially in increasing workload scenarios (ED, nightshifts) but further research is needed to evaluate the real impact on the clinical practice.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
人工智能与骨盆骨折X光诊断——辐射急救医院绩效、工作流程集成和放射科医生反馈评估的初步研究
目的本研究的目的是评估人工智能(AI)在X光骨盆骨折诊断中的支持,重点关注辐射急救医院的绩效、工作流程集成和放射科医生的反馈。材料和方法在2021年8月至11月期间,共评估了235个骨折或疑似骨折部位,并将其纳入前瞻性研究。将放射科医生的特异性、敏感性、准确性、阳性和阴性预测值与AI进行比较。Cohen’s kappa用于计算AI与放射科医生之间的一致性。我们还回顾了人工智能工作流程集成过程,重点关注潜在问题,并通过调查评估了放射科医生对人工智能的看法。结果放射科医生在准确性、敏感性和特异性方面的表现优于AI,但McNemar检验显示AI和放射科医生的表现之间没有统计学上的显著差异(p=0.32)。计算Cohen’s K为0.64。结论与预期相反,我们的初步结果并没有证明患者的预后和报告时间有真正的改善,但证明了AI高NPV(94,62%),并且与放射科医生的表现相比没有劣势。此外,我们研究中使用的商用人工智能算法会自动从数据中学习,因此我们预计性能会逐步提高。人工智能可以被认为是一种很有前途的工具,可以排除骨折(尤其是当用作“第二读者”时),并优先考虑阳性病例,特别是在工作量增加的情况下(ED、夜班),但还需要进一步的研究来评估对临床实践的真正影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
European Journal of Radiology Open
European Journal of Radiology Open Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
4.10
自引率
5.00%
发文量
55
审稿时长
51 days
期刊最新文献
Deep learning model for diagnosis of thyroid nodules with size less than 1 cm: A multicenter, retrospective study MRI-based radiomics machine learning model to differentiate non-clear cell renal cell carcinoma from benign renal tumors Post-deployment performance of a deep learning algorithm for normal and abnormal chest X-ray classification: A study at visa screening centers in the United Arab Emirates Study on the classification of benign and malignant breast lesions using a multi-sequence breast MRI fusion radiomics and deep learning model True cost estimation of common imaging procedures for cost-effectiveness analysis - insights from a Singapore hospital emergency department
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1