以 CT 为参考标准,比较人工智能解决方案和放射科医生对成人骨盆、髋部和四肢骨折放射影像的检测结果。

IF 4.9 2区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Diagnostic and Interventional Imaging Pub Date : 2024-09-18 DOI:10.1016/j.diii.2024.09.004
Maxime Pastor, Djamel Dabli, Raphaël Lonjon, Chris Serrand, Fehmi Snene, Fayssal Trad, Fabien de Oliveira, Jean-Paul Beregi, Joël Greffier
{"title":"以 CT 为参考标准,比较人工智能解决方案和放射科医生对成人骨盆、髋部和四肢骨折放射影像的检测结果。","authors":"Maxime Pastor, Djamel Dabli, Raphaël Lonjon, Chris Serrand, Fehmi Snene, Fayssal Trad, Fabien de Oliveira, Jean-Paul Beregi, Joël Greffier","doi":"10.1016/j.diii.2024.09.004","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The purpose of this study was to compare the diagnostic performance of an artificial intelligence (AI) solution for the detection of fractures of pelvic, proximal femur or extremity fractures in adults with radiologist interpretation of radiographs, using standard dose CT examination as the standard of reference.</p><p><strong>Materials and methods: </strong>This retrospective study included 94 adult patients with suspected bone fractures who underwent a standard dose CT examination and radiographs of the pelvis and/or hip and extremities at our institution between January 2022 and August 2023. For all patients, an AI solution was used retrospectively on the radiographs to detect and localize bone fractures of the pelvis and/or hip and extremities. Results of the AI solution were compared to the reading of each radiograph by a radiologist using McNemar test. The results of standard dose CT examination as interpreted by a senior radiologist were used as the standard of reference.</p><p><strong>Result: </strong>A total of 94 patients (63 women; mean age, 56.4 ± 22.5 [standard deviation] years) were included. Forty-seven patients had at least one fracture, and a total of 71 fractures were deemed present using the standard of reference (25 hand/wrist, 16 pelvis, 30 foot/ankle). Using the standard of reference, the analysis of radiographs by the AI solution resulted in 58 true positive, 13 false negative, 33 true negative and 15 false positive findings, yielding 82 % sensitivity (58/71; 95 % confidence interval [CI]: 71-89 %), 69 % specificity (33/48; 95 % CI: 55-80 %), and 76 % accuracy (91/119; 95 % CI: 69-84 %). Using the standard of reference, the reading of the radiologist resulted in 65 true positive, 6 false negative, 42 true negative and 6 false positive findings, yielding 92 % sensitivity (65/71; 95 % CI: 82-96 %), 88 % specificity (42/48; 95 % CI: 75-94 %), and 90 % accuracy (107/119; 95 % CI: 85-95 %). The radiologist outperformed the AI solution in terms of sensitivity (P = 0.045), specificity (P = 0.016), and accuracy (P < 0.001).</p><p><strong>Conclusion: </strong>In this study, the radiologist outperformed the AI solution for the diagnosis of pelvic, hip and extremity fractures of the using radiographs. This raises the question of whether a strong standard of reference for evaluating AI solutions should be used in future studies comparing AI and human reading in fracture detection using radiographs.</p>","PeriodicalId":48656,"journal":{"name":"Diagnostic and Interventional Imaging","volume":null,"pages":null},"PeriodicalIF":4.9000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparison between artificial intelligence solution and radiologist for the detection of pelvic, hip and extremity fractures on radiographs in adult using CT as standard of reference.\",\"authors\":\"Maxime Pastor, Djamel Dabli, Raphaël Lonjon, Chris Serrand, Fehmi Snene, Fayssal Trad, Fabien de Oliveira, Jean-Paul Beregi, Joël Greffier\",\"doi\":\"10.1016/j.diii.2024.09.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>The purpose of this study was to compare the diagnostic performance of an artificial intelligence (AI) solution for the detection of fractures of pelvic, proximal femur or extremity fractures in adults with radiologist interpretation of radiographs, using standard dose CT examination as the standard of reference.</p><p><strong>Materials and methods: </strong>This retrospective study included 94 adult patients with suspected bone fractures who underwent a standard dose CT examination and radiographs of the pelvis and/or hip and extremities at our institution between January 2022 and August 2023. For all patients, an AI solution was used retrospectively on the radiographs to detect and localize bone fractures of the pelvis and/or hip and extremities. Results of the AI solution were compared to the reading of each radiograph by a radiologist using McNemar test. The results of standard dose CT examination as interpreted by a senior radiologist were used as the standard of reference.</p><p><strong>Result: </strong>A total of 94 patients (63 women; mean age, 56.4 ± 22.5 [standard deviation] years) were included. Forty-seven patients had at least one fracture, and a total of 71 fractures were deemed present using the standard of reference (25 hand/wrist, 16 pelvis, 30 foot/ankle). Using the standard of reference, the analysis of radiographs by the AI solution resulted in 58 true positive, 13 false negative, 33 true negative and 15 false positive findings, yielding 82 % sensitivity (58/71; 95 % confidence interval [CI]: 71-89 %), 69 % specificity (33/48; 95 % CI: 55-80 %), and 76 % accuracy (91/119; 95 % CI: 69-84 %). Using the standard of reference, the reading of the radiologist resulted in 65 true positive, 6 false negative, 42 true negative and 6 false positive findings, yielding 92 % sensitivity (65/71; 95 % CI: 82-96 %), 88 % specificity (42/48; 95 % CI: 75-94 %), and 90 % accuracy (107/119; 95 % CI: 85-95 %). The radiologist outperformed the AI solution in terms of sensitivity (P = 0.045), specificity (P = 0.016), and accuracy (P < 0.001).</p><p><strong>Conclusion: </strong>In this study, the radiologist outperformed the AI solution for the diagnosis of pelvic, hip and extremity fractures of the using radiographs. This raises the question of whether a strong standard of reference for evaluating AI solutions should be used in future studies comparing AI and human reading in fracture detection using radiographs.</p>\",\"PeriodicalId\":48656,\"journal\":{\"name\":\"Diagnostic and Interventional Imaging\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diagnostic and Interventional Imaging\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.diii.2024.09.004\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diagnostic and Interventional Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.diii.2024.09.004","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

摘要

目的:本研究的目的是以标准剂量CT检查为参考标准,比较人工智能(AI)解决方案在检测成人骨盆、股骨近端或四肢骨折方面的诊断性能与放射科医生对X光片的判读:这项回顾性研究纳入了2022年1月至2023年8月期间在我院接受标准剂量CT检查和骨盆和/或髋关节及四肢X光片检查的94例疑似骨折成人患者。对所有患者的射线照片都采用了人工智能解决方案,以检测和定位骨盆和/或髋关节及四肢骨折。使用 McNemar 检验将人工智能解决方案的结果与放射科医生对每张射线照片的判读结果进行比较。由资深放射科医生解读的标准剂量 CT 检查结果作为参考标准:共纳入 94 名患者(63 名女性;平均年龄为 56.4 ± 22.5 [标准差]岁)。47 名患者至少有一处骨折,根据参考标准,共有 71 处骨折被认定为存在(25 处手部/腕部骨折、16 处骨盆骨折、30 处足部/踝部骨折)。使用参考标准,通过人工智能解决方案对射线照片进行分析,得出 58 个真阳性、13 个假阴性、33 个真阴性和 15 个假阳性结果,灵敏度为 82%(58/71;95% 置信区间 [CI]:71-89%),特异度为 69%(33/48;95% CI:55-80%),准确度为 76%(91/119;95% CI:69-84%)。使用参考标准时,放射科医生的读片结果为 65 个真阳性、6 个假阴性、42 个真阴性和 6 个假阳性,灵敏度为 92%(65/71;95% CI:82-96%),特异度为 88%(42/48;95% CI:75-94%),准确度为 90%(107/119;95% CI:85-95%)。在灵敏度(P = 0.045)、特异性(P = 0.016)和准确性(P < 0.001)方面,放射科医生的表现优于人工智能解决方案:结论:在本研究中,放射科医生在使用放射影像诊断骨盆、髋部和四肢骨折方面的表现优于人工智能解决方案。这就提出了一个问题,即在今后比较人工智能和人类读片在使用射线照片检测骨折方面的效果时,是否应该使用一个强有力的参考标准来评估人工智能解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Comparison between artificial intelligence solution and radiologist for the detection of pelvic, hip and extremity fractures on radiographs in adult using CT as standard of reference.

Purpose: The purpose of this study was to compare the diagnostic performance of an artificial intelligence (AI) solution for the detection of fractures of pelvic, proximal femur or extremity fractures in adults with radiologist interpretation of radiographs, using standard dose CT examination as the standard of reference.

Materials and methods: This retrospective study included 94 adult patients with suspected bone fractures who underwent a standard dose CT examination and radiographs of the pelvis and/or hip and extremities at our institution between January 2022 and August 2023. For all patients, an AI solution was used retrospectively on the radiographs to detect and localize bone fractures of the pelvis and/or hip and extremities. Results of the AI solution were compared to the reading of each radiograph by a radiologist using McNemar test. The results of standard dose CT examination as interpreted by a senior radiologist were used as the standard of reference.

Result: A total of 94 patients (63 women; mean age, 56.4 ± 22.5 [standard deviation] years) were included. Forty-seven patients had at least one fracture, and a total of 71 fractures were deemed present using the standard of reference (25 hand/wrist, 16 pelvis, 30 foot/ankle). Using the standard of reference, the analysis of radiographs by the AI solution resulted in 58 true positive, 13 false negative, 33 true negative and 15 false positive findings, yielding 82 % sensitivity (58/71; 95 % confidence interval [CI]: 71-89 %), 69 % specificity (33/48; 95 % CI: 55-80 %), and 76 % accuracy (91/119; 95 % CI: 69-84 %). Using the standard of reference, the reading of the radiologist resulted in 65 true positive, 6 false negative, 42 true negative and 6 false positive findings, yielding 92 % sensitivity (65/71; 95 % CI: 82-96 %), 88 % specificity (42/48; 95 % CI: 75-94 %), and 90 % accuracy (107/119; 95 % CI: 85-95 %). The radiologist outperformed the AI solution in terms of sensitivity (P = 0.045), specificity (P = 0.016), and accuracy (P < 0.001).

Conclusion: In this study, the radiologist outperformed the AI solution for the diagnosis of pelvic, hip and extremity fractures of the using radiographs. This raises the question of whether a strong standard of reference for evaluating AI solutions should be used in future studies comparing AI and human reading in fracture detection using radiographs.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Diagnostic and Interventional Imaging
Diagnostic and Interventional Imaging Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
8.50
自引率
29.10%
发文量
126
审稿时长
11 days
期刊介绍: Diagnostic and Interventional Imaging accepts publications originating from any part of the world based only on their scientific merit. The Journal focuses on illustrated articles with great iconographic topics and aims at aiding sharpening clinical decision-making skills as well as following high research topics. All articles are published in English. Diagnostic and Interventional Imaging publishes editorials, technical notes, letters, original and review articles on abdominal, breast, cancer, cardiac, emergency, forensic medicine, head and neck, musculoskeletal, gastrointestinal, genitourinary, interventional, obstetric, pediatric, thoracic and vascular imaging, neuroradiology, nuclear medicine, as well as contrast material, computer developments, health policies and practice, and medical physics relevant to imaging.
期刊最新文献
Diagnostic and prognostic value of MRI-based Node-RADS for the assessment of regional lymph node metastasis in renal cell carcinoma. Evaluation of a deep learning-based software to automatically detect and quantify breast arterial calcifications on digital mammogram. Artificial intelligence for bone fracture detection: A promising tool but no substitute for human expertise. CT, MRI and contrast-enhanced ultrasound features of mucinous cystic neoplasm of the liver. Generative AI smartphones: From entertainment to potentially serious risks in radiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1