Kellen L Mulford, Austin F Grove, Elizabeth S Kaji, Pouria Rouzrokh, Ryan Roman, Mete Kremers, Hilal Maradit Kremers, Michael J Taunton, Cody C Wyles
{"title":"感知不确定性的膝关节 X 光片深度学习特征描述,用于大规模登记创建。","authors":"Kellen L Mulford, Austin F Grove, Elizabeth S Kaji, Pouria Rouzrokh, Ryan Roman, Mete Kremers, Hilal Maradit Kremers, Michael J Taunton, Cody C Wyles","doi":"10.1016/j.arth.2024.10.103","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>We present an automated image ingestion pipeline for a knee radiography registry, integrating a multilabel image-semantic classifier with conformal prediction-based uncertainty quantification and an object detection model for knee hardware.</p><p><strong>Methods: </strong>Annotators retrospectively classified 26,000 knee images detailing presence, laterality, prostheses, and radiographic views. They further annotated surgical construct locations in 11,841 knee radiographs. An uncertainty-aware multilabel EfficientNet-based classifier was trained to identify the knee laterality, implants, and radiographic view. A classifier trained with embeddings from the EfficientNet model detected out-of-domain images. An object detection model was trained to identify 20 different knee implants. Model performance was assessed against a held-out internal and an external dataset using per-class F1 score, accuracy, sensitivity, and specificity. Conformal prediction was evaluated with marginal coverage and efficiency.</p><p><strong>Results: </strong>Classification Model with Conformal Prediction: F1 scores for each label output > 0.98. Coverage of each label output was >0.99 and the average efficiency was 0.97.</p><p><strong>Domain detection model: </strong>The F1 score was 0.99, with precision and recall for knee radiographs of 0.99.</p><p><strong>Object detection model: </strong>Mean average precision across all classes was 0.945 and ranged from 0.695 to 1.000. Average precision and recall across all classes were 0.950 and 0.886.</p><p><strong>Conclusions: </strong>We present a multilabel classifier with domain detection and an object detection model to characterize knee radiographs. Conformal prediction enhances transparency in cases when the model is uncertain.</p>","PeriodicalId":51077,"journal":{"name":"Journal of Arthroplasty","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Uncertainty-Aware Deep Learning Characterization of Knee Radiographs for Large-Scale Registry Creation.\",\"authors\":\"Kellen L Mulford, Austin F Grove, Elizabeth S Kaji, Pouria Rouzrokh, Ryan Roman, Mete Kremers, Hilal Maradit Kremers, Michael J Taunton, Cody C Wyles\",\"doi\":\"10.1016/j.arth.2024.10.103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>We present an automated image ingestion pipeline for a knee radiography registry, integrating a multilabel image-semantic classifier with conformal prediction-based uncertainty quantification and an object detection model for knee hardware.</p><p><strong>Methods: </strong>Annotators retrospectively classified 26,000 knee images detailing presence, laterality, prostheses, and radiographic views. They further annotated surgical construct locations in 11,841 knee radiographs. An uncertainty-aware multilabel EfficientNet-based classifier was trained to identify the knee laterality, implants, and radiographic view. A classifier trained with embeddings from the EfficientNet model detected out-of-domain images. An object detection model was trained to identify 20 different knee implants. Model performance was assessed against a held-out internal and an external dataset using per-class F1 score, accuracy, sensitivity, and specificity. Conformal prediction was evaluated with marginal coverage and efficiency.</p><p><strong>Results: </strong>Classification Model with Conformal Prediction: F1 scores for each label output > 0.98. Coverage of each label output was >0.99 and the average efficiency was 0.97.</p><p><strong>Domain detection model: </strong>The F1 score was 0.99, with precision and recall for knee radiographs of 0.99.</p><p><strong>Object detection model: </strong>Mean average precision across all classes was 0.945 and ranged from 0.695 to 1.000. Average precision and recall across all classes were 0.950 and 0.886.</p><p><strong>Conclusions: </strong>We present a multilabel classifier with domain detection and an object detection model to characterize knee radiographs. Conformal prediction enhances transparency in cases when the model is uncertain.</p>\",\"PeriodicalId\":51077,\"journal\":{\"name\":\"Journal of Arthroplasty\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2024-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Arthroplasty\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.arth.2024.10.103\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ORTHOPEDICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Arthroplasty","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.arth.2024.10.103","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0
摘要
背景:我们为膝关节放射摄影登记处提供了一个自动图像摄取管道,将多标签图像语义分类器与基于保形预测的不确定性量化和膝关节硬件对象检测模型整合在一起:注释者对 26,000 张膝关节图像进行了回顾性分类,详细说明了存在、侧位、假体和射线视图。他们进一步注释了 11,841 张膝关节 X 光片中的手术结构位置。他们训练了基于 EfficientNet 的不确定性感知多标签分类器,以识别膝关节侧位、假体和放射学视图。使用来自 EfficientNet 模型的嵌入进行训练的分类器可检测域外图像。对物体检测模型进行了训练,以识别 20 种不同的膝关节植入物。使用每类的 F1 分数、准确性、灵敏度和特异性,针对保留的内部和外部数据集对模型性能进行了评估。根据边缘覆盖率和效率对共形预测进行了评估:带有共形预测的分类模型:每个标签输出的 F1 分数大于 0.98。每个标签输出的覆盖率大于 0.99,平均效率为 0.97:F1 分数为 0.99,膝关节 X 光片的精确度和召回率均为 0.99:所有类别的平均精确度为 0.945,范围在 0.695 至 1.000 之间。所有类别的平均精确度和召回率分别为 0.950 和 0.886:我们提出了一种具有领域检测和物体检测模型的多标签分类器,用于描述膝关节X光片。在模型不确定的情况下,共形预测提高了透明度。
Uncertainty-Aware Deep Learning Characterization of Knee Radiographs for Large-Scale Registry Creation.
Background: We present an automated image ingestion pipeline for a knee radiography registry, integrating a multilabel image-semantic classifier with conformal prediction-based uncertainty quantification and an object detection model for knee hardware.
Methods: Annotators retrospectively classified 26,000 knee images detailing presence, laterality, prostheses, and radiographic views. They further annotated surgical construct locations in 11,841 knee radiographs. An uncertainty-aware multilabel EfficientNet-based classifier was trained to identify the knee laterality, implants, and radiographic view. A classifier trained with embeddings from the EfficientNet model detected out-of-domain images. An object detection model was trained to identify 20 different knee implants. Model performance was assessed against a held-out internal and an external dataset using per-class F1 score, accuracy, sensitivity, and specificity. Conformal prediction was evaluated with marginal coverage and efficiency.
Results: Classification Model with Conformal Prediction: F1 scores for each label output > 0.98. Coverage of each label output was >0.99 and the average efficiency was 0.97.
Domain detection model: The F1 score was 0.99, with precision and recall for knee radiographs of 0.99.
Object detection model: Mean average precision across all classes was 0.945 and ranged from 0.695 to 1.000. Average precision and recall across all classes were 0.950 and 0.886.
Conclusions: We present a multilabel classifier with domain detection and an object detection model to characterize knee radiographs. Conformal prediction enhances transparency in cases when the model is uncertain.
期刊介绍:
The Journal of Arthroplasty brings together the clinical and scientific foundations for joint replacement. This peer-reviewed journal publishes original research and manuscripts of the highest quality from all areas relating to joint replacement or the treatment of its complications, including those dealing with clinical series and experience, prosthetic design, biomechanics, biomaterials, metallurgy, biologic response to arthroplasty materials in vivo and in vitro.