Artificial intelligence can be used in the identification and classification of shoulder osteoarthritis and avascular necrosis on plain radiographs: a training study of 7,139 radiograph sets.
Martin Magnéli, Michael Axenhus, Johan Fagrell, Petter Ling, Jacob Gislén, Yilmaz Demir, Erica Domeij-Arverud, Kristofer Hallberg, Björn Salomonsson, Max Gordon
{"title":"Artificial intelligence can be used in the identification and classification of shoulder osteoarthritis and avascular necrosis on plain radiographs: a training study of 7,139 radiograph sets.","authors":"Martin Magnéli, Michael Axenhus, Johan Fagrell, Petter Ling, Jacob Gislén, Yilmaz Demir, Erica Domeij-Arverud, Kristofer Hallberg, Björn Salomonsson, Max Gordon","doi":"10.2340/17453674.2024.40905","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and purpose: </strong>Knowledge concerning the use AI models for the classification of glenohumeral osteoarthritis (GHOA) and avascular necrosis (AVN) of the humeral head is lacking. We aimed to analyze how a deep learning (DL) model trained to identify and grade GHOA on plain radiographs performs. Our secondary aim was to train a DL model to identify and grade AVN on plain radiographs.</p><p><strong>Patients and methods: </strong>A modified ResNet-type network was trained on a dataset of radiographic shoulder examinations from a large tertiary hospital. A total of 7,139 radiographs were included. The dataset included various projections of the shoulder, and the network was trained using stochastic gradient descent. Performance evaluation metrics, area under the receiver operating characteristic curve (AUC), sensitivity, and specificity were used to assess the network's performance for each outcome.</p><p><strong>Results: </strong>The network demonstrated AUC values ranging from 0.73 to 0.93 for GHOA classification and > 0.90 for all AVN classification classes. The network exhibited lower AUC for mild cases compared with definitive cases of GHOA. When none and mild grades were combined, the AUC increased, suggesting difficulties in distinguishing between these 2 grades.</p><p><strong>Conclusion: </strong>We found that a DL model can be trained to identify and grade GHOA on plain radiographs. Furthermore, we show that a DL model can identify and grade AVN on plain radiographs. The network performed well, particularly for definitive cases of GHOA and any level of AVN. However, challenges remain in distinguishing between none and mild GHOA grades.</p>","PeriodicalId":6916,"journal":{"name":"Acta Orthopaedica","volume":"95 ","pages":"319-324"},"PeriodicalIF":2.5000,"publicationDate":"2024-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11182033/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Orthopaedica","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2340/17453674.2024.40905","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and purpose: Knowledge concerning the use AI models for the classification of glenohumeral osteoarthritis (GHOA) and avascular necrosis (AVN) of the humeral head is lacking. We aimed to analyze how a deep learning (DL) model trained to identify and grade GHOA on plain radiographs performs. Our secondary aim was to train a DL model to identify and grade AVN on plain radiographs.
Patients and methods: A modified ResNet-type network was trained on a dataset of radiographic shoulder examinations from a large tertiary hospital. A total of 7,139 radiographs were included. The dataset included various projections of the shoulder, and the network was trained using stochastic gradient descent. Performance evaluation metrics, area under the receiver operating characteristic curve (AUC), sensitivity, and specificity were used to assess the network's performance for each outcome.
Results: The network demonstrated AUC values ranging from 0.73 to 0.93 for GHOA classification and > 0.90 for all AVN classification classes. The network exhibited lower AUC for mild cases compared with definitive cases of GHOA. When none and mild grades were combined, the AUC increased, suggesting difficulties in distinguishing between these 2 grades.
Conclusion: We found that a DL model can be trained to identify and grade GHOA on plain radiographs. Furthermore, we show that a DL model can identify and grade AVN on plain radiographs. The network performed well, particularly for definitive cases of GHOA and any level of AVN. However, challenges remain in distinguishing between none and mild GHOA grades.
期刊介绍:
Acta Orthopaedica (previously Acta Orthopaedica Scandinavica) presents original articles of basic research interest, as well as clinical studies in the field of orthopedics and related sub disciplines. Ever since the journal was founded in 1930, by a group of Scandinavian orthopedic surgeons, the journal has been published for an international audience. Acta Orthopaedica is owned by the Nordic Orthopaedic Federation and is the official publication of this federation.