The value of deep learning-based X-ray techniques in detecting and classifying K-L grades of knee osteoarthritis: a systematic review and meta-analysis.
Haoming Zhao, Liang Ou, Ziming Zhang, Le Zhang, Ke Liu, Jianjun Kuang
{"title":"The value of deep learning-based X-ray techniques in detecting and classifying K-L grades of knee osteoarthritis: a systematic review and meta-analysis.","authors":"Haoming Zhao, Liang Ou, Ziming Zhang, Le Zhang, Ke Liu, Jianjun Kuang","doi":"10.1007/s00330-024-10928-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Knee osteoarthritis (KOA), a prevalent degenerative joint disease, is primarily diagnosed through X-ray imaging. The Kellgren-Lawrence grading system (K-L) is the gold standard for evaluating KOA severity through X-ray analysis. However, this method is highly subjective and non-quantifiable, limiting its effectiveness in detecting subtle joint changes on X-rays. Recent researchers have been directed towards developing deep-learning (DL) techniques for a more accurate diagnosis of KOA using X-ray images. Despite advancements in these intelligent methods, the debate over their diagnostic sensitivity continues. Hence, we conducted the current meta-analysis.</p><p><strong>Methods: </strong>A comprehensive search was conducted in PubMed, Cochrane, Embase, Web of Science, and IEEE up to July 11, 2023. The QUADAS-2 tool was employed to assess the risk of bias in the included studies. Given the multi-classification nature of DL tasks, the sensitivity of DL across different K-L grades was meta-analyzed.</p><p><strong>Results: </strong>A total of 19 studies were included, encompassing 62,158 images. These images consisted of 22,388 for K-L<sub>0</sub>, 13,415 for K-L<sub>1</sub>, 15,597 for K-L<sub>2</sub>, 7768 for K-L<sub>3</sub>, and 2990 for K-L<sub>4</sub>. The meta-analysis demonstrated that the sensitivity of DL was 86.74% for K-L<sub>0</sub> (95% CI: 80.01%-92.28%), 64.00% for K-L<sub>1</sub> (95% CI: 51.81%-75.35%), 75.03% for K-L<sub>2</sub> (95% CI: 66.00%-83.09%), 84.76% for K-L<sub>3</sub> (95% CI: 78.34%-90.25%), and 90.32% for K-L<sub>4</sub> (95% CI: 85.39%-94.40%).</p><p><strong>Conclusions: </strong>The DL multi-classification methods based on X-ray imaging generally demonstrate a favorable sensitivity rate (over 50%) in distinguishing between K-L<sub>0</sub>-K-L<sub>4</sub>. Specifically, for K-L<sub>4</sub>, the sensitivity is highly satisfactory at 90.32%. In contrast, the sensitivity rates for K-L<sub>1-2</sub> still need improvement.</p><p><strong>Clinical relevance statement: </strong>Deep-learning methods have been useful to some extent in assessing the effectiveness of X-rays for osteoarthritis of the knee. However, this requires further research and reliable data to provide specific recommendations for clinical practice.</p><p><strong>Key points: </strong>X-ray deep-learning (DL) methods are debatable for evaluating knee osteoarthritis (KOA) under The Kellgren-Lawrence system (K-L). Multi-classification deep-learning methods are more clinically relevant for assessing K-L grading than dichotomous results. For K-L3 and K-L4, X-ray-based DL has high diagnostic performance; early KOA needs to be further improved.</p>","PeriodicalId":12076,"journal":{"name":"European Radiology","volume":" ","pages":"327-340"},"PeriodicalIF":4.7000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11631813/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00330-024-10928-9","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/12 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: Knee osteoarthritis (KOA), a prevalent degenerative joint disease, is primarily diagnosed through X-ray imaging. The Kellgren-Lawrence grading system (K-L) is the gold standard for evaluating KOA severity through X-ray analysis. However, this method is highly subjective and non-quantifiable, limiting its effectiveness in detecting subtle joint changes on X-rays. Recent researchers have been directed towards developing deep-learning (DL) techniques for a more accurate diagnosis of KOA using X-ray images. Despite advancements in these intelligent methods, the debate over their diagnostic sensitivity continues. Hence, we conducted the current meta-analysis.
Methods: A comprehensive search was conducted in PubMed, Cochrane, Embase, Web of Science, and IEEE up to July 11, 2023. The QUADAS-2 tool was employed to assess the risk of bias in the included studies. Given the multi-classification nature of DL tasks, the sensitivity of DL across different K-L grades was meta-analyzed.
Results: A total of 19 studies were included, encompassing 62,158 images. These images consisted of 22,388 for K-L0, 13,415 for K-L1, 15,597 for K-L2, 7768 for K-L3, and 2990 for K-L4. The meta-analysis demonstrated that the sensitivity of DL was 86.74% for K-L0 (95% CI: 80.01%-92.28%), 64.00% for K-L1 (95% CI: 51.81%-75.35%), 75.03% for K-L2 (95% CI: 66.00%-83.09%), 84.76% for K-L3 (95% CI: 78.34%-90.25%), and 90.32% for K-L4 (95% CI: 85.39%-94.40%).
Conclusions: The DL multi-classification methods based on X-ray imaging generally demonstrate a favorable sensitivity rate (over 50%) in distinguishing between K-L0-K-L4. Specifically, for K-L4, the sensitivity is highly satisfactory at 90.32%. In contrast, the sensitivity rates for K-L1-2 still need improvement.
Clinical relevance statement: Deep-learning methods have been useful to some extent in assessing the effectiveness of X-rays for osteoarthritis of the knee. However, this requires further research and reliable data to provide specific recommendations for clinical practice.
Key points: X-ray deep-learning (DL) methods are debatable for evaluating knee osteoarthritis (KOA) under The Kellgren-Lawrence system (K-L). Multi-classification deep-learning methods are more clinically relevant for assessing K-L grading than dichotomous results. For K-L3 and K-L4, X-ray-based DL has high diagnostic performance; early KOA needs to be further improved.
期刊介绍:
European Radiology (ER) continuously updates scientific knowledge in radiology by publication of strong original articles and state-of-the-art reviews written by leading radiologists. A well balanced combination of review articles, original papers, short communications from European radiological congresses and information on society matters makes ER an indispensable source for current information in this field.
This is the Journal of the European Society of Radiology, and the official journal of a number of societies.
From 2004-2008 supplements to European Radiology were published under its companion, European Radiology Supplements, ISSN 1613-3749.