Deep learning in gonarthrosis classification: a comparative study of model architectures and single vs. multi-model methods.

IF 4.7 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Frontiers in Artificial Intelligence Pub Date : 2025-02-05 eCollection Date: 2025-01-01 DOI:10.3389/frai.2025.1413820
Sahika Betul Yayli, Kutay Kılıç, Salih Beyaz
{"title":"Deep learning in gonarthrosis classification: a comparative study of model architectures and single vs. multi-model methods.","authors":"Sahika Betul Yayli, Kutay Kılıç, Salih Beyaz","doi":"10.3389/frai.2025.1413820","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aims to classify Kellgren-Lawrence (KL) osteoarthritis stages using knee anteroposterior X-ray images by comparing two deep learning (DL) methodologies: a traditional single-model approach and a proposed multi-model approach. We addressed three core research questions in this study: (1) How effective are single-model and multi-model deep learning approaches in classifying KL stages? (2) How do seven convolutional neural network (CNN) architectures perform across four distinct deep learning tasks? (3) What is the impact of CLAHE (Contrast Limited Adaptive Histogram Equalization) on classification performance?</p><p><strong>Approach: </strong>We created a dataset of 14,607 annotated knee AP X-rays from three hospitals. The knee joint region was isolated using a YOLOv5 object detection model. The multi-model approach utilized three DL models: one for osteophyte detection, another for joint space narrowing analysis, and a third to combine these outputs with demographic and image data for KL classification. The single-model approach directly classified KL stages as a benchmark. Seven CNN architectures (NfNet-F0/F1, EfficientNet-B0/B3, Inception-ResNet-v2, VGG16) were trained with and without CLAHE augmentation.</p><p><strong>Results: </strong>The single-model approach achieved an F1-score of 0.763 and accuracy of 0.767, outperforming the multi-model strategy, which scored 0.736 and 0.740. Different models performed best across tasks, underscoring the need for task-specific architecture selection. CLAHE negatively impacted most models, with only one showing a marginal improvement of 0.3%.</p><p><strong>Conclusion: </strong>The single-model approach was more effective for KL grading, surpassing metrics in existing literature. These findings emphasize the importance of task-specific architectures and preprocessing. Future studies should explore ensemble modeling, advanced augmentations, and clinical validation to enhance applicability.</p>","PeriodicalId":33315,"journal":{"name":"Frontiers in Artificial Intelligence","volume":"8 ","pages":"1413820"},"PeriodicalIF":4.7000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11835854/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/frai.2025.1413820","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: This study aims to classify Kellgren-Lawrence (KL) osteoarthritis stages using knee anteroposterior X-ray images by comparing two deep learning (DL) methodologies: a traditional single-model approach and a proposed multi-model approach. We addressed three core research questions in this study: (1) How effective are single-model and multi-model deep learning approaches in classifying KL stages? (2) How do seven convolutional neural network (CNN) architectures perform across four distinct deep learning tasks? (3) What is the impact of CLAHE (Contrast Limited Adaptive Histogram Equalization) on classification performance?

Approach: We created a dataset of 14,607 annotated knee AP X-rays from three hospitals. The knee joint region was isolated using a YOLOv5 object detection model. The multi-model approach utilized three DL models: one for osteophyte detection, another for joint space narrowing analysis, and a third to combine these outputs with demographic and image data for KL classification. The single-model approach directly classified KL stages as a benchmark. Seven CNN architectures (NfNet-F0/F1, EfficientNet-B0/B3, Inception-ResNet-v2, VGG16) were trained with and without CLAHE augmentation.

Results: The single-model approach achieved an F1-score of 0.763 and accuracy of 0.767, outperforming the multi-model strategy, which scored 0.736 and 0.740. Different models performed best across tasks, underscoring the need for task-specific architecture selection. CLAHE negatively impacted most models, with only one showing a marginal improvement of 0.3%.

Conclusion: The single-model approach was more effective for KL grading, surpassing metrics in existing literature. These findings emphasize the importance of task-specific architectures and preprocessing. Future studies should explore ensemble modeling, advanced augmentations, and clinical validation to enhance applicability.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
关节病分类中的深度学习:模型架构和单模型与多模型方法的比较研究。
目的:本研究旨在通过比较两种深度学习(DL)方法:传统的单模型方法和提出的多模型方法,利用膝关节正位x线图像对Kellgren-Lawrence (KL)骨关节炎分期进行分类。在本研究中,我们解决了三个核心研究问题:(1)单模型和多模型深度学习方法在KL阶段分类中的有效性如何?(2)七种卷积神经网络(CNN)架构如何在四种不同的深度学习任务中执行?(3) CLAHE(对比度有限自适应直方图均衡化)对分类性能的影响是什么?方法:我们创建了来自三家医院的14,607张带注释的膝关节AP x射线数据集。使用YOLOv5目标检测模型分离膝关节区域。多模型方法利用了三个深度学习模型:一个用于骨赘检测,另一个用于关节间隙狭窄分析,第三个将这些输出与人口统计学和图像数据结合起来进行KL分类。单一模型方法直接将KL阶段分类为基准。7个CNN架构(NfNet-F0/F1、EfficientNet-B0/B3、Inception-ResNet-v2、VGG16)在CLAHE增强和不增强的情况下进行了训练。结果:单模型方法的f1得分为0.763,准确率为0.767,优于多模型策略的得分0.736和0.740。不同的模型在不同的任务中表现最好,这强调了特定于任务的体系结构选择的必要性。CLAHE对大多数车型产生了负面影响,只有一款车型表现出0.3%的边际改善。结论:单模型方法对KL评分更有效,优于现有文献中的指标。这些发现强调了任务特定架构和预处理的重要性。未来的研究应探索集成模型、高级增强和临床验证,以提高适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.10
自引率
2.50%
发文量
272
审稿时长
13 weeks
期刊最新文献
ORCH: many analyses, one merge-a deterministic multi-agent orchestrator for discrete-choice reasoning with EMA-guided routing. Graph-enhanced multimodal fusion of vascular biomarkers and deep features for diabetic retinopathy detection. A systematic review and future directions for AI-driven detection of fraud patterns in SACCO transactions. Augmenting large language models with psychologically grounded models of causal reasoning for planning under uncertainty. KmPred: prediction of Michaelis constants (Km) using an integrative machine learning framework.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1