Yaqi Liu , Tingting Wang , Li Yang , Jianhong Wu , Tao He
{"title":"Automatic Joint Lesion Detection by enhancing local feature interaction","authors":"Yaqi Liu , Tingting Wang , Li Yang , Jianhong Wu , Tao He","doi":"10.1016/j.compmedimag.2025.102509","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, deep learning models have demonstrated impressive performance in Automatic Joint Lesion Detection (AJLD), yet balancing accuracy and efficiency remains a significant challenge. This paper focuses on achieving end-to-end lesion detection while improving accuracy to meet clinical requirements. To enhance the overall performance of AJLD, we propose novel modules: Local Attention Feature Fusion (LAFF) and Gaussian Positional Encoding (GPE). These modules are extensively integrated into YOLO, resulting in an improved YOLO model by enhancing <strong>L</strong>ocal <strong>F</strong>eature interaction, named <span><math><msub><mrow><mi>YOLO</mi></mrow><mrow><mi>lf</mi></mrow></msub></math></span> for short. The LAFF module, based on pathological features presented by arthritis, strengthens the implicit connections between joints by acquiring local attention information. The GPE module enhances the connections between joints by encoding their local positional information. In this paper, we validate our approach using two arthritis datasets, including the largest AJLD dataset in the literature (960 X-ray images annotated by two arthritis specialists and one radiologist) and another arthritis dataset with 216 X-ray images, supplemented by the MURA dataset, a more general dataset for abnormality detection in musculoskeletal radiographs. In various series of YOLO models, the improved <span><math><msub><mrow><mi>YOLO</mi></mrow><mrow><mi>lf</mi></mrow></msub></math></span> shows a significant increase in detection accuracy. Taking YOLOv8 as an example, the improved <span><math><mrow><msub><mrow><mi>YOLO</mi></mrow><mrow><mi>lf</mi></mrow></msub><mi>v8</mi></mrow></math></span> increases mAP@50 from 0.765 to 0.785 and from 0.831 to 0.859 on two arthritis datasets, demonstrating the plug-and-play nature and clinical applicability of the proposed LAFF and GPE modules.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"121 ","pages":"Article 102509"},"PeriodicalIF":5.4000,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895611125000187","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, deep learning models have demonstrated impressive performance in Automatic Joint Lesion Detection (AJLD), yet balancing accuracy and efficiency remains a significant challenge. This paper focuses on achieving end-to-end lesion detection while improving accuracy to meet clinical requirements. To enhance the overall performance of AJLD, we propose novel modules: Local Attention Feature Fusion (LAFF) and Gaussian Positional Encoding (GPE). These modules are extensively integrated into YOLO, resulting in an improved YOLO model by enhancing Local Feature interaction, named for short. The LAFF module, based on pathological features presented by arthritis, strengthens the implicit connections between joints by acquiring local attention information. The GPE module enhances the connections between joints by encoding their local positional information. In this paper, we validate our approach using two arthritis datasets, including the largest AJLD dataset in the literature (960 X-ray images annotated by two arthritis specialists and one radiologist) and another arthritis dataset with 216 X-ray images, supplemented by the MURA dataset, a more general dataset for abnormality detection in musculoskeletal radiographs. In various series of YOLO models, the improved shows a significant increase in detection accuracy. Taking YOLOv8 as an example, the improved increases mAP@50 from 0.765 to 0.785 and from 0.831 to 0.859 on two arthritis datasets, demonstrating the plug-and-play nature and clinical applicability of the proposed LAFF and GPE modules.
期刊介绍:
The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.