An Integrated Approach using YOLOv8 and ResNet, SeResNet & Vision Transformer (ViT) Algorithms based on ROI Fracture Prediction in X-ray Images of the Elbow.
Taukir Alam, Wei-Cheng Yeh, Fang Rong Hsu, Wei-Chung Shia, Robert Singh, Taimoor Hassan, Wenru Lin, Hong-Ye Yang, Tahir Hussain
{"title":"An Integrated Approach using YOLOv8 and ResNet, SeResNet & Vision Transformer (ViT) Algorithms based on ROI Fracture Prediction in X-ray Images of the Elbow.","authors":"Taukir Alam, Wei-Cheng Yeh, Fang Rong Hsu, Wei-Chung Shia, Robert Singh, Taimoor Hassan, Wenru Lin, Hong-Ye Yang, Tahir Hussain","doi":"10.2174/0115734056309890240912054616","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>In this study, we harnessed three cutting-edge algorithms' capabilities to refine the elbow fracture prediction process through X-ray image analysis. Employing the YOLOv8 (You only look once) algorithm, we first identified Regions of Interest (ROI) within the X-ray images, significantly augmenting fracture prediction accuracy.</p><p><strong>Methods: </strong>Subsequently, we integrated and compared the ResNet, the SeResNet (Squeeze-and-Excitation Residual Network) ViT (Vision Transformer) algorithms to refine our predictive capabilities. Furthermore, to ensure optimal precision, we implemented a series of meticulous refinements. This included recalibrating ROI regions to enable finer-grained identification of diagnostically significant areas within the X-ray images. Additionally, advanced image enhancement techniques were applied to optimize the X-ray images' visual quality and structural clarity.</p><p><strong>Results: </strong>These methodological enhancements synergistically contributed to a substantial improvement in the overall accuracy of our fracture predictions. The dataset utilized for training, testing & validation, and comprehensive evaluation exclusively comprised elbow X-ray images, where predicting the fracture with three algorithms: Resnet50; accuracy 0.97, precision 1, recall 0.95, SeResnet50; accuracy 0.97, precision 1, recall 0.95 & ViTB- 16 with high accuracy of 0.99, precision same as the other two algorithms, with a recall of 0.95.</p><p><strong>Conclusion: </strong>This approach has the potential to increase the precision of diagnoses, lessen the burden of radiologists, easily integrate into current medical imaging systems, and assist clinical decision-making, all of which could lead to better patient care and health outcomes overall.</p>","PeriodicalId":54215,"journal":{"name":"Current Medical Imaging Reviews","volume":null,"pages":null},"PeriodicalIF":1.1000,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Medical Imaging Reviews","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2174/0115734056309890240912054616","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: In this study, we harnessed three cutting-edge algorithms' capabilities to refine the elbow fracture prediction process through X-ray image analysis. Employing the YOLOv8 (You only look once) algorithm, we first identified Regions of Interest (ROI) within the X-ray images, significantly augmenting fracture prediction accuracy.
Methods: Subsequently, we integrated and compared the ResNet, the SeResNet (Squeeze-and-Excitation Residual Network) ViT (Vision Transformer) algorithms to refine our predictive capabilities. Furthermore, to ensure optimal precision, we implemented a series of meticulous refinements. This included recalibrating ROI regions to enable finer-grained identification of diagnostically significant areas within the X-ray images. Additionally, advanced image enhancement techniques were applied to optimize the X-ray images' visual quality and structural clarity.
Results: These methodological enhancements synergistically contributed to a substantial improvement in the overall accuracy of our fracture predictions. The dataset utilized for training, testing & validation, and comprehensive evaluation exclusively comprised elbow X-ray images, where predicting the fracture with three algorithms: Resnet50; accuracy 0.97, precision 1, recall 0.95, SeResnet50; accuracy 0.97, precision 1, recall 0.95 & ViTB- 16 with high accuracy of 0.99, precision same as the other two algorithms, with a recall of 0.95.
Conclusion: This approach has the potential to increase the precision of diagnoses, lessen the burden of radiologists, easily integrate into current medical imaging systems, and assist clinical decision-making, all of which could lead to better patient care and health outcomes overall.
基于肘部 X 射线图像 ROI 骨折预测的 YOLOv8 与 ResNet、SeResNet 和 Vision Transformer (ViT) 算法集成方法。
简介在这项研究中,我们利用三种尖端算法的功能,通过X光图像分析改进了肘部骨折预测过程。利用 YOLOv8(只看一次)算法,我们首先确定了 X 光图像中的感兴趣区(ROI),从而显著提高了骨折预测的准确性:随后,我们整合并比较了 ResNet、SeResNet(挤压-激发残余网络)和 ViT(视觉转换器)算法,以完善我们的预测能力。此外,为了确保最佳精度,我们还进行了一系列细致的改进。这包括重新校准 ROI 区域,以便更精细地识别 X 射线图像中具有诊断意义的区域。此外,我们还采用了先进的图像增强技术,以优化 X 光图像的视觉质量和结构清晰度:结果:这些方法上的改进协同作用,大大提高了骨折预测的整体准确性。用于训练、测试和验证以及综合评估的数据集完全由肘部 X 光图像组成,其中使用三种算法预测骨折:Resnet50的准确率为0.97,精确度为1,召回率为0.95;SeResnet50的准确率为0.97,精确度为1,召回率为0.95;ViTB- 16的准确率为0.99,精确度与其他两种算法相同,召回率为0.95:这种方法有可能提高诊断的精确度,减轻放射科医生的负担,轻松集成到当前的医学影像系统中,并辅助临床决策,所有这些都能带来更好的病人护理和整体健康结果。
期刊介绍:
Current Medical Imaging Reviews publishes frontier review articles, original research articles, drug clinical trial studies and guest edited thematic issues on all the latest advances on medical imaging dedicated to clinical research. All relevant areas are covered by the journal, including advances in the diagnosis, instrumentation and therapeutic applications related to all modern medical imaging techniques.
The journal is essential reading for all clinicians and researchers involved in medical imaging and diagnosis.