Improving Tuberculosis Detection in Chest X-ray Images through Transfer Learning and Deep Learning: A Comparative Study of CNN Architectures

medRxiv - Radiology and Imaging Pub Date : 2024-08-03 DOI:10.1101/2024.08.02.24311396

Alex Mirugwe, Lillian Tamale, Juwa Nyirenda

{"title":"Improving Tuberculosis Detection in Chest X-ray Images through Transfer Learning and Deep Learning: A Comparative Study of CNN Architectures","authors":"Alex Mirugwe, Lillian Tamale, Juwa Nyirenda","doi":"10.1101/2024.08.02.24311396","DOIUrl":null,"url":null,"abstract":"Introduction: Tuberculosis remains a significant global health challenge, necessitating more efficient and accurate diagnostic methods.\nMethods: This study evaluates the performance of various convolutional neural network (CNN) architectures VGG16, VGG19, ResNet50, ResNet101, ResNet152, and Inception-ResNet-V2 in classifying chest X-ray (CXR) images as either normal or TB positive. The dataset comprised 4,200 CXR images, with 700 labeled as TB-positive and 3,500 as normal. We also examined the impact of data augmentation on model performance and analyzed the training times and the number of parameters for each architecture.\nResults: Our results showed that VGG16 outperformed the other models across all evaluation metrics, achieving an accuracy of 99.4%, precision of 97.9%, recall of 98.6%, F1-score of 98.3%, and AUC-ROC of 98.25%. Surprisingly, data augmentation did not improve performance, suggesting that the original dataset's diversity was sufficient. Furthermore, models with large numbers of parameters, such as ResNet152 and Inception-ResNet-V2, required longer training times without yielding proportionally better performance.\nDiscussion: These findings highlight the importance of selecting the appropriate model architecture based on task-specific requirements. While more complex models with larger parameter counts may seem advantageous, they do not necessarily offer superior performance and often come with increased computational costs.\nConclusion: The study demonstrates the potential of simpler models such as VGG16 to effectively diagnose TB from CXR images, providing a balance between performance and computational efficiency. This insight can guide future research and practical implementations in medical image classification.","PeriodicalId":501358,"journal":{"name":"medRxiv - Radiology and Imaging","volume":"10 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Radiology and Imaging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.02.24311396","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction: Tuberculosis remains a significant global health challenge, necessitating more efficient and accurate diagnostic methods. Methods: This study evaluates the performance of various convolutional neural network (CNN) architectures VGG16, VGG19, ResNet50, ResNet101, ResNet152, and Inception-ResNet-V2 in classifying chest X-ray (CXR) images as either normal or TB positive. The dataset comprised 4,200 CXR images, with 700 labeled as TB-positive and 3,500 as normal. We also examined the impact of data augmentation on model performance and analyzed the training times and the number of parameters for each architecture. Results: Our results showed that VGG16 outperformed the other models across all evaluation metrics, achieving an accuracy of 99.4%, precision of 97.9%, recall of 98.6%, F1-score of 98.3%, and AUC-ROC of 98.25%. Surprisingly, data augmentation did not improve performance, suggesting that the original dataset's diversity was sufficient. Furthermore, models with large numbers of parameters, such as ResNet152 and Inception-ResNet-V2, required longer training times without yielding proportionally better performance. Discussion: These findings highlight the importance of selecting the appropriate model architecture based on task-specific requirements. While more complex models with larger parameter counts may seem advantageous, they do not necessarily offer superior performance and often come with increased computational costs. Conclusion: The study demonstrates the potential of simpler models such as VGG16 to effectively diagnose TB from CXR images, providing a balance between performance and computational efficiency. This insight can guide future research and practical implementations in medical image classification.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过迁移学习和深度学习改进胸部 X 光图像中的结核病检测：CNN 架构比较研究

简介：结核病仍然是全球健康的重大挑战：结核病仍然是全球健康面临的重大挑战，需要更高效、更准确的诊断方法：本研究评估了各种卷积神经网络（CNN）架构 VGG16、VGG19、ResNet50、ResNet101、ResNet152 和 Inception-ResNet-V2 在将胸部 X 光（CXR）图像分类为正常或肺结核阳性方面的性能。数据集包括 4,200 张 CXR 图像，其中 700 张标记为肺结核阳性，3,500 张标记为正常。我们还研究了数据增强对模型性能的影响，并分析了每种架构的训练时间和参数数量：结果显示，VGG16 在所有评估指标上都优于其他模型，准确率达到 99.4%，精确率达到 97.9%，召回率达到 98.6%，F1 分数达到 98.3%，AUC-ROC 达到 98.25%。令人惊讶的是，数据扩充并没有提高性能，这表明原始数据集的多样性已经足够。此外，具有大量参数的模型，如 ResNet152 和 Inception-ResNet-V2 需要更长的训练时间，但性能却没有相应提高：讨论：这些发现强调了根据特定任务要求选择适当模型架构的重要性。虽然参数数越多的复杂模型似乎越有利，但它们并不一定能提供更优越的性能，而且往往会增加计算成本：这项研究表明，VGG16 等较简单的模型具有通过 CXR 图像有效诊断肺结核的潜力，可在性能和计算效率之间取得平衡。这一见解可以指导未来医学图像分类的研究和实际应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

medRxiv - Radiology and Imaging

自引率

0.00%

发文量