Augmenting a spine CT scans dataset using VAEs, GANs, and transfer learning for improved detection of vertebral compression fractures

IF 7 2区医学 Q1 BIOLOGY Computers in biology and medicine Pub Date : 2024-11-16 DOI:10.1016/j.compbiomed.2024.109446

Zeina El Kojok, Hadi Al Khansa, Fouad Trad, Ali Chehab

{"title":"Augmenting a spine CT scans dataset using VAEs, GANs, and transfer learning for improved detection of vertebral compression fractures","authors":"Zeina El Kojok, Hadi Al Khansa, Fouad Trad, Ali Chehab","doi":"10.1016/j.compbiomed.2024.109446","DOIUrl":null,"url":null,"abstract":"<div><div>In recent years, deep learning has become a popular tool to analyze and classify medical images. However, challenges such as limited data availability, high labeling costs, and privacy concerns remain significant obstacles. As such, generative models have been extensively explored as a solution to generate new images and overcome the stated challenges. In this paper, we augment a dataset of chest CT scans for Vertebral Compression Fractures (VCFs) collected from the American University of Beirut Medical Center (AUBMC), specifically targeting the detection of incidental fractures that are often overlooked in routine chest CTs, as these scans are not typically focused on spinal analysis. Our goal is to enhance AI systems to enable automated early detection of such incidental fractures, addressing a critical healthcare gap and leading to improved patient outcomes by catching fractures that might otherwise go undiagnosed. We first generate a synthetic dataset based on the segmented CTSpine1K dataset to simulate real grayscale data that aligns with our specific scenario. Then, we use this generated data to evaluate the generative capabilities of Deep Convolutional Generative Adverserial Networks (DCGANs), variational autoencoders (VAEs), and VAE-GAN models. The VAE-GAN model demonstrated the highest performance, achieving a Fréchet Inception Distance (FID) five times lower than the other architectures. To adapt this model to real-image scenarios, we perform transfer learning on the GAN, training it with the real dataset collected from AUBMC and generating additional samples. Finally, we train a CNN using augmented datasets that include both real and generated synthetic data and compare its performance to training on real data alone. We then evaluate the model exclusively on a test set composed of real images to assess the effect of the generated data on real-world performance. We find that training on augmented datasets significantly improves the classification accuracy on a test set composed of real images by 16 %, increasing it from 73 % to 89 %. This improvement demonstrates that the generated data is of high quality and enhances the model's ability to perform well against unseen, real data.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"184 ","pages":"Article 109446"},"PeriodicalIF":7.0000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482524015312","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

In recent years, deep learning has become a popular tool to analyze and classify medical images. However, challenges such as limited data availability, high labeling costs, and privacy concerns remain significant obstacles. As such, generative models have been extensively explored as a solution to generate new images and overcome the stated challenges. In this paper, we augment a dataset of chest CT scans for Vertebral Compression Fractures (VCFs) collected from the American University of Beirut Medical Center (AUBMC), specifically targeting the detection of incidental fractures that are often overlooked in routine chest CTs, as these scans are not typically focused on spinal analysis. Our goal is to enhance AI systems to enable automated early detection of such incidental fractures, addressing a critical healthcare gap and leading to improved patient outcomes by catching fractures that might otherwise go undiagnosed. We first generate a synthetic dataset based on the segmented CTSpine1K dataset to simulate real grayscale data that aligns with our specific scenario. Then, we use this generated data to evaluate the generative capabilities of Deep Convolutional Generative Adverserial Networks (DCGANs), variational autoencoders (VAEs), and VAE-GAN models. The VAE-GAN model demonstrated the highest performance, achieving a Fréchet Inception Distance (FID) five times lower than the other architectures. To adapt this model to real-image scenarios, we perform transfer learning on the GAN, training it with the real dataset collected from AUBMC and generating additional samples. Finally, we train a CNN using augmented datasets that include both real and generated synthetic data and compare its performance to training on real data alone. We then evaluate the model exclusively on a test set composed of real images to assess the effect of the generated data on real-world performance. We find that training on augmented datasets significantly improves the classification accuracy on a test set composed of real images by 16 %, increasing it from 73 % to 89 %. This improvement demonstrates that the generated data is of high quality and enhances the model's ability to perform well against unseen, real data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用 VAE、GAN 和迁移学习增强脊柱 CT 扫描数据集，以改进椎体压缩性骨折的检测。

近年来，深度学习已成为分析和分类医学图像的热门工具。然而，有限的数据可用性、高昂的标记成本和隐私问题等挑战仍然是重大障碍。因此，生成模型作为一种生成新图像并克服上述挑战的解决方案，受到了广泛的探讨。在本文中，我们增强了从贝鲁特美国大学医学中心（AUBMC）收集的胸部 CT 扫描数据集，以检测椎体压缩性骨折（VCF），特别是检测常规胸部 CT 扫描中经常被忽视的偶然骨折，因为这些扫描通常不侧重于脊柱分析。我们的目标是增强人工智能系统，以实现对此类偶然骨折的自动早期检测，从而解决关键的医疗缺口，并通过捕捉可能无法诊断的骨折来改善患者的预后。我们首先根据分割后的 CTSpine1K 数据集生成一个合成数据集，以模拟符合我们特定场景的真实灰度数据。然后，我们使用生成的数据来评估深度卷积生成逆向网络（DCGAN）、变异自动编码器（VAE）和 VAE-GAN 模型的生成能力。VAE-GAN 模型的性能最高，其弗雷谢特起始距离（FID）比其他架构低五倍。为使该模型适应真实图像场景，我们对 GAN 进行了迁移学习，使用从 AUBMC 收集的真实数据集对其进行训练，并生成额外样本。最后，我们使用包含真实数据和生成的合成数据的增强数据集来训练 CNN，并将其性能与仅在真实数据上进行的训练进行比较。然后，我们只在由真实图像组成的测试集上对模型进行评估，以评估生成数据对实际性能的影响。我们发现，在由真实图像组成的测试集上进行增强数据集训练后，分类准确率明显提高了 16%，从 73% 提高到 89%。这一提高表明生成的数据质量很高，并增强了模型在未见过的真实数据上的表现能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computers in biology and medicine 工程技术-工程：生物医学

CiteScore

11.70

自引率

10.40%

发文量

1086

审稿时长

74 days

期刊介绍： Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.