Evi M.C. Huijben, Josien P.W. Pluim, Maureen A.J.M. van Eijnatten
{"title":"用于解决胸部 X 光片分类中数据限制的去噪扩散概率模型","authors":"Evi M.C. Huijben, Josien P.W. Pluim, Maureen A.J.M. van Eijnatten","doi":"10.1016/j.imu.2024.101575","DOIUrl":null,"url":null,"abstract":"<div><p>Deep learning plays a crucial role in medical imaging analysis, particularly in tasks such as image classification and segmentation. However, learning from medical imaging datasets presents challenges, including scarcity of labeled examples, class imbalances, and inadequate representation of diverse patient populations. To address these challenges, there has been a growing interest in the use of deep generative models to create synthetic training data, with denoising diffusion probabilistic models (DDPMs) recently gaining attention for their ability to produce realistic and high-quality images. This study explores the potential of a DDPM to generate synthetic chest X-rays for multi-label classifier training. The results indicate that the use of a conditional DDPM has the potential to produce a realistic training set of synthetic chest X-rays. In addition, the study analyzes the impact on classification performance of addressing class imbalance. Balancing the synthetic training set increased the overall classification sensitivity from 0.02 to 0.59, but decreased the overall specificity from 0.99 to 0.71. Furthermore, we investigated the potential of unconditional pre-training to learn general representations, followed by conditional fine-tuning of the DDPM. The results indicate that this approach allows the amount of labeled training data to be reduced to 25% of the original set. Finally, we demonstrate that fidelity and classification metrics do not consistently exhibit the same trends. Integrating a DDPM into the classification pipeline underscores the benefits of having optimal control over the data and efficient use of available unlabeled data. Our research provides insights for making informed decisions about integrating generative models into medical image analysis.</p></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"50 ","pages":"Article 101575"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S235291482400131X/pdfft?md5=629db3cc19c06c57d9e66726c73db9a2&pid=1-s2.0-S235291482400131X-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Denoising diffusion probabilistic models for addressing data limitations in chest X-ray classification\",\"authors\":\"Evi M.C. Huijben, Josien P.W. Pluim, Maureen A.J.M. van Eijnatten\",\"doi\":\"10.1016/j.imu.2024.101575\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Deep learning plays a crucial role in medical imaging analysis, particularly in tasks such as image classification and segmentation. However, learning from medical imaging datasets presents challenges, including scarcity of labeled examples, class imbalances, and inadequate representation of diverse patient populations. To address these challenges, there has been a growing interest in the use of deep generative models to create synthetic training data, with denoising diffusion probabilistic models (DDPMs) recently gaining attention for their ability to produce realistic and high-quality images. This study explores the potential of a DDPM to generate synthetic chest X-rays for multi-label classifier training. The results indicate that the use of a conditional DDPM has the potential to produce a realistic training set of synthetic chest X-rays. In addition, the study analyzes the impact on classification performance of addressing class imbalance. Balancing the synthetic training set increased the overall classification sensitivity from 0.02 to 0.59, but decreased the overall specificity from 0.99 to 0.71. Furthermore, we investigated the potential of unconditional pre-training to learn general representations, followed by conditional fine-tuning of the DDPM. The results indicate that this approach allows the amount of labeled training data to be reduced to 25% of the original set. Finally, we demonstrate that fidelity and classification metrics do not consistently exhibit the same trends. Integrating a DDPM into the classification pipeline underscores the benefits of having optimal control over the data and efficient use of available unlabeled data. Our research provides insights for making informed decisions about integrating generative models into medical image analysis.</p></div>\",\"PeriodicalId\":13953,\"journal\":{\"name\":\"Informatics in Medicine Unlocked\",\"volume\":\"50 \",\"pages\":\"Article 101575\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S235291482400131X/pdfft?md5=629db3cc19c06c57d9e66726c73db9a2&pid=1-s2.0-S235291482400131X-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Informatics in Medicine Unlocked\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S235291482400131X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235291482400131X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
摘要
深度学习在医学影像分析中发挥着至关重要的作用,尤其是在图像分类和分割等任务中。然而,从医学影像数据集进行学习面临着各种挑战,包括标记示例稀缺、类不平衡以及对不同患者群体的代表性不足。为了应对这些挑战,人们对使用深度生成模型创建合成训练数据越来越感兴趣,去噪扩散概率模型(DDPM)最近因其生成逼真和高质量图像的能力而备受关注。本研究探索了 DDPM 生成合成胸部 X 光片用于多标签分类器训练的潜力。结果表明,使用条件 DDPM 有可能生成逼真的合成胸部 X 光片训练集。此外,研究还分析了解决类不平衡问题对分类性能的影响。平衡合成训练集可将整体分类灵敏度从 0.02 提高到 0.59,但将整体特异性从 0.99 降低到 0.71。此外,我们还研究了无条件预训练学习一般表征,然后对 DDPM 进行有条件微调的潜力。结果表明,这种方法可以将标记训练数据量减少到原始数据集的 25%。最后,我们证明了保真度和分类指标并不总是表现出相同的趋势。将 DDPM 集成到分类流水线中凸显了优化数据控制和有效利用可用非标记数据的好处。我们的研究为将生成模型集成到医学图像分析中的明智决策提供了启示。
Denoising diffusion probabilistic models for addressing data limitations in chest X-ray classification
Deep learning plays a crucial role in medical imaging analysis, particularly in tasks such as image classification and segmentation. However, learning from medical imaging datasets presents challenges, including scarcity of labeled examples, class imbalances, and inadequate representation of diverse patient populations. To address these challenges, there has been a growing interest in the use of deep generative models to create synthetic training data, with denoising diffusion probabilistic models (DDPMs) recently gaining attention for their ability to produce realistic and high-quality images. This study explores the potential of a DDPM to generate synthetic chest X-rays for multi-label classifier training. The results indicate that the use of a conditional DDPM has the potential to produce a realistic training set of synthetic chest X-rays. In addition, the study analyzes the impact on classification performance of addressing class imbalance. Balancing the synthetic training set increased the overall classification sensitivity from 0.02 to 0.59, but decreased the overall specificity from 0.99 to 0.71. Furthermore, we investigated the potential of unconditional pre-training to learn general representations, followed by conditional fine-tuning of the DDPM. The results indicate that this approach allows the amount of labeled training data to be reduced to 25% of the original set. Finally, we demonstrate that fidelity and classification metrics do not consistently exhibit the same trends. Integrating a DDPM into the classification pipeline underscores the benefits of having optimal control over the data and efficient use of available unlabeled data. Our research provides insights for making informed decisions about integrating generative models into medical image analysis.
期刊介绍:
Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.