Denoising diffusion probabilistic models for addressing data limitations in chest X-ray classification

Evi M.C. Huijben, Josien P.W. Pluim, Maureen A.J.M. van Eijnatten
{"title":"Denoising diffusion probabilistic models for addressing data limitations in chest X-ray classification","authors":"Evi M.C. Huijben,&nbsp;Josien P.W. Pluim,&nbsp;Maureen A.J.M. van Eijnatten","doi":"10.1016/j.imu.2024.101575","DOIUrl":null,"url":null,"abstract":"<div><p>Deep learning plays a crucial role in medical imaging analysis, particularly in tasks such as image classification and segmentation. However, learning from medical imaging datasets presents challenges, including scarcity of labeled examples, class imbalances, and inadequate representation of diverse patient populations. To address these challenges, there has been a growing interest in the use of deep generative models to create synthetic training data, with denoising diffusion probabilistic models (DDPMs) recently gaining attention for their ability to produce realistic and high-quality images. This study explores the potential of a DDPM to generate synthetic chest X-rays for multi-label classifier training. The results indicate that the use of a conditional DDPM has the potential to produce a realistic training set of synthetic chest X-rays. In addition, the study analyzes the impact on classification performance of addressing class imbalance. Balancing the synthetic training set increased the overall classification sensitivity from 0.02 to 0.59, but decreased the overall specificity from 0.99 to 0.71. Furthermore, we investigated the potential of unconditional pre-training to learn general representations, followed by conditional fine-tuning of the DDPM. The results indicate that this approach allows the amount of labeled training data to be reduced to 25% of the original set. Finally, we demonstrate that fidelity and classification metrics do not consistently exhibit the same trends. Integrating a DDPM into the classification pipeline underscores the benefits of having optimal control over the data and efficient use of available unlabeled data. Our research provides insights for making informed decisions about integrating generative models into medical image analysis.</p></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"50 ","pages":"Article 101575"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S235291482400131X/pdfft?md5=629db3cc19c06c57d9e66726c73db9a2&pid=1-s2.0-S235291482400131X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235291482400131X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Deep learning plays a crucial role in medical imaging analysis, particularly in tasks such as image classification and segmentation. However, learning from medical imaging datasets presents challenges, including scarcity of labeled examples, class imbalances, and inadequate representation of diverse patient populations. To address these challenges, there has been a growing interest in the use of deep generative models to create synthetic training data, with denoising diffusion probabilistic models (DDPMs) recently gaining attention for their ability to produce realistic and high-quality images. This study explores the potential of a DDPM to generate synthetic chest X-rays for multi-label classifier training. The results indicate that the use of a conditional DDPM has the potential to produce a realistic training set of synthetic chest X-rays. In addition, the study analyzes the impact on classification performance of addressing class imbalance. Balancing the synthetic training set increased the overall classification sensitivity from 0.02 to 0.59, but decreased the overall specificity from 0.99 to 0.71. Furthermore, we investigated the potential of unconditional pre-training to learn general representations, followed by conditional fine-tuning of the DDPM. The results indicate that this approach allows the amount of labeled training data to be reduced to 25% of the original set. Finally, we demonstrate that fidelity and classification metrics do not consistently exhibit the same trends. Integrating a DDPM into the classification pipeline underscores the benefits of having optimal control over the data and efficient use of available unlabeled data. Our research provides insights for making informed decisions about integrating generative models into medical image analysis.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于解决胸部 X 光片分类中数据限制的去噪扩散概率模型
深度学习在医学影像分析中发挥着至关重要的作用,尤其是在图像分类和分割等任务中。然而,从医学影像数据集进行学习面临着各种挑战,包括标记示例稀缺、类不平衡以及对不同患者群体的代表性不足。为了应对这些挑战,人们对使用深度生成模型创建合成训练数据越来越感兴趣,去噪扩散概率模型(DDPM)最近因其生成逼真和高质量图像的能力而备受关注。本研究探索了 DDPM 生成合成胸部 X 光片用于多标签分类器训练的潜力。结果表明,使用条件 DDPM 有可能生成逼真的合成胸部 X 光片训练集。此外,研究还分析了解决类不平衡问题对分类性能的影响。平衡合成训练集可将整体分类灵敏度从 0.02 提高到 0.59,但将整体特异性从 0.99 降低到 0.71。此外,我们还研究了无条件预训练学习一般表征,然后对 DDPM 进行有条件微调的潜力。结果表明,这种方法可以将标记训练数据量减少到原始数据集的 25%。最后,我们证明了保真度和分类指标并不总是表现出相同的趋势。将 DDPM 集成到分类流水线中凸显了优化数据控制和有效利用可用非标记数据的好处。我们的研究为将生成模型集成到医学图像分析中的明智决策提供了启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Informatics in Medicine Unlocked
Informatics in Medicine Unlocked Medicine-Health Informatics
CiteScore
9.50
自引率
0.00%
发文量
282
审稿时长
39 days
期刊介绍: Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.
期刊最新文献
Usability and accessibility in mHealth stroke apps: An empirical assessment Spatiotemporal chest wall movement analysis using depth sensor imaging for detecting respiratory asynchrony Regression and classification of Windkessel parameters from non-invasive cardiovascular quantities using a fully connected neural network Patient2Trial: From patient to participant in clinical trials using large language models Structural modification of Naproxen; physicochemical, spectral, medicinal, and pharmacological evaluation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1