{"title":"AugDiff: Diffusion-Based Feature Augmentation for Multiple Instance Learning in Whole Slide Image","authors":"Zhuchen Shao;Liuxi Dai;Yifeng Wang;Haoqian Wang;Yongbing Zhang","doi":"10.1109/TAI.2024.3454591","DOIUrl":null,"url":null,"abstract":"Multiple instance learning (MIL), a powerful strategy for weakly supervised learning, is able to perform various prediction tasks on gigapixel whole slide images (WSIs). However, the tens of thousands of patches in WSIs usually incur a vast computational burden for image augmentation, limiting the performance improvement in MIL. Currently, the feature augmentation-based MIL framework is a promising solution, while existing methods such as mixup often produce unrealistic features. To explore a more efficient and practical augmentation method, we introduce the diffusion model (DM) into MIL for the first time and propose a feature augmentation framework called AugDiff. The diverse generation capabilities of DM guarantee a various range of feature augmentations, while its iterative generation approach effectively preserves semantic integrity during these augmentations. We conduct extensive experiments over four distinct cancer datasets, two different feature extractors, and three prevalent MIL algorithms to evaluate the performance of AugDiff. Ablation study and visualization further verify the effectiveness. Moreover, we highlight AugDiff's higher quality augmented feature over image augmentation and its superiority over self-supervised learning. The generalization over external datasets indicates its broader applications. The code is open-sourced on \n<uri>https://github.com/szc19990412/AugDiff</uri>\n.","PeriodicalId":73305,"journal":{"name":"IEEE transactions on artificial intelligence","volume":"5 12","pages":"6617-6628"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on artificial intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10666706/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Multiple instance learning (MIL), a powerful strategy for weakly supervised learning, is able to perform various prediction tasks on gigapixel whole slide images (WSIs). However, the tens of thousands of patches in WSIs usually incur a vast computational burden for image augmentation, limiting the performance improvement in MIL. Currently, the feature augmentation-based MIL framework is a promising solution, while existing methods such as mixup often produce unrealistic features. To explore a more efficient and practical augmentation method, we introduce the diffusion model (DM) into MIL for the first time and propose a feature augmentation framework called AugDiff. The diverse generation capabilities of DM guarantee a various range of feature augmentations, while its iterative generation approach effectively preserves semantic integrity during these augmentations. We conduct extensive experiments over four distinct cancer datasets, two different feature extractors, and three prevalent MIL algorithms to evaluate the performance of AugDiff. Ablation study and visualization further verify the effectiveness. Moreover, we highlight AugDiff's higher quality augmented feature over image augmentation and its superiority over self-supervised learning. The generalization over external datasets indicates its broader applications. The code is open-sourced on
https://github.com/szc19990412/AugDiff
.
多实例学习(MIL)是一种强大的弱监督学习策略,能够在千兆像素的整张幻灯片图像(WSI)上执行各种预测任务。然而,WSI 中数以万计的斑块通常会给图像增强带来巨大的计算负担,从而限制了 MIL 性能的提高。目前,基于特征增强的 MIL 框架是一种很有前景的解决方案,而现有的方法(如 mixup)往往会产生不切实际的特征。为了探索一种更高效、更实用的增强方法,我们首次在 MIL 中引入了扩散模型(DM),并提出了名为 AugDiff 的特征增强框架。DM 多样化的生成能力保证了各种特征增强,而其迭代生成方法在这些增强过程中有效地保持了语义的完整性。我们在四个不同的癌症数据集、两种不同的特征提取器和三种流行的 MIL 算法上进行了广泛的实验,以评估 AugDiff 的性能。消融研究和可视化进一步验证了其有效性。此外,我们还强调了 AugDiff 比图像增强具有更高质量的增强特征,而且比自我监督学习更具优势。在外部数据集上的泛化表明其应用范围更广。代码开源于 https://github.com/szc19990412/AugDiff。