用于医学图像分类的提示驱动潜域泛化。

IEEE transactions on medical imaging Pub Date : 2024-08-13 DOI:10.1109/TMI.2024.3443119

Siyuan Yan;Zhen Yu;Chi Liu;Lie Ju;Dwarikanath Mahapatra;Brigid Betz-Stablein;Victoria Mar;Monika Janda;Peter Soyer;Zongyuan Ge

{"title":"用于医学图像分类的提示驱动潜域泛化。","authors":"Siyuan Yan;Zhen Yu;Chi Liu;Lie Ju;Dwarikanath Mahapatra;Brigid Betz-Stablein;Victoria Mar;Monika Janda;Peter Soyer;Zongyuan Ge","doi":"10.1109/TMI.2024.3443119","DOIUrl":null,"url":null,"abstract":"Deep learning models for medical image analysis easily suffer from distribution shifts caused by dataset artifact bias, camera variations, differences in the imaging station, etc., leading to unreliable diagnoses in real-world clinical settings. Domain generalization (DG) methods, which aim to train models on multiple domains to perform well on unseen domains, offer a promising direction to solve the problem. However, existing DG methods assume domain labels of each image are available and accurate, which is typically feasible for only a limited number of medical datasets. To address these challenges, we propose a unified DG framework for medical image classification without relying on domain labels, called Prompt-driven Latent Domain Generalization (PLDG). PLDG consists of unsupervised domain discovery and prompt learning. This framework first discovers pseudo domain labels by clustering the bias-associated style features, then leverages collaborative domain prompts to guide a Vision Transformer to learn knowledge from discovered diverse domains. To facilitate cross-domain knowledge learning between different prompts, we introduce a domain prompt generator that enables knowledge sharing between domain prompts and a shared prompt. A domain mixup strategy is additionally employed for more flexible decision margins and mitigates the risk of incorrect domain assignments. Extensive experiments on three medical image classification tasks and one debiasing task demonstrate that our method can achieve comparable or even superior performance than conventional DG algorithms without relying on domain labels. Our code is publicly available at \n<uri>https://github.com/SiyuanYan1/PLDG/tree/main</uri>\n.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 1","pages":"348-360"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prompt-Driven Latent Domain Generalization for Medical Image Classification\",\"authors\":\"Siyuan Yan;Zhen Yu;Chi Liu;Lie Ju;Dwarikanath Mahapatra;Brigid Betz-Stablein;Victoria Mar;Monika Janda;Peter Soyer;Zongyuan Ge\",\"doi\":\"10.1109/TMI.2024.3443119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep learning models for medical image analysis easily suffer from distribution shifts caused by dataset artifact bias, camera variations, differences in the imaging station, etc., leading to unreliable diagnoses in real-world clinical settings. Domain generalization (DG) methods, which aim to train models on multiple domains to perform well on unseen domains, offer a promising direction to solve the problem. However, existing DG methods assume domain labels of each image are available and accurate, which is typically feasible for only a limited number of medical datasets. To address these challenges, we propose a unified DG framework for medical image classification without relying on domain labels, called Prompt-driven Latent Domain Generalization (PLDG). PLDG consists of unsupervised domain discovery and prompt learning. This framework first discovers pseudo domain labels by clustering the bias-associated style features, then leverages collaborative domain prompts to guide a Vision Transformer to learn knowledge from discovered diverse domains. To facilitate cross-domain knowledge learning between different prompts, we introduce a domain prompt generator that enables knowledge sharing between domain prompts and a shared prompt. A domain mixup strategy is additionally employed for more flexible decision margins and mitigates the risk of incorrect domain assignments. Extensive experiments on three medical image classification tasks and one debiasing task demonstrate that our method can achieve comparable or even superior performance than conventional DG algorithms without relying on domain labels. Our code is publicly available at \\n<uri>https://github.com/SiyuanYan1/PLDG/tree/main</uri>\\n.\",\"PeriodicalId\":94033,\"journal\":{\"name\":\"IEEE transactions on medical imaging\",\"volume\":\"44 1\",\"pages\":\"348-360\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on medical imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10634512/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on medical imaging","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10634512/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

用于医学图像分析的深度学习模型很容易受到数据集伪装偏差、相机变化、成像站差异等因素造成的分布偏移的影响，从而导致实际临床环境中诊断结果不可靠。领域泛化（DG）方法旨在训练多个领域的模型，使其在未见领域中表现良好，为解决这一问题提供了一个很有前景的方向。然而，现有的领域泛化方法假定每张图像的领域标签都是可用且准确的，而这通常只对有限的医疗数据集可行。为了应对这些挑战，我们提出了一种不依赖域标签的统一医学图像分类 DG 框架，称为提示驱动潜域泛化（Prompt-driven Latent Domain Generalization，PLDG）。PLDG 包括无监督领域发现和提示学习。该框架首先通过聚类与偏差相关的风格特征来发现伪领域标签，然后利用协作领域提示来引导视觉转换器从发现的不同领域中学习知识。为了促进不同提示之间的跨领域知识学习，我们引入了一个领域提示生成器，使领域提示和共享提示之间能够共享知识。此外，我们还采用了领域混合策略，以获得更灵活的决策空间，并降低错误领域分配的风险。在三个医学图像分类任务和一个除杂任务上的广泛实验表明，我们的方法无需依赖领域标签，就能获得与传统 DG 算法相当甚至更优的性能。我们的代码可通过 https://github.com/SiyuanYan1/PLDG/tree/main 公开获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Prompt-Driven Latent Domain Generalization for Medical Image Classification

Deep learning models for medical image analysis easily suffer from distribution shifts caused by dataset artifact bias, camera variations, differences in the imaging station, etc., leading to unreliable diagnoses in real-world clinical settings. Domain generalization (DG) methods, which aim to train models on multiple domains to perform well on unseen domains, offer a promising direction to solve the problem. However, existing DG methods assume domain labels of each image are available and accurate, which is typically feasible for only a limited number of medical datasets. To address these challenges, we propose a unified DG framework for medical image classification without relying on domain labels, called Prompt-driven Latent Domain Generalization (PLDG). PLDG consists of unsupervised domain discovery and prompt learning. This framework first discovers pseudo domain labels by clustering the bias-associated style features, then leverages collaborative domain prompts to guide a Vision Transformer to learn knowledge from discovered diverse domains. To facilitate cross-domain knowledge learning between different prompts, we introduce a domain prompt generator that enables knowledge sharing between domain prompts and a shared prompt. A domain mixup strategy is additionally employed for more flexible decision margins and mitigates the risk of incorrect domain assignments. Extensive experiments on three medical image classification tasks and one debiasing task demonstrate that our method can achieve comparable or even superior performance than conventional DG algorithms without relying on domain labels. Our code is publicly available at https://github.com/SiyuanYan1/PLDG/tree/main .

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE transactions on medical imaging

自引率

0.00%

发文量