Enhancing global sensitivity and uncertainty quantification in medical image reconstruction with Monte Carlo arbitrary-masked mamba

IF 10.7 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Medical image analysis Pub Date : 2024-09-03 DOI:10.1016/j.media.2024.103334

{"title":"Enhancing global sensitivity and uncertainty quantification in medical image reconstruction with Monte Carlo arbitrary-masked mamba","authors":"","doi":"10.1016/j.media.2024.103334","DOIUrl":null,"url":null,"abstract":"<div><p>Deep learning has been extensively applied in medical image reconstruction, where Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) represent the predominant paradigms, each possessing distinct advantages and inherent limitations: CNNs exhibit linear complexity with local sensitivity, whereas ViTs demonstrate quadratic complexity with global sensitivity. The emerging Mamba has shown superiority in learning visual representation, which combines the advantages of linear scalability and global sensitivity. In this study, we introduce MambaMIR, an Arbitrary-Masked Mamba-based model with wavelet decomposition for joint medical image reconstruction and uncertainty estimation. A novel Arbitrary Scan Masking (ASM) mechanism “masks out” redundant information to introduce randomness for further uncertainty estimation. Compared to the commonly used Monte Carlo (MC) dropout, our proposed MC-ASM provides an uncertainty map without the need for hyperparameter tuning and mitigates the performance drop typically observed when applying dropout to low-level tasks. For further texture preservation and better perceptual quality, we employ the wavelet transformation into MambaMIR and explore its variant based on the Generative Adversarial Network, namely MambaMIR-GAN. Comprehensive experiments have been conducted for multiple representative medical image reconstruction tasks, demonstrating that the proposed MambaMIR and MambaMIR-GAN outperform other baseline and state-of-the-art methods in different reconstruction tasks, where MambaMIR achieves the best reconstruction fidelity and MambaMIR-GAN has the best perceptual quality. In addition, our MC-ASM provides uncertainty maps as an additional tool for clinicians, while mitigating the typical performance drop caused by the commonly used dropout.</p></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":null,"pages":null},"PeriodicalIF":10.7000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1361841524002597/pdfft?md5=a57fc8f3e3e9d07f16079fe5caf02411&pid=1-s2.0-S1361841524002597-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841524002597","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning has been extensively applied in medical image reconstruction, where Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) represent the predominant paradigms, each possessing distinct advantages and inherent limitations: CNNs exhibit linear complexity with local sensitivity, whereas ViTs demonstrate quadratic complexity with global sensitivity. The emerging Mamba has shown superiority in learning visual representation, which combines the advantages of linear scalability and global sensitivity. In this study, we introduce MambaMIR, an Arbitrary-Masked Mamba-based model with wavelet decomposition for joint medical image reconstruction and uncertainty estimation. A novel Arbitrary Scan Masking (ASM) mechanism “masks out” redundant information to introduce randomness for further uncertainty estimation. Compared to the commonly used Monte Carlo (MC) dropout, our proposed MC-ASM provides an uncertainty map without the need for hyperparameter tuning and mitigates the performance drop typically observed when applying dropout to low-level tasks. For further texture preservation and better perceptual quality, we employ the wavelet transformation into MambaMIR and explore its variant based on the Generative Adversarial Network, namely MambaMIR-GAN. Comprehensive experiments have been conducted for multiple representative medical image reconstruction tasks, demonstrating that the proposed MambaMIR and MambaMIR-GAN outperform other baseline and state-of-the-art methods in different reconstruction tasks, where MambaMIR achieves the best reconstruction fidelity and MambaMIR-GAN has the best perceptual quality. In addition, our MC-ASM provides uncertainty maps as an additional tool for clinicians, while mitigating the typical performance drop caused by the commonly used dropout.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用蒙特卡罗任意掩码曼巴增强医学图像重建中的全局灵敏度和不确定性量化

深度学习已被广泛应用于医学影像重建，其中卷积神经网络（CNN）和视觉变换器（ViT）是最主要的范例，各自具有独特的优势和固有的局限性：卷积神经网络具有线性复杂性和局部敏感性，而视觉变换器则具有二次复杂性和全局敏感性。新兴的 Mamba 结合了线性可扩展性和全局敏感性的优势，在学习视觉表征方面表现出了优越性。在本研究中，我们介绍了 MambaMIR，这是一种基于任意扫描屏蔽的 Mamba 模型，采用小波分解技术，用于联合医学图像重建和不确定性估计。新颖的任意扫描屏蔽（ASM）机制 "屏蔽 "了冗余信息，为进一步的不确定性估计引入了随机性。与常用的蒙特卡洛（Monte Carlo，MC）滤波相比，我们提出的 MC-ASM 无需调整超参数就能提供不确定性图，并缓解了在低级任务中应用滤波时通常会出现的性能下降问题。为了进一步保持纹理和更好的感知质量，我们在 MambaMIR 中采用了小波变换，并探索了其基于生成对抗网络的变体，即 MambaMIR-GAN。我们针对多个具有代表性的医学图像重建任务进行了综合实验，结果表明，在不同的重建任务中，所提出的 MambaMIR 和 MambaMIR-GAN 优于其他基线方法和最先进的方法，其中 MambaMIR 实现了最佳的重建保真度，MambaMIR-GAN 具有最佳的感知质量。此外，我们的 MC-ASM 还为临床医生提供了不确定性图作为额外的工具，同时减轻了因常用的 dropout 而导致的典型性能下降。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.

期刊最新文献

A robust image segmentation and synthesis pipeline for histopathology Editorial Board Low-dose computed tomography perceptual image quality assessment Labeled-to-unlabeled distribution alignment for partially-supervised multi-organ medical image segmentation ATEC23 Challenge: Automated prediction of treatment effectiveness in ovarian cancer using histopathological images