Adaptive dynamic inference for few-shot left atrium segmentation

IF 10.7 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Medical image analysis Pub Date : 2024-08-23 DOI:10.1016/j.media.2024.103321

Jun Chen , Xuejiao Li , Heye Zhang , Yongwon Cho , Sung Ho Hwang , Zhifan Gao , Guang Yang

{"title":"Adaptive dynamic inference for few-shot left atrium segmentation","authors":"Jun Chen , Xuejiao Li , Heye Zhang , Yongwon Cho , Sung Ho Hwang , Zhifan Gao , Guang Yang","doi":"10.1016/j.media.2024.103321","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate segmentation of the left atrium (LA) from late gadolinium-enhanced cardiac magnetic resonance (LGE CMR) images is crucial for aiding the treatment of patients with atrial fibrillation. Few-shot learning holds significant potential for achieving accurate LA segmentation with low demand on high-cost labeled LGE CMR data and fast generalization across different centers. However, accurate LA segmentation with few-shot learning is a challenging task due to the low-intensity contrast between the LA and other neighboring organs in LGE CMR images. To address this issue, we propose an Adaptive Dynamic Inference Network (ADINet) that explicitly models the differences between the foreground and background. Specifically, ADINet leverages dynamic collaborative inference (DCI) and dynamic reverse inference (DRI) to adaptively allocate semantic-aware and spatial-specific convolution weights and indication information. These allocations are conditioned on the support foreground and background knowledge, utilizing pixel-wise correlations, for different spatial positions of query images. The convolution weights adapt to different visual patterns based on spatial positions, enabling effective encoding of differences between foreground and background regions. Meanwhile, the indication information adapts to the background visual pattern to reversely decode foreground LA regions, leveraging their spatial complementarity. To promote the learning of ADINet, we propose hierarchical supervision, which enforces spatial consistency and differences between the background and foreground regions through pixel-wise semantic supervision and pixel-pixel correlation supervision. We demonstrated the performance of ADINet on three LGE CMR datasets from different centers. Compared to state-of-the-art methods with ten available samples, ADINet yielded better segmentation performance in terms of four metrics.</p></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"98 ","pages":"Article 103321"},"PeriodicalIF":10.7000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841524002469","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate segmentation of the left atrium (LA) from late gadolinium-enhanced cardiac magnetic resonance (LGE CMR) images is crucial for aiding the treatment of patients with atrial fibrillation. Few-shot learning holds significant potential for achieving accurate LA segmentation with low demand on high-cost labeled LGE CMR data and fast generalization across different centers. However, accurate LA segmentation with few-shot learning is a challenging task due to the low-intensity contrast between the LA and other neighboring organs in LGE CMR images. To address this issue, we propose an Adaptive Dynamic Inference Network (ADINet) that explicitly models the differences between the foreground and background. Specifically, ADINet leverages dynamic collaborative inference (DCI) and dynamic reverse inference (DRI) to adaptively allocate semantic-aware and spatial-specific convolution weights and indication information. These allocations are conditioned on the support foreground and background knowledge, utilizing pixel-wise correlations, for different spatial positions of query images. The convolution weights adapt to different visual patterns based on spatial positions, enabling effective encoding of differences between foreground and background regions. Meanwhile, the indication information adapts to the background visual pattern to reversely decode foreground LA regions, leveraging their spatial complementarity. To promote the learning of ADINet, we propose hierarchical supervision, which enforces spatial consistency and differences between the background and foreground regions through pixel-wise semantic supervision and pixel-pixel correlation supervision. We demonstrated the performance of ADINet on three LGE CMR datasets from different centers. Compared to state-of-the-art methods with ten available samples, ADINet yielded better segmentation performance in terms of four metrics.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

自适应动态推理的左心房少拍分割技术

从晚期钆增强心脏磁共振（LGE CMR）图像中准确分割左心房（LA）对于帮助治疗心房颤动患者至关重要。少量学习在实现准确的 LA 分割方面具有巨大潜力，对高成本的标记 LGE CMR 数据要求低，并能在不同中心快速推广。然而，由于在 LGE CMR 图像中 LA 与其他邻近器官之间的低强度对比度，利用少点学习准确分割 LA 是一项具有挑战性的任务。为了解决这个问题，我们提出了一种自适应动态推理网络（ADINet），它能明确地模拟前景和背景之间的差异。具体来说，ADINet 利用动态协同推理 (DCI) 和动态反向推理 (DRI) 自适应地分配语义感知和特定空间的卷积权重和指示信息。这些分配以支持前景和背景知识为条件，利用像素相关性，针对查询图像的不同空间位置。卷积权重能适应基于空间位置的不同视觉模式，从而对前景和背景区域之间的差异进行有效编码。同时，指示信息适应背景视觉模式，利用其空间互补性反向解码前景 LA 区域。为了促进 ADINet 的学习，我们提出了分层监督，通过像素语义监督和像素相关性监督来加强空间一致性和前景与背景区域之间的差异。我们在来自不同中心的三个 LGE CMR 数据集上展示了 ADINet 的性能。与使用 10 个可用样本的最先进方法相比，ADINet 在四个指标上都获得了更好的分割性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.