Complementary information mutual learning for multimodality medical image segmentation

IF 6.3 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Neural Networks Pub Date : 2024-09-06 DOI:10.1016/j.neunet.2024.106670

Chuyun Shen , Wenhao Li , Haoqing Chen , Xiaoling Wang , Fengping Zhu , Yuxin Li , Xiangfeng Wang , Bo Jin

{"title":"Complementary information mutual learning for multimodality medical image segmentation","authors":"Chuyun Shen , Wenhao Li , Haoqing Chen , Xiaoling Wang , Fengping Zhu , Yuxin Li , Xiangfeng Wang , Bo Jin","doi":"10.1016/j.neunet.2024.106670","DOIUrl":null,"url":null,"abstract":"<div><p>Radiologists must utilize medical images of multiple modalities for tumor segmentation and diagnosis due to the limitations of medical imaging technology and the diversity of tumor signals. This has led to the development of multimodal learning in medical image segmentation. However, the redundancy among modalities creates challenges for existing <em>subtraction</em>-based joint learning methods, such as misjudging the importance of modalities, ignoring specific modal information, and increasing cognitive load. These thorny issues ultimately decrease segmentation accuracy and increase the risk of overfitting. This paper presents the <strong>complementary information mutual learning (CIML)</strong> framework, which can mathematically model and address the negative impact of inter-modal redundant information. CIML adopts the idea of <em>addition</em> and removes inter-modal redundant information through inductive bias-driven task decomposition and message passing-based redundancy filtering. CIML first decomposes the multimodal segmentation task into multiple subtasks based on expert prior knowledge, minimizing the information dependence between modalities. Furthermore, CIML introduces a scheme in which each modality can extract information from other modalities additively through message passing. To achieve non-redundancy of extracted information, the redundant filtering is transformed into complementary information learning inspired by the variational information bottleneck. The complementary information learning procedure can be efficiently solved by variational inference and cross-modal spatial attention. Numerical results from the verification task and standard benchmarks indicate that CIML efficiently removes redundant information between modalities, outperforming SOTA methods regarding validation accuracy and segmentation effect. To emphasize, message-passing-based redundancy filtering allows neural network visualization techniques to visualize the knowledge relationship among different modalities, which reflects interpretability.</p></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"180 ","pages":"Article 106670"},"PeriodicalIF":6.3000,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S089360802400594X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Radiologists must utilize medical images of multiple modalities for tumor segmentation and diagnosis due to the limitations of medical imaging technology and the diversity of tumor signals. This has led to the development of multimodal learning in medical image segmentation. However, the redundancy among modalities creates challenges for existing subtraction-based joint learning methods, such as misjudging the importance of modalities, ignoring specific modal information, and increasing cognitive load. These thorny issues ultimately decrease segmentation accuracy and increase the risk of overfitting. This paper presents the complementary information mutual learning (CIML) framework, which can mathematically model and address the negative impact of inter-modal redundant information. CIML adopts the idea of addition and removes inter-modal redundant information through inductive bias-driven task decomposition and message passing-based redundancy filtering. CIML first decomposes the multimodal segmentation task into multiple subtasks based on expert prior knowledge, minimizing the information dependence between modalities. Furthermore, CIML introduces a scheme in which each modality can extract information from other modalities additively through message passing. To achieve non-redundancy of extracted information, the redundant filtering is transformed into complementary information learning inspired by the variational information bottleneck. The complementary information learning procedure can be efficiently solved by variational inference and cross-modal spatial attention. Numerical results from the verification task and standard benchmarks indicate that CIML efficiently removes redundant information between modalities, outperforming SOTA methods regarding validation accuracy and segmentation effect. To emphasize, message-passing-based redundancy filtering allows neural network visualization techniques to visualize the knowledge relationship among different modalities, which reflects interpretability.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于多模态医学图像分割的互补信息相互学习

由于医学成像技术的局限性和肿瘤信号的多样性，放射科医生必须利用多种模式的医学影像进行肿瘤分割和诊断。因此，医学图像分割中的多模态学习应运而生。然而，模态间的冗余给现有的基于减法的联合学习方法带来了挑战，如错误判断模态的重要性、忽略特定模态信息、增加认知负荷等。这些棘手的问题最终会降低分割精度，增加过拟合的风险。本文提出了互补信息相互学习（CIML）框架，该框架可以对模态间冗余信息的负面影响进行数学建模并加以解决。CIML 采用加法的思想，通过归纳偏差驱动的任务分解和基于消息传递的冗余过滤来消除模态间的冗余信息。CIML 首先根据专家的先验知识将多模态分割任务分解为多个子任务，从而将模态间的信息依赖性降至最低。此外，CIML 还引入了一种方案，即每种模态都可以通过信息传递从其他模态中以加法方式提取信息。为了实现提取信息的非冗余性，受变异信息瓶颈的启发，冗余过滤被转化为互补信息学习。互补信息学习过程可以通过变异推理和跨模态空间注意力有效解决。来自验证任务和标准基准的数值结果表明，CIML 能有效去除模态间的冗余信息，在验证精度和分割效果方面优于 SOTA 方法。值得强调的是，基于消息传递的冗余过滤允许神经网络可视化技术将不同模态之间的知识关系可视化，这反映了可解释性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Neural Networks 工程技术-计算机：人工智能

CiteScore

13.90

自引率

7.70%

发文量

425

审稿时长

67 days

期刊介绍： Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.