首页 > 最新文献

IEEE Transactions on Medical Imaging最新文献

英文 中文
B2Q-Net: Bidirectional Branch Query Network for Surgical Phase Recognition. B2Q-Net:面向手术相位识别的双向分支查询网络。
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-16 DOI: 10.1109/tmi.2026.3654795
Wenjie Zhang,Zhiheng Li,Yue Bi,Xiao Jia,Ran Song,Yipeng Zhang,Wei Zhang
Surgical phase recognition (SPR) is essential for surgical workflow analysis and provides immediate guidance during procedures. Existing methods aggregate frame-level information into a global representation and treat the task as frame-wise classification. However, this pipeline lacks a feedback mechanism for integrating historical information into local temporal modeling. To address this limitation, we propose the Bidirectional Branch Query Network (B2Q-Net), which reformulates the SPR task as the bidirectional query between phase-level features and frame-level features. B2Q-Net incorporates historical information during the initialization of phase queries. This enables bidirectional information flow during iterative refinement of two-level feature maps between phases and frames. Furthermore, we introduce a dual-scale selector (DSS) to generate high-quality phase queries for the current video clip. These phase queries retrieve historical information from the proposed state space query (SSQ) module, which uses learnable tokens as the historical state space to preserve historical information. Extensive evaluations on three datasets demonstrate that B2Q-Net consistently outperforms state-of-the-art methods in recognition accuracy while achieving an inference speed of 106 fps. The B2Q-Net code is available at https://github.com/vsislab/B2Q-Net.
手术阶段识别(SPR)对于手术流程分析至关重要,并在手术过程中提供即时指导。现有方法将帧级信息聚合到全局表示中,并将任务视为逐帧分类。然而,该管道缺乏将历史信息集成到局部时间建模中的反馈机制。为了解决这一限制,我们提出了双向分支查询网络(B2Q-Net),它将SPR任务重新表述为相位级特征和帧级特征之间的双向查询。B2Q-Net在初始化阶段查询期间合并了历史信息。这使得在迭代细化阶段和框架之间的两级特征映射期间双向信息流成为可能。此外,我们引入了双尺度选择器(DSS)来为当前视频片段生成高质量的相位查询。这些阶段查询从建议的状态空间查询(SSQ)模块检索历史信息,该模块使用可学习的令牌作为历史状态空间来保存历史信息。对三个数据集的广泛评估表明,B2Q-Net在识别精度方面始终优于最先进的方法,同时实现106 fps的推理速度。B2Q-Net的代码可在https://github.com/vsislab/B2Q-Net上获得。
{"title":"B2Q-Net: Bidirectional Branch Query Network for Surgical Phase Recognition.","authors":"Wenjie Zhang,Zhiheng Li,Yue Bi,Xiao Jia,Ran Song,Yipeng Zhang,Wei Zhang","doi":"10.1109/tmi.2026.3654795","DOIUrl":"https://doi.org/10.1109/tmi.2026.3654795","url":null,"abstract":"Surgical phase recognition (SPR) is essential for surgical workflow analysis and provides immediate guidance during procedures. Existing methods aggregate frame-level information into a global representation and treat the task as frame-wise classification. However, this pipeline lacks a feedback mechanism for integrating historical information into local temporal modeling. To address this limitation, we propose the Bidirectional Branch Query Network (B2Q-Net), which reformulates the SPR task as the bidirectional query between phase-level features and frame-level features. B2Q-Net incorporates historical information during the initialization of phase queries. This enables bidirectional information flow during iterative refinement of two-level feature maps between phases and frames. Furthermore, we introduce a dual-scale selector (DSS) to generate high-quality phase queries for the current video clip. These phase queries retrieve historical information from the proposed state space query (SSQ) module, which uses learnable tokens as the historical state space to preserve historical information. Extensive evaluations on three datasets demonstrate that B2Q-Net consistently outperforms state-of-the-art methods in recognition accuracy while achieving an inference speed of 106 fps. The B2Q-Net code is available at https://github.com/vsislab/B2Q-Net.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"37 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145986341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI Synthesis. 学习模态感知表征:多模态MRI合成的自适应群智能交互网络。
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-15 DOI: 10.1109/tmi.2026.3654249
Tao Song,Yicheng Wu,Minhao Hu,Xiangde Luo,Linda Wei,Guotai Wang,Yi Guo,Feng Xu,Shaoting Zhang
Multimodal MR image synthesis aims to generate missing modality images by effectively fusing and mapping from a subset of available MRI modalities. Most existing methods adopt an image-to-image translation paradigm, treating multiple modalities as input channels. However, these approaches often yield sub-optimal results due to the inherent difficulty in achieving precise feature-or semantic-level alignment across modalities. To address these challenges, we propose an Adaptive Group-wise Interaction Network (AGI-Net) that explicitly models both inter-modality and intra-modality relationships for multimodal MR image synthesis. Specifically, feature channels are first partitioned into predefined groups, after which an adaptive rolling mechanism is applied to conventional convolutional kernels to better capture feature and semantic correspondences between different modalities. In parallel, a cross-group attention module is introduced to enable effective feature fusion across groups, thereby enhancing the network's representational capacity. We validate the proposed AGI-Net on the publicly available IXI and BraTS2023 datasets. Experimental results demonstrate that AGI-Net achieves state-of-the-art performance in multimodal MR image synthesis tasks, confirming the effectiveness of its modality-aware interaction design. We release the relevant code at: https://github.com/zunzhumu/Adaptive-Group-wise-Interaction-Network-for-Multimodal-MRI-Synthesis.git.
多模态磁共振图像合成旨在通过有效地融合和映射可用的MRI模态子集来生成缺失的模态图像。大多数现有方法采用图像到图像的翻译范式,将多种模态作为输入通道。然而,这些方法往往产生次优结果,因为在实现跨模式的精确特征或语义级对齐方面存在固有的困难。为了应对这些挑战,我们提出了一种自适应群体智能交互网络(AGI-Net),该网络明确地模拟了多模态MR图像合成的模态间和模态内关系。具体而言,该方法首先将特征通道划分为预定义的组,然后将自适应滚动机制应用于传统卷积核,以更好地捕获不同模态之间的特征和语义对应关系。同时,引入了跨群体关注模块,实现了有效的跨群体特征融合,从而增强了网络的表征能力。我们在公开可用的IXI和BraTS2023数据集上验证了提议的AGI-Net。实验结果表明,AGI-Net在多模态磁共振图像合成任务中达到了最先进的性能,证实了其模态感知交互设计的有效性。我们在https://github.com/zunzhumu/Adaptive-Group-wise-Interaction-Network-for-Multimodal-MRI-Synthesis.git上发布了相关代码。
{"title":"Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI Synthesis.","authors":"Tao Song,Yicheng Wu,Minhao Hu,Xiangde Luo,Linda Wei,Guotai Wang,Yi Guo,Feng Xu,Shaoting Zhang","doi":"10.1109/tmi.2026.3654249","DOIUrl":"https://doi.org/10.1109/tmi.2026.3654249","url":null,"abstract":"Multimodal MR image synthesis aims to generate missing modality images by effectively fusing and mapping from a subset of available MRI modalities. Most existing methods adopt an image-to-image translation paradigm, treating multiple modalities as input channels. However, these approaches often yield sub-optimal results due to the inherent difficulty in achieving precise feature-or semantic-level alignment across modalities. To address these challenges, we propose an Adaptive Group-wise Interaction Network (AGI-Net) that explicitly models both inter-modality and intra-modality relationships for multimodal MR image synthesis. Specifically, feature channels are first partitioned into predefined groups, after which an adaptive rolling mechanism is applied to conventional convolutional kernels to better capture feature and semantic correspondences between different modalities. In parallel, a cross-group attention module is introduced to enable effective feature fusion across groups, thereby enhancing the network's representational capacity. We validate the proposed AGI-Net on the publicly available IXI and BraTS2023 datasets. Experimental results demonstrate that AGI-Net achieves state-of-the-art performance in multimodal MR image synthesis tasks, confirming the effectiveness of its modality-aware interaction design. We release the relevant code at: https://github.com/zunzhumu/Adaptive-Group-wise-Interaction-Network-for-Multimodal-MRI-Synthesis.git.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"8 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145971767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Domain Adaptive Multiple Instance Self-Training for Intraoperative Anomaly Detection. 术中异常检测的领域自适应多实例自训练。
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-15 DOI: 10.1109/tmi.2026.3654087
Ziang Chen,Yiming Ding,Jianchang Zhao,Bo Yi,Jianguo Wei
Intraoperative anomalies cause deviations from the ideal surgical workflow, heightening the risk of consequential errors and complications. Their reliable recognition has traditionally relied on continuous surgeon monitoring, yet automated anomaly detection systems are now indispensable for the safe advancement of assistive and autonomous surgery. However, existing approaches struggle with domain shifts across surgical platforms and unpredictable scenarios in deformable surgical environments. To address this, we propose DA-MIST, a Domain Adaptive Multiple Instance Self-Training framework for weakly supervised anomaly detection. DA-MIST adopts a two-stage training strategy that combines multiple instance learning with self-training, enhanced by a scene-decoupled memory mechanism that disentangles state-irrelevant scene variations from memory banks, preserving only state-discriminative features for robust anomaly identification. Additionally, a state-aware dual-branch attention module integrates Gaussian dynamic and global self-attention for effective temporal reasoning. Evaluated on our newly compiled large-scale endoscopic video dataset encompassing seven representative anomalies, DA-MIST demonstrates strong adaptability across heterogeneous surgical domains, consistently reducing false alarms and enhancing anomaly localization accuracy. Our code and dataset will be available at: https://github.com/iamziang/DA-MIST.
术中异常导致偏离理想的手术流程,增加了相应错误和并发症的风险。传统上,他们的可靠识别依赖于连续的外科医生监测,但自动化异常检测系统现在对于辅助和自主手术的安全进步是不可或缺的。然而,现有的方法在手术平台的领域转移和不可预测的手术环境中挣扎。为了解决这个问题,我们提出了DA-MIST,一个用于弱监督异常检测的领域自适应多实例自训练框架。DA-MIST采用两阶段训练策略,将多实例学习与自训练相结合,并通过场景解耦记忆机制进行增强,该机制将与状态无关的场景变化从记忆库中分离出来,仅保留状态判别特征以进行鲁棒异常识别。此外,一个状态感知的双分支注意模块集成了高斯动态和全局自注意,用于有效的时间推理。在我们新编译的包含七个代表性异常的大规模内窥镜视频数据集上进行评估,DA-MIST在异质手术领域表现出强大的适应性,持续减少误报并提高异常定位的准确性。我们的代码和数据集可以在https://github.com/iamziang/DA-MIST上获得。
{"title":"Domain Adaptive Multiple Instance Self-Training for Intraoperative Anomaly Detection.","authors":"Ziang Chen,Yiming Ding,Jianchang Zhao,Bo Yi,Jianguo Wei","doi":"10.1109/tmi.2026.3654087","DOIUrl":"https://doi.org/10.1109/tmi.2026.3654087","url":null,"abstract":"Intraoperative anomalies cause deviations from the ideal surgical workflow, heightening the risk of consequential errors and complications. Their reliable recognition has traditionally relied on continuous surgeon monitoring, yet automated anomaly detection systems are now indispensable for the safe advancement of assistive and autonomous surgery. However, existing approaches struggle with domain shifts across surgical platforms and unpredictable scenarios in deformable surgical environments. To address this, we propose DA-MIST, a Domain Adaptive Multiple Instance Self-Training framework for weakly supervised anomaly detection. DA-MIST adopts a two-stage training strategy that combines multiple instance learning with self-training, enhanced by a scene-decoupled memory mechanism that disentangles state-irrelevant scene variations from memory banks, preserving only state-discriminative features for robust anomaly identification. Additionally, a state-aware dual-branch attention module integrates Gaussian dynamic and global self-attention for effective temporal reasoning. Evaluated on our newly compiled large-scale endoscopic video dataset encompassing seven representative anomalies, DA-MIST demonstrates strong adaptability across heterogeneous surgical domains, consistently reducing false alarms and enhancing anomaly localization accuracy. Our code and dataset will be available at: https://github.com/iamziang/DA-MIST.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"17 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145971769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Energy-Threshold Bias Calculator: A Physics-Model Based Adaptive Correction Scheme for Photon-Counting CT 能量阈值偏置计算器:一种基于物理模型的光子计数CT自适应校正方案
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-15 DOI: 10.1109/tmi.2026.3654612
Yuting Chen, Yuxiang Xing, Li Zhang, Zhi Deng, Hewei Gao
{"title":"Energy-Threshold Bias Calculator: A Physics-Model Based Adaptive Correction Scheme for Photon-Counting CT","authors":"Yuting Chen, Yuxiang Xing, Li Zhang, Zhi Deng, Hewei Gao","doi":"10.1109/tmi.2026.3654612","DOIUrl":"https://doi.org/10.1109/tmi.2026.3654612","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"84 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145972015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Medical Microwave Imaging Using Physics-Guided Deep Learning Part 2: The Inverse Solver 医学微波成像使用物理引导的深度学习第2部分:逆求解器
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-14 DOI: 10.1109/tmi.2026.3653974
L. Guo, A. Bialkowski, A. Abbosh
{"title":"Medical Microwave Imaging Using Physics-Guided Deep Learning Part 2: The Inverse Solver","authors":"L. Guo, A. Bialkowski, A. Abbosh","doi":"10.1109/tmi.2026.3653974","DOIUrl":"https://doi.org/10.1109/tmi.2026.3653974","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"141 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145972159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UltraMamba: Mamba-based Multimodal Ultrasound Image Adaptive Fusion for Breast Lesion Segmentation. 基于mamba的多模态超声图像自适应融合乳腺病变分割。
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-13 DOI: 10.1109/tmi.2026.3653779
Jiahui Huang,Jiaxin Huang,Mingdu Zhang,Qiong Wang,Xiao-Qing Pei,Ying Hu,Hao Chen,Yan Pang
Multimodal ultrasound imaging, combining B-mode ultrasound, shear wave velocity, and shear wave time, is crucial for diagnosing and treating breast lesions, providing insights into lesion characteristics and tissue properties. However, challenges arise from intermodal feature misalignment and attention shifts due to varied capture methods and an overemphasis on vibrant color data. To tackle these issues, we introduce two innovations: a novel segmentation framework and a comprehensive dataset. The UltraMamba framework utilizes bidirectional alignment between modalities and enhances region-specific information to improve breast lesion segmentation accuracy. Key components include the Cross-Modal Knowledge Interaction module for robust information exchange and the Region-Aware Feature Excitation module to focus on relevant features. We also present the BreLS dataset, the first two-dimensional multimodal ultrasound breast lesion dataset, with paired images from 506 cases, serving as a valuable resource for analysis. UltraMamba shows strong performance on the BreLS dataset, achieving a Dice Similarity Coefficient of 72.16% and an HD95 of 42.02 mm, reflecting improvements of 2.59% in DSC and a 6.78 mm reduction in HD95 compared to the second-best framework, MMCA-NET. These results highlight UltraMamba's potential to enhance segmentation accuracy in clinical settings, facilitating precise treatment planning and, ultimately, leading to improved outcomes. Code: https://github.com/deepang-ai/UltraMamba.
多模态超声成像结合b超、横波速度和横波时间,对乳腺病变的诊断和治疗至关重要,可以深入了解病变特征和组织性质。然而,由于不同的捕获方法和过分强调鲜艳的颜色数据,多式联运特征不对齐和注意力转移带来了挑战。为了解决这些问题,我们引入了两个创新:一个新的分割框架和一个全面的数据集。UltraMamba框架利用模式之间的双向对齐,增强区域特异性信息,以提高乳腺病变分割的准确性。关键组件包括跨模态知识交互模块,用于鲁棒信息交换;区域感知特征激励模块,用于关注相关特征。我们还提出了BreLS数据集,这是第一个二维多模态超声乳腺病变数据集,其中包含来自506例病例的成对图像,作为有价值的分析资源。UltraMamba在brres数据集上表现出色,实现了72.16%的Dice Similarity Coefficient和42.02 mm的HD95,与第二好的框架MMCA-NET相比,DSC提高了2.59%,HD95降低了6.78 mm。这些结果突出了ultramba在临床环境中提高分割准确性的潜力,促进了精确的治疗计划,并最终改善了结果。代码:https://github.com/deepang-ai/UltraMamba。
{"title":"UltraMamba: Mamba-based Multimodal Ultrasound Image Adaptive Fusion for Breast Lesion Segmentation.","authors":"Jiahui Huang,Jiaxin Huang,Mingdu Zhang,Qiong Wang,Xiao-Qing Pei,Ying Hu,Hao Chen,Yan Pang","doi":"10.1109/tmi.2026.3653779","DOIUrl":"https://doi.org/10.1109/tmi.2026.3653779","url":null,"abstract":"Multimodal ultrasound imaging, combining B-mode ultrasound, shear wave velocity, and shear wave time, is crucial for diagnosing and treating breast lesions, providing insights into lesion characteristics and tissue properties. However, challenges arise from intermodal feature misalignment and attention shifts due to varied capture methods and an overemphasis on vibrant color data. To tackle these issues, we introduce two innovations: a novel segmentation framework and a comprehensive dataset. The UltraMamba framework utilizes bidirectional alignment between modalities and enhances region-specific information to improve breast lesion segmentation accuracy. Key components include the Cross-Modal Knowledge Interaction module for robust information exchange and the Region-Aware Feature Excitation module to focus on relevant features. We also present the BreLS dataset, the first two-dimensional multimodal ultrasound breast lesion dataset, with paired images from 506 cases, serving as a valuable resource for analysis. UltraMamba shows strong performance on the BreLS dataset, achieving a Dice Similarity Coefficient of 72.16% and an HD95 of 42.02 mm, reflecting improvements of 2.59% in DSC and a 6.78 mm reduction in HD95 compared to the second-best framework, MMCA-NET. These results highlight UltraMamba's potential to enhance segmentation accuracy in clinical settings, facilitating precise treatment planning and, ultimately, leading to improved outcomes. Code: https://github.com/deepang-ai/UltraMamba.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"7 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145961412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adjacent-aware Modality Recovery based on Incomplete Multi-Modal Brain Disease Diagnosis. 基于不完全多模态脑部疾病诊断的邻接感知模态恢复。
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-13 DOI: 10.1109/tmi.2026.3654000
Jinrong Cui,Weihao Ye,Shengrong Li,Jie Wen,Qi Zhu
Multi-modal learning is extensively applied to diagnose brain diseases such as epilepsy and Alzheimer's disease. However, incomplete multi-modal data, where some modalities are unavailable or difficult to collect, limits the effectiveness of conventional methods. Additionally, existing approaches often overlook semantic relationships between neighbors with the same-label and latent information in missing modalities. To address these challenges, we propose an adjacent-aware distillation recovery framework designed for incomplete multi-modal learning, with a focus on diagnosing representative brain diseases, i.e. epilepsy and Alzheimer's disease. The key novelty of our framework lies in its joint design of adjacent-aware modality recovery and multi-modal representation learning in a single end-to-end pipeline. Specifically, we introduce a label-guided adjacent-aware recovery module that uses a self-attention mechanism to exploit neighbor semantics and generate distribution-consistent features for high-quality modality reconstruction. The recovered features are then refined through a knowledge distillation pathway into a modality generator, enhancing generalization under severe data incompleteness. For multi-modal representation learning, the recovered modality information is fused with the original incomplete information to enhance feature extraction and representation. Extensive experiments demonstrate the effectiveness of our method in diagnosing epilepsy and Alzheimer's disease.
多模态学习被广泛应用于癫痫和阿尔茨海默病等脑部疾病的诊断。然而,不完整的多模态数据,其中一些模态不可用或难以收集,限制了传统方法的有效性。此外,现有的方法往往忽略了具有相同标签的邻居之间的语义关系和缺失模态中的潜在信息。为了解决这些挑战,我们提出了一个针对不完全多模态学习设计的邻接感知蒸馏恢复框架,重点是诊断代表性脑部疾病,即癫痫和阿尔茨海默病。该框架的关键新颖之处在于它在单个端到端管道中联合设计了邻接感知模态恢复和多模态表示学习。具体来说,我们引入了一个标签引导的邻接感知恢复模块,该模块使用自关注机制来利用邻居语义并生成分布一致的特征,以实现高质量的模态重建。然后通过知识蒸馏途径将恢复的特征提炼成模态生成器,增强了严重数据不完备情况下的泛化能力。在多模态表示学习中,将恢复的模态信息与原始的不完全信息融合,增强特征提取和表示能力。大量的实验证明了我们的方法在诊断癫痫和阿尔茨海默病方面的有效性。
{"title":"Adjacent-aware Modality Recovery based on Incomplete Multi-Modal Brain Disease Diagnosis.","authors":"Jinrong Cui,Weihao Ye,Shengrong Li,Jie Wen,Qi Zhu","doi":"10.1109/tmi.2026.3654000","DOIUrl":"https://doi.org/10.1109/tmi.2026.3654000","url":null,"abstract":"Multi-modal learning is extensively applied to diagnose brain diseases such as epilepsy and Alzheimer's disease. However, incomplete multi-modal data, where some modalities are unavailable or difficult to collect, limits the effectiveness of conventional methods. Additionally, existing approaches often overlook semantic relationships between neighbors with the same-label and latent information in missing modalities. To address these challenges, we propose an adjacent-aware distillation recovery framework designed for incomplete multi-modal learning, with a focus on diagnosing representative brain diseases, i.e. epilepsy and Alzheimer's disease. The key novelty of our framework lies in its joint design of adjacent-aware modality recovery and multi-modal representation learning in a single end-to-end pipeline. Specifically, we introduce a label-guided adjacent-aware recovery module that uses a self-attention mechanism to exploit neighbor semantics and generate distribution-consistent features for high-quality modality reconstruction. The recovered features are then refined through a knowledge distillation pathway into a modality generator, enhancing generalization under severe data incompleteness. For multi-modal representation learning, the recovered modality information is fused with the original incomplete information to enhance feature extraction and representation. Extensive experiments demonstrate the effectiveness of our method in diagnosing epilepsy and Alzheimer's disease.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"87 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145961413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Conditional Contrast-Agnostic Deformable Image Registration with Uncertainty Estimation 基于不确定性估计的自适应条件对比度不可知形变图像配准
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-12 DOI: 10.1109/tmi.2026.3652830
Yinsong Wang, Xinzhe Luo, Siyi Du, Chen Qin
{"title":"Adaptive Conditional Contrast-Agnostic Deformable Image Registration with Uncertainty Estimation","authors":"Yinsong Wang, Xinzhe Luo, Siyi Du, Chen Qin","doi":"10.1109/tmi.2026.3652830","DOIUrl":"https://doi.org/10.1109/tmi.2026.3652830","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"2 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145955127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Textual Anatomical Knowledge for Class-Imbalanced Semi-Supervised Multi-Organ Segmentation 利用文本解剖知识进行类不平衡半监督多器官分割
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-12 DOI: 10.1109/tmi.2026.3651295
Yuliang Gu, Weilun Tsao, Yepeng Liu, Lianming Wu, Thierry Géraud, Bo Du, Yongchao Xu
{"title":"Leveraging Textual Anatomical Knowledge for Class-Imbalanced Semi-Supervised Multi-Organ Segmentation","authors":"Yuliang Gu, Weilun Tsao, Yepeng Liu, Lianming Wu, Thierry Géraud, Bo Du, Yongchao Xu","doi":"10.1109/tmi.2026.3651295","DOIUrl":"https://doi.org/10.1109/tmi.2026.3651295","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"84 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145955129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatio-Temporal Representation Decoupling and Enhancement for Federated Instrument Segmentation in Surgical Videos 手术视频联合器械分割的时空表征解耦与增强
IF 10.6 1区 医学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2026-01-12 DOI: 10.1109/tmi.2026.3651254
Zheng Fang, Xiaoming Qi, Chun-Mei Feng, Jialun Pei, Weixin Si, Yueming Jin
{"title":"Spatio-Temporal Representation Decoupling and Enhancement for Federated Instrument Segmentation in Surgical Videos","authors":"Zheng Fang, Xiaoming Qi, Chun-Mei Feng, Jialun Pei, Weixin Si, Yueming Jin","doi":"10.1109/tmi.2026.3651254","DOIUrl":"https://doi.org/10.1109/tmi.2026.3651254","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"14 1","pages":"1-1"},"PeriodicalIF":10.6,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145955131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Medical Imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1