Pub Date : 2026-03-09DOI: 10.1109/tmi.2026.3671423
Ziang Xu, Bin Li, Yang Hu, Chenyu Zhang, James East, Sharib Ali, Jens Rittscher
{"title":"Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Latent Priors","authors":"Ziang Xu, Bin Li, Yang Hu, Chenyu Zhang, James East, Sharib Ali, Jens Rittscher","doi":"10.1109/tmi.2026.3671423","DOIUrl":"https://doi.org/10.1109/tmi.2026.3671423","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"25 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147380688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-06DOI: 10.1109/tmi.2026.3671287
Wenwen Zhang,Zhenyu Tang,Hao Zhang,Shaohao Rui,Z Jane Wang,Xiaosong Wang
Federated learning (FL) enables collaborative model training across decentralized medical datasets while preserving data privacy. Its practical adoption remains limited due to data heterogeneity, specifically, differences in input imaging modality (e.g., CT or MRI) and client task (e.g., segmentation or classification) across participating institutions (clients). Such data heterogeneity poses significant challenges for jointly learning a unified global model that generalizes across clients with different input modality and task. To address this, we propose FedCMT, a modality-agnostic FL framework that adaptively aggregates heterogeneous client models. FedCMT supports flexible input modalities and diverse local tasks by incorporating group-wise adapters and personalized decoders that capture modality- and task-specific features. To enhance collaboration across clients, FedCMT employs a conflict-averse module that extracts modality-invariant representations and mitigates inter-client feature conflicts. FedCMT also integrates a global-to-local knowledge distillation mechanism to balance global consistency and local specialization. The proposed FedCMT maintains stability while fostering shared knowledge in diverse medical imaging modalities. We evaluate FedCMT on ten CT and MR datasets involving up to eight federated clients performing segmentation or classification tasks. Experimental results show that FedCMT consistently outperforms state-of-the-art FL baselines, yielding an average improvement of 4.76% over state-of-the-art methods and 4.01% over standalone training. These results demonstrate FedCMT as a promising adaptable FL for real-world medical image analysis.
{"title":"Modality-Agnostic Federated Learning with Adaptive Updates for Heterogeneous Medical Image Tasks.","authors":"Wenwen Zhang,Zhenyu Tang,Hao Zhang,Shaohao Rui,Z Jane Wang,Xiaosong Wang","doi":"10.1109/tmi.2026.3671287","DOIUrl":"https://doi.org/10.1109/tmi.2026.3671287","url":null,"abstract":"Federated learning (FL) enables collaborative model training across decentralized medical datasets while preserving data privacy. Its practical adoption remains limited due to data heterogeneity, specifically, differences in input imaging modality (e.g., CT or MRI) and client task (e.g., segmentation or classification) across participating institutions (clients). Such data heterogeneity poses significant challenges for jointly learning a unified global model that generalizes across clients with different input modality and task. To address this, we propose FedCMT, a modality-agnostic FL framework that adaptively aggregates heterogeneous client models. FedCMT supports flexible input modalities and diverse local tasks by incorporating group-wise adapters and personalized decoders that capture modality- and task-specific features. To enhance collaboration across clients, FedCMT employs a conflict-averse module that extracts modality-invariant representations and mitigates inter-client feature conflicts. FedCMT also integrates a global-to-local knowledge distillation mechanism to balance global consistency and local specialization. The proposed FedCMT maintains stability while fostering shared knowledge in diverse medical imaging modalities. We evaluate FedCMT on ten CT and MR datasets involving up to eight federated clients performing segmentation or classification tasks. Experimental results show that FedCMT consistently outperforms state-of-the-art FL baselines, yielding an average improvement of 4.76% over state-of-the-art methods and 4.01% over standalone training. These results demonstrate FedCMT as a promising adaptable FL for real-world medical image analysis.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"72 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147368238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Magnetic Particle Imaging (MPI) enables noninvasive temperature imaging without depth limitations. However, due to the lack of effective calibration strategies that can simultaneously address issues such as calibration infeasibility and environmental mismatch, its practical in vivo application remains challenging. In this work, we propose a novel in vivo temperature imaging method based on a dual-mode magnetic particle spectroscopy/magnetic particle imaging (MPS/MPI) system. First, MPS is employed to capture the differences in harmonic phase responses of magnetic nanoparticles (MNPs) under in vivo and in vitro conditions, thereby enabling the construction of calibration functions that are consistent with the in vivo environment. Second, an MLP based calibration strategy is proposed, which accounts for non-ideal deviations from the approximately linear temperature-phase relationship and integrates multi-parameter information into a unified network, thereby enabling accurate and stable temperature mapping. Comprehensive simulation, in vitro, and in vivo experiments demonstrate that, compared with conventional phantom-based temperature mapping methods, the proposed method reduces the in vivo temperature reconstruction error by approximately 17.24% and achieves an average absolute temperature error below 1.257 °C. These results verify the feasibility of accurate in vivo temperature imaging using MPI and provide essential technical support for temperature-sensitive applications, including magnetic hyperthermia.
{"title":"Phase-lag Based MPS/MPI Dual-mode Precise in vivo Temperature Imaging Technique.","authors":"Siao Lei,Wenxuan Zou,Yanjun Liu,Guanghui Li,Gen Shi,Jiaqian Li,Jie He,Guangxing Zhou,Yang Jing,Yu An,Jie Tian","doi":"10.1109/tmi.2026.3670844","DOIUrl":"https://doi.org/10.1109/tmi.2026.3670844","url":null,"abstract":"Magnetic Particle Imaging (MPI) enables noninvasive temperature imaging without depth limitations. However, due to the lack of effective calibration strategies that can simultaneously address issues such as calibration infeasibility and environmental mismatch, its practical in vivo application remains challenging. In this work, we propose a novel in vivo temperature imaging method based on a dual-mode magnetic particle spectroscopy/magnetic particle imaging (MPS/MPI) system. First, MPS is employed to capture the differences in harmonic phase responses of magnetic nanoparticles (MNPs) under in vivo and in vitro conditions, thereby enabling the construction of calibration functions that are consistent with the in vivo environment. Second, an MLP based calibration strategy is proposed, which accounts for non-ideal deviations from the approximately linear temperature-phase relationship and integrates multi-parameter information into a unified network, thereby enabling accurate and stable temperature mapping. Comprehensive simulation, in vitro, and in vivo experiments demonstrate that, compared with conventional phantom-based temperature mapping methods, the proposed method reduces the in vivo temperature reconstruction error by approximately 17.24% and achieves an average absolute temperature error below 1.257 °C. These results verify the feasibility of accurate in vivo temperature imaging using MPI and provide essential technical support for temperature-sensitive applications, including magnetic hyperthermia.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"1 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147359389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-04DOI: 10.1109/tmi.2026.3670643
C Ross Schmidtlein,Jin Ren,Andrzej Krol,Howard C Gifford,Joseph A O'Donoghue,Lisa Bodei,Yuesheng Xu
Targeted Alpha Therapy (TAT), using alpha-emitting radionuclides (AER) such as 225Ac, shows promise for the treatment of advanced and refractory cancers. Currently, TAT is prescribed on the basis of activity (e.g., MBq, kBq/kg), with no account taken of individual biodistribution or kinetics. The delivery of patient-specific treatment, based on absorbed dose criteria, requires in-vivo imaging of the AER biodistribution, a challenging scenario due to the scarcity of imageable photons. To address this, we present a novel computed quantitative planar (CQP) imaging method that reconstructs a coronal projection of the 3D AER distribution from anterior/posterior scintigraphy coregistered with CT. The model is regularized using maximum a posteriori estimation with sparse ℓ1 tight-framelet transforms and solved via a convergence-guaranteed fixed-point proximity algorithm. To experimentally evaluate our approach, we built a modular slab phantom containing a known distribution of 225Ac vitrified in epoxy. CQP reconstruction was characterized by significantly reduced bias and noise, improved spatial resolution, and better signal-to-noise ratios, compared to geometric mean methods. The CQP approach is clinically implementable with conventional SPECT/CT systems, without need for hardware additions or modifications, and can assist dosimetry workflows, especially where 3D SPECT/PET is impractical.
{"title":"Computed Quantitative Planar Imaging for Targeted Alpha Therapy: Model-Based Sparse Reconstruction Validated with a Novel 225Ac Epoxy Phantom.","authors":"C Ross Schmidtlein,Jin Ren,Andrzej Krol,Howard C Gifford,Joseph A O'Donoghue,Lisa Bodei,Yuesheng Xu","doi":"10.1109/tmi.2026.3670643","DOIUrl":"https://doi.org/10.1109/tmi.2026.3670643","url":null,"abstract":"Targeted Alpha Therapy (TAT), using alpha-emitting radionuclides (AER) such as 225Ac, shows promise for the treatment of advanced and refractory cancers. Currently, TAT is prescribed on the basis of activity (e.g., MBq, kBq/kg), with no account taken of individual biodistribution or kinetics. The delivery of patient-specific treatment, based on absorbed dose criteria, requires in-vivo imaging of the AER biodistribution, a challenging scenario due to the scarcity of imageable photons. To address this, we present a novel computed quantitative planar (CQP) imaging method that reconstructs a coronal projection of the 3D AER distribution from anterior/posterior scintigraphy coregistered with CT. The model is regularized using maximum a posteriori estimation with sparse ℓ1 tight-framelet transforms and solved via a convergence-guaranteed fixed-point proximity algorithm. To experimentally evaluate our approach, we built a modular slab phantom containing a known distribution of 225Ac vitrified in epoxy. CQP reconstruction was characterized by significantly reduced bias and noise, improved spatial resolution, and better signal-to-noise ratios, compared to geometric mean methods. The CQP approach is clinically implementable with conventional SPECT/CT systems, without need for hardware additions or modifications, and can assist dosimetry workflows, especially where 3D SPECT/PET is impractical.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"130 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147350524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-03-03DOI: 10.1109/tmi.2026.3670159
Xuan Gong,Jiaqi Li,Yirui Wang,Haoshen Li,Jiawen Yao,Lianzhen Zhong,Dazhou Guo,Ke Yan,David Doermann,Le Lu,Feiran Jiao,Tsung-Ying Ho,Ling Zhang,Abudili Abuduxuku,Haifeng Wang,Xianghua Ye,Dakai Jin,Qifeng Wang
Esophageal cancer is one of the most lethal cancers, with 5-year survival rate of only 20%. Patient outcomes can vary significantly even though they are at the same cancer stage and receive similar treatments. Accurate prognostic prediction for esophageal cancer patients is highly desired to receive personalized precise treatment. Nevertheless, there are very few automated methods yet to fully exploit the preoperative contrast-enhanced computed tomography (CE-CT) imaging for assessing esophageal cancer prognosis. In addition to image patterns, important prognostic factors should encompass tumor size and location, as well as lymph nodes (LNs) involvement, including features such as LN number, size, spatial distribution, and their proximity to tumor. Considering these complexities, we propose a novel Tumor and LN Context-Geometry network for the preoperative prediction of esophageal cancer survival in CE-CT images. Specifically, we (1) focus on learning survival patterns of CT texture via co-attention context modeling at most informative regions, i.e., automatically segmented tumor, LNs and LN-stations; and (2) integrate tumor and LN anatomical and spatial associations into neural geometry modeling for a comprehensive learning of metastatic involvement and tumor invasion to adjacent structures. Empirical studies show our presented framework can improve overall survival prediction performances compared with existing state-of-the-art survival analysis methods, and evidently suggest that incorporating these findings into the existing esophageal cancer staging system would add its clinical values.
{"title":"Preoperative Prediction of Esophageal Cancer Survival in CT via Tumor and Lymph Node Context and Geometry Modeling.","authors":"Xuan Gong,Jiaqi Li,Yirui Wang,Haoshen Li,Jiawen Yao,Lianzhen Zhong,Dazhou Guo,Ke Yan,David Doermann,Le Lu,Feiran Jiao,Tsung-Ying Ho,Ling Zhang,Abudili Abuduxuku,Haifeng Wang,Xianghua Ye,Dakai Jin,Qifeng Wang","doi":"10.1109/tmi.2026.3670159","DOIUrl":"https://doi.org/10.1109/tmi.2026.3670159","url":null,"abstract":"Esophageal cancer is one of the most lethal cancers, with 5-year survival rate of only 20%. Patient outcomes can vary significantly even though they are at the same cancer stage and receive similar treatments. Accurate prognostic prediction for esophageal cancer patients is highly desired to receive personalized precise treatment. Nevertheless, there are very few automated methods yet to fully exploit the preoperative contrast-enhanced computed tomography (CE-CT) imaging for assessing esophageal cancer prognosis. In addition to image patterns, important prognostic factors should encompass tumor size and location, as well as lymph nodes (LNs) involvement, including features such as LN number, size, spatial distribution, and their proximity to tumor. Considering these complexities, we propose a novel Tumor and LN Context-Geometry network for the preoperative prediction of esophageal cancer survival in CE-CT images. Specifically, we (1) focus on learning survival patterns of CT texture via co-attention context modeling at most informative regions, i.e., automatically segmented tumor, LNs and LN-stations; and (2) integrate tumor and LN anatomical and spatial associations into neural geometry modeling for a comprehensive learning of metastatic involvement and tumor invasion to adjacent structures. Empirical studies show our presented framework can improve overall survival prediction performances compared with existing state-of-the-art survival analysis methods, and evidently suggest that incorporating these findings into the existing esophageal cancer staging system would add its clinical values.","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"12 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147346290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Large-Scale Multimodality via Dual-Path Cooperative Feature Fusion Strategy for Medical Image Segmentation","authors":"Dayu Tan, Xingcheng Wang, Yansen Su, Junfeng Xia, Chunhou Zheng, Weimin Zhong","doi":"10.1109/tmi.2026.3667954","DOIUrl":"https://doi.org/10.1109/tmi.2026.3667954","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"17 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147287395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-24DOI: 10.1109/tmi.2026.3667605
Junjie Shi, Zhaobin Sun, Li Yu, Xin Yang, Zengqiang Yan
{"title":"Addressing Imbalanced Modal Incompleteness in Realistic Multi-Modal Medical Image Segmentation via Hierarchical Gradient Alignment","authors":"Junjie Shi, Zhaobin Sun, Li Yu, Xin Yang, Zengqiang Yan","doi":"10.1109/tmi.2026.3667605","DOIUrl":"https://doi.org/10.1109/tmi.2026.3667605","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"45 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147278554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-24DOI: 10.1109/tmi.2026.3667706
Xu Yin, John Q. Gan, Haixian Wang
{"title":"Cogformer: A unified multi-scale brain representation for visual decoding and reconstruction from fMRI","authors":"Xu Yin, John Q. Gan, Haixian Wang","doi":"10.1109/tmi.2026.3667706","DOIUrl":"https://doi.org/10.1109/tmi.2026.3667706","url":null,"abstract":"","PeriodicalId":13418,"journal":{"name":"IEEE Transactions on Medical Imaging","volume":"128 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2026-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147278556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}