首页 > 最新文献

Proceedings. IEEE International Symposium on Biomedical Imaging最新文献

英文 中文
MINIMALLY USER-GUIDED 3D MICRO-ULTRASOUND PROSTATE SEGMENTATION. 微创用户引导的三维微超声前列腺分割。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10981266
Alex Ling Yu Hung, Kai Zhao, Kaifeng Pang, Qi Miao, Zhaozhi Wang, Wayne Brisbane, Demetri Terzopoulos, Kyunghyun Sung

Micro-ultrasound is an emerging imaging tool that complements MRI in detecting prostate cancer by offering high-resolution imaging at lower cost. However, reliable annotations for micro-ultrasound data remain challenging due to the limited availability of experts and a steep learning curve. To address the clear clinical need, we propose a click-based, user-guided volumetric micro-ultrasound prostate segmentation model requiring minimal user intervention and training data. Our model predicts the segmentation of the entire prostate volume after users place a few points on the two boundary image slices of the prostate. Experiments show that the model needs only a small amount of training data to achieve strong segmentation performance, with each of its components contributing to its overall improvement. We demonstrate that the level of expertise of the user scarcely affects performance. This makes prostate segmentation practically feasible for general users.

微超声是一种新兴的成像工具,通过提供低成本的高分辨率成像来补充MRI检测前列腺癌。然而,由于专家的可用性有限和陡峭的学习曲线,对微超声数据的可靠注释仍然具有挑战性。为了满足明确的临床需求,我们提出了一种基于点击的、用户引导的体积微超声前列腺分割模型,需要最少的用户干预和训练数据。在用户在前列腺的两个边界图像切片上放置一些点后,我们的模型预测整个前列腺体积的分割。实验表明,该模型只需要少量的训练数据就可以获得较强的分割性能,其每个组成部分都有助于整体的改进。我们证明了用户的专业水平几乎不影响性能。这使得前列腺分割对一般用户来说实际上是可行的。
{"title":"MINIMALLY USER-GUIDED 3D MICRO-ULTRASOUND PROSTATE SEGMENTATION.","authors":"Alex Ling Yu Hung, Kai Zhao, Kaifeng Pang, Qi Miao, Zhaozhi Wang, Wayne Brisbane, Demetri Terzopoulos, Kyunghyun Sung","doi":"10.1109/isbi60581.2025.10981266","DOIUrl":"10.1109/isbi60581.2025.10981266","url":null,"abstract":"<p><p>Micro-ultrasound is an emerging imaging tool that complements MRI in detecting prostate cancer by offering high-resolution imaging at lower cost. However, reliable annotations for micro-ultrasound data remain challenging due to the limited availability of experts and a steep learning curve. To address the clear clinical need, we propose a click-based, user-guided volumetric micro-ultrasound prostate segmentation model requiring minimal user intervention and training data. Our model predicts the segmentation of the entire prostate volume after users place a few points on the two boundary image slices of the prostate. Experiments show that the model needs only a small amount of training data to achieve strong segmentation performance, with each of its components contributing to its overall improvement. We demonstrate that the level of expertise of the user scarcely affects performance. This makes prostate segmentation practically feasible for general users.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12104093/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144152957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MPR-DIFF: A SELF-SUPERVISED DIFFUSION MODEL FOR MULTI-PLANAR REFORMATION IN PROSTATE MICRO-ULTRASOUND IMAGING. Mpr-diff:前列腺微超声多平面重构的自监督扩散模型。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10981012
Kaifeng Pang, Qi Miao, Alex Ling Yu Hung, Kai Zhao, Eunsun Oh, Raymi Ramirez, Wayne Brisbane, Kyunghyun Sung

Micro-ultrasound (MicroUS) is a novel imaging technology with the potential to provide a low-cost and high-resolution approach for prostate cancer diagnosis. However, MicroUS is acquired in a non-uniform, fan-shaped sweep, where voxel size varies with distance from the probe and across slice angles. This irregular voxel distribution complicates reformatting into other imaging planes, making it challenging to conduct joint evaluations with other modalities such as MRI and histopathology. Existing interpolation-based reformatting methods lead to poor image resolution and introduce severe artifacts. In this paper, we propose MPR-Diff, a self-supervised diffusion model for super-resolution-based multi-planar reformation in prostate MicroUS imaging. Our method addresses the lack of high-resolution reference in the target plane by extracting simulated training patches from acquired slices. We performed both a quantitative evaluation and an expert reader study, demonstrating that our approach significantly enhances image resolution and reduces artifacts, thereby increasing the potential diagnostic value of MicroUS. Code is available at https://github.com/Calvin-Pang/MPR-Diff.

微超声(MicroUS)是一种新颖的成像技术,具有提供低成本和高分辨率前列腺癌诊断方法的潜力。然而,MicroUS是在非均匀的扇形扫描中获得的,其中体素大小随与探针的距离和横切片角度而变化。这种不规则的体素分布使重新格式化到其他成像平面变得复杂,这使得与MRI和组织病理学等其他模式进行联合评估变得具有挑战性。现有的基于插值的重新格式化方法导致图像分辨率差,并引入严重的伪影。本文提出了一种自监督扩散模型MPR-Diff,用于前列腺显微成像中基于超分辨率的多平面重构。我们的方法通过从获取的切片中提取模拟训练补丁来解决目标平面缺乏高分辨率参考的问题。我们进行了定量评估和专家读者研究,证明我们的方法显着提高了图像分辨率并减少了伪影,从而增加了MicroUS的潜在诊断价值。代码可从https://github.com/Calvin-Pang/MPR-Diff获得。
{"title":"MPR-DIFF: A SELF-SUPERVISED DIFFUSION MODEL FOR MULTI-PLANAR REFORMATION IN PROSTATE MICRO-ULTRASOUND IMAGING.","authors":"Kaifeng Pang, Qi Miao, Alex Ling Yu Hung, Kai Zhao, Eunsun Oh, Raymi Ramirez, Wayne Brisbane, Kyunghyun Sung","doi":"10.1109/isbi60581.2025.10981012","DOIUrl":"10.1109/isbi60581.2025.10981012","url":null,"abstract":"<p><p>Micro-ultrasound (MicroUS) is a novel imaging technology with the potential to provide a low-cost and high-resolution approach for prostate cancer diagnosis. However, MicroUS is acquired in a non-uniform, fan-shaped sweep, where voxel size varies with distance from the probe and across slice angles. This irregular voxel distribution complicates reformatting into other imaging planes, making it challenging to conduct joint evaluations with other modalities such as MRI and histopathology. Existing interpolation-based reformatting methods lead to poor image resolution and introduce severe artifacts. In this paper, we propose <b>MPR-Diff</b>, a self-supervised diffusion model for super-resolution-based multi-planar reformation in prostate MicroUS imaging. Our method addresses the lack of high-resolution reference in the target plane by extracting simulated training patches from acquired slices. We performed both a quantitative evaluation and an expert reader study, demonstrating that our approach significantly enhances image resolution and reduces artifacts, thereby increasing the potential diagnostic value of MicroUS. Code is available at https://github.com/Calvin-Pang/MPR-Diff.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12105648/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144153005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LEARNING ACCURATE RIGID REGISTRATION FOR LONGITUDINAL BRAIN MRI FROM SYNTHETIC DATA. 从合成数据中学习纵向脑mri的精确刚性配准。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10980859
Jingru Fu, Adrian V Dalca, Bruce Fischl, Rodrigo Moreno, Malte Hoffmann

Rigid registration aims to determine the translations and rotations necessary to align features in a pair of images. While recent machine learning methods have become state-of-the-art for linear and deformable registration across subjects, they have demonstrated limitations when applied to longitudinal (within-subject) registration, where achieving precise alignment is critical. Building on an existing framework for anatomy-aware, acquisition-agnostic affine registration, we propose a model optimized for longitudinal, rigid brain registration. By training the model with synthetic within-subject pairs augmented with rigid and subtle nonlinear transforms, the model estimates more accurate rigid transforms than previous cross-subject networks and performs robustly on longitudinal registration pairs within and across magnetic resonance imaging (MRI) contrasts.

刚性配准旨在确定对齐一对图像中的特征所需的平移和旋转。虽然最近的机器学习方法已经成为跨学科线性和可变形注册的最先进方法,但它们在应用于纵向(主题内)注册时表现出局限性,在纵向(主题内)注册中,实现精确对齐至关重要。在现有的解剖学感知、获取不可知论仿射配准框架的基础上,我们提出了一个纵向、刚性脑配准优化模型。通过使用合成的主题内对增强刚性和微妙非线性变换来训练模型,该模型比以前的跨主题网络估计更准确的刚性变换,并在磁共振成像(MRI)对比内部和之间的纵向配准对上执行稳健。
{"title":"LEARNING ACCURATE RIGID REGISTRATION FOR LONGITUDINAL BRAIN MRI FROM SYNTHETIC DATA.","authors":"Jingru Fu, Adrian V Dalca, Bruce Fischl, Rodrigo Moreno, Malte Hoffmann","doi":"10.1109/isbi60581.2025.10980859","DOIUrl":"10.1109/isbi60581.2025.10980859","url":null,"abstract":"<p><p>Rigid registration aims to determine the translations and rotations necessary to align features in a pair of images. While recent machine learning methods have become state-of-the-art for linear and deformable registration across subjects, they have demonstrated limitations when applied to longitudinal (within-subject) registration, where achieving precise alignment is critical. Building on an existing framework for anatomy-aware, acquisition-agnostic affine registration, we propose a model optimized for longitudinal, rigid brain registration. By training the model with synthetic within-subject pairs augmented with rigid and subtle nonlinear transforms, the model estimates more accurate rigid transforms than previous cross-subject networks and performs robustly on longitudinal registration pairs within and across magnetic resonance imaging (MRI) contrasts.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12237398/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144593150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
OVERALL SURVIVAL PREDICTION OF BRAIN TUMOR PATIENTS WITH MULTIMODAL MRI USING SWIN UNETR. 用swin unetr预测脑肿瘤患者多模态mri的总生存期。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10981128
Gihyeon Kim, Fangxu Xing, Hyoun-Joong Kong, Emiliano Santarnecchi, Helen A Shih, Thomas Bortfeld, Georges El Fakhri, Xiaofeng Liu, Jang-Hwan Choi, Jonghye Woo

Accurate prediction of glioblastoma patient survival can significantly aid in personalized treatment planning. While pre-operative multimodal magnetic resonance imaging (MRI) offers complementary information, current methods are constrained by relatively limited data and largely rely on hand-crafted features extracted from segmentation results. To address these issues, in this work, we propose a data-efficient multi-task framework to take advantage of hierarchical segmentation features within advanced Swin UNETR for survival prediction. By integrating multi-scale features, we are able to capture detailed spatial information and global context, while employing the shifted window mechanism to maintain computational efficiency and scalability for 3D volumes. We further alleviate survival data scarcity through segmentation pre-training, while the features are fine-tuned to align with the survival prediction task and refined by statistical F-values. In addition, age information is incorporated alongside the extracted features to enhance survival prediction performance. Through comprehensive evaluations on the BraTS dataset, we demonstrate that our model achieves superior segmentation accuracy and state-of-the-art survival prediction performance, offering a robust solution for clinical prognosis in glioblastoma patients.

准确预测胶质母细胞瘤患者的生存可以显著地帮助制定个性化的治疗计划。虽然术前多模态磁共振成像(MRI)提供了补充信息,但目前的方法受到相对有限的数据的限制,并且很大程度上依赖于从分割结果中提取的手工特征。为了解决这些问题,在这项工作中,我们提出了一个数据高效的多任务框架,以利用高级Swin UNETR中的分层分割特征进行生存预测。通过整合多尺度特征,我们能够捕获详细的空间信息和全局背景,同时采用移位窗口机制来保持3D体的计算效率和可扩展性。我们通过分割预训练进一步缓解生存数据的稀缺性,同时对特征进行微调,使其与生存预测任务保持一致,并通过统计f值进行细化。此外,年龄信息与提取的特征相结合,以提高生存预测性能。通过对BraTS数据集的综合评估,我们证明了我们的模型具有优越的分割精度和最先进的生存预测性能,为胶质母细胞瘤患者的临床预后提供了一个强大的解决方案。
{"title":"OVERALL SURVIVAL PREDICTION OF BRAIN TUMOR PATIENTS WITH MULTIMODAL MRI USING SWIN UNETR.","authors":"Gihyeon Kim, Fangxu Xing, Hyoun-Joong Kong, Emiliano Santarnecchi, Helen A Shih, Thomas Bortfeld, Georges El Fakhri, Xiaofeng Liu, Jang-Hwan Choi, Jonghye Woo","doi":"10.1109/isbi60581.2025.10981128","DOIUrl":"10.1109/isbi60581.2025.10981128","url":null,"abstract":"<p><p>Accurate prediction of glioblastoma patient survival can significantly aid in personalized treatment planning. While pre-operative multimodal magnetic resonance imaging (MRI) offers complementary information, current methods are constrained by relatively limited data and largely rely on hand-crafted features extracted from segmentation results. To address these issues, in this work, we propose a data-efficient multi-task framework to take advantage of hierarchical segmentation features within advanced Swin UNETR for survival prediction. By integrating multi-scale features, we are able to capture detailed spatial information and global context, while employing the shifted window mechanism to maintain computational efficiency and scalability for 3D volumes. We further alleviate survival data scarcity through segmentation pre-training, while the features are fine-tuned to align with the survival prediction task and refined by statistical F-values. In addition, age information is incorporated alongside the extracted features to enhance survival prediction performance. Through comprehensive evaluations on the BraTS dataset, we demonstrate that our model achieves superior segmentation accuracy and state-of-the-art survival prediction performance, offering a robust solution for clinical prognosis in glioblastoma patients.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12345447/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144850060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TPOT: TOPOLOGY PRESERVING OPTIMAL TRANSPORT IN RETINAL FUNDUS IMAGE ENHANCEMENT. 拓扑保持视网膜眼底图像增强中的最佳传输。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10981104
Xuanzhao Dong, Wenhui Zhu, Xin Li, Guoxin Sun, Yi Su, Oana M Dumitrascu, Yalin Wang

Retinal fundus photography enhancement is important for diagnosing and monitoring retinal diseases. However, early approaches to retinal image enhancement, such as those based on Generative Adversarial Networks (GANs), often struggle to preserve the complex topological information of blood vessels, resulting in spurious or missing vessel structures. The persistence diagram, which captures topological features based on the persistence of topological structures under different filtrations, provides a promising way to represent the structure information. In this work, we propose a topology-preserving training paradigm that regularizes blood vessel structures by minimizing the differences of persistence diagrams. We call the resulting framework Topology Preserving Optimal Transport (TPOT). Experimental results on a large-scale dataset demonstrate the superiority of the proposed method compared to several state-of-the-art supervised and unsupervised techniques, both in terms of image quality and performance in the downstream blood vessel segmentation task. The code is available at https://github.com/Retinal-Research/TPOT.

视网膜眼底摄影增强对视网膜疾病的诊断和监测具有重要意义。然而,早期的视网膜图像增强方法,如基于生成对抗网络(gan)的方法,往往难以保留血管的复杂拓扑信息,导致血管结构虚假或缺失。持久性图基于拓扑结构在不同过滤条件下的持久性捕获拓扑特征,为表示结构信息提供了一种很有前途的方法。在这项工作中,我们提出了一种保持拓扑的训练范式,通过最小化持久性图的差异来规范血管结构。我们将得到的框架称为拓扑保持最优传输(TPOT)。在大规模数据集上的实验结果表明,与几种最先进的监督和无监督技术相比,所提出的方法在图像质量和下游血管分割任务中的性能方面都具有优越性。代码可在https://github.com/Retinal-Research/TPOT上获得。
{"title":"TPOT: TOPOLOGY PRESERVING OPTIMAL TRANSPORT IN RETINAL FUNDUS IMAGE ENHANCEMENT.","authors":"Xuanzhao Dong, Wenhui Zhu, Xin Li, Guoxin Sun, Yi Su, Oana M Dumitrascu, Yalin Wang","doi":"10.1109/isbi60581.2025.10981104","DOIUrl":"https://doi.org/10.1109/isbi60581.2025.10981104","url":null,"abstract":"<p><p>Retinal fundus photography enhancement is important for diagnosing and monitoring retinal diseases. However, early approaches to retinal image enhancement, such as those based on Generative Adversarial Networks (GANs), often struggle to preserve the complex topological information of blood vessels, resulting in spurious or missing vessel structures. The persistence diagram, which captures topological features based on the persistence of topological structures under different filtrations, provides a promising way to represent the structure information. In this work, we propose a topology-preserving training paradigm that regularizes blood vessel structures by minimizing the differences of persistence diagrams. We call the resulting framework Topology Preserving Optimal Transport (TPOT). Experimental results on a large-scale dataset demonstrate the superiority of the proposed method compared to several state-of-the-art supervised and unsupervised techniques, both in terms of image quality and performance in the downstream blood vessel segmentation task. The code is available at https://github.com/Retinal-Research/TPOT.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12380521/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144981851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CLASSIFFICATION OF MILD COGNITIVE IMPAIRMENT BASED ON DYNAMIC FUNCTIONAL CONNECTIVITY USING SPATIO-TEMPORAL TRANSFORMER. 基于时空转换的动态功能连接对轻度认知障碍的分类。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10980922
Jing Zhang, Yanjun Lyu, Xiaowei Yu, Lu Zhang, Chao Cao, Tong Chen, Minheng Chen, Yan Zhuang, Tianming Liu, Dajiang Zhu

Dynamic functional connectivity (dFC) using resting-state functional magnetic resonance imaging (rs-fMRI) is an advanced technique for capturing the dynamic changes of neural activities, and can be very useful in the studies of brain diseases such as Alzheimer's disease (AD). Yet, existing studies have not fully leveraged the sequential information embedded within dFC that can potentially provide valuable information when identifying brain conditions. In this paper, we propose a novel framework that jointly learns the embedding of both spatial and temporal information within dFC based on the transformer architecture. Specifically, we first construct dFC networks from rs-fMRI data through a sliding window strategy. Then, we simultaneously employ a temporal block and a spatial block to capture higher-order representations of dynamic spatio-temporal dependencies, via mapping them into an efficient fused feature representation. To further enhance the robustness of this feature representations by reducing the dependency of labeled data, we also introduce a contrastive learning strategy to manipulate different brain states. Experimental results on 345 subjects with 570 scans from the Alzheimer's Disease Neuroimaging Initiative (ADNI) demonstrate the superiority of our proposed method for MCI (Mild Cognitive Impairment, the prodromal stage of AD) prediction, highlighting its potential for early identification of AD. The code is available at: https://github.com/Nancy-Zhang-0/MCI_dFC_STT.

静息状态功能磁共振成像(rs-fMRI)是一种捕捉神经活动动态变化的先进技术,在阿尔茨海默病(AD)等脑疾病的研究中非常有用。然而,现有的研究并没有充分利用dFC中嵌入的顺序信息,这些信息可能在识别大脑状况时提供有价值的信息。在本文中,我们提出了一种基于变压器结构的dFC空间和时间信息嵌入联合学习的新框架。具体来说,我们首先通过滑动窗口策略从rs-fMRI数据构建dFC网络。然后,我们同时使用一个时间块和一个空间块来捕获动态时空依赖关系的高阶表示,通过将它们映射成一个有效的融合特征表示。为了通过减少标记数据的依赖性来进一步增强这种特征表示的鲁棒性,我们还引入了对比学习策略来操纵不同的大脑状态。来自阿尔茨海默病神经影像学倡议(ADNI)的345名受试者的570次扫描的实验结果表明,我们提出的方法在轻度认知障碍(轻度认知障碍,AD的前驱阶段)预测方面具有优势,突出了其早期识别AD的潜力。代码可从https://github.com/Nancy-Zhang-0/MCI_dFC_STT获得。
{"title":"CLASSIFFICATION OF MILD COGNITIVE IMPAIRMENT BASED ON DYNAMIC FUNCTIONAL CONNECTIVITY USING SPATIO-TEMPORAL TRANSFORMER.","authors":"Jing Zhang, Yanjun Lyu, Xiaowei Yu, Lu Zhang, Chao Cao, Tong Chen, Minheng Chen, Yan Zhuang, Tianming Liu, Dajiang Zhu","doi":"10.1109/isbi60581.2025.10980922","DOIUrl":"10.1109/isbi60581.2025.10980922","url":null,"abstract":"<p><p>Dynamic functional connectivity (dFC) using resting-state functional magnetic resonance imaging (rs-fMRI) is an advanced technique for capturing the dynamic changes of neural activities, and can be very useful in the studies of brain diseases such as Alzheimer's disease (AD). Yet, existing studies have not fully leveraged the sequential information embedded within dFC that can potentially provide valuable information when identifying brain conditions. In this paper, we propose a novel framework that jointly learns the embedding of both spatial and temporal information within dFC based on the transformer architecture. Specifically, we first construct dFC networks from rs-fMRI data through a sliding window strategy. Then, we simultaneously employ a temporal block and a spatial block to capture higher-order representations of dynamic spatio-temporal dependencies, via mapping them into an efficient fused feature representation. To further enhance the robustness of this feature representations by reducing the dependency of labeled data, we also introduce a contrastive learning strategy to manipulate different brain states. Experimental results on 345 subjects with 570 scans from the Alzheimer's Disease Neuroimaging Initiative (ADNI) demonstrate the superiority of our proposed method for MCI (Mild Cognitive Impairment, the prodromal stage of AD) prediction, highlighting its potential for early identification of AD. The code is available at: https://github.com/Nancy-Zhang-0/MCI_dFC_STT.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12715849/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806684","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
USING STRUCTURAL SIMILARITY AND KOLMOGOROV-ARNOLD NETWORKS FOR ANATOMICAL EMBEDDING OF CORTICAL FOLDING PATTERNS. 利用结构相似性和kolmogorov-arnold网络进行皮层折叠模式的解剖嵌入。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10981274
Minheng Chen, Chao Cao, Tong Chen, Yan Zhuang, Jing Zhang, Yanjun Lyu, Xiaowei Yu, Lu Zhang, Tianming Liu, Dajiang Zhu

The 3-hinge gyrus (3HG) is a newly defined folding pattern, which is the conjunction of gyri coming from three directions in cortical folding. Many studies demonstrated that 3HGs can be reliable nodes when constructing brain networks or connectome since they simultaneously possess commonality and individuality across different individual brains and populations. However, 3HGs are identified and validated within individual spaces, making it difficult to directly serve as the brain network nodes due to the absence of cross-subject correspondence. The 3HG correspondences represent the intrinsic regulation of brain organizational architecture, traditional image-based registration methods tend to fail because individual anatomical properties need to be fully respected. To address this challenge, we propose a novel self-supervised framework for anatomical feature embedding of the 3HGs to build the correspondences among different brains. The core component of this framework is to construct a structural similarity-enhanced multi-hop feature encoding strategy based on the recently developed Kolmogorov-Arnold network (KAN) for anatomical feature embedding. Extensive experiments suggest that our approach can effectively establish robust cross-subject correspondences when no one-to-one mapping exists. The code is available at github.com/m1nhengChen/SSE-CortexEmbed.

3-hinge gyrus (3HG)是一种新定义的折叠模式,它是皮层折叠中来自三个方向的脑回的连接。许多研究表明,3hg可以作为构建大脑网络或连接组的可靠节点,因为它们同时具有不同个体大脑和人群的共性和个性。然而,3hg是在单独的空间内被识别和验证的,由于缺乏跨学科的通信,很难直接作为大脑网络节点。3HG对应关系代表了大脑组织结构的内在规律,传统的基于图像的配准方法往往失败,因为需要充分尊重个体的解剖特性。为了解决这一挑战,我们提出了一种新的自监督框架,用于3hg的解剖特征嵌入,以建立不同大脑之间的对应关系。该框架的核心部分是基于最近发展的Kolmogorov-Arnold网络(KAN)构建结构相似性增强的多跳特征编码策略,用于解剖特征嵌入。大量实验表明,当没有一对一映射存在时,我们的方法可以有效地建立稳健的跨主题对应。代码可在github.com/m1nhengChen/SSE-CortexEmbed上获得。
{"title":"USING STRUCTURAL SIMILARITY AND KOLMOGOROV-ARNOLD NETWORKS FOR ANATOMICAL EMBEDDING OF CORTICAL FOLDING PATTERNS.","authors":"Minheng Chen, Chao Cao, Tong Chen, Yan Zhuang, Jing Zhang, Yanjun Lyu, Xiaowei Yu, Lu Zhang, Tianming Liu, Dajiang Zhu","doi":"10.1109/isbi60581.2025.10981274","DOIUrl":"10.1109/isbi60581.2025.10981274","url":null,"abstract":"<p><p>The 3-hinge gyrus (3HG) is a newly defined folding pattern, which is the conjunction of gyri coming from three directions in cortical folding. Many studies demonstrated that 3HGs can be reliable nodes when constructing brain networks or connectome since they simultaneously possess commonality and individuality across different individual brains and populations. However, 3HGs are identified and validated within individual spaces, making it difficult to directly serve as the brain network nodes due to the absence of cross-subject correspondence. The 3HG correspondences represent the intrinsic regulation of brain organizational architecture, traditional image-based registration methods tend to fail because individual anatomical properties need to be fully respected. To address this challenge, we propose a novel self-supervised framework for anatomical feature embedding of the 3HGs to build the correspondences among different brains. The core component of this framework is to construct a structural similarity-enhanced multi-hop feature encoding strategy based on the recently developed Kolmogorov-Arnold network (KAN) for anatomical feature embedding. Extensive experiments suggest that our approach can effectively establish robust cross-subject correspondences when no one-to-one mapping exists. The code is available at github.com/m1nhengChen/SSE-CortexEmbed.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12711413/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BENCHMARKING TRANSFERABILITY OF SELF-SUPERVISED PRETRAINING FOR MULTI-ORGAN SEGMENTATION ON DIFFERENT MODALITIES. 对不同模式下多器官分割自监督预训练的可移植性进行基准测试。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10980778
Jue Jiang, Harini Veeraraghavan

Self-supervised learning (SSL) is an approach to pretrain deep networks with unlabeled datasets by using pretext tasks that use images as "ground truth". Pretext tasks have been shown to impact accuracy of task categories, e.g. segmentation vs. classification. However, versatility of SSL features to downstream tasks involving different modalities has not been studied. We benchmarked impact of SSL tasks such as contrastive predictive coding, token self-distillation, and generative masked image modeling (MIM) with 3D vision transformer performed using 10K 3D-CTs (or 1.89M images) from various disease sites. SSL pretraining was used to assess (a) multi-organ segmentation under data-limited fine tuning, (b) feature reuse and (c) organ localization with multi-head attention. Analysis showed that pretext tasks combining MIM and token self-distillation balanced local and global attention distance, produced higher segmentation accuracy in few-shot and data-limited settings for MRI and CT. Feature reuse was impacted by similarity of pretraining and fine-tuning modality.

自监督学习(Self-supervised learning, SSL)是一种用未标记数据集预训练深度网络的方法,方法是使用使用图像作为“基础事实”的借口任务。借口任务已被证明会影响任务类别的准确性,例如分割与分类。然而,SSL特性对涉及不同模式的下游任务的通用性尚未得到研究。我们对SSL任务(如对比预测编码、令牌自蒸馏和生成掩膜图像建模(MIM))的影响进行了基准测试,并使用来自不同疾病站点的10K 3D- ct(或1.89M图像)进行了3D视觉转换。SSL预训练用于评估(a)数据有限微调下的多器官分割,(b)特征重用和(c)多头关注下的器官定位。分析表明,结合MIM和token自蒸馏的借口任务平衡了局部和全局注意距离,在较少镜头和数据有限的情况下,MRI和CT的分割精度更高。特征重用受到预训练和微调方式相似性的影响。
{"title":"BENCHMARKING TRANSFERABILITY OF SELF-SUPERVISED PRETRAINING FOR MULTI-ORGAN SEGMENTATION ON DIFFERENT MODALITIES.","authors":"Jue Jiang, Harini Veeraraghavan","doi":"10.1109/isbi60581.2025.10980778","DOIUrl":"10.1109/isbi60581.2025.10980778","url":null,"abstract":"<p><p>Self-supervised learning (SSL) is an approach to pretrain deep networks with unlabeled datasets by using pretext tasks that use images as \"ground truth\". Pretext tasks have been shown to impact accuracy of task categories, e.g. segmentation vs. classification. However, versatility of SSL features to downstream tasks involving different modalities has not been studied. We benchmarked impact of SSL tasks such as contrastive predictive coding, token self-distillation, and generative masked image modeling (MIM) with 3D vision transformer performed using 10K 3D-CTs (or 1.89M images) from various disease sites. SSL pretraining was used to assess (a) multi-organ segmentation under data-limited fine tuning, (b) feature reuse and (c) organ localization with multi-head attention. Analysis showed that pretext tasks combining MIM and token self-distillation balanced local and global attention distance, produced higher segmentation accuracy in few-shot and data-limited settings for MRI and CT. Feature reuse was impacted by similarity of pretraining and fine-tuning modality.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12214241/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144562287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
UNSUPERVISED CORTICAL SURFACE REGISTRATION NETWORK FOR ALIGNING GYRALNET. 无监督皮质面配准网络对准回网。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10981138
Jiale Cheng, Fenqiang Zhao, Dan Hu, Chao Cao, Zhengwang Wu, Xinrui Yuan, Kangfu Han, Lu Zhang, Tianming Liu, Dajiang Zhu, Gang Li

The cortical 3-hinge gyrus (3HG) and its network (GyralNet) play key roles in understanding the regularity and variability of brain structure and function. However, existing cortical surface registration methods overlook these features, resulting in suboptimal alignment across subjects. Currently, no 3HG and GyralNet atlas exist for registration, and generation of the corresponding atlas requires extensive runtime using traditional methods. To enable better registration of these features, we introduce an unsupervised learning framework to jointly develop 3HGs and GyralNet atlas and register the individual cortical features onto the atlas. To incorporate the graph structure of 3HGs and GyralNet into the registration network, we convert them into surface distance maps, facilitating effective integration. To effectively learn large deformations, a multi-level spherical registration network based on spherical U-Net is introduced to perform registration in a coarse-to-fine manner. Experiments demonstrate our approach's ability to generate 3HGs and GyralNet atlas with detailed patterns and effectively improve registration accuracy.

皮质3铰回(3HG)及其网络(GyralNet)在理解大脑结构和功能的规律性和可变性中起着关键作用。然而,现有的皮质表面配准方法忽略了这些特征,导致受试者之间的排列不理想。目前没有用于配准的3HG和GyralNet地图集,使用传统方法生成相应地图集需要大量的运行时间。为了更好地注册这些特征,我们引入了一个无监督学习框架来联合开发3HGs和GyralNet地图集,并将单个皮质特征注册到地图集上。为了将3hg和GyralNet的图形结构整合到配准网络中,我们将它们转换成表面距离图,便于有效整合。为了有效地学习大变形,引入了基于球面U-Net的多级球面配准网络,实现了从粗到精的配准。实验表明,该方法能够生成具有详细模式的3hg和GyralNet地图集,并有效提高配准精度。
{"title":"UNSUPERVISED CORTICAL SURFACE REGISTRATION NETWORK FOR ALIGNING GYRALNET.","authors":"Jiale Cheng, Fenqiang Zhao, Dan Hu, Chao Cao, Zhengwang Wu, Xinrui Yuan, Kangfu Han, Lu Zhang, Tianming Liu, Dajiang Zhu, Gang Li","doi":"10.1109/isbi60581.2025.10981138","DOIUrl":"10.1109/isbi60581.2025.10981138","url":null,"abstract":"<p><p>The cortical 3-hinge gyrus (3HG) and its network (GyralNet) play key roles in understanding the regularity and variability of brain structure and function. However, existing cortical surface registration methods overlook these features, resulting in suboptimal alignment across subjects. Currently, no 3HG and GyralNet atlas exist for registration, and generation of the corresponding atlas requires extensive runtime using traditional methods. To enable better registration of these features, we introduce an unsupervised learning framework to jointly develop 3HGs and GyralNet atlas and register the individual cortical features onto the atlas. To incorporate the graph structure of 3HGs and GyralNet into the registration network, we convert them into surface distance maps, facilitating effective integration. To effectively learn large deformations, a multi-level spherical registration network based on spherical U-Net is introduced to perform registration in a coarse-to-fine manner. Experiments demonstrate our approach's ability to generate 3HGs and GyralNet atlas with detailed patterns and effectively improve registration accuracy.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12178662/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144334595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BRAIN-ADAPTER: ENHANCING NEUROLOGICAL DISORDER ANALYSIS WITH ADAPTER-TUNING MULTIMODAL LARGE LANGUAGE MODELS. 脑适配器:用适配器调谐多模态大语言模型增强神经障碍分析。
Pub Date : 2025-04-01 Epub Date: 2025-05-12 DOI: 10.1109/isbi60581.2025.10980770
Jing Zhang, Xiaowei Yu, Yanjun Lyu, Lu Zhang, Tong Chen, Chao Cao, Yan Zhuang, Minheng Chen, Tianming Liu, Dajiang Zhu

Understanding brain disorders is crucial for accurate clinical diagnosis and treatment. Recent advances in Multimodal Large Language Models (MLLMs) offer a promising approach to interpreting medical images with the support of text descriptions. However, previous research has primarily focused on 2D medical images, leaving richer spatial information of 3D images under-explored, and single-modality-based methods are limited by overlooking the critical clinical information contained in other modalities. To address this issue, this paper proposes Brain-Adapter, a novel approach that incorporates an extra bottleneck layer to learn new knowledge and instill it into the original pre-trained knowledge. The major idea is to incorporate a lightweight bottleneck layer to train fewer parameters while capturing essential information and utilize a Contrastive Language-Image Pre-training (CLIP) strategy to align multimodal data within a unified representation space. Extensive experiments demonstrated the effectiveness of our approach in integrating multimodal data to significantly improve the diagnosis accuracy without high computational costs, highlighting the potential to enhance real-world diagnostic workflows.

了解脑部疾病对准确的临床诊断和治疗至关重要。多模态大语言模型(mllm)的最新进展为在文本描述的支持下解释医学图像提供了一种很有前途的方法。然而,以往的研究主要集中在二维医学图像上,缺乏对三维图像更丰富的空间信息的探索,并且基于单一模态的方法由于忽略了其他模态中包含的关键临床信息而受到限制。为了解决这一问题,本文提出了一种新的方法Brain-Adapter,该方法结合了一个额外的瓶颈层来学习新知识并将其灌输到原始的预训练知识中。主要思想是结合一个轻量级的瓶颈层来训练更少的参数,同时捕获基本信息,并利用对比语言图像预训练(CLIP)策略在统一的表示空间内对齐多模态数据。大量的实验证明了我们的方法在集成多模态数据方面的有效性,在不增加计算成本的情况下显着提高了诊断准确性,突出了增强现实世界诊断工作流程的潜力。
{"title":"BRAIN-ADAPTER: ENHANCING NEUROLOGICAL DISORDER ANALYSIS WITH ADAPTER-TUNING MULTIMODAL LARGE LANGUAGE MODELS.","authors":"Jing Zhang, Xiaowei Yu, Yanjun Lyu, Lu Zhang, Tong Chen, Chao Cao, Yan Zhuang, Minheng Chen, Tianming Liu, Dajiang Zhu","doi":"10.1109/isbi60581.2025.10980770","DOIUrl":"10.1109/isbi60581.2025.10980770","url":null,"abstract":"<p><p>Understanding brain disorders is crucial for accurate clinical diagnosis and treatment. Recent advances in Multimodal Large Language Models (MLLMs) offer a promising approach to interpreting medical images with the support of text descriptions. However, previous research has primarily focused on 2D medical images, leaving richer spatial information of 3D images under-explored, and single-modality-based methods are limited by overlooking the critical clinical information contained in other modalities. To address this issue, this paper proposes Brain-Adapter, a novel approach that incorporates an extra bottleneck layer to learn new knowledge and instill it into the original pre-trained knowledge. The major idea is to incorporate a lightweight bottleneck layer to train fewer parameters while capturing essential information and utilize a Contrastive Language-Image Pre-training (CLIP) strategy to align multimodal data within a unified representation space. Extensive experiments demonstrated the effectiveness of our approach in integrating multimodal data to significantly improve the diagnosis accuracy without high computational costs, highlighting the potential to enhance real-world diagnostic workflows.</p>","PeriodicalId":74566,"journal":{"name":"Proceedings. IEEE International Symposium on Biomedical Imaging","volume":"2025 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12714489/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings. IEEE International Symposium on Biomedical Imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1