首页 > 最新文献

Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention最新文献

英文 中文
Physics Informed Neural Networks for Estimation of Tissue Properties from Multi-echo Configuration State MRI. 基于物理信息的神经网络在多回波构型状态MRI中估计组织特性。
Samuel I Adams-Tew, Henrik Odéen, Dennis L Parker, Cheng-Chieh Cheng, Bruno Madore, Allison Payne, Sarang Joshi

This work investigates the use of configuration state imaging together with deep neural networks to develop quantitative MRI techniques for deployment in an interventional setting. A physics modeling technique for inhomogeneous fields and heterogeneous tissues is presented and used to evaluate the theoretical capability of neural networks to estimate parameter maps from configuration state signal data. All tested normalization strategies achieved similar performance in estimating T 2 and T 2 * . Varying network architecture and data normalization had substantial impacts on estimated flip angle and T 1 , highlighting their importance in developing neural networks to solve these inverse problems. The developed signal modeling technique provides an environment that will enable the development and evaluation of physics-informed machine learning techniques for MR parameter mapping and facilitate the development of quantitative MRI techniques to inform clinical decisions during MR-guided treatments.

这项工作研究了配置状态成像与深度神经网络的使用,以开发用于介入性设置的定量MRI技术。提出了一种非均匀场和非均匀组织的物理建模技术,并用于评估神经网络从组态信号数据估计参数映射的理论能力。所有测试的归一化策略在估计t2和t2 *方面都取得了相似的性能。不同的网络结构和数据归一化对估计的翻转角和t1有实质性的影响,突出了它们在开发神经网络来解决这些逆问题中的重要性。开发的信号建模技术提供了一个环境,可以开发和评估用于MR参数映射的物理信息机器学习技术,并促进定量MRI技术的开发,以便在MR引导治疗期间为临床决策提供信息。
{"title":"Physics Informed Neural Networks for Estimation of Tissue Properties from Multi-echo Configuration State MRI.","authors":"Samuel I Adams-Tew, Henrik Odéen, Dennis L Parker, Cheng-Chieh Cheng, Bruno Madore, Allison Payne, Sarang Joshi","doi":"10.1007/978-3-031-72120-5_47","DOIUrl":"10.1007/978-3-031-72120-5_47","url":null,"abstract":"<p><p>This work investigates the use of configuration state imaging together with deep neural networks to develop quantitative MRI techniques for deployment in an interventional setting. A physics modeling technique for inhomogeneous fields and heterogeneous tissues is presented and used to evaluate the theoretical capability of neural networks to estimate parameter maps from configuration state signal data. All tested normalization strategies achieved similar performance in estimating <math> <msub><mrow><mi>T</mi></mrow> <mrow><mn>2</mn></mrow> </msub> </math> and <math> <msubsup><mrow><mi>T</mi></mrow> <mrow><mn>2</mn></mrow> <mrow><mi>*</mi></mrow> </msubsup> </math> . Varying network architecture and data normalization had substantial impacts on estimated flip angle and <math> <msub><mrow><mi>T</mi></mrow> <mrow><mn>1</mn></mrow> </msub> </math> , highlighting their importance in developing neural networks to solve these inverse problems. The developed signal modeling technique provides an environment that will enable the development and evaluation of physics-informed machine learning techniques for MR parameter mapping and facilitate the development of quantitative MRI techniques to inform clinical decisions during MR-guided treatments.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15011 ","pages":"502-511"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11653200/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142857422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Volume-optimal persistence homological scaffolds of hemodynamic networks covary with MEG theta-alpha aperiodic dynamics. 血流动力学网络的体积最优持续性同源支架与MEG - α非周期动力学共变。
Nghi Nguyen, Tao Hou, Enrico Amico, Jingyi Zheng, Huajun Huang, Alan D Kaplan, Giovanni Petri, Joaquín Goñi, Yize Zhao, Duy Duong-Tran, Li Shen

Higher-order properties of functional magnetic resonance imaging (fMRI) induced connectivity have been shown to unravel many exclusive topological and dynamical insights beyond pairwise interactions. Nonetheless, whether these fMRI-induced higher-order properties play a role in disentangling other neuroimaging modalities' insights remains largely unexplored and poorly understood. In this work, by analyzing fMRI data from the Human Connectome Project Young Adult dataset using persistent homology, we discovered that the volume-optimal persistence homological scaffolds of fMRI-based functional connectomes exhibited conservative topological reconfigurations from the resting state to attentional task-positive state. Specifically, while reflecting the extent to which each cortical region contributed to functional cycles following different cognitive demands, these reconfigurations were constrained such that the spatial distribution of cavities in the connectome is relatively conserved. Most importantly, such level of contributions covaried with powers of aperiodic activities mostly within the theta-alpha (4-12 Hz) band measured by magnetoencephalography (MEG). This comprehensive result suggests that fMRI-induced hemodynamics and MEG theta-alpha aperiodic activities are governed by the same functional constraints specific to each cortical morpho-structure. Methodologically, our work paves the way toward an innovative computing paradigm in multimodal neuroimaging topological learning. The code for our analyses is provided in https://github.com/ngcaonghi/scaffold_noise.

功能性磁共振成像(fMRI)诱导的连接的高阶特性已经被证明揭示了许多超越两两相互作用的独家拓扑和动力学见解。尽管如此,这些fmri诱导的高阶特性是否在解开其他神经成像模式的见解中发挥作用,在很大程度上仍未被探索和理解。在这项工作中,通过使用持久同源性分析来自人类连接组项目年轻人数据集的fMRI数据,我们发现基于fMRI的功能连接组的体积最优持久同源支架从静置状态到注意任务积极状态表现出保守的拓扑重构。具体来说,虽然反映了每个皮质区域在不同认知需求下对功能周期的贡献程度,但这些重新配置受到限制,使得连接组中空腔的空间分布相对保守。最重要的是,这种贡献水平与脑磁图(MEG)测量的θ - α (4-12 Hz)波段内的非周期活动功率共变。这一综合结果表明,fmri诱导的血流动力学和MEG β - α非周期活动受特定于每种皮质形态结构的相同功能约束。在方法上,我们的工作为多模态神经成像拓扑学习的创新计算范式铺平了道路。我们的分析代码在https://github.com/ngcaonghi/scaffold_noise中提供。
{"title":"Volume-optimal persistence homological scaffolds of hemodynamic networks covary with MEG theta-alpha aperiodic dynamics.","authors":"Nghi Nguyen, Tao Hou, Enrico Amico, Jingyi Zheng, Huajun Huang, Alan D Kaplan, Giovanni Petri, Joaquín Goñi, Yize Zhao, Duy Duong-Tran, Li Shen","doi":"10.1007/978-3-031-72384-1_49","DOIUrl":"10.1007/978-3-031-72384-1_49","url":null,"abstract":"<p><p>Higher-order properties of functional magnetic resonance imaging (fMRI) induced connectivity have been shown to unravel many exclusive topological and dynamical insights beyond pairwise interactions. Nonetheless, whether these fMRI-induced higher-order properties play a role in disentangling other neuroimaging modalities' insights remains largely unexplored and poorly understood. In this work, by analyzing fMRI data from the Human Connectome Project Young Adult dataset using persistent homology, we discovered that the volume-optimal persistence homological scaffolds of fMRI-based functional connectomes exhibited conservative topological reconfigurations from the resting state to attentional task-positive state. Specifically, while reflecting the extent to which each cortical region contributed to functional cycles following different cognitive demands, these reconfigurations were constrained such that the spatial distribution of cavities in the connectome is relatively conserved. Most importantly, such level of contributions covaried with powers of aperiodic activities mostly within the theta-alpha (4-12 Hz) band measured by magnetoencephalography (MEG). This comprehensive result suggests that fMRI-induced hemodynamics and MEG theta-alpha aperiodic activities are governed by the same functional constraints specific to each cortical morpho-structure. Methodologically, our work paves the way toward an innovative computing paradigm in multimodal neuroimaging topological learning. The code for our analyses is provided in https://github.com/ngcaonghi/scaffold_noise.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15003 ","pages":"519-529"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11816146/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143416610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the Quality of Brain MRI Generators. 脑磁共振成像发生器的质量评价。
Jiaqi Wu, Wei Peng, Binxu Li, Yu Zhang, Kilian M Pohl

Deep learning models generating structural brain MRIs have the potential to significantly accelerate discovery of neuroscience studies. However, their use has been limited in part by the way their quality is evaluated. Most evaluations of generative models focus on metrics originally designed for natural images (such as structural similarity index and Fréchet inception distance). As we show in a comparison of 6 state-of-the-art generative models trained and tested on over 3000 MRIs, these metrics are sensitive to the experimental setup and inadequately assess how well brain MRIs capture macrostructural properties of brain regions (a.k.a., anatomical plausibility). This shortcoming of the metrics results in inconclusive findings even when qualitative differences between the outputs of models are evident. We therefore propose a framework for evaluating models generating brain MRIs, which requires uniform processing of the real MRIs, standardizing the implementation of the models, and automatically segmenting the MRIs generated by the models. The segmentations are used for quantifying the plausibility of anatomy displayed in the MRIs. To ensure meaningful quantification, it is crucial that the segmentations are highly reliable. Our framework rigorously checks this reliability, a step often overlooked by prior work. Only 3 of the 6 generative models produced MRIs, of which at least 95% had highly reliable segmentations. More importantly, the assessment of each model by our framework is in line with qualitative assessments, reinforcing the validity of our approach. The code of this framework is available via https://github.com/jiaqiw01/MRIAnatEval.git.

生成大脑结构核磁共振成像的深度学习模型有可能显著加速神经科学研究的发现。然而,它们的使用在一定程度上受到其质量评估方式的限制。大多数生成模型的评估都集中在最初为自然图像设计的度量上(如结构相似指数和fr起始距离)。正如我们在6个最先进的生成模型的比较中所显示的那样,这些指标对实验设置很敏感,并且不能充分评估大脑核磁共振成像捕获大脑区域宏观结构特性(即解剖合理性)的程度。即使在模型输出之间的质量差异很明显时,度量标准的这一缺点也会导致不确定的结果。因此,我们提出了一个评估脑核磁共振成像模型的框架,该框架要求对真实核磁共振成像进行统一处理,规范模型的实现,并对模型生成的核磁共振成像进行自动分割。分割用于量化核磁共振成像显示的解剖结构的合理性。为了确保有意义的量化,至关重要的是分割是高度可靠的。我们的框架严格检查这种可靠性,这一步经常被之前的工作所忽略。6个生成模型中只有3个生成了mri,其中至少95%具有高可靠的分割。更重要的是,我们的框架对每个模型的评估与定性评估是一致的,从而加强了我们方法的有效性。该框架的代码可通过https://github.com/jiaqiw01/MRIAnatEval.git获得。
{"title":"Evaluating the Quality of Brain MRI Generators.","authors":"Jiaqi Wu, Wei Peng, Binxu Li, Yu Zhang, Kilian M Pohl","doi":"10.1007/978-3-031-72117-5_28","DOIUrl":"10.1007/978-3-031-72117-5_28","url":null,"abstract":"<p><p>Deep learning models generating structural brain MRIs have the potential to significantly accelerate discovery of neuroscience studies. However, their use has been limited in part by the way their quality is evaluated. Most evaluations of generative models focus on metrics originally designed for natural images (such as structural similarity index and Fréchet inception distance). As we show in a comparison of 6 state-of-the-art generative models trained and tested on over 3000 MRIs, these metrics are sensitive to the experimental setup and inadequately assess how well brain MRIs capture macrostructural properties of brain regions (a.k.a., anatomical plausibility). This shortcoming of the metrics results in inconclusive findings even when qualitative differences between the outputs of models are evident. We therefore propose a framework for evaluating models generating brain MRIs, which requires uniform processing of the real MRIs, standardizing the implementation of the models, and automatically segmenting the MRIs generated by the models. The segmentations are used for quantifying the plausibility of anatomy displayed in the MRIs. To ensure meaningful quantification, it is crucial that the segmentations are highly reliable. Our framework rigorously checks this reliability, a step often overlooked by prior work. Only 3 of the 6 generative models produced MRIs, of which at least 95% had highly reliable segmentations. More importantly, the assessment of each model by our framework is in line with qualitative assessments, reinforcing the validity of our approach. The code of this framework is available via https://github.com/jiaqiw01/MRIAnatEval.git.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15010 ","pages":"297-307"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12066240/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144056478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Super-Field MRI Synthesis for Infant Brains Enhanced by Dual Channel Latent Diffusion. 双通道潜在扩散增强婴儿脑超场MRI合成。
Austin Tapp, Can Zhao, Holger R Roth, Jeffrey Tanedo, Syed Muhammad Anwar, Niall J Bourke, Joseph Hajnal, Victoria Nankabirwa, Sean Deoni, Natasha Lepore, Marius George Linguraru

In resource-limited settings, portable ultra-low-field (uLF, i.e., 0.064T) magnetic resonance imaging (MRI) systems expand accessibility of radiological scanning, particularly for low-income areas as well as underserved populations like neonates and infants. However, compared to high-field (HF, e.g., ≥ 1.5T) systems, inferior image quality in uLF scanning poses challenges for research and clinical use. To address this, we introduce Super-Field Network (SFNet), a custom swinUNETRv2 with generative adversarial network components that uses uLF MRIs to generate super-field (SF) images comparable to HF MRIs. We acquired a cohort of infant data (n=30, aged 0-2 years) with paired uLF-HF MRI data from a resource-limited setting with an underrepresented population in research. To enhance the small dataset, we present a novel use of latent diffusion to create dual-channel (uLF-HF) paired MRIs. We compare SFNet with state-of-the-art synthesis methods by HF-SF image similarity perceptual scores and by automated HF and SF segmentations of white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF). The best performance was achieved by SFNet trained on the latent diffusion enhanced dataset yielding state-of-the-art results in Fréchet inception distance at 9.08 ± 1.21, perceptual similarity at 0.11 ± 0.01, and PSNR at 22.64 ± 1.31. True HF and SF segmentations had a strong overlap with Dice similarity coefficients of 0.71 ± 0.1, 0.79 ± 0.2, and 0.73 ± 0.08 for WM, GM, and CSF, respectively, in the developing infant brain with incomplete myelination, and displayed 166%, 107%, and 106% improvement over respective uLF-based segmentation metrics. SF MRI supports health equity by enhancing the clinical use of uLF imaging systems and improving the diagnostic capabilities of low-cost portable MRI systems in resource-limited settings and for underserved populations. Our code is made openly available at https://github.com/AustinTapp/SFnet.

在资源有限的情况下,便携式超低场(uLF,即0.064T)磁共振成像(MRI)系统扩大了放射扫描的可及性,特别是对低收入地区以及新生儿和婴儿等服务不足人群。然而,与高场(HF,例如≥1.5T)系统相比,超低频扫描的图像质量较差,给研究和临床应用带来了挑战。为了解决这个问题,我们引入了超级场网络(SFNet),这是一个自定义的swinUNETRv2,具有生成对抗网络组件,它使用uLF mri生成与HF mri相当的超级场(SF)图像。我们获得了一组婴儿数据(n=30, 0-2岁)和配对的uLF-HF MRI数据,这些数据来自资源有限的环境,研究中代表性不足的人群。为了增强小数据集,我们提出了一种新的使用潜在扩散来创建双通道(uLF-HF)配对mri。我们通过HF-SF图像相似性感知评分和脑白质(WM)、灰质(GM)和脑脊液(CSF)的自动HF和SF分割,将SFNet与最先进的合成方法进行比较。在潜在扩散增强数据集上训练的SFNet获得了最好的性能,获得了最先进的结果,其中fr起始距离为9.08±1.21,感知相似度为0.11±0.01,PSNR为22.64±1.31。在髓鞘发育不完全的婴儿脑中,WM、GM和CSF的真实HF和SF分割与Dice相似系数有很强的重叠,分别为0.71±0.1、0.79±0.2和0.73±0.08,比各自基于ulf的分割指标提高了166%、107%和106%。SF MRI通过加强uLF成像系统的临床使用,提高低成本便携式MRI系统在资源有限的环境和服务不足人群中的诊断能力,支持健康公平。我们的代码可以在https://github.com/AustinTapp/SFnet上公开获得。
{"title":"Super-Field MRI Synthesis for Infant Brains Enhanced by Dual Channel Latent Diffusion.","authors":"Austin Tapp, Can Zhao, Holger R Roth, Jeffrey Tanedo, Syed Muhammad Anwar, Niall J Bourke, Joseph Hajnal, Victoria Nankabirwa, Sean Deoni, Natasha Lepore, Marius George Linguraru","doi":"10.1007/978-3-031-72384-1_42","DOIUrl":"10.1007/978-3-031-72384-1_42","url":null,"abstract":"<p><p>In resource-limited settings, portable ultra-low-field (uLF, i.e., 0.064T) magnetic resonance imaging (MRI) systems expand accessibility of radiological scanning, particularly for low-income areas as well as underserved populations like neonates and infants. However, compared to high-field (HF, e.g., ≥ 1.5T) systems, inferior image quality in uLF scanning poses challenges for research and clinical use. To address this, we introduce Super-Field Network (SFNet), a custom swinUNETRv2 with generative adversarial network components that uses uLF MRIs to generate super-field (SF) images comparable to HF MRIs. We acquired a cohort of infant data (n=30, aged 0-2 years) with paired uLF-HF MRI data from a resource-limited setting with an underrepresented population in research. To enhance the small dataset, we present a novel use of latent diffusion to create dual-channel (uLF-HF) paired MRIs. We compare SFNet with state-of-the-art synthesis methods by HF-SF image similarity perceptual scores and by automated HF and SF segmentations of white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF). The best performance was achieved by SFNet trained on the latent diffusion enhanced dataset yielding state-of-the-art results in Fréchet inception distance at 9.08 ± 1.21, perceptual similarity at 0.11 ± 0.01, and PSNR at 22.64 ± 1.31. True HF and SF segmentations had a strong overlap with Dice similarity coefficients of 0.71 ± 0.1, 0.79 ± 0.2, and 0.73 ± 0.08 for WM, GM, and CSF, respectively, in the developing infant brain with incomplete myelination, and displayed 166%, 107%, and 106% improvement over respective uLF-based segmentation metrics. SF MRI supports health equity by enhancing the clinical use of uLF imaging systems and improving the diagnostic capabilities of low-cost portable MRI systems in resource-limited settings and for underserved populations. Our code is made openly available at https://github.com/AustinTapp/SFnet.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15003 ","pages":"444-454"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033166/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144053815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-guided Knowledge-Injected Graph Neural Network for Alzheimer's Diseases. 针对阿尔茨海默病的自导式知识注入图神经网络。
Zhepeng Wang, Runxue Bao, Yawen Wu, Guodong Liu, Lei Yang, Liang Zhan, Feng Zheng, Weiwen Jiang, Yanfu Zhang

Graph neural networks (GNNs) are proficient machine learning models in handling irregularly structured data. Nevertheless, their generic formulation falls short when applied to the analysis of brain connectomes in Alzheimer's Disease (AD), necessitating the incorporation of domain-specific knowledge to achieve optimal model performance. The integration of AD-related expertise into GNNs presents a significant challenge. Current methodologies reliant on manual design often demand substantial expertise from external domain specialists to guide the development of novel models, thereby consuming considerable time and resources. To mitigate the need for manual curation, this paper introduces a novel self-guided knowledge-infused multimodal GNN to autonomously integrate domain knowledge into the model development process. We propose to conceptualize existing domain knowledge as natural language, and devise a specialized multimodal GNN framework tailored to leverage this uncurated knowledge to direct the learning of the GNN submodule, thereby enhancing its efficacy and improving prediction interpretability. To assess the effectiveness of our framework, we compile a comprehensive literature dataset comprising recent peer-reviewed publications on AD. By integrating this literature dataset with several real-world AD datasets, our experimental results illustrate the effectiveness of the proposed method in extracting curated knowledge and offering explanations on graphs for domain-specific applications. Furthermore, our approach successfully utilizes the extracted information to enhance the performance of the GNN.

图神经网络(GNN)是处理不规则结构数据的熟练机器学习模型。然而,在应用于分析阿尔茨海默病(AD)的大脑连接组时,它们的通用表述并不完善,需要结合特定领域的知识才能实现最佳模型性能。将老年痴呆症相关专业知识整合到 GNN 中是一项重大挑战。目前依赖人工设计的方法往往需要外部领域专家提供大量专业知识,以指导新型模型的开发,从而耗费大量时间和资源。为了减少对人工策划的需求,本文介绍了一种新型的自引导知识注入多模态 GNN,可自主地将领域知识整合到模型开发过程中。我们建议将现有的领域知识概念化为自然语言,并设计一个专门的多模态 GNN 框架,利用这些未经整理的知识来指导 GNN 子模块的学习,从而增强其功效并提高预测的可解释性。为了评估我们的框架的有效性,我们汇编了一个全面的文献数据集,其中包括最近发表的有关注意力缺失症的同行评议出版物。通过将该文献数据集与几个真实世界的注意力缺失症数据集进行整合,我们的实验结果表明了所提出的方法在为特定领域应用提取策划知识和提供图解方面的有效性。此外,我们的方法还成功地利用了提取的信息来提高 GNN 的性能。
{"title":"Self-guided Knowledge-Injected Graph Neural Network for Alzheimer's Diseases.","authors":"Zhepeng Wang, Runxue Bao, Yawen Wu, Guodong Liu, Lei Yang, Liang Zhan, Feng Zheng, Weiwen Jiang, Yanfu Zhang","doi":"10.1007/978-3-031-72069-7_36","DOIUrl":"10.1007/978-3-031-72069-7_36","url":null,"abstract":"<p><p>Graph neural networks (GNNs) are proficient machine learning models in handling irregularly structured data. Nevertheless, their generic formulation falls short when applied to the analysis of brain connectomes in Alzheimer's Disease (AD), necessitating the incorporation of domain-specific knowledge to achieve optimal model performance. The integration of AD-related expertise into GNNs presents a significant challenge. Current methodologies reliant on manual design often demand substantial expertise from external domain specialists to guide the development of novel models, thereby consuming considerable time and resources. To mitigate the need for manual curation, this paper introduces a novel self-guided knowledge-infused multimodal GNN to autonomously integrate domain knowledge into the model development process. We propose to conceptualize existing domain knowledge as natural language, and devise a specialized multimodal GNN framework tailored to leverage this uncurated knowledge to direct the learning of the GNN submodule, thereby enhancing its efficacy and improving prediction interpretability. To assess the effectiveness of our framework, we compile a comprehensive literature dataset comprising recent peer-reviewed publications on AD. By integrating this literature dataset with several real-world AD datasets, our experimental results illustrate the effectiveness of the proposed method in extracting curated knowledge and offering explanations on graphs for domain-specific applications. Furthermore, our approach successfully utilizes the extracted information to enhance the performance of the GNN.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15002 ","pages":"378-388"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11488260/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142484836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gradient Guided Co-Retention Feature Pyramid Network for LDCT Image Denoising. 梯度引导的共保留特征金字塔网络用于LDCT图像去噪。
Li Zhou, Dayang Wang, Yongshun Xu, Shuo Han, Bahareh Morovati, Shuyi Fan, Hengyong Yu

Low-dose computed tomography (LDCT) reduces the risks of radiation exposure but introduces noise and artifacts into CT images. The Feature Pyramid Network (FPN) is a conventional method for extracting multi-scale feature maps from input images. While upper layers in FPN enhance semantic value, details become generalized with reduced spatial resolution at each layer. In this work, we propose a Gradient Guided Co-Retention Feature Pyramid Network (G2CR-FPN) to address the connection between spatial resolution and semantic value beyond feature maps extracted from LDCT images. The network is structured with three essential paths: the bottom-up path utilizes the FPN structure to generate the hierarchical feature maps, representing multi-scale spatial resolutions and semantic values. Meanwhile, the lateral path serves as a skip connection between feature maps with the same spatial resolution, while also functioning feature maps as directional gradients. This path incorporates a gradient approximation, deriving edge-like enhanced feature maps in horizontal and vertical directions. The top-down path incorporates a proposed co-retention block that learns the high-level semantic value embedded in the preceding map of the path. This learning process is guided by the directional gradient approximation of the high-resolution feature map from the bottom-up path. Experimental results on the clinical CT images demonstrated the promising performance of the model. Our code is available at: https://github.com/liz109/G2CR-FPN.

低剂量计算机断层扫描(LDCT)降低了辐射暴露的风险,但在CT图像中引入了噪声和伪影。特征金字塔网络(FPN)是从输入图像中提取多尺度特征映射的一种传统方法。在FPN中,上层的语义值会得到提升,而细节则会随着每层空间分辨率的降低而一般化。在这项工作中,我们提出了一个梯度引导的共同保留特征金字塔网络(G2CR-FPN)来解决从LDCT图像中提取的特征图之外的空间分辨率和语义值之间的联系。该网络由三条基本路径构成:自底向上路径利用FPN结构生成层次化特征图,表示多尺度空间分辨率和语义值;同时,横向路径作为具有相同空间分辨率的特征图之间的跳跃连接,同时也将特征图作为方向梯度。该路径结合了梯度近似,在水平和垂直方向上派生出类似边缘的增强特征图。自顶向下的路径包含了一个建议的协同保留块,该块学习嵌入在路径的前一个映射中的高级语义值。该学习过程由自底向上路径的高分辨率特征映射的方向梯度近似指导。在临床CT图像上的实验结果证明了该模型的良好性能。我们的代码可在:https://github.com/liz109/G2CR-FPN。
{"title":"Gradient Guided Co-Retention Feature Pyramid Network for LDCT Image Denoising.","authors":"Li Zhou, Dayang Wang, Yongshun Xu, Shuo Han, Bahareh Morovati, Shuyi Fan, Hengyong Yu","doi":"10.1007/978-3-031-72390-2_15","DOIUrl":"10.1007/978-3-031-72390-2_15","url":null,"abstract":"<p><p>Low-dose computed tomography (LDCT) reduces the risks of radiation exposure but introduces noise and artifacts into CT images. The Feature Pyramid Network (FPN) is a conventional method for extracting multi-scale feature maps from input images. While upper layers in FPN enhance semantic value, details become generalized with reduced spatial resolution at each layer. In this work, we propose a Gradient Guided Co-Retention Feature Pyramid Network (G2CR-FPN) to address the connection between spatial resolution and semantic value beyond feature maps extracted from LDCT images. The network is structured with three essential paths: the bottom-up path utilizes the FPN structure to generate the hierarchical feature maps, representing multi-scale spatial resolutions and semantic values. Meanwhile, the lateral path serves as a skip connection between feature maps with the same spatial resolution, while also functioning feature maps as directional gradients. This path incorporates a gradient approximation, deriving edge-like enhanced feature maps in horizontal and vertical directions. The top-down path incorporates a proposed co-retention block that learns the high-level semantic value embedded in the preceding map of the path. This learning process is guided by the directional gradient approximation of the high-resolution feature map from the bottom-up path. Experimental results on the clinical CT images demonstrated the promising performance of the model. Our code is available at: https://github.com/liz109/G2CR-FPN.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15012 ","pages":"153-163"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12443485/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145088647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disentangled Hybrid Transformer for Identification of Infants with Prenatal Drug Exposure. 解缠混合变压器鉴别婴儿产前药物暴露。
Jiale Cheng, Zhengwang Wu, Xinrui Yuan, Li Wang, Weili Lin, Karen Grewen, Gang Li

Prenatal drug exposure, which occurs during a time of extraordinary and critical brain development, is typically associated with cognitive, behavioral, and physiological deficits during infancy, childhood, and adolescence. Early identifying infants with prenatal drug exposures and associated biomarkers using neuroimages can help inform earlier, more effective, and personalized interventions to greatly improve later cognitive outcomes. To this end, we propose a novel deep learning model called disentangled hybrid volume-surface transformer for identifying individual infants with prenatal drug exposures. Specifically, we design two distinct branches, a volumetric network for learning non-cortical features in 3D image space, and a surface network for learning features on the highly convoluted cortical surface manifold. To better capture long-range dependency and generate highly discriminative representations, image and surface transformers are respectively employed for the volume and surface branches. Then, a disentanglement strategy is further proposed to separate the representations from two branches into complementary variables and common variables, thus removing redundant information and boosting expressive capability. After that, the disentangled representations are concatenated to a classifier to determine if there is an existence of prenatal drug exposures. We have validated our method on 210 infant MRI scans and demonstrated its superior performance, compared to ablated models and state-of-the-art methods.

产前药物暴露发生在大脑发育异常和关键时期,通常与婴儿期、儿童期和青春期的认知、行为和生理缺陷有关。使用神经图像早期识别产前药物暴露和相关生物标志物的婴儿可以帮助提供更早,更有效和个性化的干预措施,以极大地改善后来的认知结果。为此,我们提出了一种新的深度学习模型,称为解纠缠混合体积-表面变压器,用于识别产前药物暴露的个体婴儿。具体来说,我们设计了两个不同的分支,一个是用于学习3D图像空间中非皮质特征的体积网络,另一个是用于学习高度卷积的皮质表面流形上的特征的表面网络。为了更好地捕获远程依赖关系并生成高度判别的表示,分别对体积和表面分支使用了图像和表面变压器。然后,提出了一种解纠缠策略,将两个分支的表示分离为互补变量和公共变量,从而去除冗余信息,提高表达能力。之后,将解纠缠的表示连接到分类器,以确定是否存在产前药物暴露。我们已经在210个婴儿MRI扫描上验证了我们的方法,并证明了与消融模型和最先进的方法相比,它具有优越的性能。
{"title":"Disentangled Hybrid Transformer for Identification of Infants with Prenatal Drug Exposure.","authors":"Jiale Cheng, Zhengwang Wu, Xinrui Yuan, Li Wang, Weili Lin, Karen Grewen, Gang Li","doi":"10.1007/978-3-031-72390-2_7","DOIUrl":"10.1007/978-3-031-72390-2_7","url":null,"abstract":"<p><p>Prenatal drug exposure, which occurs during a time of extraordinary and critical brain development, is typically associated with cognitive, behavioral, and physiological deficits during infancy, childhood, and adolescence. Early identifying infants with prenatal drug exposures and associated biomarkers using neuroimages can help inform earlier, more effective, and personalized interventions to greatly improve later cognitive outcomes. To this end, we propose a novel deep learning model called disentangled hybrid volume-surface transformer for identifying individual infants with prenatal drug exposures. Specifically, we design two distinct branches, a volumetric network for learning non-cortical features in 3D image space, and a surface network for learning features on the highly convoluted cortical surface manifold. To better capture long-range dependency and generate highly discriminative representations, image and surface transformers are respectively employed for the volume and surface branches. Then, a disentanglement strategy is further proposed to separate the representations from two branches into complementary variables and common variables, thus removing redundant information and boosting expressive capability. After that, the disentangled representations are concatenated to a classifier to determine if there is an existence of prenatal drug exposures. We have validated our method on 210 infant MRI scans and demonstrated its superior performance, compared to ablated models and state-of-the-art methods.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15012 ","pages":"67-76"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC13004604/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147501339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rethinking Histology Slide Digitization Workflows for Low-Resource Settings. 重新思考低资源环境下的组织学切片数字化工作流程。
Talat Zehra, Joseph Marino, Wendy Wang, Grigoriy Frantsuzov, Saad Nadeem

Histology slide digitization is becoming essential for telepathology (remote consultation), knowledge sharing (education), and using the state-of-the-art artificial intelligence algorithms (augmented/automated end-to-end clinical workflows). However, the cumulative costs of digital multi-slide high-speed brightfield scanners, cloud/on-premises storage, and personnel (IT and technicians) make the current slide digitization workflows out-of-reach for limited-resource settings, further widening the health equity gap; even single-slide manual scanning commercial solutions are costly due to hardware requirements (high-resolution cameras, high-spec PC/workstation, and support for only high-end microscopes). In this work, we present a new cloud slide digitization workflow for creating scanner-quality whole-slide images (WSIs) from uploaded low-quality videos, acquired from cheap and inexpensive microscopes with built-in cameras. Specifically, we present a pipeline to create stitched WSIs while automatically deblurring out-of-focus regions, upsampling input 10X images to 40X resolution, and reducing brightness/contrast and light-source illumination variations. We demonstrate the WSI creation efficacy from our workflow on World Health Organization-declared neglected tropical disease, Cutaneous Leishmaniasis (prevalent only in the poorest regions of the world and only diagnosed by sub-specialist dermatopathologists, rare in poor countries), as well as other common pathologies on core biopsies of breast, liver, duodenum, stomach and lymph node. The code and pretrained models will be accessible via our GitHub (https://github.com/nadeemlab/DeepLIIF), and the cloud platform will be available at https://deepliif.org for uploading microscope videos and downloading/viewing WSIs with shareable links (no sign-in required) for telepathology and knowledge sharing.

组织学切片数字化对于远程病理学(远程会诊)、知识共享(教育)和使用最先进的人工智能算法(增强/自动化端到端临床工作流程)变得至关重要。然而,数字多幻灯片高速明场扫描仪、云/本地存储和人员(IT和技术人员)的累积成本使得目前的幻灯片数字化工作流程在资源有限的环境中遥不可及,进一步扩大了健康公平差距;由于硬件要求(高分辨率相机,高规格PC/工作站,只支持高端显微镜),即使是单片手动扫描商业解决方案也很昂贵。在这项工作中,我们提出了一种新的云幻灯片数字化工作流程,用于从上载的低质量视频中创建扫描仪质量的全幻灯片图像(wsi),这些视频来自内置摄像头的廉价和廉价显微镜。具体来说,我们提出了一个流水线来创建缝合的wsi,同时自动去模糊失焦区域,将输入的10倍图像上采样到40倍分辨率,并减少亮度/对比度和光源照明变化。我们从世界卫生组织宣布被忽视的热带病、皮肤利什曼病(仅在世界上最贫穷的地区流行,仅由亚专科皮肤病理学家诊断,在贫穷国家很少见)以及乳房、肝脏、十二指肠、胃和淋巴结核心活检的其他常见病理的工作流程中证明了WSI的创建效果。代码和预训练模型可通过GitHub (https://github.com/nadeemlab/DeepLIIF)访问,云平台https://deepliif.org可用于上传显微镜视频和下载/查看具有可共享链接的WSIs(无需登录),用于心灵病理学和知识共享。
{"title":"Rethinking Histology Slide Digitization Workflows for Low-Resource Settings.","authors":"Talat Zehra, Joseph Marino, Wendy Wang, Grigoriy Frantsuzov, Saad Nadeem","doi":"10.1007/978-3-031-72083-3_40","DOIUrl":"10.1007/978-3-031-72083-3_40","url":null,"abstract":"<p><p>Histology slide digitization is becoming essential for telepathology (remote consultation), knowledge sharing (education), and using the state-of-the-art artificial intelligence algorithms (augmented/automated end-to-end clinical workflows). However, the cumulative costs of digital multi-slide high-speed brightfield scanners, cloud/on-premises storage, and personnel (IT and technicians) make the current slide digitization workflows out-of-reach for limited-resource settings, further widening the health equity gap; even single-slide manual scanning commercial solutions are costly due to hardware requirements (high-resolution cameras, high-spec PC/workstation, and support for only high-end microscopes). In this work, we present a new cloud slide digitization workflow for creating scanner-quality whole-slide images (WSIs) from uploaded low-quality videos, acquired from cheap and inexpensive microscopes with built-in cameras. Specifically, we present a pipeline to create stitched WSIs while automatically deblurring out-of-focus regions, upsampling input 10X images to 40X resolution, and reducing brightness/contrast and light-source illumination variations. We demonstrate the WSI creation efficacy from our workflow on World Health Organization-declared neglected tropical disease, Cutaneous Leishmaniasis (prevalent only in the poorest regions of the world and only diagnosed by sub-specialist dermatopathologists, rare in poor countries), as well as other common pathologies on core biopsies of breast, liver, duodenum, stomach and lymph node. The code and pretrained models will be accessible via our GitHub (https://github.com/nadeemlab/DeepLIIF), and the cloud platform will be available at https://deepliif.org for uploading microscope videos and downloading/viewing WSIs with shareable links (no sign-in required) for telepathology and knowledge sharing.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15004 ","pages":"427-436"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11786607/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143082977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of Effective Connectome from Infancy to Adolescence. 从婴儿期到青春期有效连接组的发展。
Guoshi Li, Kim-Han Thung, Hoyt Taylor, Zhengwang Wu, Gang Li, Li Wang, Weili Lin, Sahar Ahmad, Pew-Thian Yap

Delineating the normative developmental profile of functional connectome is important for both standardized assessment of individual growth and early detection of diseases. However, functional connectome has been mostly studied using functional connectivity (FC), where undirected connectivity strengths are estimated from statistical correlation of resting-state functional MRI (rs-fMRI) signals. To address this limitation, we applied regression dynamic causal modeling (rDCM) to delineate the developmental trajectories of effective connectivity (EC), the directed causal influence among neuronal populations, in whole-brain networks from infancy to adolescence (0-22 years old) based on high-quality rs-fMRI data from Baby Connectome Project (BCP) and Human Connectome Project Development (HCP-D). Analysis with linear mixed model demonstrates significant age effect on the mean nodal EC which is best fit by a "U" shaped quadratic curve with minimal EC at around 2 years old. Further analysis indicates that five brain regions including the left and right cuneus, left precuneus, left supramarginal gyrus and right inferior temporal gyrus have the most significant age effect on nodal EC (p < 0.05, FDR corrected). Moreover, the frontoparietal control (FPC) network shows the fastest increase from early childhood to adolescence followed by the visual and salience networks. Our findings suggest complex nonlinear developmental profile of EC from infancy to adolescence, which may reflect dynamic structural and functional maturation during this critical growth period.

描述功能连接体的规范性发育特征对于个体生长的标准化评估和疾病的早期发现都很重要。然而,功能连接组的研究主要是使用功能连接(FC),其中无向连接强度是通过静息状态功能MRI (rs-fMRI)信号的统计相关性来估计的。为了解决这一局限性,我们基于婴儿连接组项目(BCP)和人类连接组项目发展(HCP-D)的高质量rs-fMRI数据,应用回归动态因果模型(rDCM)来描述婴儿期到青春期(0-22岁)全脑网络中有效连接(EC)的发展轨迹,即神经元群体之间的直接因果影响。线性混合模型分析表明,年龄对平均节点电导率的影响显著,其最佳拟合曲线为“U”型,2岁左右电导率最小。进一步分析发现,左右楔叶、左楔前叶、左边缘上回、右颞下回等5个脑区对结性EC的年龄影响最为显著(p < 0.05, FDR校正)。此外,额顶叶控制(FPC)网络在儿童早期到青少年时期增长最快,其次是视觉和显著性网络。我们的研究结果表明,从婴儿期到青春期,EC复杂的非线性发育特征可能反映了这一关键生长时期结构和功能的动态成熟。
{"title":"Development of Effective Connectome from Infancy to Adolescence.","authors":"Guoshi Li, Kim-Han Thung, Hoyt Taylor, Zhengwang Wu, Gang Li, Li Wang, Weili Lin, Sahar Ahmad, Pew-Thian Yap","doi":"10.1007/978-3-031-72384-1_13","DOIUrl":"10.1007/978-3-031-72384-1_13","url":null,"abstract":"<p><p>Delineating the normative developmental profile of functional connectome is important for both standardized assessment of individual growth and early detection of diseases. However, functional connectome has been mostly studied using functional connectivity (FC), where undirected connectivity strengths are estimated from statistical correlation of resting-state functional MRI (rs-fMRI) signals. To address this limitation, we applied regression dynamic causal modeling (rDCM) to delineate the developmental trajectories of effective connectivity (EC), the directed causal influence among neuronal populations, in whole-brain networks from infancy to adolescence (0-22 years old) based on high-quality rs-fMRI data from Baby Connectome Project (BCP) and Human Connectome Project Development (HCP-D). Analysis with linear mixed model demonstrates significant age effect on the mean nodal EC which is best fit by a \"U\" shaped quadratic curve with minimal EC at around 2 years old. Further analysis indicates that five brain regions including the left and right cuneus, left precuneus, left supramarginal gyrus and right inferior temporal gyrus have the most significant age effect on nodal EC (<i>p</i> < 0.05, FDR corrected). Moreover, the frontoparietal control (FPC) network shows the fastest increase from early childhood to adolescence followed by the visual and salience networks. Our findings suggest complex nonlinear developmental profile of EC from infancy to adolescence, which may reflect dynamic structural and functional maturation during this critical growth period.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15003 ","pages":"131-140"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11758277/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143049390","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Extraction for Generative Medical Imaging Evaluation: New Evidence Against an Evolving Trend. 特征提取用于生成医学成像评价:新证据反对一个不断发展的趋势。
McKell Woodland, Austin Castelo, Mais Al Taie, Jessica Albuquerque Marques Silva, Mohamed Eltaher, Frank Mohn, Alexander Shieh, Suprateek Kundu, Joshua P Yung, Ankit B Patel, Kristy K Brock

Fréchet Inception Distance (FID) is a widely used metric for assessing synthetic image quality. It relies on an ImageNet-based feature extractor, making its applicability to medical imaging unclear. A recent trend is to adapt FID to medical imaging through feature extractors trained on medical images. Our study challenges this practice by demonstrating that ImageNet-based extractors are more consistent and aligned with human judgment than their RadImageNet counterparts. We evaluated sixteen StyleGAN2 networks across four medical imaging modalities and four data augmentation techniques with Fréchet distances (FDs) computed using eleven ImageNet or RadImageNet-trained feature extractors. Comparison with human judgment via visual Turing tests revealed that ImageNet-based extractors produced rankings consistent with human judgment, with the FD derived from the ImageNet-trained SwAV extractor significantly correlating with expert evaluations. In contrast, RadImageNet-based rankings were volatile and inconsistent with human judgment. Our findings challenge prevailing assumptions, providing novel evidence that medical image-trained feature extractors do not inherently improve FDs and can even compromise their reliability. Our code is available at https://github.com/mckellwoodland/fid-med-eval.

起始距离(FID)是一种广泛用于评价合成图像质量的度量。它依赖于基于imagenet的特征提取器,这使得它对医学成像的适用性不明确。最近的一个趋势是通过对医学图像进行训练的特征提取器使FID适应医学成像。我们的研究通过证明基于imagenet的提取器比基于RadImageNet的提取器更符合人类的判断,从而挑战了这种做法。我们评估了16个StyleGAN2网络跨越4种医学成像模式和4种数据增强技术,使用11个ImageNet或radimagenet训练的特征提取器计算了fr距离(fd)。通过视觉图灵测试与人类判断的比较显示,基于imagenet的提取器产生的排名与人类判断一致,从imagenet训练的SwAV提取器获得的FD与专家评估显着相关。相比之下,基于radimagenet的排名是不稳定的,与人类的判断不一致。我们的研究结果挑战了普遍的假设,提供了新的证据,证明医学图像训练的特征提取器并不能内在地改善fd,甚至可能损害其可靠性。我们的代码可在https://github.com/mckellwoodland/fid-med-eval上获得。
{"title":"Feature Extraction for Generative Medical Imaging Evaluation: New Evidence Against an Evolving Trend.","authors":"McKell Woodland, Austin Castelo, Mais Al Taie, Jessica Albuquerque Marques Silva, Mohamed Eltaher, Frank Mohn, Alexander Shieh, Suprateek Kundu, Joshua P Yung, Ankit B Patel, Kristy K Brock","doi":"10.1007/978-3-031-72390-2_9","DOIUrl":"10.1007/978-3-031-72390-2_9","url":null,"abstract":"<p><p>Fréchet Inception Distance (FID) is a widely used metric for assessing synthetic image quality. It relies on an ImageNet-based feature extractor, making its applicability to medical imaging unclear. A recent trend is to adapt FID to medical imaging through feature extractors trained on medical images. Our study challenges this practice by demonstrating that ImageNet-based extractors are more consistent and aligned with human judgment than their RadImageNet counterparts. We evaluated sixteen StyleGAN2 networks across four medical imaging modalities and four data augmentation techniques with Fréchet distances (FDs) computed using eleven ImageNet or RadImageNet-trained feature extractors. Comparison with human judgment via visual Turing tests revealed that ImageNet-based extractors produced rankings consistent with human judgment, with the FD derived from the ImageNet-trained SwAV extractor significantly correlating with expert evaluations. In contrast, RadImageNet-based rankings were volatile and inconsistent with human judgment. Our findings challenge prevailing assumptions, providing novel evidence that medical image-trained feature extractors do not inherently improve FDs and can even compromise their reliability. Our code is available at https://github.com/mckellwoodland/fid-med-eval.</p>","PeriodicalId":94280,"journal":{"name":"Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention","volume":"15012 ","pages":"87-97"},"PeriodicalIF":0.0,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12117514/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144183435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1