首页 > 最新文献

Information processing in medical imaging : proceedings of the ... conference最新文献

英文 中文
Pixel-Level Explanation of Multiple Instance Learning Models in Biomedical Single Cell Images 生物医学单细胞图像中多实例学习模型的像素级解释
Pub Date : 2023-03-15 DOI: 10.48550/arXiv.2303.08632
A. Sadafi, Oleksandra Adonkina, Ashkan Khakzar, P. Lienemann, Rudolf Matthias Hehr, D. Rueckert, N. Navab, C. Marr
Explainability is a key requirement for computer-aided diagnosis systems in clinical decision-making. Multiple instance learning with attention pooling provides instance-level explainability, however for many clinical applications a deeper, pixel-level explanation is desirable, but missing so far. In this work, we investigate the use of four attribution methods to explain a multiple instance learning models: GradCAM, Layer-Wise Relevance Propagation (LRP), Information Bottleneck Attribution (IBA), and InputIBA. With this collection of methods, we can derive pixel-level explanations on for the task of diagnosing blood cancer from patients' blood smears. We study two datasets of acute myeloid leukemia with over 100 000 single cell images and observe how each attribution method performs on the multiple instance learning architecture focusing on different properties of the white blood single cells. Additionally, we compare attribution maps with the annotations of a medical expert to see how the model's decision-making differs from the human standard. Our study addresses the challenge of implementing pixel-level explainability in multiple instance learning models and provides insights for clinicians to better understand and trust decisions from computer-aided diagnosis systems.
可解释性是计算机辅助诊断系统在临床决策中的关键要求。具有注意力池的多实例学习提供了实例级的可解释性,然而对于许多临床应用来说,更深入的、像素级的解释是可取的,但迄今为止还没有。在这项工作中,我们研究了使用四种归因方法来解释多实例学习模型:GradCAM,分层相关传播(LRP),信息瓶颈归因(IBA)和InputIBA。通过这些方法的集合,我们可以从患者的血液涂片中获得诊断血癌任务的像素级解释。我们研究了两个包含超过100,000个单细胞图像的急性髓系白血病数据集,并观察了每种归因方法在多实例学习架构上的表现,重点关注白细胞的不同特性。此外,我们将归因图与医学专家的注释进行比较,看看模型的决策与人类标准有何不同。我们的研究解决了在多实例学习模型中实现像素级可解释性的挑战,并为临床医生更好地理解和信任计算机辅助诊断系统的决策提供了见解。
{"title":"Pixel-Level Explanation of Multiple Instance Learning Models in Biomedical Single Cell Images","authors":"A. Sadafi, Oleksandra Adonkina, Ashkan Khakzar, P. Lienemann, Rudolf Matthias Hehr, D. Rueckert, N. Navab, C. Marr","doi":"10.48550/arXiv.2303.08632","DOIUrl":"https://doi.org/10.48550/arXiv.2303.08632","url":null,"abstract":"Explainability is a key requirement for computer-aided diagnosis systems in clinical decision-making. Multiple instance learning with attention pooling provides instance-level explainability, however for many clinical applications a deeper, pixel-level explanation is desirable, but missing so far. In this work, we investigate the use of four attribution methods to explain a multiple instance learning models: GradCAM, Layer-Wise Relevance Propagation (LRP), Information Bottleneck Attribution (IBA), and InputIBA. With this collection of methods, we can derive pixel-level explanations on for the task of diagnosing blood cancer from patients' blood smears. We study two datasets of acute myeloid leukemia with over 100 000 single cell images and observe how each attribution method performs on the multiple instance learning architecture focusing on different properties of the white blood single cells. Additionally, we compare attribution maps with the annotations of a medical expert to see how the model's decision-making differs from the human standard. Our study addresses the challenge of implementing pixel-level explainability in multiple instance learning models and provides insights for clinicians to better understand and trust decisions from computer-aided diagnosis systems.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"23 1","pages":"170-182"},"PeriodicalIF":0.0,"publicationDate":"2023-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78891836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
HALOS: Hallucination-free Organ Segmentation after Organ Resection Surgery 光晕:器官切除术后无幻觉的器官分割
Pub Date : 2023-03-14 DOI: 10.48550/arXiv.2303.07717
Anne-Marie Rickmann, Murong Xu, Thomas Wolf, Oksana P. Kovalenko, C. Wachinger
The wide range of research in deep learning-based medical image segmentation pushed the boundaries in a multitude of applications. A clinically relevant problem that received less attention is the handling of scans with irregular anatomy, e.g., after organ resection. State-of-the-art segmentation models often lead to organ hallucinations, i.e., false-positive predictions of organs, which cannot be alleviated by oversampling or post-processing. Motivated by the increasing need to develop robust deep learning models, we propose HALOS for abdominal organ segmentation in MR images that handles cases after organ resection surgery. To this end, we combine missing organ classification and multi-organ segmentation tasks into a multi-task model, yielding a classification-assisted segmentation pipeline. The segmentation network learns to incorporate knowledge about organ existence via feature fusion modules. Extensive experiments on a small labeled test set and large-scale UK Biobank data demonstrate the effectiveness of our approach in terms of higher segmentation Dice scores and near-to-zero false positive prediction rate.
基于深度学习的医学图像分割的广泛研究在众多应用中突破了界限。一个较少受到关注的临床相关问题是不规则解剖扫描的处理,例如器官切除后。最先进的分割模型经常导致器官幻觉,即器官的假阳性预测,这不能通过过采样或后处理来缓解。由于越来越需要开发强大的深度学习模型,我们提出了用于MR图像中腹部器官分割的HALOS,用于处理器官切除手术后的病例。为此,我们将缺失器官分类和多器官分割任务结合到一个多任务模型中,产生了一个分类辅助分割管道。该分割网络通过特征融合模块学习合并器官存在的知识。在小型标记测试集和大规模UK Biobank数据上进行的大量实验表明,我们的方法在更高的分割Dice分数和接近于零的假阳性预测率方面是有效的。
{"title":"HALOS: Hallucination-free Organ Segmentation after Organ Resection Surgery","authors":"Anne-Marie Rickmann, Murong Xu, Thomas Wolf, Oksana P. Kovalenko, C. Wachinger","doi":"10.48550/arXiv.2303.07717","DOIUrl":"https://doi.org/10.48550/arXiv.2303.07717","url":null,"abstract":"The wide range of research in deep learning-based medical image segmentation pushed the boundaries in a multitude of applications. A clinically relevant problem that received less attention is the handling of scans with irregular anatomy, e.g., after organ resection. State-of-the-art segmentation models often lead to organ hallucinations, i.e., false-positive predictions of organs, which cannot be alleviated by oversampling or post-processing. Motivated by the increasing need to develop robust deep learning models, we propose HALOS for abdominal organ segmentation in MR images that handles cases after organ resection surgery. To this end, we combine missing organ classification and multi-organ segmentation tasks into a multi-task model, yielding a classification-assisted segmentation pipeline. The segmentation network learns to incorporate knowledge about organ existence via feature fusion modules. Extensive experiments on a small labeled test set and large-scale UK Biobank data demonstrate the effectiveness of our approach in terms of higher segmentation Dice scores and near-to-zero false positive prediction rate.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"12 1","pages":"667-678"},"PeriodicalIF":0.0,"publicationDate":"2023-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76323908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
NeurEPDiff: Neural Operators to Predict Geodesics in Deformation Spaces 预测变形空间测地线的神经算子
Pub Date : 2023-03-13 DOI: 10.48550/arXiv.2303.07115
Nian Wu, Miaomiao Zhang
This paper presents NeurEPDiff, a novel network to fast predict the geodesics in deformation spaces generated by a well known Euler-Poincar'e differential equation (EPDiff). To achieve this, we develop a neural operator that for the first time learns the evolving trajectory of geodesic deformations parameterized in the tangent space of diffeomorphisms(a.k.a velocity fields). In contrast to previous methods that purely fit the training images, our proposed NeurEPDiff learns a nonlinear mapping function between the time-dependent velocity fields. A composition of integral operators and smooth activation functions is formulated in each layer of NeurEPDiff to effectively approximate such mappings. The fact that NeurEPDiff is able to rapidly provide the numerical solution of EPDiff (given any initial condition) results in a significantly reduced computational cost of geodesic shooting of diffeomorphisms in a high-dimensional image space. Additionally, the properties of discretiztion/resolution-invariant of NeurEPDiff make its performance generalizable to multiple image resolutions after being trained offline. We demonstrate the effectiveness of NeurEPDiff in registering two image datasets: 2D synthetic data and 3D brain resonance imaging (MRI). The registration accuracy and computational efficiency are compared with the state-of-the-art diffeomophic registration algorithms with geodesic shooting.
本文提出了一种新的神经网络NeurEPDiff,用于快速预测由欧拉-庞加莱微分方程(EPDiff)产生的变形空间中的测地线。为了实现这一点,我们开发了一个神经算子,该算子首次学习了在微分同态的切空间中参数化的测地线变形的演化轨迹。A速度场)。与以往单纯拟合训练图像的方法不同,我们提出的NeurEPDiff学习了随时间变化的速度场之间的非线性映射函数。在NeurEPDiff的每一层中建立了积分算子和光滑激活函数的组合,以有效地近似这种映射。NeurEPDiff能够快速提供EPDiff的数值解(给定任何初始条件),从而大大降低了高维图像空间中差分同态的测地拍摄的计算成本。此外,NeurEPDiff的离散化/分辨率不变性特性使其在离线训练后可以推广到多种图像分辨率。我们证明了NeurEPDiff在注册两个图像数据集方面的有效性:2D合成数据和3D脑磁共振成像(MRI)。并将其配准精度和计算效率与目前最先进的测地线射击差分配准算法进行了比较。
{"title":"NeurEPDiff: Neural Operators to Predict Geodesics in Deformation Spaces","authors":"Nian Wu, Miaomiao Zhang","doi":"10.48550/arXiv.2303.07115","DOIUrl":"https://doi.org/10.48550/arXiv.2303.07115","url":null,"abstract":"This paper presents NeurEPDiff, a novel network to fast predict the geodesics in deformation spaces generated by a well known Euler-Poincar'e differential equation (EPDiff). To achieve this, we develop a neural operator that for the first time learns the evolving trajectory of geodesic deformations parameterized in the tangent space of diffeomorphisms(a.k.a velocity fields). In contrast to previous methods that purely fit the training images, our proposed NeurEPDiff learns a nonlinear mapping function between the time-dependent velocity fields. A composition of integral operators and smooth activation functions is formulated in each layer of NeurEPDiff to effectively approximate such mappings. The fact that NeurEPDiff is able to rapidly provide the numerical solution of EPDiff (given any initial condition) results in a significantly reduced computational cost of geodesic shooting of diffeomorphisms in a high-dimensional image space. Additionally, the properties of discretiztion/resolution-invariant of NeurEPDiff make its performance generalizable to multiple image resolutions after being trained offline. We demonstrate the effectiveness of NeurEPDiff in registering two image datasets: 2D synthetic data and 3D brain resonance imaging (MRI). The registration accuracy and computational efficiency are compared with the state-of-the-art diffeomophic registration algorithms with geodesic shooting.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"24 1","pages":"588-600"},"PeriodicalIF":0.0,"publicationDate":"2023-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79122410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Surface-normal Based Neural Framework for Colonoscopy Reconstruction 基于表面正常的结肠镜重建神经框架
Pub Date : 2023-03-13 DOI: 10.48550/arXiv.2303.07264
Shuxian Wang, Yubo Zhang, Sarah K. McGill, J. Rosenman, Jan-Michael Frahm, Soumyadip Sengupta, S. Pizer
Reconstructing a 3D surface from colonoscopy video is challenging due to illumination and reflectivity variation in the video frame that can cause defective shape predictions. Aiming to overcome this challenge, we utilize the characteristics of surface normal vectors and develop a two-step neural framework that significantly improves the colonoscopy reconstruction quality. The normal-based depth initialization network trained with self-supervised normal consistency loss provides depth map initialization to the normal-depth refinement module, which utilizes the relationship between illumination and surface normals to refine the frame-wise normal and depth predictions recursively. Our framework's depth accuracy performance on phantom colonoscopy data demonstrates the value of exploiting the surface normals in colonoscopy reconstruction, especially on en face views. Due to its low depth error, the prediction result from our framework will require limited post-processing to be clinically applicable for real-time colonoscopy reconstruction.
由于视频帧中的照明和反射率变化可能导致有缺陷的形状预测,因此从结肠镜检查视频中重建3D表面具有挑战性。为了克服这一挑战,我们利用表面法向量的特点,开发了一个两步神经框架,显著提高了结肠镜重建质量。使用自监督法线一致性损失训练的基于法线的深度初始化网络为法线深度细化模块提供深度图初始化,该模块利用光照和表面法线之间的关系递归地细化逐帧的法线和深度预测。我们的框架在模拟结肠镜数据上的深度精度表现表明了在结肠镜重建中利用表面法线的价值,特别是在正面视图上。由于其低深度误差,我们的框架预测结果需要有限的后处理才能临床应用于实时结肠镜重建。
{"title":"A Surface-normal Based Neural Framework for Colonoscopy Reconstruction","authors":"Shuxian Wang, Yubo Zhang, Sarah K. McGill, J. Rosenman, Jan-Michael Frahm, Soumyadip Sengupta, S. Pizer","doi":"10.48550/arXiv.2303.07264","DOIUrl":"https://doi.org/10.48550/arXiv.2303.07264","url":null,"abstract":"Reconstructing a 3D surface from colonoscopy video is challenging due to illumination and reflectivity variation in the video frame that can cause defective shape predictions. Aiming to overcome this challenge, we utilize the characteristics of surface normal vectors and develop a two-step neural framework that significantly improves the colonoscopy reconstruction quality. The normal-based depth initialization network trained with self-supervised normal consistency loss provides depth map initialization to the normal-depth refinement module, which utilizes the relationship between illumination and surface normals to refine the frame-wise normal and depth predictions recursively. Our framework's depth accuracy performance on phantom colonoscopy data demonstrates the value of exploiting the surface normals in colonoscopy reconstruction, especially on en face views. Due to its low depth error, the prediction result from our framework will require limited post-processing to be clinically applicable for real-time colonoscopy reconstruction.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"30 1","pages":"797-809"},"PeriodicalIF":0.0,"publicationDate":"2023-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91340772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Don't PANIC: Prototypical Additive Neural Network for Interpretable Classification of Alzheimer's Disease 不要惊慌:用于阿尔茨海默病可解释分类的原型加法神经网络
Pub Date : 2023-03-13 DOI: 10.48550/arXiv.2303.07125
Thomas Wolf, Sebastian Pölsterl, C. Wachinger
Alzheimer's disease (AD) has a complex and multifactorial etiology, which requires integrating information about neuroanatomy, genetics, and cerebrospinal fluid biomarkers for accurate diagnosis. Hence, recent deep learning approaches combined image and tabular information to improve diagnostic performance. However, the black-box nature of such neural networks is still a barrier for clinical applications, in which understanding the decision of a heterogeneous model is integral. We propose PANIC, a prototypical additive neural network for interpretable AD classification that integrates 3D image and tabular data. It is interpretable by design and, thus, avoids the need for post-hoc explanations that try to approximate the decision of a network. Our results demonstrate that PANIC achieves state-of-the-art performance in AD classification, while directly providing local and global explanations. Finally, we show that PANIC extracts biologically meaningful signatures of AD, and satisfies a set of desirable desiderata for trustworthy machine learning. Our implementation is available at https://github.com/ai-med/PANIC .
阿尔茨海默病(AD)具有复杂的多因素病因,需要整合神经解剖学、遗传学和脑脊液生物标志物的信息才能准确诊断。因此,最近的深度学习方法结合了图像和表格信息来提高诊断性能。然而,这种神经网络的黑箱性质仍然是临床应用的障碍,在临床应用中,理解异构模型的决策是不可或缺的。我们提出了PANIC,一个典型的用于可解释AD分类的加性神经网络,它集成了3D图像和表格数据。它可以通过设计来解释,因此,避免了试图近似网络决策的事后解释的需要。我们的研究结果表明,PANIC在AD分类中达到了最先进的性能,同时直接提供了局部和全局解释。最后,我们证明了PANIC提取了AD的生物学意义签名,并满足了一组值得信赖的机器学习的理想需求。我们的实现可以在https://github.com/ai-med/PANIC上获得。
{"title":"Don't PANIC: Prototypical Additive Neural Network for Interpretable Classification of Alzheimer's Disease","authors":"Thomas Wolf, Sebastian Pölsterl, C. Wachinger","doi":"10.48550/arXiv.2303.07125","DOIUrl":"https://doi.org/10.48550/arXiv.2303.07125","url":null,"abstract":"Alzheimer's disease (AD) has a complex and multifactorial etiology, which requires integrating information about neuroanatomy, genetics, and cerebrospinal fluid biomarkers for accurate diagnosis. Hence, recent deep learning approaches combined image and tabular information to improve diagnostic performance. However, the black-box nature of such neural networks is still a barrier for clinical applications, in which understanding the decision of a heterogeneous model is integral. We propose PANIC, a prototypical additive neural network for interpretable AD classification that integrates 3D image and tabular data. It is interpretable by design and, thus, avoids the need for post-hoc explanations that try to approximate the decision of a network. Our results demonstrate that PANIC achieves state-of-the-art performance in AD classification, while directly providing local and global explanations. Finally, we show that PANIC extracts biologically meaningful signatures of AD, and satisfies a set of desirable desiderata for trustworthy machine learning. Our implementation is available at https://github.com/ai-med/PANIC .","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"28 1","pages":"82-94"},"PeriodicalIF":0.0,"publicationDate":"2023-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76432496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Token Sparsification for Faster Medical Image Segmentation 用于快速医学图像分割的标记稀疏化
Pub Date : 2023-03-11 DOI: 10.48550/arXiv.2303.06522
Lei Zhou, Huidong Liu, Joseph Bae, Junjun He, D. Samaras, P. Prasanna
Can we use sparse tokens for dense prediction, e.g., segmentation? Although token sparsification has been applied to Vision Transformers (ViT) to accelerate classification, it is still unknown how to perform segmentation from sparse tokens. To this end, we reformulate segmentation as a sparse encoding ->token completion ->dense decoding (SCD) pipeline. We first empirically show that naively applying existing approaches from classification token pruning and masked image modeling (MIM) leads to failure and inefficient training caused by inappropriate sampling algorithms and the low quality of the restored dense features. In this paper, we propose Soft-topK Token Pruning (STP) and Multi-layer Token Assembly (MTA) to address these problems. In sparse encoding, STP predicts token importance scores with a lightweight sub-network and samples the topK tokens. The intractable topK gradients are approximated through a continuous perturbed score distribution. In token completion, MTA restores a full token sequence by assembling both sparse output tokens and pruned multi-layer intermediate ones. The last dense decoding stage is compatible with existing segmentation decoders, e.g., UNETR. Experiments show SCD pipelines equipped with STP and MTA are much faster than baselines without token pruning in both training (up to 120% higher throughput and inference up to 60.6% higher throughput) while maintaining segmentation quality.
我们可以使用稀疏标记进行密集预测,例如分割吗?尽管标记稀疏化已被应用于视觉变形器(Vision transformer, ViT)来加速分类,但如何从稀疏标记中进行分割仍然是一个未知的问题。为此,我们将分割重新制定为稀疏编码->令牌补全->密集解码(SCD)管道。我们首先通过经验证明,天真地应用分类令牌修剪和掩蔽图像建模(MIM)等现有方法会导致采样算法不合适以及恢复的密集特征质量低,从而导致训练失败和低效。在本文中,我们提出软顶令牌修剪(STP)和多层令牌组装(MTA)来解决这些问题。在稀疏编码中,STP使用轻量级子网络预测令牌重要性分数,并对topK令牌进行采样。顽固性topK梯度通过连续扰动分数分布近似。在令牌补全中,MTA通过组合稀疏的输出令牌和经过修剪的多层中间令牌来恢复完整的令牌序列。最后一个密集解码阶段与现有的分割解码器兼容,例如UNETR。实验表明,在保持分割质量的同时,配备STP和MTA的SCD管道在训练(高达120%的吞吐量和高达60.6%的吞吐量)上都比没有令牌修剪的基线快得多。
{"title":"Token Sparsification for Faster Medical Image Segmentation","authors":"Lei Zhou, Huidong Liu, Joseph Bae, Junjun He, D. Samaras, P. Prasanna","doi":"10.48550/arXiv.2303.06522","DOIUrl":"https://doi.org/10.48550/arXiv.2303.06522","url":null,"abstract":"Can we use sparse tokens for dense prediction, e.g., segmentation? Although token sparsification has been applied to Vision Transformers (ViT) to accelerate classification, it is still unknown how to perform segmentation from sparse tokens. To this end, we reformulate segmentation as a sparse encoding ->token completion ->dense decoding (SCD) pipeline. We first empirically show that naively applying existing approaches from classification token pruning and masked image modeling (MIM) leads to failure and inefficient training caused by inappropriate sampling algorithms and the low quality of the restored dense features. In this paper, we propose Soft-topK Token Pruning (STP) and Multi-layer Token Assembly (MTA) to address these problems. In sparse encoding, STP predicts token importance scores with a lightweight sub-network and samples the topK tokens. The intractable topK gradients are approximated through a continuous perturbed score distribution. In token completion, MTA restores a full token sequence by assembling both sparse output tokens and pruned multi-layer intermediate ones. The last dense decoding stage is compatible with existing segmentation decoders, e.g., UNETR. Experiments show SCD pipelines equipped with STP and MTA are much faster than baselines without token pruning in both training (up to 120% higher throughput and inference up to 60.6% higher throughput) while maintaining segmentation quality.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"19 1","pages":"743-754"},"PeriodicalIF":0.0,"publicationDate":"2023-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84304658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resolving quantitative MRI model degeneracy with machine learning via training data distribution design 通过训练数据分布设计,用机器学习解决定量MRI模型退化问题
Pub Date : 2023-03-09 DOI: 10.48550/arXiv.2303.05464
Michele Guerreri, Sean C. Epstein, H. Azadbakht, Hui Zhang
Quantitative MRI (qMRI) aims to map tissue properties non-invasively via models that relate these unknown quantities to measured MRI signals. Estimating these unknowns, which has traditionally required model fitting - an often iterative procedure, can now be done with one-shot machine learning (ML) approaches. Such parameter estimation may be complicated by intrinsic qMRI signal model degeneracy: different combinations of tissue properties produce the same signal. Despite their many advantages, it remains unclear whether ML approaches can resolve this issue. Growing empirical evidence appears to suggest ML approaches remain susceptible to model degeneracy. Here we demonstrate under the right circumstances ML can address this issue. Inspired by recent works on the impact of training data distributions on ML-based parameter estimation, we propose to resolve model degeneracy by designing training data distributions. We put forward a classification of model degeneracies and identify one particular kind of degeneracies amenable to the proposed attack. The strategy is demonstrated successfully using the Revised NODDI model with standard multi-shell diffusion MRI data as an exemplar. Our results illustrate the importance of training set design which has the potential to allow accurate estimation of tissue properties with ML.
定量MRI (qMRI)旨在通过将这些未知量与测量的MRI信号相关联的模型,非侵入性地绘制组织特性。估计这些未知数,传统上需要模型拟合,这通常是一个迭代的过程,现在可以用一次性机器学习(ML)方法来完成。这种参数估计可能会因qMRI信号模型的内在退化而变得复杂:组织特性的不同组合产生相同的信号。尽管ML方法有很多优点,但它是否能解决这个问题还不清楚。越来越多的经验证据似乎表明ML方法仍然容易受到模型退化的影响。这里我们演示了在适当的情况下ML可以解决这个问题。受最近研究训练数据分布对基于ml的参数估计影响的工作的启发,我们提出通过设计训练数据分布来解决模型退化问题。我们提出了一种模型退化的分类方法,并确定了一种适合这种攻击的模型退化。以标准多壳扩散MRI数据为例,对修正的NODDI模型进行了验证。我们的结果说明了训练集设计的重要性,它有可能允许用ML准确估计组织特性。
{"title":"Resolving quantitative MRI model degeneracy with machine learning via training data distribution design","authors":"Michele Guerreri, Sean C. Epstein, H. Azadbakht, Hui Zhang","doi":"10.48550/arXiv.2303.05464","DOIUrl":"https://doi.org/10.48550/arXiv.2303.05464","url":null,"abstract":"Quantitative MRI (qMRI) aims to map tissue properties non-invasively via models that relate these unknown quantities to measured MRI signals. Estimating these unknowns, which has traditionally required model fitting - an often iterative procedure, can now be done with one-shot machine learning (ML) approaches. Such parameter estimation may be complicated by intrinsic qMRI signal model degeneracy: different combinations of tissue properties produce the same signal. Despite their many advantages, it remains unclear whether ML approaches can resolve this issue. Growing empirical evidence appears to suggest ML approaches remain susceptible to model degeneracy. Here we demonstrate under the right circumstances ML can address this issue. Inspired by recent works on the impact of training data distributions on ML-based parameter estimation, we propose to resolve model degeneracy by designing training data distributions. We put forward a classification of model degeneracies and identify one particular kind of degeneracies amenable to the proposed attack. The strategy is demonstrated successfully using the Revised NODDI model with standard multi-shell diffusion MRI data as an exemplar. Our results illustrate the importance of training set design which has the potential to allow accurate estimation of tissue properties with ML.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"3 1","pages":"3-14"},"PeriodicalIF":0.0,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86424637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MetaMorph: Learning Metamorphic Image Transformation With Appearance Changes 变形:学习变形图像的外观变化
Pub Date : 2023-03-08 DOI: 10.48550/arXiv.2303.04849
Jian Wang, Jiarui Xing, J.T. Druzgal, W. Wells, Miaomiao Zhang
This paper presents a novel predictive model, MetaMorph, for metamorphic registration of images with appearance changes (i.e., caused by brain tumors). In contrast to previous learning-based registration methods that have little or no control over appearance-changes, our model introduces a new regularization that can effectively suppress the negative effects of appearance changing areas. In particular, we develop a piecewise regularization on the tangent space of diffeomorphic transformations (also known as initial velocity fields) via learned segmentation maps of abnormal regions. The geometric transformation and appearance changes are treated as joint tasks that are mutually beneficial. Our model MetaMorph is more robust and accurate when searching for an optimal registration solution under the guidance of segmentation, which in turn improves the segmentation performance by providing appropriately augmented training labels. We validate MetaMorph on real 3D human brain tumor magnetic resonance imaging (MRI) scans. Experimental results show that our model outperforms the state-of-the-art learning-based registration models. The proposed MetaMorph has great potential in various image-guided clinical interventions, e.g., real-time image-guided navigation systems for tumor removal surgery.
本文提出了一种新的预测模型MetaMorph,用于具有外观变化(即由脑肿瘤引起)的图像的变质配准。与之前基于学习的注册方法很少或根本不控制外观变化相比,我们的模型引入了一种新的正则化方法,可以有效地抑制外观变化区域的负面影响。特别是,我们通过学习异常区域的分割映射,在微分同构变换的切空间(也称为初始速度场)上开发了分段正则化。几何变换和外观变化被视为互惠互利的联合任务。我们的模型MetaMorph在分割指导下寻找最优配准解时更加鲁棒和准确,从而通过提供适当的增强训练标签来提高分割性能。我们在真实的三维人类脑肿瘤磁共振成像(MRI)扫描上验证了MetaMorph。实验结果表明,我们的模型优于目前最先进的基于学习的配准模型。所提出的MetaMorph在各种图像引导的临床干预中具有很大的潜力,例如用于肿瘤切除手术的实时图像引导导航系统。
{"title":"MetaMorph: Learning Metamorphic Image Transformation With Appearance Changes","authors":"Jian Wang, Jiarui Xing, J.T. Druzgal, W. Wells, Miaomiao Zhang","doi":"10.48550/arXiv.2303.04849","DOIUrl":"https://doi.org/10.48550/arXiv.2303.04849","url":null,"abstract":"This paper presents a novel predictive model, MetaMorph, for metamorphic registration of images with appearance changes (i.e., caused by brain tumors). In contrast to previous learning-based registration methods that have little or no control over appearance-changes, our model introduces a new regularization that can effectively suppress the negative effects of appearance changing areas. In particular, we develop a piecewise regularization on the tangent space of diffeomorphic transformations (also known as initial velocity fields) via learned segmentation maps of abnormal regions. The geometric transformation and appearance changes are treated as joint tasks that are mutually beneficial. Our model MetaMorph is more robust and accurate when searching for an optimal registration solution under the guidance of segmentation, which in turn improves the segmentation performance by providing appropriately augmented training labels. We validate MetaMorph on real 3D human brain tumor magnetic resonance imaging (MRI) scans. Experimental results show that our model outperforms the state-of-the-art learning-based registration models. The proposed MetaMorph has great potential in various image-guided clinical interventions, e.g., real-time image-guided navigation systems for tumor removal surgery.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"67 1","pages":"576-587"},"PeriodicalIF":0.0,"publicationDate":"2023-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83589864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Improved Segmentation of Deep Sulci in Cortical Gray Matter Using a Deep Learning Framework Incorporating Laplace's Equation 基于拉普拉斯方程的深度学习框架改进皮层灰质深沟分割
Pub Date : 2023-03-01 DOI: 10.48550/arXiv.2303.00795
S. Ravikumar, Ranjit Itttyerah, Sydney A. Lim, L. Xie, Sandhitsu R. Das, Pulkit Khandelwal, L. Wisse, M. Bedard, John L. Robinson, Terry K. Schuck, M. Grossman, J. Trojanowski, Eddie B. Lee, M. Tisdall, K. Prabhakaran, J. Detre, D. Irwin, Winifred Trotman, G. Mizsei, Emilio Artacho-P'erula, Maria Mercedes Iniguez de Onzono Martin, Maria del Mar Arroyo Jim'enez, M. Muñoz, Francisco Javier Molina Romero, M. Rabal, Sandra Cebada-S'anchez, J. Gonz'alez, C. Rosa-Prieto, Marta Córcoles Parada, D. Wolk, R. Insausti, Paul Yushkevich
When developing tools for automated cortical segmentation, the ability to produce topologically correct segmentations is important in order to compute geometrically valid morphometry measures. In practice, accurate cortical segmentation is challenged by image artifacts and the highly convoluted anatomy of the cortex itself. To address this, we propose a novel deep learning-based cortical segmentation method in which prior knowledge about the geometry of the cortex is incorporated into the network during the training process. We design a loss function which uses the theory of Laplace's equation applied to the cortex to locally penalize unresolved boundaries between tightly folded sulci. Using an ex vivo MRI dataset of human medial temporal lobe specimens, we demonstrate that our approach outperforms baseline segmentation networks, both quantitatively and qualitatively.
在开发自动皮层分割工具时,为了计算几何上有效的形态测量,生成拓扑正确分割的能力是很重要的。在实践中,准确的皮层分割受到图像伪影和皮层本身高度复杂的解剖结构的挑战。为了解决这个问题,我们提出了一种新的基于深度学习的皮层分割方法,该方法在训练过程中将皮层几何形状的先验知识整合到网络中。我们设计了一个损失函数,该函数使用拉普拉斯方程理论应用于皮层,以局部惩罚紧密折叠沟之间未解决的边界。使用人类内侧颞叶标本的离体MRI数据集,我们证明了我们的方法在定量和定性上都优于基线分割网络。
{"title":"Improved Segmentation of Deep Sulci in Cortical Gray Matter Using a Deep Learning Framework Incorporating Laplace's Equation","authors":"S. Ravikumar, Ranjit Itttyerah, Sydney A. Lim, L. Xie, Sandhitsu R. Das, Pulkit Khandelwal, L. Wisse, M. Bedard, John L. Robinson, Terry K. Schuck, M. Grossman, J. Trojanowski, Eddie B. Lee, M. Tisdall, K. Prabhakaran, J. Detre, D. Irwin, Winifred Trotman, G. Mizsei, Emilio Artacho-P'erula, Maria Mercedes Iniguez de Onzono Martin, Maria del Mar Arroyo Jim'enez, M. Muñoz, Francisco Javier Molina Romero, M. Rabal, Sandra Cebada-S'anchez, J. Gonz'alez, C. Rosa-Prieto, Marta Córcoles Parada, D. Wolk, R. Insausti, Paul Yushkevich","doi":"10.48550/arXiv.2303.00795","DOIUrl":"https://doi.org/10.48550/arXiv.2303.00795","url":null,"abstract":"When developing tools for automated cortical segmentation, the ability to produce topologically correct segmentations is important in order to compute geometrically valid morphometry measures. In practice, accurate cortical segmentation is challenged by image artifacts and the highly convoluted anatomy of the cortex itself. To address this, we propose a novel deep learning-based cortical segmentation method in which prior knowledge about the geometry of the cortex is incorporated into the network during the training process. We design a loss function which uses the theory of Laplace's equation applied to the cortex to locally penalize unresolved boundaries between tightly folded sulci. Using an ex vivo MRI dataset of human medial temporal lobe specimens, we demonstrate that our approach outperforms baseline segmentation networks, both quantitatively and qualitatively.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"146 1","pages":"692-704"},"PeriodicalIF":0.0,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77647104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
X-TRA: Improving Chest X-ray Tasks with Cross-Modal Retrieval Augmentation X-TRA:改进胸部x线任务与跨模态检索增强
Pub Date : 2023-02-22 DOI: 10.48550/arXiv.2302.11352
Tom van Sonsbeek, M. Worring
An important component of human analysis of medical images and their context is the ability to relate newly seen things to related instances in our memory. In this paper we mimic this ability by using multi-modal retrieval augmentation and apply it to several tasks in chest X-ray analysis. By retrieving similar images and/or radiology reports we expand and regularize the case at hand with additional knowledge, while maintaining factual knowledge consistency. The method consists of two components. First, vision and language modalities are aligned using a pre-trained CLIP model. To enforce that the retrieval focus will be on detailed disease-related content instead of global visual appearance it is fine-tuned using disease class information. Subsequently, we construct a non-parametric retrieval index, which reaches state-of-the-art retrieval levels. We use this index in our downstream tasks to augment image representations through multi-head attention for disease classification and report retrieval. We show that retrieval augmentation gives considerable improvements on these tasks. Our downstream report retrieval even shows to be competitive with dedicated report generation methods, paving the path for this method in medical imaging.
人类分析医学图像及其背景的一个重要组成部分是将新看到的事物与我们记忆中的相关实例联系起来的能力。在本文中,我们通过使用多模态检索增强来模拟这种能力,并将其应用于胸部x射线分析中的几个任务。通过检索相似的图像和/或放射学报告,我们用额外的知识扩展和规范手边的病例,同时保持事实知识的一致性。该方法由两个部分组成。首先,使用预训练的CLIP模型对齐视觉和语言模式。为了确保检索重点将放在与疾病相关的详细内容上,而不是全局视觉外观上,它使用疾病类别信息进行了微调。随后,我们构建了一个非参数检索索引,该索引达到了最先进的检索水平。我们在下游任务中使用该索引,通过多头关注来增强图像表示,用于疾病分类和报告检索。我们表明,检索增强在这些任务上提供了相当大的改进。我们的下游报告检索甚至显示出与专用报告生成方法的竞争力,为该方法在医学成像中铺平了道路。
{"title":"X-TRA: Improving Chest X-ray Tasks with Cross-Modal Retrieval Augmentation","authors":"Tom van Sonsbeek, M. Worring","doi":"10.48550/arXiv.2302.11352","DOIUrl":"https://doi.org/10.48550/arXiv.2302.11352","url":null,"abstract":"An important component of human analysis of medical images and their context is the ability to relate newly seen things to related instances in our memory. In this paper we mimic this ability by using multi-modal retrieval augmentation and apply it to several tasks in chest X-ray analysis. By retrieving similar images and/or radiology reports we expand and regularize the case at hand with additional knowledge, while maintaining factual knowledge consistency. The method consists of two components. First, vision and language modalities are aligned using a pre-trained CLIP model. To enforce that the retrieval focus will be on detailed disease-related content instead of global visual appearance it is fine-tuned using disease class information. Subsequently, we construct a non-parametric retrieval index, which reaches state-of-the-art retrieval levels. We use this index in our downstream tasks to augment image representations through multi-head attention for disease classification and report retrieval. We show that retrieval augmentation gives considerable improvements on these tasks. Our downstream report retrieval even shows to be competitive with dedicated report generation methods, paving the path for this method in medical imaging.","PeriodicalId":73379,"journal":{"name":"Information processing in medical imaging : proceedings of the ... conference","volume":"21 1","pages":"471-482"},"PeriodicalIF":0.0,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84151625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Information processing in medical imaging : proceedings of the ... conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1