Computerized Medical Imaging and Graphics最新文献_第5页

PADS-Net: GAN-based radiomics using multi-task network of denoising and segmentation for ultrasonic diagnosis of Parkinson disease PADS-Net：基于gan的多任务去噪和分割网络放射组学用于帕金森病的超声诊断。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-08 DOI: 10.1016/j.compmedimag.2024.102490

Yiwen Shen , Li Chen , Jieyi Liu , Haobo Chen , Changyan Wang , Hong Ding , Qi Zhang

Parkinson disease (PD) is a prevalent neurodegenerative disorder, and its accurate diagnosis is crucial for timely intervention. We propose the PArkinson disease Denoising and Segmentation Network (PADS-Net), to simultaneously denoise and segment transcranial ultrasound images of midbrain for accurate PD diagnosis. The PADS-Net is built upon generative adversarial networks and incorporates a multi-task deep learning framework aimed at optimizing the tasks of denoising and segmentation for ultrasound images. A composite loss function including the mean absolute error, the mean squared error and the Dice loss, is adopted in the PADS-Net to effectively capture image details. The PADS-Net also integrates radiomics techniques for PD diagnosis by exploiting high-throughput features from ultrasound images. A four-branch ensemble diagnostic model is designed by utilizing two “wings” of the butterfly-shaped midbrain regions on both ipsilateral and contralateral images to enhance the accuracy of PD diagnosis. Experimental results demonstrate that the PADS-Net not only reduced speckle noise, achieving the edge-to-noise ratio of 16.90, but also attained a Dice coefficient of 0.91 for midbrain segmentation. The PADS-Net finally achieved an area under the receiver operating characteristic curve as high as 0.87 for diagnosis of PD. Our PADS-Net excels in transcranial ultrasound image denoising and segmentation and offers a potential clinical solution to accurate PD assessment.

帕金森病（PD）是一种常见的神经退行性疾病，准确诊断对及时干预至关重要。我们提出了帕金森病去噪和分割网络（PADS-Net），用于同时对中脑的经颅超声图像进行去噪和分割，以准确诊断帕金森病。PADS-Net 建立在生成对抗网络的基础上，采用了多任务深度学习框架，旨在优化超声图像的去噪和分割任务。PADS-Net 采用了包括平均绝对误差、平均平方误差和 Dice 损失在内的复合损失函数，以有效捕捉图像细节。PADS-Net 还整合了放射组学技术，通过利用超声图像中的高通量特征来诊断脊髓灰质炎。利用同侧和对侧图像中蝴蝶状中脑区域的两个 "翅膀"，设计了一个四分支集合诊断模型，以提高帕金森病诊断的准确性。实验结果表明，PADS-Net 不仅降低了斑点噪声，实现了 16.90 的边缘噪声比，而且中脑分割的 Dice 系数达到了 0.91。最终，PADS-Net 在诊断帕金森病时的接收者操作特征曲线下面积高达 0.87。我们的 PADS-Net 在经颅超声图像去噪和分割方面表现出色，为准确评估帕金森病提供了潜在的临床解决方案。

{"title":"PADS-Net: GAN-based radiomics using multi-task network of denoising and segmentation for ultrasonic diagnosis of Parkinson disease","authors":"Yiwen Shen , Li Chen , Jieyi Liu , Haobo Chen , Changyan Wang , Hong Ding , Qi Zhang","doi":"10.1016/j.compmedimag.2024.102490","DOIUrl":"10.1016/j.compmedimag.2024.102490","url":null,"abstract":"<div><div>Parkinson disease (PD) is a prevalent neurodegenerative disorder, and its accurate diagnosis is crucial for timely intervention. We propose the <em>PA</em>rkinson disease <em>D</em>enoising and <em>S</em>egmentation <em>Net</em>work (PADS-Net), to simultaneously denoise and segment transcranial ultrasound images of midbrain for accurate PD diagnosis. The PADS-Net is built upon generative adversarial networks and incorporates a multi-task deep learning framework aimed at optimizing the tasks of denoising and segmentation for ultrasound images. A composite loss function including the mean absolute error, the mean squared error and the Dice loss, is adopted in the PADS-Net to effectively capture image details. The PADS-Net also integrates radiomics techniques for PD diagnosis by exploiting high-throughput features from ultrasound images. A four-branch ensemble diagnostic model is designed by utilizing two “wings” of the butterfly-shaped midbrain regions on both ipsilateral and contralateral images to enhance the accuracy of PD diagnosis. Experimental results demonstrate that the PADS-Net not only reduced speckle noise, achieving the edge-to-noise ratio of 16.90, but also attained a Dice coefficient of 0.91 for midbrain segmentation. The PADS-Net finally achieved an area under the receiver operating characteristic curve as high as 0.87 for diagnosis of PD. Our PADS-Net excels in transcranial ultrasound image denoising and segmentation and offers a potential clinical solution to accurate PD assessment.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"120 ","pages":"Article 102490"},"PeriodicalIF":5.4,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142985555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep Equilibrium Unfolding Learning for Noise Estimation and Removal in Optical Molecular Imaging 基于深度平衡展开学习的光学分子成像噪声估计与去除。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-08 DOI: 10.1016/j.compmedimag.2025.102492

Lidan Fu , Lingbing Li , Binchun Lu , Xiaoyong Guo , Xiaojing Shi , Jie Tian , Zhenhua Hu

In clinical optical molecular imaging, the need for real-time high frame rates and low excitation doses to ensure patient safety inherently increases susceptibility to detection noise. Faced with the challenge of image degradation caused by severe noise, image denoising is essential for mitigating the trade-off between acquisition cost and image quality. However, prevailing deep learning methods exhibit uncontrollable and suboptimal performance with limited interpretability, primarily due to neglecting underlying physical model and frequency information. In this work, we introduce an end-to-end model-driven Deep Equilibrium Unfolding Mamba (DEQ-UMamba) that integrates proximal gradient descent technique and learnt spatial-frequency characteristics to decouple complex noise structures into statistical distributions, enabling effective noise estimation and suppression in fluorescent images. Moreover, to address the computational limitations of unfolding networks, DEQ-UMamba trains an implicit mapping by directly differentiating the equilibrium point of the convergent solution, thereby ensuring stability and avoiding non-convergent behavior. With each network module aligned to a corresponding operation in the iterative optimization process, the proposed method achieves clear structural interpretability and strong performance. Comprehensive experiments conducted on both clinical and in vivo datasets demonstrate that DEQ-UMamba outperforms current state-of-the-art alternatives while utilizing fewer parameters, facilitating the advancement of cost-effective and high-quality clinical molecular imaging.

在临床光学分子成像中，需要实时高帧率和低激发剂量来确保患者安全，这本质上增加了对检测噪声的敏感性。面对严重噪声导致图像退化的挑战，图像去噪是缓解采集成本和图像质量之间权衡的关键。然而，目前流行的深度学习方法表现出不可控和次优性能，可解释性有限，主要是由于忽略了底层物理模型和频率信息。在这项工作中，我们介绍了一个端到端模型驱动的深度平衡展开曼巴（DEQ-UMamba），它集成了近端梯度下降技术和学习的空间频率特征，将复杂的噪声结构解耦到统计分布中，从而能够有效地估计和抑制荧光图像中的噪声。此外，为了解决展开网络的计算限制，DEQ-UMamba通过直接微分收敛解的平衡点来训练隐式映射，从而确保稳定性并避免非收敛行为。在迭代优化过程中，每个网络模块对应一个操作，具有清晰的结构可解释性和较强的性能。在临床和体内数据集上进行的综合实验表明，DEQ-UMamba优于目前最先进的替代方案，同时使用更少的参数，促进了成本效益和高质量临床分子成像的进步。

{"title":"Deep Equilibrium Unfolding Learning for Noise Estimation and Removal in Optical Molecular Imaging","authors":"Lidan Fu , Lingbing Li , Binchun Lu , Xiaoyong Guo , Xiaojing Shi , Jie Tian , Zhenhua Hu","doi":"10.1016/j.compmedimag.2025.102492","DOIUrl":"10.1016/j.compmedimag.2025.102492","url":null,"abstract":"<div><div>In clinical optical molecular imaging, the need for real-time high frame rates and low excitation doses to ensure patient safety inherently increases susceptibility to detection noise. Faced with the challenge of image degradation caused by severe noise, image denoising is essential for mitigating the trade-off between acquisition cost and image quality. However, prevailing deep learning methods exhibit uncontrollable and suboptimal performance with limited interpretability, primarily due to neglecting underlying physical model and frequency information. In this work, we introduce an end-to-end model-driven Deep Equilibrium Unfolding Mamba (DEQ-UMamba) that integrates proximal gradient descent technique and learnt spatial-frequency characteristics to decouple complex noise structures into statistical distributions, enabling effective noise estimation and suppression in fluorescent images. Moreover, to address the computational limitations of unfolding networks, DEQ-UMamba trains an implicit mapping by directly differentiating the equilibrium point of the convergent solution, thereby ensuring stability and avoiding non-convergent behavior. With each network module aligned to a corresponding operation in the iterative optimization process, the proposed method achieves clear structural interpretability and strong performance. Comprehensive experiments conducted on both clinical and in vivo datasets demonstrate that DEQ-UMamba outperforms current state-of-the-art alternatives while utilizing fewer parameters, facilitating the advancement of cost-effective and high-quality clinical molecular imaging.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"120 ","pages":"Article 102492"},"PeriodicalIF":5.4,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143015674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

NURBS curve shape prior-guided multiscale attention network for automatic segmentation of the inferior alveolar nerve NURBS曲线形状先验引导多尺度注意网络下牙槽神经自动分割。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-07 DOI: 10.1016/j.compmedimag.2024.102485

Shuanglin Jiang , Jiangchang Xu , Wenyin Wang , Baoxin Tao , Yiqun Wu , Xiaojun Chen

Accurate segmentation of the inferior alveolar nerve (IAN) within Cone-Beam Computed Tomography (CBCT) images is critical for the precise planning of oral and maxillofacial surgeries, especially to avoid IAN damage. Existing methods often fail due to the low contrast of the IAN and the presence of artifacts, which can cause segmentation discontinuities. To address these challenges, this paper proposes a novel approach that employs Non-Uniform Rational B-Spline (NURBS) curve shape priors into a multiscale attention network for the automatic segmentation of the IAN. Firstly, an automatic method for generating non-uniform rational B-spline (NURBS) shape prior is proposed and introduced into the segmentation network, which significantly enhancing the continuity and accuracy of IAN segmentation. Then a multiscale attention segmentation network, incorporating a dilation selective attention module is developed, to improve the network’s feature extraction capacity. The proposed approach is validated on both in-house and public datasets, showcasing superior performance compared to established benchmarks, achieving 80.29±11.04% dice coefficient (Dice) and 68.14±12.06% intersection of union (IoU), the 95% Hausdorff distance (95HD) reaches 1.61±6.14 mm and mean surface distance (MSD) reaches 0.64±2.16 mm on private dataset. On public dataset, the Dice reaches 80.69±4.93%, IoU reaches 67.86±6.73%, 95HD reaches 1.04±0.95 mm, and MSD reaches 0.42±0.34 mm. Compared to state-of-the-art networks, the proposed approach out-performs in both voxel accuracy and surface distance. It offers significant potential to improve doctors’ efficiency in segmentation tasks and holds promise for applications in dental surgery planning. The source codes are available at https://github.com/SJTUjsl/NURBS_IAN.git.

圆锥束ct （Cone-Beam Computed Tomography， CBCT）图像中准确分割下牙槽神经（IAN）对于口腔颌面外科手术的精确规划，尤其是避免下颌牙槽神经损伤至关重要。现有的方法往往失败，由于低对比度的IAN和存在的伪影，这可能会导致分割不连续。为了解决这些问题，本文提出了一种新的方法，将非均匀有理b样条（NURBS）曲线形状先验引入到多尺度注意力网络中，用于人工神经网络的自动分割。首先，提出了一种非均匀有理b样条（NURBS）形状先验的自动生成方法，并将其引入到分割网络中，显著提高了IAN分割的连续性和准确性；为了提高网络的特征提取能力，设计了一种包含扩张选择性注意模块的多尺度注意力分割网络。在内部和公共数据集上验证了该方法，与现有基准相比，该方法的性能优越，在私有数据集上实现了80.29±11.04%的骰子系数（dice）和68.14±12.06%的联合交叉点（IoU）， 95% Hausdorff距离（95HD）达到1.61±6.14 mm，平均表面距离（MSD）达到0.64±2.16 mm。在公开数据集上，Dice达到80.69±4.93%，IoU达到67.86±6.73%，95HD达到1.04±0.95 mm， MSD达到0.42±0.34 mm。与最先进的网络相比，该方法在体素精度和表面距离方面都优于最先进的网络。它为提高医生在分割任务中的效率提供了巨大的潜力，并有望在牙科手术计划中应用。源代码可从https://github.com/SJTUjsl/NURBS_IAN.git获得。

{"title":"NURBS curve shape prior-guided multiscale attention network for automatic segmentation of the inferior alveolar nerve","authors":"Shuanglin Jiang , Jiangchang Xu , Wenyin Wang , Baoxin Tao , Yiqun Wu , Xiaojun Chen","doi":"10.1016/j.compmedimag.2024.102485","DOIUrl":"10.1016/j.compmedimag.2024.102485","url":null,"abstract":"<div><div>Accurate segmentation of the inferior alveolar nerve (IAN) within Cone-Beam Computed Tomography (CBCT) images is critical for the precise planning of oral and maxillofacial surgeries, especially to avoid IAN damage. Existing methods often fail due to the low contrast of the IAN and the presence of artifacts, which can cause segmentation discontinuities. To address these challenges, this paper proposes a novel approach that employs Non-Uniform Rational B-Spline (NURBS) curve shape priors into a multiscale attention network for the automatic segmentation of the IAN. Firstly, an automatic method for generating non-uniform rational B-spline (NURBS) shape prior is proposed and introduced into the segmentation network, which significantly enhancing the continuity and accuracy of IAN segmentation. Then a multiscale attention segmentation network, incorporating a dilation selective attention module is developed, to improve the network’s feature extraction capacity. The proposed approach is validated on both in-house and public datasets, showcasing superior performance compared to established benchmarks, achieving 80.29±11.04% dice coefficient (Dice) and 68.14±12.06% intersection of union (IoU), the 95% Hausdorff distance (95HD) reaches 1.61±6.14 mm and mean surface distance (MSD) reaches 0.64±2.16 mm on private dataset. On public dataset, the Dice reaches 80.69±4.93%, IoU reaches 67.86±6.73%, 95HD reaches 1.04±0.95 mm, and MSD reaches 0.42±0.34 mm. Compared to state-of-the-art networks, the proposed approach out-performs in both voxel accuracy and surface distance. It offers significant potential to improve doctors’ efficiency in segmentation tasks and holds promise for applications in dental surgery planning. The source codes are available at <span><span>https://github.com/SJTUjsl/NURBS_IAN.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"120 ","pages":"Article 102485"},"PeriodicalIF":5.4,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142967409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Convergent–Diffusion Denoising Model for multi-scenario CT Image Reconstruction 多场景CT图像重构的收敛-扩散去噪模型。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-04 DOI: 10.1016/j.compmedimag.2024.102491

Xinghua Ma , Mingye Zou , Xinyan Fang , Gongning Luo , Wei Wang , Suyu Dong , Xiangyu Li , Kuanquan Wang , Qing Dong , Ye Tian , Shuo Li

A generic and versatile CT Image Reconstruction (CTIR) scheme can efficiently mitigate imaging noise resulting from inherent physical limitations, substantially bolstering the dependability of CT imaging diagnostics across a wider spectrum of patient cases. Current CTIR techniques often concentrate on distinct areas such as Low-Dose CT denoising (LDCTD), Sparse-View CT reconstruction (SVCTR), and Metal Artifact Reduction (MAR). Nevertheless, due to the intricate nature of multi-scenario CTIR, these techniques frequently narrow their focus to specific tasks, resulting in limited generalization capabilities for diverse scenarios. We propose a novel Convergent–Diffusion Denoising Model (CDDM) for multi-scenario CTIR, which utilizes a stepwise denoising process to converge toward an imaging-noise-free image with high generalization. CDDM uses a diffusion-based process based on a priori decay distribution to steadily correct imaging noise, thus avoiding the overfitting of individual samples. Within CDDM, a domain-correlated sampling network (DS-Net) provides an innovative sinogram-guided noise prediction scheme to leverage both image and sinogram (i.e., dual-domain) information. DS-Net analyzes the correlation of the dual-domain representations for sampling the noise distribution, introducing sinogram semantics to avoid secondary artifacts. Experimental results validate the practical applicability of our scheme across various CTIR scenarios, including LDCTD, MAR, and SVCTR, with the support of sinogram knowledge.

一种通用的、通用的CT图像重建（CTIR）方案可以有效地减轻由于固有物理限制而产生的成像噪声，从而大大增强了CT成像诊断在更广泛的患者病例中的可靠性。目前的ctr技术通常集中在不同的领域，如低剂量CT去噪（LDCTD）、稀疏视图CT重建（SVCTR）和金属伪影还原（MAR）。然而，由于多场景CTIR的复杂性，这些技术经常将其重点缩小到特定任务上，导致对不同场景的泛化能力有限。本文提出了一种新的多场景CTIR的收敛-扩散去噪模型（CDDM），该模型利用逐步去噪过程收敛到具有高泛化性的无成像噪声图像。CDDM使用基于先验衰减分布的扩散过程来稳定地校正成像噪声，从而避免了单个样本的过拟合。在CDDM中，域相关采样网络（DS-Net）提供了一种创新的正弦图引导噪声预测方案，以利用图像和正弦图（即双域）信息。DS-Net分析了双域表示的相关性来采样噪声分布，引入了正弦图语义来避免二次伪影。实验结果验证了该方案在各种CTIR场景下的实用性，包括LDCTD、MAR和SVCTR，并得到了正弦图知识的支持。

{"title":"Convergent–Diffusion Denoising Model for multi-scenario CT Image Reconstruction","authors":"Xinghua Ma , Mingye Zou , Xinyan Fang , Gongning Luo , Wei Wang , Suyu Dong , Xiangyu Li , Kuanquan Wang , Qing Dong , Ye Tian , Shuo Li","doi":"10.1016/j.compmedimag.2024.102491","DOIUrl":"10.1016/j.compmedimag.2024.102491","url":null,"abstract":"<div><div>A generic and versatile CT Image Reconstruction (CTIR) scheme can efficiently mitigate imaging noise resulting from inherent physical limitations, substantially bolstering the dependability of CT imaging diagnostics across a wider spectrum of patient cases. Current CTIR techniques often concentrate on distinct areas such as Low-Dose CT denoising (LDCTD), Sparse-View CT reconstruction (SVCTR), and Metal Artifact Reduction (MAR). Nevertheless, due to the intricate nature of multi-scenario CTIR, these techniques frequently narrow their focus to specific tasks, resulting in limited generalization capabilities for diverse scenarios. We propose a novel Convergent–Diffusion Denoising Model (CDDM) for multi-scenario CTIR, which utilizes a stepwise denoising process to converge toward an imaging-noise-free image with high generalization. CDDM uses a diffusion-based process based on a priori decay distribution to steadily correct imaging noise, thus avoiding the overfitting of individual samples. Within CDDM, a domain-correlated sampling network (DS-Net) provides an innovative sinogram-guided noise prediction scheme to leverage both image and sinogram (<em>i.e.</em>, dual-domain) information. DS-Net analyzes the correlation of the dual-domain representations for sampling the noise distribution, introducing sinogram semantics to avoid secondary artifacts. Experimental results validate the practical applicability of our scheme across various CTIR scenarios, including LDCTD, MAR, and SVCTR, with the support of sinogram knowledge.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"120 ","pages":"Article 102491"},"PeriodicalIF":5.4,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142958436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automatic medical report generation based on deep learning: A state of the art survey 基于深度学习的自动医疗报告生成：一项最新的调查。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-04 DOI: 10.1016/j.compmedimag.2024.102486

Xinyao Liu , Junchang Xin , Qi Shen , Zhihong Huang , Zhiqiong Wang

With the increasing popularity of medical imaging and its expanding applications, posing significant challenges for radiologists. Radiologists need to spend substantial time and effort to review images and manually writing reports every day. To address these challenges and speed up the process of patient care, researchers have employed deep learning methods to automatically generate medical reports. In recent years, researchers have been increasingly focusing on this task and a large amount of related work has emerged. Although there have been some review articles summarizing the state of the art in this field, their discussions remain relatively limited. Therefore, this paper provides a comprehensive review of the latest advancements in automatic medical report generation, focusing on four key aspects: (1) describing the problem of automatic medical report generation, (2) introducing datasets of different modalities, (3) thoroughly analyzing existing evaluation metrics, (4) classifying existing studies into five categories: retrieval-based, domain knowledge-based, attention-based, reinforcement learning-based, large language models-based, and merged model. In addition, we point out the problems in this field and discuss the directions of future challenges. We hope that this review provides a thorough understanding of automatic medical report generation and encourages the continued development in this area.

随着医学影像的日益普及及其应用的扩大，对放射科医生提出了重大挑战。放射科医生每天需要花费大量的时间和精力来检查图像并手动编写报告。为了应对这些挑战并加快患者护理过程，研究人员采用深度学习方法自动生成医疗报告。近年来，研究人员越来越关注这一课题，并出现了大量的相关工作。虽然已经有一些综述文章总结了这一领域的技术状况，但它们的讨论仍然相对有限。因此，本文对医学报告自动生成的最新进展进行了全面综述，重点介绍了四个关键方面：(1)描述了医学报告自动生成的问题；(2)引入了不同模式的数据集；(3)深入分析了现有的评估指标；(4)将现有的研究分为五类：基于检索的、基于领域知识的、基于注意的、基于强化学习的、基于大型语言模型的和合并模型的。此外，我们还指出了该领域存在的问题，并讨论了未来挑战的方向。我们希望这篇综述能提供对自动医学报告生成的全面理解，并鼓励这一领域的持续发展。

{"title":"Automatic medical report generation based on deep learning: A state of the art survey","authors":"Xinyao Liu , Junchang Xin , Qi Shen , Zhihong Huang , Zhiqiong Wang","doi":"10.1016/j.compmedimag.2024.102486","DOIUrl":"10.1016/j.compmedimag.2024.102486","url":null,"abstract":"<div><div>With the increasing popularity of medical imaging and its expanding applications, posing significant challenges for radiologists. Radiologists need to spend substantial time and effort to review images and manually writing reports every day. To address these challenges and speed up the process of patient care, researchers have employed deep learning methods to automatically generate medical reports. In recent years, researchers have been increasingly focusing on this task and a large amount of related work has emerged. Although there have been some review articles summarizing the state of the art in this field, their discussions remain relatively limited. Therefore, this paper provides a comprehensive review of the latest advancements in automatic medical report generation, focusing on four key aspects: (1) describing the problem of automatic medical report generation, (2) introducing datasets of different modalities, (3) thoroughly analyzing existing evaluation metrics, (4) classifying existing studies into five categories: retrieval-based, domain knowledge-based, attention-based, reinforcement learning-based, large language models-based, and merged model. In addition, we point out the problems in this field and discuss the directions of future challenges. We hope that this review provides a thorough understanding of automatic medical report generation and encourages the continued development in this area.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"120 ","pages":"Article 102486"},"PeriodicalIF":5.4,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142958421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DDEvENet: Evidence-based ensemble learning for uncertainty-aware brain parcellation using diffusion MRI 基于证据的集成学习在不确定性感知脑包裹中的应用。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-04 DOI: 10.1016/j.compmedimag.2024.102489

Chenjun Li , Dian Yang , Shun Yao , Shuyue Wang , Ye Wu , Le Zhang , Qiannuo Li , Kang Ik Kevin Cho , Johanna Seitz-Holland , Lipeng Ning , Jon Haitz Legarreta , Yogesh Rathi , Carl-Fredrik Westin , Lauren J. O’Donnell , Nir A. Sochen , Ofer Pasternak , Fan Zhang

In this study, we developed an Evidential Ensemble Neural Network based on Deep learning and Diffusion MRI, namely DDEvENet, for anatomical brain parcellation. The key innovation of DDEvENet is the design of an evidential deep learning framework to quantify predictive uncertainty at each voxel during a single inference. To do so, we design an evidence-based ensemble learning framework for uncertainty-aware parcellation to leverage the multiple dMRI parameters derived from diffusion MRI. Using DDEvENet, we obtained accurate parcellation and uncertainty estimates across different datasets from healthy and clinical populations and with different imaging acquisitions. The overall network includes five parallel subnetworks, where each is dedicated to learning the FreeSurfer parcellation for a certain diffusion MRI parameter. An evidence-based ensemble methodology is then proposed to fuse the individual outputs. We perform experimental evaluations on large-scale datasets from multiple imaging sources, including high-quality diffusion MRI data from healthy adults and clinically diffusion MRI data from participants with various brain diseases (schizophrenia, bipolar disorder, attention-deficit/hyperactivity disorder, Parkinson’s disease, cerebral small vessel disease, and neurosurgical patients with brain tumors). Compared to several state-of-the-art methods, our experimental results demonstrate highly improved parcellation accuracy across the multiple testing datasets despite the differences in dMRI acquisition protocols and health conditions. Furthermore, thanks to the uncertainty estimation, our DDEvENet approach demonstrates a good ability to detect abnormal brain regions in patients with lesions that are consistent with expert-drawn results, enhancing the interpretability and reliability of the segmentation results.

在这项研究中，我们开发了一个基于深度学习和弥散MRI的证据集成神经网络，即DDEvENet，用于解剖脑包裹。DDEvENet的关键创新是设计了一个证据深度学习框架，用于量化单个推理过程中每个体素的预测不确定性。为此，我们设计了一个基于证据的集成学习框架，用于不确定性感知的分割，以利用来自扩散MRI的多个dMRI参数。使用DDEvENet，我们获得了来自健康人群和临床人群以及不同成像获取的不同数据集的准确分割和不确定性估计。整个网络包括5个并行的子网络，每个子网络都致力于学习FreeSurfer对特定扩散MRI参数的分割。然后提出了一种基于证据的集成方法来融合各个输出。我们对来自多个成像来源的大规模数据集进行了实验评估，包括来自健康成人的高质量弥散性MRI数据和来自各种脑部疾病（精神分裂症、双相情感障碍、注意力缺陷/多动障碍、帕金森病、脑小血管疾病和脑肿瘤神经外科患者）参与者的临床弥散性MRI数据。与几种最先进的方法相比，尽管dMRI采集方案和健康条件存在差异，但我们的实验结果表明，在多个测试数据集上，我们的包裹精度得到了极大的提高。此外，由于不确定性估计，我们的DDEvENet方法能够很好地检测出与专家绘制结果一致的病变患者的异常大脑区域，从而增强了分割结果的可解释性和可靠性。

{"title":"DDEvENet: Evidence-based ensemble learning for uncertainty-aware brain parcellation using diffusion MRI","authors":"Chenjun Li , Dian Yang , Shun Yao , Shuyue Wang , Ye Wu , Le Zhang , Qiannuo Li , Kang Ik Kevin Cho , Johanna Seitz-Holland , Lipeng Ning , Jon Haitz Legarreta , Yogesh Rathi , Carl-Fredrik Westin , Lauren J. O’Donnell , Nir A. Sochen , Ofer Pasternak , Fan Zhang","doi":"10.1016/j.compmedimag.2024.102489","DOIUrl":"10.1016/j.compmedimag.2024.102489","url":null,"abstract":"<div><div>In this study, we developed an Evidential Ensemble Neural Network based on Deep learning and Diffusion MRI, namely DDEvENet, for anatomical brain parcellation. The key innovation of DDEvENet is the design of an evidential deep learning framework to quantify predictive uncertainty at each voxel during a single inference. To do so, we design an evidence-based ensemble learning framework for uncertainty-aware parcellation to leverage the multiple dMRI parameters derived from diffusion MRI. Using DDEvENet, we obtained accurate parcellation and uncertainty estimates across different datasets from healthy and clinical populations and with different imaging acquisitions. The overall network includes five parallel subnetworks, where each is dedicated to learning the FreeSurfer parcellation for a certain diffusion MRI parameter. An evidence-based ensemble methodology is then proposed to fuse the individual outputs. We perform experimental evaluations on large-scale datasets from multiple imaging sources, including high-quality diffusion MRI data from healthy adults and clinically diffusion MRI data from participants with various brain diseases (schizophrenia, bipolar disorder, attention-deficit/hyperactivity disorder, Parkinson’s disease, cerebral small vessel disease, and neurosurgical patients with brain tumors). Compared to several state-of-the-art methods, our experimental results demonstrate highly improved parcellation accuracy across the multiple testing datasets despite the differences in dMRI acquisition protocols and health conditions. Furthermore, thanks to the uncertainty estimation, our DDEvENet approach demonstrates a good ability to detect abnormal brain regions in patients with lesions that are consistent with expert-drawn results, enhancing the interpretability and reliability of the segmentation results.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"120 ","pages":"Article 102489"},"PeriodicalIF":5.4,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142958428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automated segmentation of deep brain structures from Inversion-Recovery MRI 基于反转-恢复MRI的深部脑结构自动分割。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-03 DOI: 10.1016/j.compmedimag.2024.102488

Aigerim Dautkulova , Omar Ait Aider , Céline Teulière , Jérôme Coste , Rémi Chaix , Omar Ouachik , Bruno Pereira , Jean-Jacques Lemaire

Methods for the automated segmentation of brain structures are a major subject of medical research. The small structures of the deep brain have received scant attention, notably for lack of manual delineations by medical experts. In this study, we assessed an automated segmentation of a novel clinical dataset containing White Matter Attenuated Inversion-Recovery (WAIR) MRI images and five manually segmented structures (substantia nigra (SN), subthalamic nucleus (STN), red nucleus (RN), mammillary body (MB) and mammillothalamic fascicle (MT-fa)) in 53 patients with severe Parkinson’s disease. T1 and DTI images were additionally used. We also assessed the reorientation of DTI diffusion vectors with reference to the ACPC line. A state-of-the-art nnU-Net method was trained and tested on subsets of 38 and 15 image datasets respectively. We used Dice similarity coefficient (DSC), 95% Hausdorff distance (95HD), and volumetric similarity (VS) as metrics to evaluate network efficiency in reproducing manual contouring. Random-effects models statistically compared values according to structures, accounting for between- and within-participant variability. Results show that WAIR significantly outperformed T1 for DSC (0.739 ± 0.073), 95HD (1.739 ± 0.398), and VS (0.892 ± 0.044). The DSC values for automated segmentation of MB, RN, SN, STN, and MT-fa decreased in that order, in line with the increasing complexity observed in manual segmentation. Based on training results, the reorientation of DTI vectors improved the automated segmentation.

脑结构的自动分割方法是医学研究的一个重要课题。大脑深部的小结构很少受到关注，特别是因为缺乏医学专家的手工描绘。在这项研究中，我们评估了一个新的临床数据集的自动分割，该数据集包含白质衰减反演恢复（WAIR） MRI图像和5个手动分割的结构（黑质（SN）、丘脑下核（STN）、红核（RN）、乳腺体（MB）和乳丘脑束（MT-fa）），共53例严重帕金森病患者。另外使用T1和DTI图像。我们还评估了参考ACPC线的DTI扩散矢量的重新定向。在38和15个图像数据集的子集上分别训练和测试了最先进的nnU-Net方法。我们使用Dice相似系数（DSC）、95% Hausdorff距离（95HD）和体积相似度（VS）作为指标来评估人工轮廓再现的网络效率。随机效应模型根据结构统计比较值，考虑参与者之间和参与者内部的可变性。结果显示，WAIR在DSC（0.739±0.073）、95HD（1.739±0.398）和VS（0.892±0.044）方面均显著优于T1。自动分割的MB、RN、SN、STN和MT-fa的DSC值依次下降，与人工分割的复杂性增加一致。在训练结果的基础上，重新定位DTI向量，提高了自动分割的效果。

{"title":"Automated segmentation of deep brain structures from Inversion-Recovery MRI","authors":"Aigerim Dautkulova , Omar Ait Aider , Céline Teulière , Jérôme Coste , Rémi Chaix , Omar Ouachik , Bruno Pereira , Jean-Jacques Lemaire","doi":"10.1016/j.compmedimag.2024.102488","DOIUrl":"10.1016/j.compmedimag.2024.102488","url":null,"abstract":"<div><div>Methods for the automated segmentation of brain structures are a major subject of medical research. The small structures of the deep brain have received scant attention, notably for lack of manual delineations by medical experts. In this study, we assessed an automated segmentation of a novel clinical dataset containing White Matter Attenuated Inversion-Recovery (WAIR) MRI images and five manually segmented structures (substantia nigra (SN), subthalamic nucleus (STN), red nucleus (RN), mammillary body (MB) and mammillothalamic fascicle (MT-fa)) in 53 patients with severe Parkinson’s disease. T1 and DTI images were additionally used. We also assessed the reorientation of DTI diffusion vectors with reference to the ACPC line. A state-of-the-art nnU-Net method was trained and tested on subsets of 38 and 15 image datasets respectively. We used Dice similarity coefficient (DSC), 95% Hausdorff distance (95HD), and volumetric similarity (VS) as metrics to evaluate network efficiency in reproducing manual contouring. Random-effects models statistically compared values according to structures, accounting for between- and within-participant variability. Results show that WAIR significantly outperformed T1 for DSC (0.739 ± 0.073), 95HD (1.739 ± 0.398), and VS (0.892 ± 0.044). The DSC values for automated segmentation of MB, RN, SN, STN, and MT-fa decreased in that order, in line with the increasing complexity observed in manual segmentation. Based on training results, the reorientation of DTI vectors improved the automated segmentation.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"120 ","pages":"Article 102488"},"PeriodicalIF":5.4,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142958416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Portable head CT motion artifact correction via diffusion-based generative model 基于扩散生成模型的便携式头部CT运动伪影校正。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-01 DOI: 10.1016/j.compmedimag.2024.102478

Zhennong Chen , Siyeop Yoon , Quirin Strotzer , Rehab Naeem Khalid , Matthew Tivnan , Quanzheng Li , Rajiv Gupta , Dufan Wu

Portable head CT images often suffer motion artifacts due to the prolonged scanning time and critically ill patients who are unable to hold still. Image-domain motion correction is attractive for this application as it does not require CT projection data. This paper describes and evaluates a generative model based on conditional diffusion to correct motion artifacts in portable head CT scans. This model was trained to find the motion-free CT image conditioned on the paired motion-corrupted image. Our method utilizes histogram equalization to resolve the intensity range discrepancy of skull and brain tissue and an advanced Elucidated Diffusion Model (EDM) framework for faster sampling and better motion correction performance. Our EDM framework is superior in correcting artifacts in the brain tissue region and across the entire image compared to CNN-based methods and standard diffusion approach (DDPM) in a simulation study and a phantom study with known motion-free ground truth. Furthermore, we conducted a reader study on real-world portable CT scans to demonstrate improvement of image quality using our method.

便携式头部CT图像由于扫描时间过长和危重病人无法保持静止，经常出现运动伪影。由于不需要CT投影数据，图像域运动校正对该应用很有吸引力。本文描述并评估了一种基于条件扩散的生成模型，用于校正便携式头部CT扫描中的运动伪影。对该模型进行训练，找出以成对的运动损坏图像为条件的无运动CT图像。该方法利用直方图均衡化来解决颅骨和脑组织的强度范围差异，并采用先进的阐明扩散模型（EDM）框架来实现更快的采样和更好的运动校正性能。与基于cnn的方法和标准扩散方法（DDPM）相比，我们的EDM框架在模拟研究和已知无运动地面真相的幻影研究中，在纠正脑组织区域和整个图像中的伪影方面更胜一筹。此外，我们对真实世界的便携式CT扫描进行了读者研究，以证明使用我们的方法可以改善图像质量。

{"title":"Portable head CT motion artifact correction via diffusion-based generative model","authors":"Zhennong Chen , Siyeop Yoon , Quirin Strotzer , Rehab Naeem Khalid , Matthew Tivnan , Quanzheng Li , Rajiv Gupta , Dufan Wu","doi":"10.1016/j.compmedimag.2024.102478","DOIUrl":"10.1016/j.compmedimag.2024.102478","url":null,"abstract":"<div><div>Portable head CT images often suffer motion artifacts due to the prolonged scanning time and critically ill patients who are unable to hold still. Image-domain motion correction is attractive for this application as it does not require CT projection data. This paper describes and evaluates a generative model based on conditional diffusion to correct motion artifacts in portable head CT scans. This model was trained to find the motion-free CT image conditioned on the paired motion-corrupted image. Our method utilizes histogram equalization to resolve the intensity range discrepancy of skull and brain tissue and an advanced Elucidated Diffusion Model (EDM) framework for faster sampling and better motion correction performance. Our EDM framework is superior in correcting artifacts in the brain tissue region and across the entire image compared to CNN-based methods and standard diffusion approach (DDPM) in a simulation study and a phantom study with known motion-free ground truth. Furthermore, we conducted a reader study on real-world portable CT scans to demonstrate improvement of image quality using our method.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"119 ","pages":"Article 102478"},"PeriodicalIF":5.4,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142873358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Inspect quantitative signals in placental histopathology: Computer-assisted multiple functional tissues identification through multi-model fusion and distillation framework 检测胎盘组织病理学中的定量信号：通过多模型融合和蒸馏框架进行计算机辅助的多功能组织识别。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-01 DOI: 10.1016/j.compmedimag.2024.102482

Yiming Liu , Ling Zhang , Mingxue Gu , Yaoxing Xiao , Ting Yu , Xiang Tao , Qing Zhang , Yan Wang , Dinggang Shen , Qingli Li

Pathological analysis of placenta is currently a valuable tool for gaining insights into pregnancy outcomes. In placental histopathology, multiple functional tissues can be inspected as potential signals reflecting the transfer functionality between fetal and maternal circulations. However, the identification of multiple functional tissues is challenging due to (1) severe heterogeneity in texture, size and shape, (2) distribution across different scales and (3) the need for comprehensive assessment at the whole slide image (WSI) level. To solve aforementioned problems, we establish a brand new dataset and propose a computer-aided segmentation framework through multi-model fusion and distillation to identify multiple functional tissues in placental histopathologic images, including villi, capillaries, fibrin deposits and trophoblast aggregations. Specifically, we propose a two-stage Multi-model Fusion and Distillation (MMFD) framework. Considering the multi-scale distribution and heterogeneity of multiple functional tissues, we enhance the visual representation in the first stage by fusing feature from multiple models to boost the effectiveness of the network. However, the multi-model fusion stage contributes to extra parameters and a significant computational burden, which is impractical for recognizing gigapixels of WSIs within clinical practice. In the second stage, we propose straightforward plug-in feature distillation method that transfers knowledge from the large fused model to a compact student model. In self-collected placental dataset, our proposed MMFD framework demonstrates an improvement of 4.3% in mean Intersection over Union (mIoU) while achieving an approximate 50% increase in inference speed and utilizing only 10% of parameters and computational resources, compared to the parameter-efficient fine-tuned Segment Anything Model (SAM) baseline. Visualization of segmentation results across entire WSIs on unseen cases demonstrates the generalizability of our proposed MMFD framework. Besides, experimental results on a public dataset further prove the effectiveness of MMFD framework on other tasks. Our work can present a fundamental method to expedite quantitative analysis of placental histopathology.

目前，胎盘病理分析是了解妊娠结局的一种有价值的工具。在胎盘组织病理学中，可以检查多个功能组织作为反映胎儿和母体循环之间转移功能的潜在信号。然而，由于(1)纹理、大小和形状的严重异质性，(2)不同尺度的分布，(3)需要在整个幻灯片图像（WSI）水平上进行综合评估，对多种功能组织的识别具有挑战性。为了解决上述问题，我们建立了一个全新的数据集，并通过多模型融合和精馏提出了一个计算机辅助分割框架，以识别胎盘组织病理图像中的多种功能组织，包括绒毛、毛细血管、纤维蛋白沉积和滋养细胞聚集。具体来说，我们提出了一个两阶段的多模型融合和蒸馏（MMFD）框架。考虑到多个功能组织的多尺度分布和异质性，我们在第一阶段通过融合多个模型的特征来增强视觉表征，以提高网络的有效性。然而，多模型融合阶段会带来额外的参数和巨大的计算负担，这对于临床实践中识别千兆像素的wsi是不切实际的。在第二阶段，我们提出了直接的插件特征蒸馏方法，将知识从大型融合模型转移到紧凑的学生模型。在自我收集的胎盘数据集中，与参数高效的微调分段任意模型（SAM）基线相比，我们提出的MMFD框架在平均交叉交叉（mIoU）上提高了4.3%，同时在推理速度上提高了约50%，仅利用了10%的参数和计算资源。对未见案例的整个wsi分割结果的可视化证明了我们提出的MMFD框架的通用性。此外，在公共数据集上的实验结果进一步证明了MMFD框架在其他任务上的有效性。我们的工作为加快胎盘组织病理学定量分析提供了一种基本方法。

{"title":"Inspect quantitative signals in placental histopathology: Computer-assisted multiple functional tissues identification through multi-model fusion and distillation framework","authors":"Yiming Liu , Ling Zhang , Mingxue Gu , Yaoxing Xiao , Ting Yu , Xiang Tao , Qing Zhang , Yan Wang , Dinggang Shen , Qingli Li","doi":"10.1016/j.compmedimag.2024.102482","DOIUrl":"10.1016/j.compmedimag.2024.102482","url":null,"abstract":"<div><div>Pathological analysis of placenta is currently a valuable tool for gaining insights into pregnancy outcomes. In placental histopathology, multiple functional tissues can be inspected as potential signals reflecting the transfer functionality between fetal and maternal circulations. However, the identification of multiple functional tissues is challenging due to (1) severe heterogeneity in texture, size and shape, (2) distribution across different scales and (3) the need for comprehensive assessment at the whole slide image (WSI) level. To solve aforementioned problems, we establish a brand new dataset and propose a computer-aided segmentation framework through multi-model fusion and distillation to identify multiple functional tissues in placental histopathologic images, including villi, capillaries, fibrin deposits and trophoblast aggregations. Specifically, we propose a two-stage Multi-model Fusion and Distillation (MMFD) framework. Considering the multi-scale distribution and heterogeneity of multiple functional tissues, we enhance the visual representation in the first stage by fusing feature from multiple models to boost the effectiveness of the network. However, the multi-model fusion stage contributes to extra parameters and a significant computational burden, which is impractical for recognizing gigapixels of WSIs within clinical practice. In the second stage, we propose straightforward plug-in feature distillation method that transfers knowledge from the large fused model to a compact student model. In self-collected placental dataset, our proposed MMFD framework demonstrates an improvement of 4.3% in mean Intersection over Union (mIoU) while achieving an approximate 50% increase in inference speed and utilizing only 10% of parameters and computational resources, compared to the parameter-efficient fine-tuned Segment Anything Model (SAM) baseline. Visualization of segmentation results across entire WSIs on unseen cases demonstrates the generalizability of our proposed MMFD framework. Besides, experimental results on a public dataset further prove the effectiveness of MMFD framework on other tasks. Our work can present a fundamental method to expedite quantitative analysis of placental histopathology.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"119 ","pages":"Article 102482"},"PeriodicalIF":5.4,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142923514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Guidelines for cerebrovascular segmentation: Managing imperfect annotations in the context of semi-supervised learning 脑血管分割指南：在半监督学习的背景下管理不完善的注释。

IF 5.4 2区医学 Q1 ENGINEERING, BIOMEDICAL

Computerized Medical Imaging and Graphics

Pub Date : 2025-01-01 DOI: 10.1016/j.compmedimag.2024.102474

Pierre Rougé , Pierre-Henri Conze , Nicolas Passat , Odyssée Merveille

Segmentation in medical imaging is an essential and often preliminary task in the image processing chain, driving numerous efforts towards the design of robust segmentation algorithms. Supervised learning methods achieve excellent performances when fed with a sufficient amount of labeled data. However, such labels are typically highly time-consuming, error-prone and expensive to produce. Alternatively, semi-supervised learning approaches leverage both labeled and unlabeled data, and are very useful when only a small fraction of the dataset is labeled. They are particularly useful for cerebrovascular segmentation, given that labeling a single volume requires several hours for an expert. In addition to the challenge posed by insufficient annotations, there are concerns regarding annotation consistency. The task of annotating the cerebrovascular tree is inherently ambiguous. Due to the discrete nature of images, the borders and extremities of vessels are often unclear. Consequently, annotations heavily rely on the expert subjectivity and on the underlying clinical objective. These discrepancies significantly increase the complexity of the segmentation task for the model and consequently impair the results. Consequently, it becomes imperative to provide clinicians with precise guidelines to improve the annotation process and construct more uniform datasets. In this article, we investigate the data dependency of deep learning methods within the context of imperfect data and semi-supervised learning, for cerebrovascular segmentation. Specifically, this study compares various state-of-the-art semi-supervised methods based on unsupervised regularization and evaluates their performance in diverse quantity and quality data scenarios. Based on these experiments, we provide guidelines for the annotation and training of cerebrovascular segmentation models.

医学成像中的分割是图像处理链中的一项基本且通常是初步的任务，它推动了对鲁棒分割算法设计的大量努力。当有足够数量的标记数据时，监督学习方法可以获得出色的性能。然而，这种标签通常非常耗时，容易出错，而且生产成本昂贵。或者，半监督学习方法利用标记和未标记的数据，并且在只有一小部分数据集被标记时非常有用。它们对脑血管分割特别有用，因为专家标记单个体积需要几个小时。除了注释不足带来的挑战之外，还有关于注释一致性的问题。注释脑血管树的任务本质上是模棱两可的。由于图像的离散性，血管的边界和末端往往是不清楚的。因此，注释严重依赖于专家的主观性和潜在的临床目标。这些差异大大增加了模型分割任务的复杂性，从而影响了结果。因此，必须为临床医生提供精确的指导，以改进注释过程并构建更统一的数据集。在本文中，我们研究了在不完全数据和半监督学习的背景下，深度学习方法对脑血管分割的数据依赖性。具体而言，本研究比较了基于无监督正则化的各种最先进的半监督方法，并评估了它们在不同数量和质量数据场景下的性能。基于这些实验，我们为脑血管分割模型的标注和训练提供了指导。

{"title":"Guidelines for cerebrovascular segmentation: Managing imperfect annotations in the context of semi-supervised learning","authors":"Pierre Rougé , Pierre-Henri Conze , Nicolas Passat , Odyssée Merveille","doi":"10.1016/j.compmedimag.2024.102474","DOIUrl":"10.1016/j.compmedimag.2024.102474","url":null,"abstract":"<div><div>Segmentation in medical imaging is an essential and often preliminary task in the image processing chain, driving numerous efforts towards the design of robust segmentation algorithms. Supervised learning methods achieve excellent performances when fed with a sufficient amount of labeled data. However, such labels are typically highly time-consuming, error-prone and expensive to produce. Alternatively, semi-supervised learning approaches leverage both labeled and unlabeled data, and are very useful when only a small fraction of the dataset is labeled. They are particularly useful for cerebrovascular segmentation, given that labeling a single volume requires several hours for an expert. In addition to the challenge posed by insufficient annotations, there are concerns regarding annotation consistency. The task of annotating the cerebrovascular tree is inherently ambiguous. Due to the discrete nature of images, the borders and extremities of vessels are often unclear. Consequently, annotations heavily rely on the expert subjectivity and on the underlying clinical objective. These discrepancies significantly increase the complexity of the segmentation task for the model and consequently impair the results. Consequently, it becomes imperative to provide clinicians with precise guidelines to improve the annotation process and construct more uniform datasets. In this article, we investigate the data dependency of deep learning methods within the context of imperfect data and semi-supervised learning, for cerebrovascular segmentation. Specifically, this study compares various state-of-the-art semi-supervised methods based on unsupervised regularization and evaluates their performance in diverse quantity and quality data scenarios. Based on these experiments, we provide guidelines for the annotation and training of cerebrovascular segmentation models.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"119 ","pages":"Article 102474"},"PeriodicalIF":5.4,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142873357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0