Metal implant segmentation in CT images based on diffusion model.

IF 2.9 3区医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING BMC Medical Imaging Pub Date : 2024-08-06 DOI:10.1186/s12880-024-01379-1

Kai Xie, Liugang Gao, Yutao Zhang, Heng Zhang, Jiawei Sun, Tao Lin, Jianfeng Sui, Xinye Ni

{"title":"Metal implant segmentation in CT images based on diffusion model.","authors":"Kai Xie, Liugang Gao, Yutao Zhang, Heng Zhang, Jiawei Sun, Tao Lin, Jianfeng Sui, Xinye Ni","doi":"10.1186/s12880-024-01379-1","DOIUrl":null,"url":null,"abstract":"Background: Computed tomography (CT) is widely in clinics and is affected by metal implants. Metal segmentation is crucial for metal artifact correction, and the common threshold method often fails to accurately segment metals.Purpose: This study aims to segment metal implants in CT images using a diffusion model and further validate it with clinical artifact images and phantom images of known size.Methods: A retrospective study was conducted on 100 patients who received radiation therapy without metal artifacts, and simulated artifact data were generated using publicly available mask data. The study utilized 11,280 slices for training and verification, and 2,820 slices for testing. Metal mask segmentation was performed using DiffSeg, a diffusion model incorporating conditional dynamic coding and a global frequency parser (GFParser). Conditional dynamic coding fuses the current segmentation mask and prior images at multiple scales, while GFParser helps eliminate high-frequency noise in the mask. Clinical artifact images and phantom images are also used for model validation.Results: Compared with the ground truth, the accuracy of DiffSeg for metal segmentation of simulated data was 97.89% and that of DSC was 95.45%. The mask shape obtained by threshold segmentation covered the ground truth and DSCs were 82.92% and 84.19% for threshold segmentation based on 2500 HU and 3000 HU. Evaluation metrics and visualization results show that DiffSeg performs better than other classical deep learning networks, especially for clinical CT, artifact data, and phantom data.Conclusion: DiffSeg efficiently and robustly segments metal masks in artifact data with conditional dynamic coding and GFParser. Future work will involve embedding the metal segmentation model in metal artifact reduction to improve the reduction effect.","PeriodicalId":9020,"journal":{"name":"BMC Medical Imaging","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11301972/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12880-024-01379-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Computed tomography (CT) is widely in clinics and is affected by metal implants. Metal segmentation is crucial for metal artifact correction, and the common threshold method often fails to accurately segment metals.

Purpose: This study aims to segment metal implants in CT images using a diffusion model and further validate it with clinical artifact images and phantom images of known size.

Methods: A retrospective study was conducted on 100 patients who received radiation therapy without metal artifacts, and simulated artifact data were generated using publicly available mask data. The study utilized 11,280 slices for training and verification, and 2,820 slices for testing. Metal mask segmentation was performed using DiffSeg, a diffusion model incorporating conditional dynamic coding and a global frequency parser (GFParser). Conditional dynamic coding fuses the current segmentation mask and prior images at multiple scales, while GFParser helps eliminate high-frequency noise in the mask. Clinical artifact images and phantom images are also used for model validation.

Results: Compared with the ground truth, the accuracy of DiffSeg for metal segmentation of simulated data was 97.89% and that of DSC was 95.45%. The mask shape obtained by threshold segmentation covered the ground truth and DSCs were 82.92% and 84.19% for threshold segmentation based on 2500 HU and 3000 HU. Evaluation metrics and visualization results show that DiffSeg performs better than other classical deep learning networks, especially for clinical CT, artifact data, and phantom data.

Conclusion: DiffSeg efficiently and robustly segments metal masks in artifact data with conditional dynamic coding and GFParser. Future work will involve embedding the metal segmentation model in metal artifact reduction to improve the reduction effect.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于扩散模型的 CT 图像中的金属植入物分割。

背景：计算机断层扫描（CT）广泛应用于临床，并受到金属植入物的影响。目的：本研究旨在利用扩散模型分割 CT 图像中的金属植入物，并通过临床伪影图像和已知大小的模型图像进一步验证该模型：方法：对 100 名接受放射治疗但无金属伪影的患者进行回顾性研究，并使用公开的掩膜数据生成模拟伪影数据。研究利用 11,280 张切片进行训练和验证，利用 2,820 张切片进行测试。金属掩膜分割使用 DiffSeg 进行，这是一种融合了条件动态编码和全局频率解析器 (GFParser) 的扩散模型。条件动态编码融合了当前分割掩膜和多个尺度的先前图像，而全频解析器则有助于消除掩膜中的高频噪声。临床伪影图像和幻影图像也用于模型验证：与基本事实相比，DiffSeg 对模拟数据进行金属分割的准确率为 97.89%，DSC 为 95.45%。在基于 2500 HU 和 3000 HU 的阈值分割中，通过阈值分割获得的掩膜形状覆盖了地面实况和 DSC，分别为 82.92% 和 84.19%。评估指标和可视化结果表明，DiffSeg 的表现优于其他经典深度学习网络，尤其是在临床 CT、伪影数据和幻影数据方面：DiffSeg 利用条件动态编码和 GFParser 对伪影数据中的金属掩膜进行了高效、稳健的分割。未来的工作将涉及在金属伪影还原中嵌入金属分割模型，以提高还原效果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

BMC Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.60

自引率

3.70%

发文量

198

审稿时长

27 weeks

期刊介绍： BMC Medical Imaging is an open access journal publishing original peer-reviewed research articles in the development, evaluation, and use of imaging techniques and image processing tools to diagnose and manage disease.