Jens Maus, Pavel Nikulin, Frank Hofheinz, Jan Petr, Anja Braune, Jörg Kotzerke, Jörg van den Hoff
{"title":"基于深度学习的双边滤波技术,用于呼吸门控 PET 的边缘保护去噪。","authors":"Jens Maus, Pavel Nikulin, Frank Hofheinz, Jan Petr, Anja Braune, Jörg Kotzerke, Jörg van den Hoff","doi":"10.1186/s40658-024-00661-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Residual image noise is substantial in positron emission tomography (PET) and one of the factors limiting lesion detection, quantification, and overall image quality. Thus, improving noise reduction remains of considerable interest. This is especially true for respiratory-gated PET investigations. The only broadly used approach for noise reduction in PET imaging has been the application of low-pass filters, usually Gaussians, which however leads to loss of spatial resolution and increased partial volume effects affecting detectability of small lesions and quantitative data evaluation. The bilateral filter (BF) - a locally adaptive image filter - allows to reduce image noise while preserving well defined object edges but manual optimization of the filter parameters for a given PET scan can be tedious and time-consuming, hampering its clinical use. In this work we have investigated to what extent a suitable deep learning based approach can resolve this issue by training a suitable network with the target of reproducing the results of manually adjusted case-specific bilateral filtering.</p><p><strong>Methods: </strong>Altogether, 69 respiratory-gated clinical PET/CT scans with three different tracers ( <math> <mrow><msup><mo>[</mo> <mn>18</mn></msup> <mtext>F</mtext> <mo>]</mo></mrow> </math> FDG, <math> <mrow><msup><mo>[</mo> <mn>18</mn></msup> <mtext>F</mtext> <mo>]</mo></mrow> </math> L-DOPA, <math> <mrow><msup><mo>[</mo> <mn>68</mn></msup> <mtext>Ga</mtext> <mo>]</mo></mrow> </math> DOTATATE) were used for the present investigation. Prior to data processing, the gated data sets were split, resulting in a total of 552 single-gate image volumes. For each of these image volumes, four 3D ROIs were delineated: one ROI for image noise assessment and three ROIs for focal uptake (e.g. tumor lesions) measurements at different target/background contrast levels. An automated procedure was used to perform a brute force search of the two-dimensional BF parameter space for each data set to identify the \"optimal\" filter parameters to generate user-approved ground truth input data consisting of pairs of original and optimally BF filtered images. For reproducing the optimal BF filtering, we employed a modified 3D U-Net CNN incorporating residual learning principle. The network training and evaluation was performed using a 5-fold cross-validation scheme. The influence of filtering on lesion SUV quantification and image noise level was assessed by calculating absolute and fractional differences between the CNN, manual BF, or original (STD) data sets in the previously defined ROIs.</p><p><strong>Results: </strong>The automated procedure used for filter parameter determination chose adequate filter parameters for the majority of the data sets with only 19 patient data sets requiring manual tuning. Evaluation of the focal uptake ROIs revealed that CNN as well as BF based filtering essentially maintain the focal <math><msub><mtext>SUV</mtext> <mtext>max</mtext></msub> </math> values of the unfiltered images with a low mean ± SD difference of <math><mrow><mi>δ</mi> <msubsup><mtext>SUV</mtext> <mtext>max</mtext> <mrow><mtext>CNN</mtext> <mo>,</mo> <mtext>STD</mtext></mrow> </msubsup> </mrow> </math> = (-3.9 ± 5.2)% and <math><mrow><mi>δ</mi> <msubsup><mtext>SUV</mtext> <mtext>max</mtext> <mrow><mtext>BF</mtext> <mo>,</mo> <mtext>STD</mtext></mrow> </msubsup> </mrow> </math> = (-4.4 ± 5.3)%. Regarding relative performance of CNN versus BF, both methods lead to very similar <math><msub><mtext>SUV</mtext> <mtext>max</mtext></msub> </math> values in the vast majority of cases with an overall average difference of <math><mrow><mi>δ</mi> <msubsup><mtext>SUV</mtext> <mtext>max</mtext> <mrow><mtext>CNN</mtext> <mo>,</mo> <mtext>BF</mtext></mrow> </msubsup> </mrow> </math> = (0.5 ± 4.8)%. Evaluation of the noise properties showed that CNN filtering mostly satisfactorily reproduces the noise level and characteristics of BF with <math><mrow><mi>δ</mi> <msup><mtext>Noise</mtext> <mrow><mtext>CNN</mtext> <mo>,</mo> <mtext>BF</mtext></mrow> </msup> </mrow> </math> = (5.6 ± 10.5)%. No significant tracer dependent differences between CNN and BF were observed.</p><p><strong>Conclusions: </strong>Our results show that a neural network based denoising can reproduce the results of a case by case optimized BF in a fully automated way. Apart from rare cases it led to images of practically identical quality regarding noise level, edge preservation, and signal recovery. We believe such a network might proof especially useful in the context of improved motion correction of respiratory-gated PET studies but could also help to establish BF-equivalent edge-preserving CNN filtering in clinical PET since it obviates time consuming manual BF parameter tuning.</p>","PeriodicalId":11559,"journal":{"name":"EJNMMI Physics","volume":"11 1","pages":"58"},"PeriodicalIF":3.0000,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11231129/pdf/","citationCount":"0","resultStr":"{\"title\":\"Deep learning based bilateral filtering for edge-preserving denoising of respiratory-gated PET.\",\"authors\":\"Jens Maus, Pavel Nikulin, Frank Hofheinz, Jan Petr, Anja Braune, Jörg Kotzerke, Jörg van den Hoff\",\"doi\":\"10.1186/s40658-024-00661-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Residual image noise is substantial in positron emission tomography (PET) and one of the factors limiting lesion detection, quantification, and overall image quality. Thus, improving noise reduction remains of considerable interest. This is especially true for respiratory-gated PET investigations. The only broadly used approach for noise reduction in PET imaging has been the application of low-pass filters, usually Gaussians, which however leads to loss of spatial resolution and increased partial volume effects affecting detectability of small lesions and quantitative data evaluation. The bilateral filter (BF) - a locally adaptive image filter - allows to reduce image noise while preserving well defined object edges but manual optimization of the filter parameters for a given PET scan can be tedious and time-consuming, hampering its clinical use. In this work we have investigated to what extent a suitable deep learning based approach can resolve this issue by training a suitable network with the target of reproducing the results of manually adjusted case-specific bilateral filtering.</p><p><strong>Methods: </strong>Altogether, 69 respiratory-gated clinical PET/CT scans with three different tracers ( <math> <mrow><msup><mo>[</mo> <mn>18</mn></msup> <mtext>F</mtext> <mo>]</mo></mrow> </math> FDG, <math> <mrow><msup><mo>[</mo> <mn>18</mn></msup> <mtext>F</mtext> <mo>]</mo></mrow> </math> L-DOPA, <math> <mrow><msup><mo>[</mo> <mn>68</mn></msup> <mtext>Ga</mtext> <mo>]</mo></mrow> </math> DOTATATE) were used for the present investigation. Prior to data processing, the gated data sets were split, resulting in a total of 552 single-gate image volumes. For each of these image volumes, four 3D ROIs were delineated: one ROI for image noise assessment and three ROIs for focal uptake (e.g. tumor lesions) measurements at different target/background contrast levels. An automated procedure was used to perform a brute force search of the two-dimensional BF parameter space for each data set to identify the \\\"optimal\\\" filter parameters to generate user-approved ground truth input data consisting of pairs of original and optimally BF filtered images. For reproducing the optimal BF filtering, we employed a modified 3D U-Net CNN incorporating residual learning principle. The network training and evaluation was performed using a 5-fold cross-validation scheme. The influence of filtering on lesion SUV quantification and image noise level was assessed by calculating absolute and fractional differences between the CNN, manual BF, or original (STD) data sets in the previously defined ROIs.</p><p><strong>Results: </strong>The automated procedure used for filter parameter determination chose adequate filter parameters for the majority of the data sets with only 19 patient data sets requiring manual tuning. Evaluation of the focal uptake ROIs revealed that CNN as well as BF based filtering essentially maintain the focal <math><msub><mtext>SUV</mtext> <mtext>max</mtext></msub> </math> values of the unfiltered images with a low mean ± SD difference of <math><mrow><mi>δ</mi> <msubsup><mtext>SUV</mtext> <mtext>max</mtext> <mrow><mtext>CNN</mtext> <mo>,</mo> <mtext>STD</mtext></mrow> </msubsup> </mrow> </math> = (-3.9 ± 5.2)% and <math><mrow><mi>δ</mi> <msubsup><mtext>SUV</mtext> <mtext>max</mtext> <mrow><mtext>BF</mtext> <mo>,</mo> <mtext>STD</mtext></mrow> </msubsup> </mrow> </math> = (-4.4 ± 5.3)%. Regarding relative performance of CNN versus BF, both methods lead to very similar <math><msub><mtext>SUV</mtext> <mtext>max</mtext></msub> </math> values in the vast majority of cases with an overall average difference of <math><mrow><mi>δ</mi> <msubsup><mtext>SUV</mtext> <mtext>max</mtext> <mrow><mtext>CNN</mtext> <mo>,</mo> <mtext>BF</mtext></mrow> </msubsup> </mrow> </math> = (0.5 ± 4.8)%. Evaluation of the noise properties showed that CNN filtering mostly satisfactorily reproduces the noise level and characteristics of BF with <math><mrow><mi>δ</mi> <msup><mtext>Noise</mtext> <mrow><mtext>CNN</mtext> <mo>,</mo> <mtext>BF</mtext></mrow> </msup> </mrow> </math> = (5.6 ± 10.5)%. No significant tracer dependent differences between CNN and BF were observed.</p><p><strong>Conclusions: </strong>Our results show that a neural network based denoising can reproduce the results of a case by case optimized BF in a fully automated way. Apart from rare cases it led to images of practically identical quality regarding noise level, edge preservation, and signal recovery. We believe such a network might proof especially useful in the context of improved motion correction of respiratory-gated PET studies but could also help to establish BF-equivalent edge-preserving CNN filtering in clinical PET since it obviates time consuming manual BF parameter tuning.</p>\",\"PeriodicalId\":11559,\"journal\":{\"name\":\"EJNMMI Physics\",\"volume\":\"11 1\",\"pages\":\"58\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2024-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11231129/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EJNMMI Physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s40658-024-00661-z\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EJNMMI Physics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40658-024-00661-z","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
摘要
背景:正电子发射计算机断层扫描(PET)的残留图像噪声很大,是限制病灶检测、定量和整体图像质量的因素之一。因此,改善降噪效果仍是一个相当重要的问题。对于呼吸门控正电子发射计算机断层扫描研究来说尤其如此。PET 成像中唯一广泛使用的降噪方法是应用低通滤波器,通常是高斯滤波器,但这会导致空间分辨率的损失和部分容积效应的增加,影响小病灶的检测能力和定量数据的评估。双边滤波器(BF)是一种局部自适应图像滤波器,可在保留清晰物体边缘的同时降低图像噪声,但针对特定 PET 扫描手动优化滤波器参数的工作既繁琐又耗时,妨碍了其临床应用。在这项工作中,我们研究了基于深度学习的合适方法能在多大程度上解决这一问题,方法是训练一个合适的网络,目标是再现手动调整特定病例双边滤波的结果:本研究共使用了69个呼吸门控临床PET/CT扫描,使用了三种不同的示踪剂([ 18 F ] FDG、[ 18 F ] L-DOPA、[ 68 Ga ] DOTATATE)。在进行数据处理之前,对门控数据集进行了拆分,共得到 552 个单门图像卷。每个图像卷都划分了四个三维 ROI:一个 ROI 用于图像噪声评估,三个 ROI 用于不同目标/背景对比度水平下的病灶摄取(如肿瘤病灶)测量。使用自动程序对每个数据集的二维 BF 参数空间进行暴力搜索,以确定 "最佳 "滤波参数,从而生成用户认可的基本真实输入数据,包括原始图像和最佳 BF 滤波图像对。为了再现最佳 BF 滤波,我们采用了一种结合残差学习原理的改进型 3D U-Net CNN。网络的训练和评估采用 5 倍交叉验证方案。通过计算 CNN、手动 BF 或原始(STD)数据集在先前定义的 ROI 中的绝对差异和分数差异,评估了过滤对病变 SUV 定量和图像噪声水平的影响:结果:用于确定滤波参数的自动程序为大多数数据集选择了适当的滤波参数,只有 19 个患者数据集需要手动调整。对病灶摄取 ROI 的评估显示,基于 CNN 和 BF 的滤波基本上保持了未滤波图像的病灶 SUV 最大值,平均±标准差较低,δ SUV max CNN , STD = (-3.9 ± 5.2) %,δ SUV max BF , STD = (-4.4 ± 5.3) %。关于 CNN 与 BF 的相对性能,这两种方法在绝大多数情况下得出的 SUV 最大值非常相似,总体平均差异为 δ SUV max CNN , BF = (0.5 ± 4.8)%。对噪声特性的评估显示,CNN 滤波能令人满意地再现 BF 的噪声水平和特性,δ Noise CNN , BF = (5.6 ± 10.5)%。CNN 和 BF 之间没有观察到明显的示踪剂依赖性差异:我们的研究结果表明,基于神经网络的去噪技术能够以完全自动化的方式重现逐个优化 BF 的结果。除了极少数情况外,它所得到的图像在噪声水平、边缘保留和信号恢复方面的质量几乎相同。我们相信,这种网络在改进呼吸门控 PET 研究的运动校正方面可能特别有用,但也有助于在临床 PET 中建立与 BF 相当的边缘保留 CNN 滤波,因为它避免了耗时的手动 BF 参数调整。
Deep learning based bilateral filtering for edge-preserving denoising of respiratory-gated PET.
Background: Residual image noise is substantial in positron emission tomography (PET) and one of the factors limiting lesion detection, quantification, and overall image quality. Thus, improving noise reduction remains of considerable interest. This is especially true for respiratory-gated PET investigations. The only broadly used approach for noise reduction in PET imaging has been the application of low-pass filters, usually Gaussians, which however leads to loss of spatial resolution and increased partial volume effects affecting detectability of small lesions and quantitative data evaluation. The bilateral filter (BF) - a locally adaptive image filter - allows to reduce image noise while preserving well defined object edges but manual optimization of the filter parameters for a given PET scan can be tedious and time-consuming, hampering its clinical use. In this work we have investigated to what extent a suitable deep learning based approach can resolve this issue by training a suitable network with the target of reproducing the results of manually adjusted case-specific bilateral filtering.
Methods: Altogether, 69 respiratory-gated clinical PET/CT scans with three different tracers ( FDG, L-DOPA, DOTATATE) were used for the present investigation. Prior to data processing, the gated data sets were split, resulting in a total of 552 single-gate image volumes. For each of these image volumes, four 3D ROIs were delineated: one ROI for image noise assessment and three ROIs for focal uptake (e.g. tumor lesions) measurements at different target/background contrast levels. An automated procedure was used to perform a brute force search of the two-dimensional BF parameter space for each data set to identify the "optimal" filter parameters to generate user-approved ground truth input data consisting of pairs of original and optimally BF filtered images. For reproducing the optimal BF filtering, we employed a modified 3D U-Net CNN incorporating residual learning principle. The network training and evaluation was performed using a 5-fold cross-validation scheme. The influence of filtering on lesion SUV quantification and image noise level was assessed by calculating absolute and fractional differences between the CNN, manual BF, or original (STD) data sets in the previously defined ROIs.
Results: The automated procedure used for filter parameter determination chose adequate filter parameters for the majority of the data sets with only 19 patient data sets requiring manual tuning. Evaluation of the focal uptake ROIs revealed that CNN as well as BF based filtering essentially maintain the focal values of the unfiltered images with a low mean ± SD difference of = (-3.9 ± 5.2)% and = (-4.4 ± 5.3)%. Regarding relative performance of CNN versus BF, both methods lead to very similar values in the vast majority of cases with an overall average difference of = (0.5 ± 4.8)%. Evaluation of the noise properties showed that CNN filtering mostly satisfactorily reproduces the noise level and characteristics of BF with = (5.6 ± 10.5)%. No significant tracer dependent differences between CNN and BF were observed.
Conclusions: Our results show that a neural network based denoising can reproduce the results of a case by case optimized BF in a fully automated way. Apart from rare cases it led to images of practically identical quality regarding noise level, edge preservation, and signal recovery. We believe such a network might proof especially useful in the context of improved motion correction of respiratory-gated PET studies but could also help to establish BF-equivalent edge-preserving CNN filtering in clinical PET since it obviates time consuming manual BF parameter tuning.
期刊介绍:
EJNMMI Physics is an international platform for scientists, users and adopters of nuclear medicine with a particular interest in physics matters. As a companion journal to the European Journal of Nuclear Medicine and Molecular Imaging, this journal has a multi-disciplinary approach and welcomes original materials and studies with a focus on applied physics and mathematics as well as imaging systems engineering and prototyping in nuclear medicine. This includes physics-driven approaches or algorithms supported by physics that foster early clinical adoption of nuclear medicine imaging and therapy.