Generative Adversarial Network With Robust Discriminator Through Multi-Task Learning for Low-Dose CT Denoising

IEEE transactions on medical imaging Pub Date : 2024-08-26 DOI:10.1109/TMI.2024.3449647

Sunggu Kyung;Jongjun Won;Seongyong Pak;Sunwoo Kim;Sangyoon Lee;Kanggil Park;Gil-Sun Hong;Namkug Kim

{"title":"Generative Adversarial Network With Robust Discriminator Through Multi-Task Learning for Low-Dose CT Denoising","authors":"Sunggu Kyung;Jongjun Won;Seongyong Pak;Sunwoo Kim;Sangyoon Lee;Kanggil Park;Gil-Sun Hong;Namkug Kim","doi":"10.1109/TMI.2024.3449647","DOIUrl":null,"url":null,"abstract":"Reducing the dose of radiation in computed tomography (CT) is vital to decreasing secondary cancer risk. However, the use of low-dose CT (LDCT) images is accompanied by increased noise that can negatively impact diagnoses. Although numerous deep learning algorithms have been developed for LDCT denoising, several challenges persist, including the visual incongruence experienced by radiologists, unsatisfactory performances across various metrics, and insufficient exploration of the networks’ robustness in other CT domains. To address such issues, this study proposes three novel accretions. First, we propose a generative adversarial network (GAN) with a robust discriminator through multi-task learning that simultaneously performs three vision tasks: restoration, image-level, and pixel-level decisions. The more multi-tasks that are performed, the better the denoising performance of the generator, which means multi-task learning enables the discriminator to provide more meaningful feedback to the generator. Second, two regulatory mechanisms, restoration consistency (RC) and non-difference suppression (NDS), are introduced to improve the discriminator’s representation capabilities. These mechanisms eliminate irrelevant regions and compare the discriminator’s results from the input and restoration, thus facilitating effective GAN training. Lastly, we incorporate residual fast Fourier transforms with convolution (Res-FFT-Conv) blocks into the generator to utilize both frequency and spatial representations. This approach provides mixed receptive fields by using spatial (or local), spectral (or global), and residual connections. Our model was evaluated using various pixel- and feature-space metrics in two denoising tasks. Additionally, we conducted visual scoring with radiologists. The results indicate superior performance in both quantitative and qualitative measures compared to state-of-the-art denoising techniques.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 1","pages":"499-518"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on medical imaging","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10646533/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Reducing the dose of radiation in computed tomography (CT) is vital to decreasing secondary cancer risk. However, the use of low-dose CT (LDCT) images is accompanied by increased noise that can negatively impact diagnoses. Although numerous deep learning algorithms have been developed for LDCT denoising, several challenges persist, including the visual incongruence experienced by radiologists, unsatisfactory performances across various metrics, and insufficient exploration of the networks’ robustness in other CT domains. To address such issues, this study proposes three novel accretions. First, we propose a generative adversarial network (GAN) with a robust discriminator through multi-task learning that simultaneously performs three vision tasks: restoration, image-level, and pixel-level decisions. The more multi-tasks that are performed, the better the denoising performance of the generator, which means multi-task learning enables the discriminator to provide more meaningful feedback to the generator. Second, two regulatory mechanisms, restoration consistency (RC) and non-difference suppression (NDS), are introduced to improve the discriminator’s representation capabilities. These mechanisms eliminate irrelevant regions and compare the discriminator’s results from the input and restoration, thus facilitating effective GAN training. Lastly, we incorporate residual fast Fourier transforms with convolution (Res-FFT-Conv) blocks into the generator to utilize both frequency and spatial representations. This approach provides mixed receptive fields by using spatial (or local), spectral (or global), and residual connections. Our model was evaluated using various pixel- and feature-space metrics in two denoising tasks. Additionally, we conducted visual scoring with radiologists. The results indicate superior performance in both quantitative and qualitative measures compared to state-of-the-art denoising techniques.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过多任务学习为低剂量 CT 去噪提供具有鲁棒判别器的生成对抗网络

减少计算机断层扫描（CT）的辐射剂量对于降低继发性癌症风险至关重要。然而，低剂量 CT（LDCT）图像的使用伴随着噪声的增加，会对诊断产生负面影响。虽然针对 LDCT 去噪已经开发出了许多深度学习算法，但仍存在一些挑战，包括放射科医生体验到的视觉不协调、各种指标的表现不尽如人意，以及对网络在其他 CT 领域的鲁棒性探索不足。为了解决这些问题，本研究提出了三个新的增量。首先，我们提出了一种生成式对抗网络（GAN），该网络通过多任务学习具有鲁棒性判别器，可同时执行三项视觉任务：还原、图像级和像素级决策。执行的多任务越多，生成器的去噪性能就越好，这意味着多任务学习能让判别器为生成器提供更有意义的反馈。其次，为了提高鉴别器的表征能力，引入了两种调节机制，即恢复一致性（RC）和无差异抑制（NDS）。这些机制可以消除无关区域，并比较鉴别器从输入和恢复中得到的结果，从而促进有效的 GAN 训练。最后，我们将残差快速傅立叶变换与卷积（Res-FFT-Conv）块纳入生成器，以利用频率和空间表示。这种方法通过使用空间（或局部）、频谱（或全局）和残差连接来提供混合感受野。我们在两项去噪任务中使用各种像素和特征空间指标对我们的模型进行了评估。此外，我们还与放射科医生进行了视觉评分。结果表明，与最先进的去噪技术相比，我们的模型在定量和定性测量方面都表现出色。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助