Prior Knowledge-Guided Triple-Domain Transformer-GAN for Direct PET Reconstruction From Low-Count Sinograms

IEEE transactions on medical imaging Pub Date : 2024-06-13 DOI:10.1109/TMI.2024.3413832

Jiaqi Cui;Pinxian Zeng;Xinyi Zeng;Yuanyuan Xu;Peng Wang;Jiliu Zhou;Yan Wang;Dinggang Shen

{"title":"Prior Knowledge-Guided Triple-Domain Transformer-GAN for Direct PET Reconstruction From Low-Count Sinograms","authors":"Jiaqi Cui;Pinxian Zeng;Xinyi Zeng;Yuanyuan Xu;Peng Wang;Jiliu Zhou;Yan Wang;Dinggang Shen","doi":"10.1109/TMI.2024.3413832","DOIUrl":null,"url":null,"abstract":"To obtain high-quality positron emission tomography (PET) images while minimizing radiation exposure, numerous methods have been dedicated to acquiring standard-count PET (SPET) from low-count PET (LPET). However, current methods have failed to take full advantage of the different emphasized information from multiple domains, i.e., the sinogram, image, and frequency domains, resulting in the loss of crucial details. Meanwhile, they overlook the unique inner-structure of the sinograms, thereby failing to fully capture its structural characteristics and relationships. To alleviate these problems, in this paper, we proposed a prior knowledge-guided transformer-GAN that unites triple domains of sinogram, image, and frequency to directly reconstruct SPET images from LPET sinograms, namely PK-TriDo. Our PK-TriDo consists of a Sinogram Inner-Structure-based Denoising Transformer (SISD-Former) to denoise the input LPET sinogram, a Frequency-adapted Image Reconstruction Transformer (FaIR-Former) to reconstruct high-quality SPET images from the denoised sinograms guided by the image domain prior knowledge, and an Adversarial Network (AdvNet) to further enhance the reconstruction quality via adversarial training. Specifically tailored for the PET imaging mechanism, we injected a sinogram embedding module that partitions the sinograms by rows and columns to obtain 1D sequences of angles and distances to faithfully preserve the inner-structure of the sinograms. Moreover, to mitigate high-frequency distortions and enhance reconstruction details, we integrated global-local frequency parsers (GLFPs) into FaIR-Former to calibrate the distributions and proportions of different frequency bands, thus compelling the network to preserve high-frequency details. Evaluations on three datasets with different dose levels and imaging scenarios demonstrated that our PK-TriDo outperforms the state-of-the-art methods.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"43 12","pages":"4174-4189"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on medical imaging","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10556760/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

To obtain high-quality positron emission tomography (PET) images while minimizing radiation exposure, numerous methods have been dedicated to acquiring standard-count PET (SPET) from low-count PET (LPET). However, current methods have failed to take full advantage of the different emphasized information from multiple domains, i.e., the sinogram, image, and frequency domains, resulting in the loss of crucial details. Meanwhile, they overlook the unique inner-structure of the sinograms, thereby failing to fully capture its structural characteristics and relationships. To alleviate these problems, in this paper, we proposed a prior knowledge-guided transformer-GAN that unites triple domains of sinogram, image, and frequency to directly reconstruct SPET images from LPET sinograms, namely PK-TriDo. Our PK-TriDo consists of a Sinogram Inner-Structure-based Denoising Transformer (SISD-Former) to denoise the input LPET sinogram, a Frequency-adapted Image Reconstruction Transformer (FaIR-Former) to reconstruct high-quality SPET images from the denoised sinograms guided by the image domain prior knowledge, and an Adversarial Network (AdvNet) to further enhance the reconstruction quality via adversarial training. Specifically tailored for the PET imaging mechanism, we injected a sinogram embedding module that partitions the sinograms by rows and columns to obtain 1D sequences of angles and distances to faithfully preserve the inner-structure of the sinograms. Moreover, to mitigate high-frequency distortions and enhance reconstruction details, we integrated global-local frequency parsers (GLFPs) into FaIR-Former to calibrate the distributions and proportions of different frequency bands, thus compelling the network to preserve high-frequency details. Evaluations on three datasets with different dose levels and imaging scenarios demonstrated that our PK-TriDo outperforms the state-of-the-art methods.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

先验知识指导下的三域变换器-广义正电子发射计算机模型（Triple-Domain Transformer-GAN for Direct PET Reconstruction from Low-Count Sinograms）。

为了获得高质量的正电子发射断层扫描（PET）图像，同时最大限度地减少辐射照射，许多方法都致力于从低计数 PET（LPET）中获取标准计数 PET（SPET）。然而，目前的方法未能充分利用来自多个域（即正弦图、图像和频域）的不同强调信息，导致关键细节丢失。同时，这些方法忽略了正弦曲线独特的内部结构，因而无法全面捕捉其结构特征和关系。为了解决这些问题，我们在本文中提出了一种先验知识指导下的变换器-GAN，即 PK-TriDo，它能将正弦图、图像和频率三重域结合起来，直接从 LPET 正弦图重建 SPET 图像。我们的PK-TriDo由一个基于正弦图内部结构的去噪变换器（SISD-Former）、一个频率适应图像重建变换器（FaIR-Former）和一个对抗网络（Adversarial Network，AdvNet）组成，前者用于对输入的LPET正弦图进行去噪，后者用于在图像域先验知识的指导下从去噪的正弦图重建高质量的SPET图像。针对 PET 成像机制，我们注入了一个正弦图嵌入模块，该模块按行和列对正弦图进行分割，以获得角度和距离的一维序列，从而忠实地保留正弦图的内部结构。此外，为了减轻高频失真并增强重建细节，我们在 FaIR-Former 中集成了全局-局部频率解析器（GLFP），以校准不同频段的分布和比例，从而迫使网络保留高频细节。在三个具有不同剂量水平和成像场景的数据集上进行的评估表明，我们的 PK-TriDo 优于最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on medical imaging

自引率

0.00%

发文量