L. M. Bonney, G. M. Kalisvaart, F. H. P. van Velden, K. M. Bradley, A. B. Hassan, W. Grootjans, D. R. McGowan
{"title":"Deep learning image enhancement algorithms in PET/CT imaging: a phantom and sarcoma patient radiomic evaluation","authors":"L. M. Bonney, G. M. Kalisvaart, F. H. P. van Velden, K. M. Bradley, A. B. Hassan, W. Grootjans, D. R. McGowan","doi":"10.1007/s00259-025-07149-7","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Purpose</h3><p>PET/CT imaging data contains a wealth of quantitative information that can provide valuable contributions to characterising tumours. A growing body of work focuses on the use of deep-learning (DL) techniques for denoising PET data. These models are clinically evaluated prior to use, however, quantitative image assessment provides potential for further evaluation. This work uses radiomic features to compare two manufacturer deep-learning (DL) image enhancement algorithms, one of which has been commercialised, against ‘gold-standard’ image reconstruction techniques in phantom data and a sarcoma patient data set (N=20).</p><h3 data-test=\"abstract-sub-heading\">Methods</h3><p>All studies in the retrospective sarcoma clinical [<span>\\(^{18}\\)</span>F]FDG dataset were acquired on either a GE Discovery 690 or 710 PET/CT scanner with volumes segmented by an experienced nuclear medicine radiologist. The modular heterogeneous imaging phantom used in this work was filled with [<span>\\(^{18}\\)</span>F]FDG, and five repeat acquisitions of the phantom were acquired on a GE Discovery 710 PET/CT scanner. The DL-enhanced images were compared to ‘gold-standard’ images the algorithms were trained to emulate and input images. The difference between image sets was tested for significance in 93 international biomarker standardisation initiative (IBSI) standardised radiomic features.</p><h3 data-test=\"abstract-sub-heading\">Results</h3><p>Comparing DL-enhanced images to the ‘gold-standard’, 4.0% and 9.7% radiomic features measured significantly different (p<sub>critical</sub> < 0.0005) in the phantom and patient data respectively (averaged over the two DL algorithms). Larger differences were observed comparing DL-enhanced images to algorithm input images with 29.8% and 43.0% of radiomic features measuring significantly different in the phantom and patient data respectively (averaged over the two DL algorithms).</p><h3 data-test=\"abstract-sub-heading\">Conclusion</h3><p>DL-enhanced images were found to be similar to images generated using the ‘gold-standard’ target image reconstruction method with more than 80% of radiomic features not significantly different in all comparisons across unseen phantom and sarcoma patient data. This result offers insight into the performance of the DL algorithms, and demonstrate potential applications for DL algorithms in harmonisation for radiomics and for radiomic features in quantitative evaluation of DL algorithms.</p>","PeriodicalId":11909,"journal":{"name":"European Journal of Nuclear Medicine and Molecular Imaging","volume":"16 1","pages":""},"PeriodicalIF":7.6000,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Nuclear Medicine and Molecular Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00259-025-07149-7","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
PET/CT imaging data contains a wealth of quantitative information that can provide valuable contributions to characterising tumours. A growing body of work focuses on the use of deep-learning (DL) techniques for denoising PET data. These models are clinically evaluated prior to use, however, quantitative image assessment provides potential for further evaluation. This work uses radiomic features to compare two manufacturer deep-learning (DL) image enhancement algorithms, one of which has been commercialised, against ‘gold-standard’ image reconstruction techniques in phantom data and a sarcoma patient data set (N=20).
Methods
All studies in the retrospective sarcoma clinical [\(^{18}\)F]FDG dataset were acquired on either a GE Discovery 690 or 710 PET/CT scanner with volumes segmented by an experienced nuclear medicine radiologist. The modular heterogeneous imaging phantom used in this work was filled with [\(^{18}\)F]FDG, and five repeat acquisitions of the phantom were acquired on a GE Discovery 710 PET/CT scanner. The DL-enhanced images were compared to ‘gold-standard’ images the algorithms were trained to emulate and input images. The difference between image sets was tested for significance in 93 international biomarker standardisation initiative (IBSI) standardised radiomic features.
Results
Comparing DL-enhanced images to the ‘gold-standard’, 4.0% and 9.7% radiomic features measured significantly different (pcritical < 0.0005) in the phantom and patient data respectively (averaged over the two DL algorithms). Larger differences were observed comparing DL-enhanced images to algorithm input images with 29.8% and 43.0% of radiomic features measuring significantly different in the phantom and patient data respectively (averaged over the two DL algorithms).
Conclusion
DL-enhanced images were found to be similar to images generated using the ‘gold-standard’ target image reconstruction method with more than 80% of radiomic features not significantly different in all comparisons across unseen phantom and sarcoma patient data. This result offers insight into the performance of the DL algorithms, and demonstrate potential applications for DL algorithms in harmonisation for radiomics and for radiomic features in quantitative evaluation of DL algorithms.
目的pet /CT成像数据包含丰富的定量信息,可以为肿瘤的表征提供有价值的贡献。越来越多的工作集中在使用深度学习(DL)技术对PET数据进行去噪。这些模型在使用前进行临床评估,然而,定量图像评估提供了进一步评估的潜力。这项工作使用放射学特征来比较两种制造商深度学习(DL)图像增强算法,其中一种已经商业化,与“金标准”图像重建技术在幽灵数据和肉瘤患者数据集(N=20)中进行比较。方法回顾性肉瘤临床[\(^{18}\) F]FDG数据集中的所有研究都是在GE Discovery 690或710 PET/CT扫描仪上获得的,并由经验丰富的核医学放射科医生对体积进行分割。本研究中使用的模块化异构成像模体填充了[\(^{18}\) F]FDG,并在GE Discovery 710 PET/CT扫描仪上获得了五次重复成像模体。将dl增强的图像与“金标准”图像进行比较,训练算法来模拟和输入图像。在93个国际生物标志物标准化倡议(IBSI)标准化放射学特征中,对图像集之间的差异进行了显著性测试。结果将dl增强图像与“金标准”进行比较,4.0% and 9.7% radiomic features measured significantly different (pcritical < 0.0005) in the phantom and patient data respectively (averaged over the two DL algorithms). Larger differences were observed comparing DL-enhanced images to algorithm input images with 29.8% and 43.0% of radiomic features measuring significantly different in the phantom and patient data respectively (averaged over the two DL algorithms).ConclusionDL-enhanced images were found to be similar to images generated using the ‘gold-standard’ target image reconstruction method with more than 80% of radiomic features not significantly different in all comparisons across unseen phantom and sarcoma patient data. This result offers insight into the performance of the DL algorithms, and demonstrate potential applications for DL algorithms in harmonisation for radiomics and for radiomic features in quantitative evaluation of DL algorithms.
期刊介绍:
The European Journal of Nuclear Medicine and Molecular Imaging serves as a platform for the exchange of clinical and scientific information within nuclear medicine and related professions. It welcomes international submissions from professionals involved in the functional, metabolic, and molecular investigation of diseases. The journal's coverage spans physics, dosimetry, radiation biology, radiochemistry, and pharmacy, providing high-quality peer review by experts in the field. Known for highly cited and downloaded articles, it ensures global visibility for research work and is part of the EJNMMI journal family.