Task-based assessment for neural networks: evaluating undersampled MRI reconstructions based on human observer signal detection.

IF 1.9 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Journal of Medical Imaging Pub Date : 2024-07-01 Epub Date: 2024-08-13 DOI:10.1117/1.JMI.11.4.045503

Joshua D Herman, Rachel E Roca, Alexandra G O'Neill, Marcus L Wong, Sajan Goud Lingala, Angel R Pineda

{"title":"Task-based assessment for neural networks: evaluating undersampled MRI reconstructions based on human observer signal detection.","authors":"Joshua D Herman, Rachel E Roca, Alexandra G O'Neill, Marcus L Wong, Sajan Goud Lingala, Angel R Pineda","doi":"10.1117/1.JMI.11.4.045503","DOIUrl":null,"url":null,"abstract":"Purpose: Recent research explores using neural networks to reconstruct undersampled magnetic resonance imaging. Because of the complexity of the artifacts in the reconstructed images, there is a need to develop task-based approaches to image quality. We compared conventional global quantitative metrics to evaluate image quality in undersampled images generated by a neural network with human observer performance in a detection task. The purpose is to study which acceleration (2×, 3×, 4×, 5×) would be chosen with the conventional metrics and compare it to the acceleration chosen by human observer performance.Approach: We used common global metrics for evaluating image quality: the normalized root mean squared error (NRMSE) and structural similarity (SSIM). These metrics are compared with a measure of image quality that incorporates a subtle signal for a specific task to allow for image quality assessment that locally evaluates the effect of undersampling on a signal. We used a U-Net to reconstruct under-sampled images with 2×, 3×, 4×, and 5× one-dimensional undersampling rates. Cross-validation was performed for a 500- and a 4000-image training set with both SSIM and MSE losses. A two-alternative forced choice (2-AFC) observer study was carried out for detecting a subtle signal (small blurred disk) from images with the 4000-image training set.Results: We found that for both loss functions, the human observer performance on the 2-AFC studies led to a choice of a 2× undersampling, but the SSIM and NRMSE led to a choice of a 3× undersampling.Conclusions: For this detection task using a subtle small signal at the edge of detectability, SSIM and NRMSE led to an overestimate of the achievable undersampling using a U-Net before a steep loss of image quality between 2×, 3×, 4×, 5× undersampling rates when compared to the performance of human observers in the detection task.","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"11 4","pages":"045503"},"PeriodicalIF":1.9000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11321363/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1117/1.JMI.11.4.045503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/13 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Recent research explores using neural networks to reconstruct undersampled magnetic resonance imaging. Because of the complexity of the artifacts in the reconstructed images, there is a need to develop task-based approaches to image quality. We compared conventional global quantitative metrics to evaluate image quality in undersampled images generated by a neural network with human observer performance in a detection task. The purpose is to study which acceleration (2×, 3×, 4×, 5×) would be chosen with the conventional metrics and compare it to the acceleration chosen by human observer performance.

Approach: We used common global metrics for evaluating image quality: the normalized root mean squared error (NRMSE) and structural similarity (SSIM). These metrics are compared with a measure of image quality that incorporates a subtle signal for a specific task to allow for image quality assessment that locally evaluates the effect of undersampling on a signal. We used a U-Net to reconstruct under-sampled images with 2×, 3×, 4×, and 5× one-dimensional undersampling rates. Cross-validation was performed for a 500- and a 4000-image training set with both SSIM and MSE losses. A two-alternative forced choice (2-AFC) observer study was carried out for detecting a subtle signal (small blurred disk) from images with the 4000-image training set.

Results: We found that for both loss functions, the human observer performance on the 2-AFC studies led to a choice of a 2× undersampling, but the SSIM and NRMSE led to a choice of a 3× undersampling.

Conclusions: For this detection task using a subtle small signal at the edge of detectability, SSIM and NRMSE led to an overestimate of the achievable undersampling using a U-Net before a steep loss of image quality between 2×, 3×, 4×, 5× undersampling rates when compared to the performance of human observers in the detection task.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于任务的神经网络评估：基于人类观察者信号检测评估欠采样磁共振成像重建。

研究目的最近的研究探索利用神经网络重建欠采样磁共振成像。由于重建图像中伪影的复杂性，需要开发基于任务的图像质量方法。我们将神经网络生成的欠采样图像中用于评估图像质量的传统全局定量指标与人类观察者在检测任务中的表现进行了比较。目的是研究传统指标会选择哪种加速度（2 倍、3 倍、4 倍、5 倍），并将其与人类观察者表现所选择的加速度进行比较：我们使用常见的全局指标来评估图像质量：归一化均方根误差 (NRMSE) 和结构相似性 (SSIM)。我们将这些指标与一种图像质量度量方法进行了比较，该方法结合了特定任务的微妙信号，可在局部评估欠采样对信号的影响，从而进行图像质量评估。我们使用 U-Net 重构欠采样图像，欠采样率分别为 2 倍、3 倍、4 倍和 5 倍。我们使用 SSIM 和 MSE 损失对 500 和 4000 图像训练集进行了交叉验证。在使用 4000 张图像训练集检测图像中的微弱信号（模糊的小圆盘）时，进行了双备选强制选择（2-AFC）观察者研究：结果：我们发现，对于两种损失函数，人类观察者在 2-AFC 研究中的表现都导致选择 2 倍的欠采样，但 SSIM 和 NRMSE 则导致选择 3 倍的欠采样：结论：与人类观察者在检测任务中的表现相比，在使用处于可检测边缘的微妙小信号的检测任务中，SSIM 和 NRMSE 会导致在 2 倍、3 倍、4 倍和 5 倍下采样率之间图像质量急剧下降之前，使用 U-Net 高估可实现的下采样率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.10

自引率

4.20%

发文量

期刊介绍： JMI covers fundamental and translational research, as well as applications, focused on medical imaging, which continue to yield physical and biomedical advancements in the early detection, diagnostics, and therapy of disease as well as in the understanding of normal. The scope of JMI includes: Imaging physics, Tomographic reconstruction algorithms (such as those in CT and MRI), Image processing and deep learning, Computer-aided diagnosis and quantitative image analysis, Visualization and modeling, Picture archiving and communications systems (PACS), Image perception and observer performance, Technology assessment, Ultrasonic imaging, Image-guided procedures, Digital pathology, Biomedical applications of biomedical imaging. JMI allows for the peer-reviewed communication and archiving of scientific developments, translational and clinical applications, reviews, and recommendations for the field.