首页 > 最新文献

European Radiology Experimental最新文献

英文 中文
Can AI write reports like a radiologist? A blinded evaluation of large language model-generated lumbar spine MRI reports. 人工智能能像放射科医生一样写报告吗?对大型语言模型生成的腰椎MRI报告进行盲法评估。
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-02-23 DOI: 10.1186/s41747-026-00682-6
Moreno Zanardo, Domenico Albano, Valentina Molinari, Renato Fabrizio, Martina Conca, Luigi Asmundo, Francesco Pardo, Francesco Traina, Michele Montechiari, Salvatore Gitto, Luca Maria Sconfienza

Background: To compare the quality and clinical usefulness of large language model (LLM)-generated lumbar spine magnetic resonance imaging (MRI) reports with radiologist-written ones and assess whether medical professionals can distinguish between them.

Materials and methods: This retrospective observational single-center study was approved by the local ethics committee. A total of 125 lumbar spine MRI reports (104 human-written, 21 LLM-generated using ChatGPT-4o) were anonymized, randomized, and blindly evaluated by five medical professionals (one board-certified radiologist, two radiology residents, one general practitioner, one orthopedic surgeon), all with basic familiarity with LLM. Each report was scored on a five-point Likert scale for clinical relevance, clarity, completeness, diagnostic accuracy, and intelligibility, whereas general practitioner and orthopedic surgeon evaluated intelligibility only. Evaluators also classified each report as AI-generated or human-written. Accuracy was defined as the proportion of correctly classified reports in distinguishing LLM-generated from radiologist-written texts. Mann-Whitney U or Student's t-tests were used.

Results: Radiologists' reports consistently received higher median scores across all domains (p < 0.001). No differences were found in the description of the imaging technique (p > 0.175). No clinically false statements were identified in the LLM-generated reports. Identification accuracy varied widely among evaluators: Board-certified radiologist achieved 88.0% accuracy (sensitivity 66.7%, specificity 92.3%), Resident 1 65.6% (14.3%, 76.0%), Resident 2 94.4% (66.7%, 100%), orthopedic surgeon 78.4% (90.5%, 76.0%) and general practitioner 65.6% (81.0%, 62.5%).

Conclusion: Radiologist-written lumbar spine MRI reports outperform LLM-generated reports in quality and structure. However, some AI-generated reports were indistinguishable from human ones, particularly for non-specialized readers. LLMs may support radiologists in structured reporting and improve workflow efficiency, while maintaining diagnostic reliability.

Relevance statement: Large language models can draft lumbar spine MRI reports, but currently lack the quality and consistency of radiologist reports. With radiologist supervision, large language models may improve reporting efficiency while preserving diagnostic reliability and supporting clinical decision-making.

Key points: LLM-generated reports are clinically coherent and stylistically comparable to those written by expert radiologists. Radiologist-written reports scored significantly higher for clinical relevance, findings, and structure. LLM-generated reports were sometimes misclassified as human-written by clinicians.

背景:比较大语言模型(LLM)生成的腰椎磁共振成像(MRI)报告与放射科医生撰写的报告的质量和临床用途,并评估医疗专业人员是否能够区分它们。材料和方法:本回顾性观察性单中心研究经当地伦理委员会批准。共有125份腰椎MRI报告(104份人为撰写,21份使用chatgpt - 40生成的LLM)由5名医学专业人员(1名委员会认证的放射科医生,2名放射科住院医师,1名全科医生,1名骨科医生)匿名、随机和盲目评估,所有人都对LLM有基本的了解。每份报告的临床相关性、清晰度、完整性、诊断准确性和可理解性以5分Likert量表评分,而全科医生和骨科医生仅评估可理解性。评估人员还将每份报告分类为人工智能生成或人工撰写。准确性定义为区分法学硕士生成的报告与放射科医生撰写的文本的正确分类报告的比例。使用Mann-Whitney U或Student's t检验。结果:放射科医生的报告在所有领域中均获得较高的中位数得分(p 0.175)。法学硕士生成的报告中没有发现临床虚假陈述。评估人员的识别准确率差异很大:委员会认证的放射科医师的准确率为88.0%(敏感性66.7%,特异性92.3%),住院医师1的准确率为65.6%(14.3%,76.0%),住院医师2的准确率为94.4%(66.7%,100%),骨科医生78.4%(90.5%,76.0%),全科医生65.6%(81.0%,62.5%)。结论:放射科医师撰写的腰椎MRI报告在质量和结构上优于llm生成的报告。然而,一些人工智能生成的报告与人类报告难以区分,特别是对于非专业读者。法学硕士可以支持放射科医生进行结构化报告,提高工作流程效率,同时保持诊断的可靠性。相关性声明:大型语言模型可以起草腰椎MRI报告,但目前缺乏放射科医生报告的质量和一致性。在放射科医生的监督下,大型语言模型可以提高报告效率,同时保持诊断的可靠性并支持临床决策。重点:法学硕士生成的报告在临床上是连贯的,并且在风格上可与放射科专家撰写的报告相媲美。放射科医师撰写的报告在临床相关性、发现和结构方面得分明显更高。法学硕士生成的报告有时被错误地归类为临床医生撰写的人工报告。
{"title":"Can AI write reports like a radiologist? A blinded evaluation of large language model-generated lumbar spine MRI reports.","authors":"Moreno Zanardo, Domenico Albano, Valentina Molinari, Renato Fabrizio, Martina Conca, Luigi Asmundo, Francesco Pardo, Francesco Traina, Michele Montechiari, Salvatore Gitto, Luca Maria Sconfienza","doi":"10.1186/s41747-026-00682-6","DOIUrl":"10.1186/s41747-026-00682-6","url":null,"abstract":"<p><strong>Background: </strong>To compare the quality and clinical usefulness of large language model (LLM)-generated lumbar spine magnetic resonance imaging (MRI) reports with radiologist-written ones and assess whether medical professionals can distinguish between them.</p><p><strong>Materials and methods: </strong>This retrospective observational single-center study was approved by the local ethics committee. A total of 125 lumbar spine MRI reports (104 human-written, 21 LLM-generated using ChatGPT-4o) were anonymized, randomized, and blindly evaluated by five medical professionals (one board-certified radiologist, two radiology residents, one general practitioner, one orthopedic surgeon), all with basic familiarity with LLM. Each report was scored on a five-point Likert scale for clinical relevance, clarity, completeness, diagnostic accuracy, and intelligibility, whereas general practitioner and orthopedic surgeon evaluated intelligibility only. Evaluators also classified each report as AI-generated or human-written. Accuracy was defined as the proportion of correctly classified reports in distinguishing LLM-generated from radiologist-written texts. Mann-Whitney U or Student's t-tests were used.</p><p><strong>Results: </strong>Radiologists' reports consistently received higher median scores across all domains (p < 0.001). No differences were found in the description of the imaging technique (p > 0.175). No clinically false statements were identified in the LLM-generated reports. Identification accuracy varied widely among evaluators: Board-certified radiologist achieved 88.0% accuracy (sensitivity 66.7%, specificity 92.3%), Resident 1 65.6% (14.3%, 76.0%), Resident 2 94.4% (66.7%, 100%), orthopedic surgeon 78.4% (90.5%, 76.0%) and general practitioner 65.6% (81.0%, 62.5%).</p><p><strong>Conclusion: </strong>Radiologist-written lumbar spine MRI reports outperform LLM-generated reports in quality and structure. However, some AI-generated reports were indistinguishable from human ones, particularly for non-specialized readers. LLMs may support radiologists in structured reporting and improve workflow efficiency, while maintaining diagnostic reliability.</p><p><strong>Relevance statement: </strong>Large language models can draft lumbar spine MRI reports, but currently lack the quality and consistency of radiologist reports. With radiologist supervision, large language models may improve reporting efficiency while preserving diagnostic reliability and supporting clinical decision-making.</p><p><strong>Key points: </strong>LLM-generated reports are clinically coherent and stylistically comparable to those written by expert radiologists. Radiologist-written reports scored significantly higher for clinical relevance, findings, and structure. LLM-generated reports were sometimes misclassified as human-written by clinicians.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2026-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12929739/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147272217","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acquisition, image quality, and PI-RADS agreement of ultrahigh-gradient DWI in prostate 3-T MRI. 前列腺3-T MRI超高梯度DWI的采集、图像质量和PI-RADS一致性。
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-02-23 DOI: 10.1186/s41747-026-00684-4
Leon M Bischoff, Christoph Endler, Philipp Krausewitz, Joerg Ellinger, Niklas Klümper, Alexander Isaak, Narine Mesropyan, Dmitrij Kravchenko, Daniel Kuetting, Alois M Sprinkart, Petra Mürtz, Claus C Pieper, Julian A Luetkens

Objective: New magnetic resonance imaging (MRI) gradient technology enables the acquisition of ultrahigh b-value diffusion-weighted imaging (DWI). We assessed its impact on image quality and Prostate Imaging Reporting and Data System (PI-RADS) scores in prostate MRI.

Materials and methods: Participants with cancer suspicion prospectively underwent 3-T prostate MRI (maximum gradient strength 200 mT/m). Sequences with b-values of 0/800, 1,500, 2,500, 3,500, and 4,500 s/mm² were acquired. Lesion conspicuity was rated from 1 (non-diagnostic) to 5 (excellent). Apparent signal-to-noise ratios (aSNR) and acquisition times were determined. Cumulative link mixed-effects models, repeated measures ANOVA, and Cohen/Fleiss κ statistics were used.

Results: A total of 107 participants, aged 67 ± 8 years (mean ± standard deviation), were included. Compared to DWI(b1500), the DWI(b2500), DWI(b3500), and DWI(b4500) acquisitions were worse regarding both lesion conspicuity (median score, 5 [interquartile interval 4-5] versus 4 [3-4] versus 2 [2-3] versus 2 [1-2], respectively; all p < 0.001) and aSNR (19.0 ± 7.5 versus 12.7 ± 4.8 versus 11.8 ± 4.1 versus 11.4 ± 2.6, respectively; all p < 0.001). Acquisition times increased from DWI(b1500) (107 ± 9 s) to DWI(b4500) (329 ± 26 s). Cohen κ for PI-RADS score agreement was good to moderate (DWI(b2500): 0.87 [confidence interval 0.81, 0.94]; DWI(b3500): 0.75 [0.65, 0.84]; DWI(4500): 0.61 [0.49, 0.72]).

Conclusion: Acquired ultrahigh gradient DWI sequences with ultrahigh b-values in prostate MRI had worse image quality than standard b-values, while PI-RADS agreement between DWI(b1500) and DWI(b2500) was good. However, diagnostic estimates for clinically significant prostate carcinoma remained limited due to a small biopsy sample size (50/107 patients).

Relevance statement: Ultrahigh b-value DWI showed no improved diagnostic performance in comparison to standard b-value DWI regarding the identification of potential prostate cancer. Ultrahigh b-value should not replace standard high b-values (1,500 s/mm²) for imaging workup of patients with suspicion for prostate cancer.

Key points: Acquired ultrahigh b-values (b2500-4500) using ultrahigh gradients of up to 140 T/m were utilized for prostate DWI. Both, overall image quality and diagnostic confidence decreased from good for DWI(b1500) to non-diagnostic for DWI(b4500). PI-RADS agreement between DWI(b1500) and DWI(b2500) was good, while it was only moderate between DWI(b1500) and DWI(b4500).

目的:新型磁共振成像(MRI)梯度技术实现了超高b值弥散加权成像(DWI)的采集。我们评估了其对前列腺MRI图像质量和前列腺成像报告和数据系统(PI-RADS)评分的影响。材料与方法:有癌症嫌疑的受试者前瞻性行3-T前列腺MRI(最大梯度强度200mt /m)。获得b值分别为0/800、1,500、2,500、3,500和4,500 s/mm²的序列。病变显著性评分从1(无诊断性)到5(极好)。测定表观信噪比(aSNR)和采集时间。采用累积关联混合效应模型、重复测量方差分析和Cohen/Fleiss κ统计。结果:共纳入107例受试者,年龄67±8岁(平均±标准差)。与DWI(b1500)相比,DWI(b2500)、DWI(b3500)和DWI(b4500)采集的病变显著性较差(中位数评分分别为5[四分位间隔4-5]、4[3-4]、2[2-3]、2[1-2]);均p结论:获得的前列腺MRI超高b值的超高梯度DWI序列的图像质量较标准b值差,而DWI(b1500)和DWI(b2500)之间的PI-RADS一致性较好。然而,由于活检样本量小(50/107例患者),对具有临床意义的前列腺癌的诊断估计仍然有限。相关声明:与标准b值DWI相比,超高b值DWI在鉴别潜在前列腺癌方面没有提高诊断性能。超高b值不应取代标准高b值(1500 s/mm²)用于前列腺癌疑似患者的影像学检查。重点:前列腺DWI采用高达140 T/m的超高梯度获取超高b值(b2500-4500)。两者的整体图像质量和诊断信心都从DWI的良好(b1500)下降到DWI的不可诊断(b4500)。DWI(b1500)与DWI(b2500)之间的PI-RADS一致性较好,而DWI(b1500)与DWI(b4500)之间的PI-RADS一致性较差。
{"title":"Acquisition, image quality, and PI-RADS agreement of ultrahigh-gradient DWI in prostate 3-T MRI.","authors":"Leon M Bischoff, Christoph Endler, Philipp Krausewitz, Joerg Ellinger, Niklas Klümper, Alexander Isaak, Narine Mesropyan, Dmitrij Kravchenko, Daniel Kuetting, Alois M Sprinkart, Petra Mürtz, Claus C Pieper, Julian A Luetkens","doi":"10.1186/s41747-026-00684-4","DOIUrl":"10.1186/s41747-026-00684-4","url":null,"abstract":"<p><strong>Objective: </strong>New magnetic resonance imaging (MRI) gradient technology enables the acquisition of ultrahigh b-value diffusion-weighted imaging (DWI). We assessed its impact on image quality and Prostate Imaging Reporting and Data System (PI-RADS) scores in prostate MRI.</p><p><strong>Materials and methods: </strong>Participants with cancer suspicion prospectively underwent 3-T prostate MRI (maximum gradient strength 200 mT/m). Sequences with b-values of 0/800, 1,500, 2,500, 3,500, and 4,500 s/mm² were acquired. Lesion conspicuity was rated from 1 (non-diagnostic) to 5 (excellent). Apparent signal-to-noise ratios (aSNR) and acquisition times were determined. Cumulative link mixed-effects models, repeated measures ANOVA, and Cohen/Fleiss κ statistics were used.</p><p><strong>Results: </strong>A total of 107 participants, aged 67 ± 8 years (mean ± standard deviation), were included. Compared to DWI(b1500), the DWI(b2500), DWI(b3500), and DWI(b4500) acquisitions were worse regarding both lesion conspicuity (median score, 5 [interquartile interval 4-5] versus 4 [3-4] versus 2 [2-3] versus 2 [1-2], respectively; all p < 0.001) and aSNR (19.0 ± 7.5 versus 12.7 ± 4.8 versus 11.8 ± 4.1 versus 11.4 ± 2.6, respectively; all p < 0.001). Acquisition times increased from DWI(b1500) (107 ± 9 s) to DWI(b4500) (329 ± 26 s). Cohen κ for PI-RADS score agreement was good to moderate (DWI(b2500): 0.87 [confidence interval 0.81, 0.94]; DWI(b3500): 0.75 [0.65, 0.84]; DWI(4500): 0.61 [0.49, 0.72]).</p><p><strong>Conclusion: </strong>Acquired ultrahigh gradient DWI sequences with ultrahigh b-values in prostate MRI had worse image quality than standard b-values, while PI-RADS agreement between DWI(b1500) and DWI(b2500) was good. However, diagnostic estimates for clinically significant prostate carcinoma remained limited due to a small biopsy sample size (50/107 patients).</p><p><strong>Relevance statement: </strong>Ultrahigh b-value DWI showed no improved diagnostic performance in comparison to standard b-value DWI regarding the identification of potential prostate cancer. Ultrahigh b-value should not replace standard high b-values (1,500 s/mm²) for imaging workup of patients with suspicion for prostate cancer.</p><p><strong>Key points: </strong>Acquired ultrahigh b-values (b2500-4500) using ultrahigh gradients of up to 140 T/m were utilized for prostate DWI. Both, overall image quality and diagnostic confidence decreased from good for DWI(b1500) to non-diagnostic for DWI(b4500). PI-RADS agreement between DWI(b1500) and DWI(b2500) was good, while it was only moderate between DWI(b1500) and DWI(b4500).</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":""},"PeriodicalIF":3.6,"publicationDate":"2026-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12929734/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147272260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning for synthetic PET imaging: a systematic mapping review of techniques, metrics, and clinical relevance. 合成PET成像的深度学习:技术、指标和临床相关性的系统制图回顾。
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-02-09 DOI: 10.1186/s41747-025-00651-5
Maria Vaccaro, Enrico Rosa, Elisa Placidi, Alessia Guarnera, Aurelio Secinaro, Carlo Gandolfo, Maria Carmen Garganese, Antonio Napolitano
<p><strong>Background: </strong>Synthetic positron emission tomography (PET) imaging, enabled by deep learning, represents a promising approach to minimize radiation exposure while preserving diagnostic accuracy. However, variability in methodologies, performance metrics, and clinical applications needs to be assessed. This systematic mapping review examines the current state of research in synthetic PET generation, analyzing their methodological frameworks and evaluating the clinical relevance.</p><p><strong>Materials and methods: </strong>A systematic search in Scopus, PubMed, and Google Scholar (2019-2024) identified peer-reviewed studies on deep learning-based synthetic PET. Review articles, conference abstracts, and inaccessible full texts were excluded. Data extraction covered study characteristics, imaging modalities, architectures, and evaluation metrics. Due to study heterogeneity, the risk of bias was not formally assessed. Results were synthesized through descriptive and quantitative analysis.</p><p><strong>Results: </strong>Of the initial 116 studies retrieved, 34 were included, 25 of them (73.5%) on brain/neuro using magnetic resonance imaging, computed tomography, or low-dose PET data to generate full-dose or tracer-specific PET. Common architectures included convolutional neural networks, generative adversarial networks, and U-Nets. Peak signal-to-noise ratio (PSNR) ranged 22.69-56.87 dB, structural similarity index measure (SSIM) 0.38-1.00 and mean absolute error (MAE) 1.37-72.00%. Whole-body applications were less frequent (9/34, 26.5%) but showed improvements in oncologic imaging, in particular for tumor detection and image quality. Despite promising advancements, challenges remain, including limited data availability, variability in tracer uptake, and the lack of standardized evaluation metrics. The absence of large/multicenter datasets limits the generalizability of findings.</p><p><strong>Conclusions: </strong>This review highlights promising advancements in synthetic PET imaging using deep learning, with several studies demonstrating the potential for high-quality image generation and substantially reduced radiation exposure. These developments are particularly significant in pediatric populations, where minimizing radiation dose is crucial to ensure patient safety and long-term health. Nonetheless, methodological variability and limited clinical validation continue to pose substantial challenges. Future research should prioritize the development of standardized evaluation protocols, the use of larger and more diverse datasets-including pediatric cohorts-and comprehensive real-world clinical validation to support the safe and effective translation of synthetic PET techniques into clinical practice.</p><p><strong>Relevance statement: </strong>Deep learning-based synthetic PET imaging enhances diagnostics while reducing radiation, but requires methodological standardization and clinical validation for broader adoption.</p><p><
背景:合成正电子发射断层扫描(PET)成像,通过深度学习实现,代表了一种有前途的方法,以尽量减少辐射暴露,同时保持诊断的准确性。然而,在方法、性能指标和临床应用方面的可变性需要进行评估。这一系统的地图回顾检查了合成PET生成的研究现状,分析了他们的方法框架并评估了临床相关性。材料和方法:系统检索Scopus、PubMed和谷歌Scholar(2019-2024),确定了基于深度学习的合成PET的同行评审研究。综述文章、会议摘要和无法访问的全文被排除在外。数据提取包括研究特征、成像方式、架构和评估指标。由于研究异质性,未正式评估偏倚风险。通过描述性和定量分析对结果进行综合。结果:在最初的116项研究中,34项被纳入,其中25项(73.5%)使用磁共振成像、计算机断层扫描或低剂量PET数据产生全剂量或示踪剂特异性PET。常见的架构包括卷积神经网络、生成对抗网络和U-Nets。峰值信噪比(PSNR)为22.69 ~ 56.87 dB,结构相似性指数(SSIM)为0.38 ~ 1.00,平均绝对误差(MAE)为1.37 ~ 72.00%。全身应用的频率较低(9/34,26.5%),但显示出肿瘤成像的改善,特别是肿瘤检测和图像质量。尽管取得了有希望的进展,但挑战仍然存在,包括有限的数据可用性、示踪剂摄取的可变性以及缺乏标准化的评估指标。缺乏大型/多中心数据集限制了研究结果的普遍性。结论:本综述强调了使用深度学习的合成PET成像的有希望的进展,有几项研究证明了高质量图像生成和大大减少辐射暴露的潜力。这些进展在儿科人群中尤为重要,因为尽量减少辐射剂量对确保患者安全和长期健康至关重要。尽管如此,方法的可变性和有限的临床验证仍然构成了实质性的挑战。未来的研究应优先考虑制定标准化的评估方案,使用更大、更多样化的数据集(包括儿科队列)和全面的真实世界临床验证,以支持合成PET技术安全有效地转化为临床实践。相关声明:基于深度学习的合成PET成像增强了诊断,同时减少了辐射,但需要方法标准化和临床验证才能更广泛地采用。重点:深度学习可以用更少的辐射暴露创建全剂量PET图像。神经学应用主导合成PET研究,保持必要的诊断细节。挑战包括有限的数据集和示踪剂摄取的可变性,需要进一步的进展。
{"title":"Deep learning for synthetic PET imaging: a systematic mapping review of techniques, metrics, and clinical relevance.","authors":"Maria Vaccaro, Enrico Rosa, Elisa Placidi, Alessia Guarnera, Aurelio Secinaro, Carlo Gandolfo, Maria Carmen Garganese, Antonio Napolitano","doi":"10.1186/s41747-025-00651-5","DOIUrl":"10.1186/s41747-025-00651-5","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Synthetic positron emission tomography (PET) imaging, enabled by deep learning, represents a promising approach to minimize radiation exposure while preserving diagnostic accuracy. However, variability in methodologies, performance metrics, and clinical applications needs to be assessed. This systematic mapping review examines the current state of research in synthetic PET generation, analyzing their methodological frameworks and evaluating the clinical relevance.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Materials and methods: &lt;/strong&gt;A systematic search in Scopus, PubMed, and Google Scholar (2019-2024) identified peer-reviewed studies on deep learning-based synthetic PET. Review articles, conference abstracts, and inaccessible full texts were excluded. Data extraction covered study characteristics, imaging modalities, architectures, and evaluation metrics. Due to study heterogeneity, the risk of bias was not formally assessed. Results were synthesized through descriptive and quantitative analysis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Of the initial 116 studies retrieved, 34 were included, 25 of them (73.5%) on brain/neuro using magnetic resonance imaging, computed tomography, or low-dose PET data to generate full-dose or tracer-specific PET. Common architectures included convolutional neural networks, generative adversarial networks, and U-Nets. Peak signal-to-noise ratio (PSNR) ranged 22.69-56.87 dB, structural similarity index measure (SSIM) 0.38-1.00 and mean absolute error (MAE) 1.37-72.00%. Whole-body applications were less frequent (9/34, 26.5%) but showed improvements in oncologic imaging, in particular for tumor detection and image quality. Despite promising advancements, challenges remain, including limited data availability, variability in tracer uptake, and the lack of standardized evaluation metrics. The absence of large/multicenter datasets limits the generalizability of findings.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;This review highlights promising advancements in synthetic PET imaging using deep learning, with several studies demonstrating the potential for high-quality image generation and substantially reduced radiation exposure. These developments are particularly significant in pediatric populations, where minimizing radiation dose is crucial to ensure patient safety and long-term health. Nonetheless, methodological variability and limited clinical validation continue to pose substantial challenges. Future research should prioritize the development of standardized evaluation protocols, the use of larger and more diverse datasets-including pediatric cohorts-and comprehensive real-world clinical validation to support the safe and effective translation of synthetic PET techniques into clinical practice.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Relevance statement: &lt;/strong&gt;Deep learning-based synthetic PET imaging enhances diagnostics while reducing radiation, but requires methodological standardization and clinical validation for broader adoption.&lt;/p&gt;&lt;p&gt;&lt;","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":"12"},"PeriodicalIF":3.6,"publicationDate":"2026-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12886704/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146150846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of ischemia duration on MRI-derived perfusion parameters in a mouse kidney transplant model. 缺血时间对小鼠肾移植模型mri衍生灌注参数的影响。
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-02-04 DOI: 10.1186/s41747-025-00675-x
Felix L Herr, Sandra Kloiber-Langhorst, Ming Ming Li, Olaf Dietrich, Robert Erdelkamp, Christoph Walz, Severin Jacobi, Ulrich Wirth, Jens Ricke, Clemens C Cyran, Joachim Andrassy

Objectives: Cold ischemia during kidney transplantation induces ischemia-reperfusion injury with endothelial dysfunction, capillary leak, and impaired perfusion. Its duration critically determines graft outcome. Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) enables noninvasive assessment of renal microcirculation and may indicate ischemic injury. We evaluated the impact of ischemia duration on DCE-MRI-derived perfusion parameters in renal transplants in mice.

Materials and methods: Procedures were approved by the local institutional animal care and use committee. A total of 15 C57BL/6 mice underwent kidney transplantation and were assigned to a short or prolonged cold ischemia group. DCE-MRI was performed to assess renal perfusion. Imaging was conducted at a mean of 268 ± 30 days (mean ± standard deviation) after transplantation. Perfusion parameters were calculated using the Patlak model, which provides the plasma volume fraction (vp), reflecting renal blood volume and perfusion, and the volume transfer constant (Ktrans), characterizing the rate of contrast agent extravasation from capillaries into the extravascular extracellular space.

Results: Significant differences were observed in the Ktrans parameter of transplanted kidneys between groups. The median Ktrans (mL/100 mL/min) was significantly higher in the 16-h group (2.87, interquartile range 2.45-3.03) versus the 30-min group (0.91, 0.90-1.42; p = 0.008). Median vp (mL/100 mL/min) was non-significantly lower in the 16-h group (21.89, 17.28-23.22) versus the 30-min group (29.02, 24.99-37.15; p = 0.151).

Conclusion: Cold ischemia with 16-h duration was associated with significantly higher Ktrans values in kidney transplants, reflecting significantly increased vascular permeability. DCE-MRI provides a sensitive tool for detecting ischemia-induced microvascular dysfunction.

Relevance statement: Quantitative DCE-MRI detects microvascular injury after 16-h cold ischemia in kidney transplants in mice, supporting its potential as a noninvasive tool to assess graft integrity and guide interventions aimed at improving long-term transplant outcomes.

Key points: The duration of ischemia critically affects endothelial integrity and perfusion characteristics in a mouse kidney transplant model. Prolonged 16-h ischemia leads to increased vascular permeability, indicating more severe endothelial and microcirculatory injury in transplanted kidneys. DCE-MRI enables sensitive detection of subtle ischemia-related microvascular alterations, supporting its value for noninvasive graft assessment.

目的:肾移植冷缺血引起缺血再灌注损伤,并伴有内皮功能障碍、毛细血管渗漏和灌注损伤。它的持续时间决定了移植的结果。动态对比增强磁共振成像(DCE-MRI)能够对肾脏微循环进行无创评估,并可能提示缺血性损伤。我们评估了缺血持续时间对小鼠肾移植dce - mri衍生灌注参数的影响。材料和方法:程序经当地机构动物护理和使用委员会批准。15只接受肾移植的C57BL/6小鼠分为短时间和长时间冷缺血组。DCE-MRI评估肾灌注。移植后平均268±30天(平均±标准差)进行影像学检查。灌注参数使用Patlak模型计算,该模型提供血浆体积分数(vp),反映肾血容量和灌注,以及体积传递常数(Ktrans),表征造影剂从毛细血管外渗到血管外细胞间隙的速率。结果:两组移植肾Ktrans参数差异有统计学意义。16小时组的中位Ktrans (mL/100 mL/min)显著高于30分钟组(0.91,0.90-1.42;p = 0.008)(2.87,四分位数范围2.45-3.03)。16小时组的中位vp (mL/100 mL/min)(21.89, 17.28-23.22)比30分钟组(29.02,24.99-37.15,p = 0.151)无显著降低。结论:肾移植16 h冷缺血后Ktrans值显著升高,反映血管通透性显著增加。DCE-MRI为检测缺血引起的微血管功能障碍提供了一种灵敏的工具。相关声明:定量DCE-MRI检测小鼠肾移植16小时冷缺血后的微血管损伤,支持其作为评估移植物完整性和指导旨在改善长期移植结果的干预措施的无创工具的潜力。缺血持续时间严重影响小鼠肾移植模型内皮细胞的完整性和灌注特性。延长16小时缺血导致血管通透性增加,表明移植肾内皮和微循环损伤更为严重。DCE-MRI能够灵敏地检测细微的缺血相关微血管改变,支持其在无创移植评估中的价值。
{"title":"Impact of ischemia duration on MRI-derived perfusion parameters in a mouse kidney transplant model.","authors":"Felix L Herr, Sandra Kloiber-Langhorst, Ming Ming Li, Olaf Dietrich, Robert Erdelkamp, Christoph Walz, Severin Jacobi, Ulrich Wirth, Jens Ricke, Clemens C Cyran, Joachim Andrassy","doi":"10.1186/s41747-025-00675-x","DOIUrl":"10.1186/s41747-025-00675-x","url":null,"abstract":"<p><strong>Objectives: </strong>Cold ischemia during kidney transplantation induces ischemia-reperfusion injury with endothelial dysfunction, capillary leak, and impaired perfusion. Its duration critically determines graft outcome. Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) enables noninvasive assessment of renal microcirculation and may indicate ischemic injury. We evaluated the impact of ischemia duration on DCE-MRI-derived perfusion parameters in renal transplants in mice.</p><p><strong>Materials and methods: </strong>Procedures were approved by the local institutional animal care and use committee. A total of 15 C57BL/6 mice underwent kidney transplantation and were assigned to a short or prolonged cold ischemia group. DCE-MRI was performed to assess renal perfusion. Imaging was conducted at a mean of 268 ± 30 days (mean ± standard deviation) after transplantation. Perfusion parameters were calculated using the Patlak model, which provides the plasma volume fraction (v<sub>p</sub>), reflecting renal blood volume and perfusion, and the volume transfer constant (K<sup>trans</sup>), characterizing the rate of contrast agent extravasation from capillaries into the extravascular extracellular space.</p><p><strong>Results: </strong>Significant differences were observed in the K<sup>trans</sup> parameter of transplanted kidneys between groups. The median K<sup>trans</sup> (mL/100 mL/min) was significantly higher in the 16-h group (2.87, interquartile range 2.45-3.03) versus the 30-min group (0.91, 0.90-1.42; p = 0.008). Median v<sub>p</sub> (mL/100 mL/min) was non-significantly lower in the 16-h group (21.89, 17.28-23.22) versus the 30-min group (29.02, 24.99-37.15; p = 0.151).</p><p><strong>Conclusion: </strong>Cold ischemia with 16-h duration was associated with significantly higher K<sup>trans</sup> values in kidney transplants, reflecting significantly increased vascular permeability. DCE-MRI provides a sensitive tool for detecting ischemia-induced microvascular dysfunction.</p><p><strong>Relevance statement: </strong>Quantitative DCE-MRI detects microvascular injury after 16-h cold ischemia in kidney transplants in mice, supporting its potential as a noninvasive tool to assess graft integrity and guide interventions aimed at improving long-term transplant outcomes.</p><p><strong>Key points: </strong>The duration of ischemia critically affects endothelial integrity and perfusion characteristics in a mouse kidney transplant model. Prolonged 16-h ischemia leads to increased vascular permeability, indicating more severe endothelial and microcirculatory injury in transplanted kidneys. DCE-MRI enables sensitive detection of subtle ischemia-related microvascular alterations, supporting its value for noninvasive graft assessment.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":"11"},"PeriodicalIF":3.6,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12873054/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visualization of gadolinium transport across the blood-brain barrier along perivascular clearance pathways. 钆沿血管周围清除通路穿越血脑屏障的可视化。
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-02-04 DOI: 10.1186/s41747-025-00672-0
Svea Seehafer, Yvonne Mrosek, Lars-Patrick Schmill, Schekeb Aludin, Olav Jansen, Carl Alexander Gless, Johanna Rümenapp, Naomi Larsen

Objective: We investigated the transport of gadolinium-based contrast agent (GBCA) across the blood-brain barrier (BBB) along the perivascular spaces as part of the glymphatic drainage in patients with iatrogenic BBB disruption following digital subtraction angiography (DSA).

Materials and methods: A retrospective analysis was conducted on patients who underwent DSA for diagnosis and/or treatment of intracranial aneurysms and received a 3-T magnetic resonance imaging (MRI) within the following day. Exclusion criteria included states with a suggested impairment of BBB integrity, such as neurodegenerative diseases or suspected glymphatic impairment. BBB disruption was assessed using a pre- and post-contrast three-dimensional T1-weighted volume-isotropic turbo spin-echo sequence. Patterns of GBCA distributions were described. The localization of GBCA-extravasation was correlated with perivascular spaces visualized on the coregistered T2-weighted sequences. Fisher's exact test and logistic regression were used.

Results: Out of 43 patients, 30 (69.8%) exhibited visible BBB disruption. BBB disruption was significantly more often observed after therapeutic DSA (p = 0.004). GBCA-enhancement patterns indicated a localized pial enhancement in 96.7% of affected patients, with additional parenchymal enhancement along the perivascular spaces in 56.7%. Enhancement was predominantly located in the downstream territories of probed vessels, suggesting a potential association with glymphatic transport. An illustrative case with serial MRI examinations is presented, demonstrating time-dependent GBCA-enhancement patterns.

Conclusion: The study provides in vivo evidence of GBCA transport patterns following iatrogenic BBB disruption, which may correspond to parts of the proposed glymphatic pathways. Our results indicate a sequential progression of contrast enhancement, initially manifesting at the brain surface and subsequently extending along perivascular spaces to the subarachnoid space.

Relevance statement: Understanding BBB disruption and glymphatic transport with MRI imaging methods may improve neurovascular disease management.

Key points: BBB disruption post-DSA may facilitate GBCA transport via glymphatic pathways, offering novel and hypothesis-generating insights into brain clearance mechanisms. GBCA enhancement followed a chronological and spatial pattern, suggesting an organized cerebrospinal-interstitial exchange system relevant for brain clearance. Findings highlight potential implications for BBB integrity in neurovascular health with prospective implications for diagnostic imaging.

目的:我们研究了钆基造影剂(GBCA)在医源性血脑屏障(BBB)破裂的患者中沿血管周围间隙作为淋巴引流的一部分通过血脑屏障(BBB)的运输。材料和方法:回顾性分析经DSA诊断和/或治疗颅内动脉瘤并于次日行3-T磁共振成像(MRI)的患者。排除标准包括提示血脑屏障完整性受损的状态,如神经退行性疾病或可疑的淋巴损伤。采用对比前和对比后的三维t1加权体积各向同性涡轮自旋回波序列评估血脑屏障破坏。描述了GBCA的分布规律。在共登记的t2加权序列上显示,gbca外渗的定位与血管周围空间相关。采用Fisher精确检验和logistic回归。结果:43例患者中,30例(69.8%)出现明显血脑屏障破坏。治疗性DSA后血脑屏障破坏明显增加(p = 0.004)。在96.7%的患者中,gbca增强模式显示局部颅底增强,56.7%的患者伴有沿血管周围间隙的附加实质增强。增强主要位于探测血管的下游区域,提示可能与淋巴运输有关。一个说明性的病例与系列MRI检查提出,证明时间依赖的gbca增强模式。结论:该研究提供了医源性血脑屏障破坏后GBCA转运模式的体内证据,这可能与所提出的部分淋巴通路相对应。我们的研究结果表明,造影剂增强的顺序发展,最初表现在脑表面,随后沿着血管周围空间延伸到蛛网膜下腔。相关性声明:通过MRI成像方法了解血脑屏障破坏和淋巴转运可能改善神经血管疾病的管理。要点:dsa后血脑屏障破坏可能促进GBCA通过淋巴通路运输,为大脑清除机制提供新的和假设生成的见解。GBCA增强遵循时间和空间模式,提示有组织的脑脊髓-间质交换系统与脑清除有关。研究结果强调了血脑屏障完整性在神经血管健康中的潜在意义,并对诊断成像具有前瞻性意义。
{"title":"Visualization of gadolinium transport across the blood-brain barrier along perivascular clearance pathways.","authors":"Svea Seehafer, Yvonne Mrosek, Lars-Patrick Schmill, Schekeb Aludin, Olav Jansen, Carl Alexander Gless, Johanna Rümenapp, Naomi Larsen","doi":"10.1186/s41747-025-00672-0","DOIUrl":"10.1186/s41747-025-00672-0","url":null,"abstract":"<p><strong>Objective: </strong>We investigated the transport of gadolinium-based contrast agent (GBCA) across the blood-brain barrier (BBB) along the perivascular spaces as part of the glymphatic drainage in patients with iatrogenic BBB disruption following digital subtraction angiography (DSA).</p><p><strong>Materials and methods: </strong>A retrospective analysis was conducted on patients who underwent DSA for diagnosis and/or treatment of intracranial aneurysms and received a 3-T magnetic resonance imaging (MRI) within the following day. Exclusion criteria included states with a suggested impairment of BBB integrity, such as neurodegenerative diseases or suspected glymphatic impairment. BBB disruption was assessed using a pre- and post-contrast three-dimensional T1-weighted volume-isotropic turbo spin-echo sequence. Patterns of GBCA distributions were described. The localization of GBCA-extravasation was correlated with perivascular spaces visualized on the coregistered T2-weighted sequences. Fisher's exact test and logistic regression were used.</p><p><strong>Results: </strong>Out of 43 patients, 30 (69.8%) exhibited visible BBB disruption. BBB disruption was significantly more often observed after therapeutic DSA (p = 0.004). GBCA-enhancement patterns indicated a localized pial enhancement in 96.7% of affected patients, with additional parenchymal enhancement along the perivascular spaces in 56.7%. Enhancement was predominantly located in the downstream territories of probed vessels, suggesting a potential association with glymphatic transport. An illustrative case with serial MRI examinations is presented, demonstrating time-dependent GBCA-enhancement patterns.</p><p><strong>Conclusion: </strong>The study provides in vivo evidence of GBCA transport patterns following iatrogenic BBB disruption, which may correspond to parts of the proposed glymphatic pathways. Our results indicate a sequential progression of contrast enhancement, initially manifesting at the brain surface and subsequently extending along perivascular spaces to the subarachnoid space.</p><p><strong>Relevance statement: </strong>Understanding BBB disruption and glymphatic transport with MRI imaging methods may improve neurovascular disease management.</p><p><strong>Key points: </strong>BBB disruption post-DSA may facilitate GBCA transport via glymphatic pathways, offering novel and hypothesis-generating insights into brain clearance mechanisms. GBCA enhancement followed a chronological and spatial pattern, suggesting an organized cerebrospinal-interstitial exchange system relevant for brain clearance. Findings highlight potential implications for BBB integrity in neurovascular health with prospective implications for diagnostic imaging.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":"10"},"PeriodicalIF":3.6,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12873042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146120243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning reconstruction accelerated reduced field-of-view DWI in rectal cancer: mucosa-submucosa-muscularis visualization and T staging. 深度学习重建加速了直肠癌视野DWI的降低:粘膜-粘膜下-肌肉层可视化和T分期。
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-01-26 DOI: 10.1186/s41747-025-00667-x
Wenjing Peng, Fan Yang, Diliang Li, Rui Zhao, Lijuan Wan, Shuang Chen, Xiangchun Liu, Sicong Wang, Yuanlong Li, Min Li, Yuan Liu, Hongmei Zhang
<p><strong>Objective: </strong>We compared the image quality and diagnostic performance of deep learning reconstruction (DLR) accelerated reduced field-of-view (rFOV<sub>DL</sub>) diffusion-weighted imaging (DWI) with standard-reconstructed full field-of-view (fFOV<sub>STA</sub>) DWI in rectal cancer.</p><p><strong>Materials and methods: </strong>This prospective study enrolled 173 participants with biopsy-confirmed rectal adenocarcinoma from November 2022 to August 2023 undergoing rFOV<sub>DL</sub> and fFOV<sub>STA</sub> DWI scans. Two radiologists evaluated qualitative image quality, objective image quality, and apparent diffusion coefficient (ADC) independently. T and N staging were evaluated in 94 participants undergoing radical surgery. Diagnostic sensitivity, specificity, and accuracy were calculated using histopathologic results as the gold standard. ADC values were analyzed for correlations with histopathologic staging.</p><p><strong>Results: </strong>We observed that rFOV<sub>DL</sub> DWI reduced acquisition time by 30% compared to fFOV<sub>STA</sub> DWI. rFOV<sub>DL</sub> DWI outperformed fFOV<sub>STA</sub> DWI in all qualitative image quality metrics (p ≤ 0.013), especially in mucosa-submucosa-muscularis visualization, spatial resolution, overall image quality, and diagnostic confidence, accompanied by comparable objective image quality (p ≥ 0.054). When applied with T2-weighted imaging, rFOV<sub>DL</sub> DWI significantly enhanced primary T-staging accuracy than fFOV<sub>STA</sub> DWI (p < 0.001), especially for early-stage tumors (T1 or T2). Tumor ADC values of rFOV<sub>DL</sub> DWI were lower than those of fFOV<sub>STA</sub> DWI, yet remained solid inverse correlations with histopathologic T-staging (p < 0.001). Higher inter-reader agreements of locoregional staging and ADC measurements were obtained by rFOV<sub>DL</sub> DWI.</p><p><strong>Conclusion: </strong>rFOV<sub>DL</sub> DWI significantly improved image quality than fFOV<sub>STA</sub> DWI, with a 30% reduced acquisition time. rFOV<sub>DL</sub> DWI facilitated higher primary T-staging accuracy, especially for early-stage rectal cancer (T1-T2).</p><p><strong>Relevance statement: </strong>Reduced acquisition time and improved imaging quality highlighted the clinical feasibility of applying DLR to rFOV DWI. rFOV<sub>DL</sub> DWI could significantly enhance primary T-staging accuracy, especially for early-stage rectal cancer (T1-T2), facilitating more precise treatment management.</p><p><strong>Key points: </strong>Applying deep learning reconstruction (DLR) to reduced field-of-view (rFOV) diffusion-weighted imaging (DWI) improved mucosa-submucosa-muscularis visualization and reduced acquisition time. DLR-based rFOV DWI significantly enhanced primary T-staging accuracy for rectal cancer, especially for early-stage tumors (T1 or T2). DLR-based rFOV DWI facilitated higher inter-reader agreements for locoregional staging and apparent diffusion coefficient measurement in rectal cancer
目的:比较深度学习重建(DLR)加速缩小视场(rFOVDL)扩散加权成像(DWI)与标准重建全视场(fFOVSTA) DWI在直肠癌诊断中的图像质量和诊断性能。材料和方法:这项前瞻性研究招募了173名在2022年11月至2023年8月期间接受rFOVDL和fFOVSTA DWI扫描的活检证实的直肠腺癌患者。两名放射科医生独立评估定性图像质量、客观图像质量和表观扩散系数(ADC)。对94名接受根治性手术的患者进行T和N分期评估。以组织病理学结果为金标准计算诊断敏感性、特异性和准确性。分析ADC值与组织病理分期的相关性。结果:我们观察到rFOVDL DWI比fFOVSTA DWI减少了30%的采集时间。rFOVDL DWI在所有定性图像质量指标上都优于fFOVSTA DWI (p≤0.013),特别是在粘膜-粘膜下-肌肉层可视化、空间分辨率、整体图像质量和诊断置信度方面,并伴有相当的客观图像质量(p≥0.054)。当与t2加权成像应用时,rFOVDL DWI比fFOVSTA DWI显著提高了原发性t分期准确性(p DL DWI低于fFOVSTA DWI),但与组织病理学t分期(p DL DWI)保持可靠的负相关。结论:rFOVDL DWI较fFOVSTA DWI显著改善图像质量,采集时间缩短30%。rFOVDL DWI具有更高的原发性t分期准确性,特别是对于早期直肠癌(T1-T2)。相关性声明:减少了采集时间,提高了成像质量,突出了DLR应用于rFOV DWI的临床可行性。rFOVDL DWI可显著提高原发性t分期准确性,尤其是对早期直肠癌(T1-T2),有助于更精准的治疗管理。重点:将深度学习重建(DLR)应用于缩小视场(rFOV)扩散加权成像(DWI)可以改善粘膜-粘膜下-肌肉层的可视化并缩短采集时间。基于dlr的rFOV DWI可显著提高直肠癌的原发性t分期准确性,尤其是早期肿瘤(T1或T2)。基于dlr的rFOV DWI有助于提高直肠癌局部分期和表观扩散系数测量的读者间一致性。
{"title":"Deep learning reconstruction accelerated reduced field-of-view DWI in rectal cancer: mucosa-submucosa-muscularis visualization and T staging.","authors":"Wenjing Peng, Fan Yang, Diliang Li, Rui Zhao, Lijuan Wan, Shuang Chen, Xiangchun Liu, Sicong Wang, Yuanlong Li, Min Li, Yuan Liu, Hongmei Zhang","doi":"10.1186/s41747-025-00667-x","DOIUrl":"10.1186/s41747-025-00667-x","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;We compared the image quality and diagnostic performance of deep learning reconstruction (DLR) accelerated reduced field-of-view (rFOV&lt;sub&gt;DL&lt;/sub&gt;) diffusion-weighted imaging (DWI) with standard-reconstructed full field-of-view (fFOV&lt;sub&gt;STA&lt;/sub&gt;) DWI in rectal cancer.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Materials and methods: &lt;/strong&gt;This prospective study enrolled 173 participants with biopsy-confirmed rectal adenocarcinoma from November 2022 to August 2023 undergoing rFOV&lt;sub&gt;DL&lt;/sub&gt; and fFOV&lt;sub&gt;STA&lt;/sub&gt; DWI scans. Two radiologists evaluated qualitative image quality, objective image quality, and apparent diffusion coefficient (ADC) independently. T and N staging were evaluated in 94 participants undergoing radical surgery. Diagnostic sensitivity, specificity, and accuracy were calculated using histopathologic results as the gold standard. ADC values were analyzed for correlations with histopathologic staging.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;We observed that rFOV&lt;sub&gt;DL&lt;/sub&gt; DWI reduced acquisition time by 30% compared to fFOV&lt;sub&gt;STA&lt;/sub&gt; DWI. rFOV&lt;sub&gt;DL&lt;/sub&gt; DWI outperformed fFOV&lt;sub&gt;STA&lt;/sub&gt; DWI in all qualitative image quality metrics (p ≤ 0.013), especially in mucosa-submucosa-muscularis visualization, spatial resolution, overall image quality, and diagnostic confidence, accompanied by comparable objective image quality (p ≥ 0.054). When applied with T2-weighted imaging, rFOV&lt;sub&gt;DL&lt;/sub&gt; DWI significantly enhanced primary T-staging accuracy than fFOV&lt;sub&gt;STA&lt;/sub&gt; DWI (p &lt; 0.001), especially for early-stage tumors (T1 or T2). Tumor ADC values of rFOV&lt;sub&gt;DL&lt;/sub&gt; DWI were lower than those of fFOV&lt;sub&gt;STA&lt;/sub&gt; DWI, yet remained solid inverse correlations with histopathologic T-staging (p &lt; 0.001). Higher inter-reader agreements of locoregional staging and ADC measurements were obtained by rFOV&lt;sub&gt;DL&lt;/sub&gt; DWI.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusion: &lt;/strong&gt;rFOV&lt;sub&gt;DL&lt;/sub&gt; DWI significantly improved image quality than fFOV&lt;sub&gt;STA&lt;/sub&gt; DWI, with a 30% reduced acquisition time. rFOV&lt;sub&gt;DL&lt;/sub&gt; DWI facilitated higher primary T-staging accuracy, especially for early-stage rectal cancer (T1-T2).&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Relevance statement: &lt;/strong&gt;Reduced acquisition time and improved imaging quality highlighted the clinical feasibility of applying DLR to rFOV DWI. rFOV&lt;sub&gt;DL&lt;/sub&gt; DWI could significantly enhance primary T-staging accuracy, especially for early-stage rectal cancer (T1-T2), facilitating more precise treatment management.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Key points: &lt;/strong&gt;Applying deep learning reconstruction (DLR) to reduced field-of-view (rFOV) diffusion-weighted imaging (DWI) improved mucosa-submucosa-muscularis visualization and reduced acquisition time. DLR-based rFOV DWI significantly enhanced primary T-staging accuracy for rectal cancer, especially for early-stage tumors (T1 or T2). DLR-based rFOV DWI facilitated higher inter-reader agreements for locoregional staging and apparent diffusion coefficient measurement in rectal cancer","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":"8"},"PeriodicalIF":3.6,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834858/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phantom-based performance comparison of two commercial deep learning CT reconstruction algorithms with super- and normal-resolution settings. 两种商业深度学习CT重建算法在超分辨率和正常分辨率设置下基于幻影的性能比较。
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-01-26 DOI: 10.1186/s41747-025-00670-2
Joël Greffier, Catherine Roy, Djamel Dabli, Jean-Paul Beregi, Maxime Pastor

Objective: We compared a super-resolution deep learning image reconstruction (SR-DLR) algorithm with a normal-resolution (NR)-DLR algorithm according to radiation dose for abdominal computed tomography (CT).

Materials and methods: An image-quality phantom was scanned with an energy-integrating detectors CT unit at three volume CT dose index radiation dose levels (12.7, 5.9, and 3 mGy). Images were reconstructed using a 1,0242 matrix for SR-DLR and a 5122 matrix for NR-DLR, for three DLR levels (level-1, level-2, and level-3). Noise power spectrum (NPS) and task-based transfer function (TTF) for iodine and Solid Water® inserts were computed; TTF values at 50% (f50, mm-1) were used to quantify spatial resolution. The detectability index (d') was computed for two simulated lesions.

Results: Noise magnitude values were lower with SR-DLR than with NR-DLR for level-2 (-27.6 ± 3.8%) and level-3 (-43.5 ± 1.4%), the opposite for level-1. Average NPS spatial frequency was higher with SR-DLR than with NR-DLR for all radiation dose levels for level-1 (55.9 ± 16.7%) and level-2 (20.1 ± 13.9%) and the opposite for level-3, except at 12.7 mGy. For both inserts, f50 was higher with SR-DLR than with NR-DLR at each radiation dose and DLR level. For simulated lesions and all DLR levels, d' values were higher with SR-DLR than with NR-DLR (level-1, 6.0 ± 2.0%; level-2, 45.7 ± 5.0%; level-3, 75.2 ± 7.3%).

Conclusion: Compared to NR-DLR, SR-DLR improved spatial resolution and detectability of simulated abdominal lesions; image noise was reduced with SR-DLR only for level-2 and level-3, while image texture was better for level-1 and level-2.

Relevance statement: Super-resolution DLR with a 1,0242 matrix size improved spatial resolution and detectability of simulated abdominal lesions compared to normal-resolution DLR. Validation in clinical settings is necessary before translation into routine practice.

Key points: The performance of a new deep learning super-resolution image reconstruction algorithm (SR-DLR) was compared to a normal-resolution (NR)-DLR algorithm using an image-quality phantom for an abdominal energy-integrating detector CT protocol. SR-DLR with a 1,0242 matrix improved spatial resolution and detectability of simulated abdominal lesions compared to NR-DLR with a 5122 matrix. Using SR-DLR, therefore, presents numerous prospects for improving abdominal CT images and a high potential for reducing the radiation doses.

目的:比较基于辐射剂量的超分辨率深度学习图像重建(SR-DLR)算法与正常分辨率深度学习图像重建(NR -DLR)算法。材料和方法:使用能量积分检测器CT单元在三种体积CT剂量指数辐射剂量水平(12.7、5.9和3 mGy)下扫描图像质量幻象。对3个DLR级别(1级、2级和3级)的SR-DLR和NR-DLR分别使用10242矩阵和5122矩阵进行图像重建。计算碘和Solid Water®插入物的噪声功率谱(NPS)和基于任务的传递函数(TTF);使用50% (f50, mm-1)的TTF值来量化空间分辨率。计算了两个模拟病变的可检测指数(d')。结果:SR-DLR的噪声值在2级(-27.6±3.8%)和3级(-43.5±1.4%)低于NR-DLR,在1级(-43.5±1.4%)。除12.7 mGy外,SR-DLR组的平均NPS空间频率高于NR-DLR组,1级(55.9±16.7%)和2级(20.1±13.9%)高于SR-DLR组。对于两种插入物,在各辐射剂量和DLR水平下,SR-DLR组的f50均高于NR-DLR组。对于模拟病变和所有DLR级别,SR-DLR的d′值高于NR-DLR(1级,6.0±2.0%;2级,45.7±5.0%;3级,75.2±7.3%)。结论:与NR-DLR相比,SR-DLR提高了模拟腹部病变的空间分辨率和可检出性;SR-DLR仅对2级和3级图像降噪,对1级和2级图像纹理效果较好。相关性声明:与正常分辨率DLR相比,10242矩阵尺寸的超分辨率DLR提高了模拟腹部病变的空间分辨率和可检测性。在转化为常规实践之前,必须在临床环境中进行验证。重点:将一种新的深度学习超分辨率图像重建算法(SR-DLR)的性能与正常分辨率(NR)-DLR算法进行比较,该算法使用图像质量幻象用于腹部能量积分检测器CT协议。与采用5122矩阵的NR-DLR相比,采用10242矩阵的SR-DLR提高了模拟腹部病变的空间分辨率和可检出性。因此,使用SR-DLR在改善腹部CT图像和降低辐射剂量方面具有很大的潜力。
{"title":"Phantom-based performance comparison of two commercial deep learning CT reconstruction algorithms with super- and normal-resolution settings.","authors":"Joël Greffier, Catherine Roy, Djamel Dabli, Jean-Paul Beregi, Maxime Pastor","doi":"10.1186/s41747-025-00670-2","DOIUrl":"10.1186/s41747-025-00670-2","url":null,"abstract":"<p><strong>Objective: </strong>We compared a super-resolution deep learning image reconstruction (SR-DLR) algorithm with a normal-resolution (NR)-DLR algorithm according to radiation dose for abdominal computed tomography (CT).</p><p><strong>Materials and methods: </strong>An image-quality phantom was scanned with an energy-integrating detectors CT unit at three volume CT dose index radiation dose levels (12.7, 5.9, and 3 mGy). Images were reconstructed using a 1,024<sup>2</sup> matrix for SR-DLR and a 512<sup>2</sup> matrix for NR-DLR, for three DLR levels (level-1, level-2, and level-3). Noise power spectrum (NPS) and task-based transfer function (TTF) for iodine and Solid Water<sup>®</sup> inserts were computed; TTF values at 50% (f<sub>50</sub>, mm<sup>-1</sup>) were used to quantify spatial resolution. The detectability index (d') was computed for two simulated lesions.</p><p><strong>Results: </strong>Noise magnitude values were lower with SR-DLR than with NR-DLR for level-2 (-27.6 ± 3.8%) and level-3 (-43.5 ± 1.4%), the opposite for level-1. Average NPS spatial frequency was higher with SR-DLR than with NR-DLR for all radiation dose levels for level-1 (55.9 ± 16.7%) and level-2 (20.1 ± 13.9%) and the opposite for level-3, except at 12.7 mGy. For both inserts, f<sub>50</sub> was higher with SR-DLR than with NR-DLR at each radiation dose and DLR level. For simulated lesions and all DLR levels, d' values were higher with SR-DLR than with NR-DLR (level-1, 6.0 ± 2.0%; level-2, 45.7 ± 5.0%; level-3, 75.2 ± 7.3%).</p><p><strong>Conclusion: </strong>Compared to NR-DLR, SR-DLR improved spatial resolution and detectability of simulated abdominal lesions; image noise was reduced with SR-DLR only for level-2 and level-3, while image texture was better for level-1 and level-2.</p><p><strong>Relevance statement: </strong>Super-resolution DLR with a 1,024<sup>2</sup> matrix size improved spatial resolution and detectability of simulated abdominal lesions compared to normal-resolution DLR. Validation in clinical settings is necessary before translation into routine practice.</p><p><strong>Key points: </strong>The performance of a new deep learning super-resolution image reconstruction algorithm (SR-DLR) was compared to a normal-resolution (NR)-DLR algorithm using an image-quality phantom for an abdominal energy-integrating detector CT protocol. SR-DLR with a 1,024<sup>2</sup> matrix improved spatial resolution and detectability of simulated abdominal lesions compared to NR-DLR with a 512<sup>2</sup> matrix. Using SR-DLR, therefore, presents numerous prospects for improving abdominal CT images and a high potential for reducing the radiation doses.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":"9"},"PeriodicalIF":3.6,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834881/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146047216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An endovascular porcine model of abdominal aortic aneurysm for interventional radiology research. 猪腹主动脉瘤血管内模型的介入放射学研究。
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-01-26 DOI: 10.1186/s41747-025-00673-z
Marie-Luise Helene Hildegard Ranner-Hafferl, Dilyana Branimirova Mangarova, Jennifer Mein, Jennifer Lilly Heyl, Jana Möckel, Dirk Schnapauff, Timo Alexander Auer, Federico Collettini, Jan Ole Kaufmann, Lisa Christine Adams, Marcus Richard Makowski, Bernd Hamm, Avan Kader, Julia Brangsch

Objective: Abdominal aortic aneurysm (AAA) remains a life-threatening condition with few large-animal disease models. We aimed to develop a fully endovascular porcine AAA model for radiology research, reducing surgical trauma and improving reproducibility versus laparotomy-based models.

Materials and methods: Fourteen female German Landrace swine (n = 14, 30-40 kg) underwent angiography-guided intervention. The animals' infrarenal aorta was dilated by ~30% via balloon catheter, then collagenase (6,000 IU), elastase (500 IU), and 25% calcium chloride (0.5 mL) were locally incubated to weaken the vessel wall. Eight animals were included in the study; group 1 (n = 4) was euthanized at 2 weeks, and group 2 (n = 4) at 4 weeks. Aortic diameter was measured weekly by ultrasound; ex vivo histology, immunofluorescence, and western blot assessed remodeling and inflammation.

Results: Progressive aneurysm expansion was observed, with diameters of 1.32 ± 0.08 cm (mean ± standard deviation) at 1 week post-intervention, 1.59 ± 0.06 cm at 2 weeks, 1.81 ± 0.10 cm at 3 weeks, and 1.94 ± 0.19 cm at 4 weeks (baseline: 0.74 ± 0.08 cm; p < 0.001). Experimental groups' macrophages increased (group 1, 15.12 ± 3.88%; group 2, 16.65 ± 5.27%) compared to control (0.66 ± 0.27%, p = 0.012 and p = 0.021, respectively). Vascular smooth muscle cells were reduced across interventional groups (45.97 ± 17.26% versus control 80.94 ± 14.26%, p = 0.005).

Conclusions: This porcine AAA model replicates human disease features with a fully endovascular workflow, offering a valuable platform for evaluation of novel imaging techniques and interventional therapies.

Relevance statement: This study presents a fully endovascular porcine model of abdominal aortic aneurysm for translational research in interventional radiology and imaging. By enabling aneurysm induction entirely through catheter-based techniques, the model could provide a clinically relevant platform for future evaluation of novel endovascular devices and intraluminal therapeutics.

Key points: This study established a fully endovascular, translational porcine model of abdominal aortic aneurysm. The model exhibited a significant mean aneurysmal dilation of about 161% at 4 weeks and 107% at 2 weeks. Serial ultrasound confirmed consistent aneurysm expansion and reproducible growth patterns in surviving animals. Ex vivo analyses demonstrated inflammation and extracellular-matrix damage, mirroring key features of human abdominal aortic aneurysm pathology. This fully catheter-based workflow provides a practical preclinical platform for evaluating imaging techniques and endovascular therapies.

目的:腹主动脉瘤(AAA)仍然是一种危及生命的疾病,很少有大型动物疾病模型。我们的目标是建立一个全血管内猪AAA模型用于放射学研究,减少手术创伤,提高基于剖腹手术的模型的可重复性。材料和方法:14头德国长白猪(n = 14, 30-40 kg)进行血管造影引导干预。经球囊导管扩张动物肾下主动脉约30%,局部孵育胶原酶(6000 IU)、弹性酶(500 IU)、25%氯化钙(0.5 mL)削弱血管壁。8只动物被纳入研究;第1组(n = 4)于2周实施安乐死,第2组(n = 4)于4周实施安乐死。每周超声测量主动脉直径;离体组织学、免疫荧光和western blot评估重塑和炎症。结果:观察到进行性动脉瘤扩张,干预后1周直径为1.32±0.08 cm(平均±标准差),2周直径为1.59±0.06 cm, 3周直径为1.81±0.10 cm, 4周直径为1.94±0.19 cm(基线:0.74±0.08 cm; p)结论:该猪AAA模型具有完整的血管内工作流程,复制了人类疾病特征,为评估新的成像技术和介入治疗提供了有价值的平台。相关声明:本研究提出了一个全血管内猪腹主动脉瘤模型,用于介入放射学和影像学的转化研究。通过完全通过导管技术诱导动脉瘤,该模型可以为未来评估新型血管内装置和腔内治疗提供临床相关平台。本研究建立了全血管内平移猪腹主动脉瘤模型。模型显示出明显的平均动脉瘤扩张,4周时约为161%,2周时约为107%。连续超声证实存活动物的动脉瘤扩张和可复制的生长模式一致。体外分析显示炎症和细胞外基质损伤,反映了人类腹主动脉瘤病理的关键特征。这种完全基于导管的工作流程为评估成像技术和血管内治疗提供了一个实用的临床前平台。
{"title":"An endovascular porcine model of abdominal aortic aneurysm for interventional radiology research.","authors":"Marie-Luise Helene Hildegard Ranner-Hafferl, Dilyana Branimirova Mangarova, Jennifer Mein, Jennifer Lilly Heyl, Jana Möckel, Dirk Schnapauff, Timo Alexander Auer, Federico Collettini, Jan Ole Kaufmann, Lisa Christine Adams, Marcus Richard Makowski, Bernd Hamm, Avan Kader, Julia Brangsch","doi":"10.1186/s41747-025-00673-z","DOIUrl":"10.1186/s41747-025-00673-z","url":null,"abstract":"<p><strong>Objective: </strong>Abdominal aortic aneurysm (AAA) remains a life-threatening condition with few large-animal disease models. We aimed to develop a fully endovascular porcine AAA model for radiology research, reducing surgical trauma and improving reproducibility versus laparotomy-based models.</p><p><strong>Materials and methods: </strong>Fourteen female German Landrace swine (n = 14, 30-40 kg) underwent angiography-guided intervention. The animals' infrarenal aorta was dilated by ~30% via balloon catheter, then collagenase (6,000 IU), elastase (500 IU), and 25% calcium chloride (0.5 mL) were locally incubated to weaken the vessel wall. Eight animals were included in the study; group 1 (n = 4) was euthanized at 2 weeks, and group 2 (n = 4) at 4 weeks. Aortic diameter was measured weekly by ultrasound; ex vivo histology, immunofluorescence, and western blot assessed remodeling and inflammation.</p><p><strong>Results: </strong>Progressive aneurysm expansion was observed, with diameters of 1.32 ± 0.08 cm (mean ± standard deviation) at 1 week post-intervention, 1.59 ± 0.06 cm at 2 weeks, 1.81 ± 0.10 cm at 3 weeks, and 1.94 ± 0.19 cm at 4 weeks (baseline: 0.74 ± 0.08 cm; p < 0.001). Experimental groups' macrophages increased (group 1, 15.12 ± 3.88%; group 2, 16.65 ± 5.27%) compared to control (0.66 ± 0.27%, p = 0.012 and p = 0.021, respectively). Vascular smooth muscle cells were reduced across interventional groups (45.97 ± 17.26% versus control 80.94 ± 14.26%, p = 0.005).</p><p><strong>Conclusions: </strong>This porcine AAA model replicates human disease features with a fully endovascular workflow, offering a valuable platform for evaluation of novel imaging techniques and interventional therapies.</p><p><strong>Relevance statement: </strong>This study presents a fully endovascular porcine model of abdominal aortic aneurysm for translational research in interventional radiology and imaging. By enabling aneurysm induction entirely through catheter-based techniques, the model could provide a clinically relevant platform for future evaluation of novel endovascular devices and intraluminal therapeutics.</p><p><strong>Key points: </strong>This study established a fully endovascular, translational porcine model of abdominal aortic aneurysm. The model exhibited a significant mean aneurysmal dilation of about 161% at 4 weeks and 107% at 2 weeks. Serial ultrasound confirmed consistent aneurysm expansion and reproducible growth patterns in surviving animals. Ex vivo analyses demonstrated inflammation and extracellular-matrix damage, mirroring key features of human abdominal aortic aneurysm pathology. This fully catheter-based workflow provides a practical preclinical platform for evaluating imaging techniques and endovascular therapies.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":"6"},"PeriodicalIF":3.6,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834893/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reliability of Gemini 2.5 Pro, ChatGPT 4.1, DeepSeek V3, and Claude Opus 4 in generating standardized CMR protocols. Gemini 2.5 Pro、ChatGPT 4.1、DeepSeek V3和Claude Opus 4在生成标准化CMR协议方面的可靠性
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-01-26 DOI: 10.1186/s41747-025-00671-1
Răzvan-Andrei Licu, Giuseppe Muscogiuri, Davide Casartelli, Anca Bacârea, Marian Pop, Andra-Maria Licu, Daniele Sferratore, Alessandro Caruso, Marianna Mirchuk, Piotr Tarkowski, Jakub Byczkowski, Sandro Sironi

Artificial intelligence (AI) and large language models (LLMs) are increasingly integrated into radiology, offering new possibilities for advanced imaging techniques, including cardiovascular magnetic resonance (CMR). This proof-of-concept study assessed four high-performing LLMs (Gemini 2.5 Pro, ChatGPT 4.1, DeepSeek V3, and Claude Opus 4) on their ability to generate CMR protocols for 140 hypothetical cardiac cases. AI-generated protocols were compared against a reference standard established by a consensus between two experienced cardiovascular radiologists, following the Society for Cardiovascular Magnetic Resonance (SCMR) recommendations. Descriptive statistics were used to quantify the concordance of LLM-generated sequences with the SCMR guidelines. Statistical agreement was measured using Cohen and Fleiss κ statistics. Gemini 2.5 Pro achieved the highest concordance, aligning with the SCMR guidelines in 71.5% of all evaluated scenarios. Overall, LLMs showed moderate agreement with the SCMR protocols, with Gemini 2.5 Pro again performing best (Cohen κ = 0.55). Agreement was substantial for mandatory CMR sequences (Fleiss κ ∈ [0.69, 0.74]) and predominantly fair for optional sequences. The tested LLMs demonstrate a potential to generate efficient and pathology-adapted CMR protocols. Under expert supervision, this capability could streamline the imaging workflow and help extend CMR to primary healthcare centers through protocol automation. RELEVANCE STATEMENT: The potential of Gemini 2.5 Pro, ChatGPT 4.1, DeepSeek V3, and Claude Opus 4 to suggest pathology-adapted CMR protocols could improve imaging throughput and help to expand access to advanced cardiac diagnostics in primary healthcare centers. KEY POINTS: The tested large language models show potential for generating CMR protocols. Substantial agreement on mandatory CMR sequences promises more efficient examinations. Automation of CMR protocols could help to improve access to this advanced technique outside major medical institutions.

人工智能(AI)和大型语言模型(llm)越来越多地集成到放射学中,为包括心血管磁共振(CMR)在内的先进成像技术提供了新的可能性。这项概念验证研究评估了四个高性能llm (Gemini 2.5 Pro、ChatGPT 4.1、DeepSeek V3和Claude Opus 4)为140个假设的心脏病例生成CMR协议的能力。根据心血管磁共振学会(SCMR)的建议,将人工智能生成的方案与两位经验丰富的心血管放射科医生达成共识的参考标准进行比较。描述性统计用于量化llm生成的序列与SCMR指南的一致性。采用Cohen和Fleiss κ统计量测定统计一致性。Gemini 2.5 Pro达到了最高的一致性,在所有评估情景中有71.5%与SCMR指南一致。总体而言,llm与SCMR方案表现出中等程度的一致性,Gemini 2.5 Pro再次表现最佳(Cohen κ = 0.55)。强制性CMR序列(Fleiss κ∈[0.69,0.74])与可选序列的一致性基本一致。经过测试的llm证明了产生高效和适应病理的CMR协议的潜力。在专家监督下,该功能可以简化成像工作流程,并通过协议自动化帮助将CMR扩展到初级医疗保健中心。相关声明:Gemini 2.5 Pro、ChatGPT 4.1、DeepSeek V3和Claude Opus 4的潜力表明,病理适应的CMR协议可以提高成像吞吐量,并有助于扩大初级卫生保健中心先进心脏诊断的可及性。重点:经过测试的大型语言模型显示了生成CMR协议的潜力。强制性CMR序列的实质性协议承诺更有效的检查。CMR协议的自动化可以帮助改善在主要医疗机构之外使用这一先进技术的机会。
{"title":"Reliability of Gemini 2.5 Pro, ChatGPT 4.1, DeepSeek V3, and Claude Opus 4 in generating standardized CMR protocols.","authors":"Răzvan-Andrei Licu, Giuseppe Muscogiuri, Davide Casartelli, Anca Bacârea, Marian Pop, Andra-Maria Licu, Daniele Sferratore, Alessandro Caruso, Marianna Mirchuk, Piotr Tarkowski, Jakub Byczkowski, Sandro Sironi","doi":"10.1186/s41747-025-00671-1","DOIUrl":"10.1186/s41747-025-00671-1","url":null,"abstract":"<p><p>Artificial intelligence (AI) and large language models (LLMs) are increasingly integrated into radiology, offering new possibilities for advanced imaging techniques, including cardiovascular magnetic resonance (CMR). This proof-of-concept study assessed four high-performing LLMs (Gemini 2.5 Pro, ChatGPT 4.1, DeepSeek V3, and Claude Opus 4) on their ability to generate CMR protocols for 140 hypothetical cardiac cases. AI-generated protocols were compared against a reference standard established by a consensus between two experienced cardiovascular radiologists, following the Society for Cardiovascular Magnetic Resonance (SCMR) recommendations. Descriptive statistics were used to quantify the concordance of LLM-generated sequences with the SCMR guidelines. Statistical agreement was measured using Cohen and Fleiss κ statistics. Gemini 2.5 Pro achieved the highest concordance, aligning with the SCMR guidelines in 71.5% of all evaluated scenarios. Overall, LLMs showed moderate agreement with the SCMR protocols, with Gemini 2.5 Pro again performing best (Cohen κ = 0.55). Agreement was substantial for mandatory CMR sequences (Fleiss κ ∈ [0.69, 0.74]) and predominantly fair for optional sequences. The tested LLMs demonstrate a potential to generate efficient and pathology-adapted CMR protocols. Under expert supervision, this capability could streamline the imaging workflow and help extend CMR to primary healthcare centers through protocol automation. RELEVANCE STATEMENT: The potential of Gemini 2.5 Pro, ChatGPT 4.1, DeepSeek V3, and Claude Opus 4 to suggest pathology-adapted CMR protocols could improve imaging throughput and help to expand access to advanced cardiac diagnostics in primary healthcare centers. KEY POINTS: The tested large language models show potential for generating CMR protocols. Substantial agreement on mandatory CMR sequences promises more efficient examinations. Automation of CMR protocols could help to improve access to this advanced technique outside major medical institutions.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":"7"},"PeriodicalIF":3.6,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12834875/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146054121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of postprocessing metrics in multimetabolic APT-weighted CEST and 2-deoxy-D-glucose-CEST-MRI for differentiating breast cancer subtypes in a murine model. 小鼠模型中多代谢apt加权CEST和2-脱氧-d -葡萄糖-CEST- mri鉴别乳腺癌亚型的后处理指标的比较
IF 3.6 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Pub Date : 2026-01-19 DOI: 10.1186/s41747-025-00665-z
Daniela Prinz, Silvester J Bartsch, Joachim Friske, Martin Krššák, Daniela Laimer-Gruber, Thomas H Helbich, Katja Pinker

Background: Chemical exchange saturation transfer (CEST)-magnetic resonance imaging (MRI), particularly amide proton transfer-weighted (APTw)-CEST and 2-deoxy-D-glucose-CEST, holds promise for noninvasive molecular breast cancer (BC) characterization. However, quantification remains challenging due to field inhomogeneities, overlapping exchange pools, and the limited robustness of conventional metrics such as the magnetization transfer ratio asymmetry (MTRasym). This study evaluates four CEST postprocessing metrics-MTRasym, Lorentzian amplitudes, MTR relaxation exchange (MTRREX), and apparent exchange-dependent relaxation (AREX)-for their diagnostic performance in differentiating BC subtypes using endogenous APTw-CEST and exogenous 2-deoxy-D-glucose-CEST in a murine BC xenograft model of Luminal A, human epidermal growth factor receptor 2 (HER2)+, and triple-negative tumors.

Materials and methods: Metabolic CEST-MRI was performed in vitro on protein and 2-deoxy-D-glucose phantoms and in vivo in a murine BC model. Imaging was conducted at 9.4 T with 120 frequency offsets from +6 to -6 ppm. MTRREX and AREX were derived via Lorentzian fitting using tailored five-pool models. Statistical comparisons across subtypes were performed per metric.

Results: In APTw-CEST, MTRREX and AREX significantly distinguished Luminal A from HER2+ (p ≤ 0.027) and Luminal A from triple-negative (p ≤ 0.006) tumors. Lorentzian amplitudes differentiated Luminal A from triple-negative (p = 0.019), while MTRasym showed no separation. In 2-deoxy-D-glucose-CEST, only AREX distinguished Luminal A from HER2+ tumors (p = 0.017).

Conclusion: Advanced metrics, particularly MTRREX and AREX, improve metabolic CEST-MRI for BC subtyping in a murine preclinical model, while MTRasym is inadequate for this purpose.

Relevance statement: Our findings underscore the importance of applying advanced postprocessing metrics to metabolic CEST-MRI for improved noninvasive BC characterization in a murine preclinical model.

Key points: Advanced multimetabolic APTw-CEST and 2-deoxy-D-glucose-CEST postprocessing metrics allowed adequate preclinical murine BC subtyping. AREX showed potential for 2-deoxy-D-glucose-CEST in tumor characterization; however, APTw-CEST remains superior. MTRasym failed to distinguish between tumor subtypes in CEST-MRI.

背景:化学交换饱和转移(CEST)-磁共振成像(MRI),特别是酰胺质子转移加权(APTw)-CEST和2-脱氧-d -葡萄糖-CEST,是非侵入性分子乳腺癌(BC)表征的希望。然而,由于磁场不均匀性、交换池重叠以及磁化转移比不对称(MTRasym)等传统指标的鲁棒性有限,量化仍然具有挑战性。本研究利用内源性ap2 -CEST和外源性2-deoxy- d -葡萄糖-CEST,在Luminal a、人表皮生长因子受体2 (HER2)+和三阴性肿瘤的小鼠BC异种移植模型中,评估了四种CEST后处理指标——mtrasym、Lorentzian振幅、MTR松弛交换(MTRREX)和明显交换依赖性松弛(AREX)——在区分BC亚型中的诊断性能。材料和方法:体外对蛋白质和2-脱氧-d -葡萄糖模型以及小鼠BC模型进行代谢CEST-MRI。成像在9.4 T下进行,120个频率偏移从+6到-6 ppm。mtrex和AREX通过Lorentzian拟合得到,使用定制的五池模型。每个度量执行跨子类型的统计比较。结果:在ap2 - cest中,mtrex和AREX可显著区分HER2+肿瘤中的Luminal A (p≤0.027)和三阴性肿瘤中的Luminal A (p≤0.006)。Lorentzian振幅将Luminal A与三阴性区分开来(p = 0.019),而MTRasym则没有区分。在2-脱氧-d -葡萄糖- cest中,只有AREX能够区分Luminal A和HER2+肿瘤(p = 0.017)。结论:在小鼠临床前模型中,先进的指标,特别是MTRREX和AREX,可以改善代谢CEST-MRI对BC亚型的诊断,而MTRasym在这方面是不够的。相关声明:我们的研究结果强调了在小鼠临床前模型中,将先进的后处理指标应用于代谢CEST-MRI以改善无创BC表征的重要性。重点:先进的多代谢ap2 - cest和2-脱氧-d -葡萄糖- cest后处理指标允许充分的临床前小鼠BC亚型。AREX显示2-脱氧-d -葡萄糖- cest在肿瘤表征中的潜力;然而,APTw-CEST仍然是优越的。MTRasym不能在CEST-MRI中区分肿瘤亚型。
{"title":"Comparison of postprocessing metrics in multimetabolic APT-weighted CEST and 2-deoxy-D-glucose-CEST-MRI for differentiating breast cancer subtypes in a murine model.","authors":"Daniela Prinz, Silvester J Bartsch, Joachim Friske, Martin Krššák, Daniela Laimer-Gruber, Thomas H Helbich, Katja Pinker","doi":"10.1186/s41747-025-00665-z","DOIUrl":"10.1186/s41747-025-00665-z","url":null,"abstract":"<p><strong>Background: </strong>Chemical exchange saturation transfer (CEST)-magnetic resonance imaging (MRI), particularly amide proton transfer-weighted (APTw)-CEST and 2-deoxy-D-glucose-CEST, holds promise for noninvasive molecular breast cancer (BC) characterization. However, quantification remains challenging due to field inhomogeneities, overlapping exchange pools, and the limited robustness of conventional metrics such as the magnetization transfer ratio asymmetry (MTR<sub>asym</sub>). This study evaluates four CEST postprocessing metrics-MTR<sub>asym</sub>, Lorentzian amplitudes, MTR relaxation exchange (MTR<sub>REX</sub>), and apparent exchange-dependent relaxation (AREX)-for their diagnostic performance in differentiating BC subtypes using endogenous APTw-CEST and exogenous 2-deoxy-D-glucose-CEST in a murine BC xenograft model of Luminal A, human epidermal growth factor receptor 2 (HER2)+, and triple-negative tumors.</p><p><strong>Materials and methods: </strong>Metabolic CEST-MRI was performed in vitro on protein and 2-deoxy-D-glucose phantoms and in vivo in a murine BC model. Imaging was conducted at 9.4 T with 120 frequency offsets from +6 to -6 ppm. MTR<sub>REX</sub> and AREX were derived via Lorentzian fitting using tailored five-pool models. Statistical comparisons across subtypes were performed per metric.</p><p><strong>Results: </strong>In APTw-CEST, MTR<sub>REX</sub> and AREX significantly distinguished Luminal A from HER2+ (p ≤ 0.027) and Luminal A from triple-negative (p ≤ 0.006) tumors. Lorentzian amplitudes differentiated Luminal A from triple-negative (p = 0.019), while MTR<sub>asym</sub> showed no separation. In 2-deoxy-D-glucose-CEST, only AREX distinguished Luminal A from HER2+ tumors (p = 0.017).</p><p><strong>Conclusion: </strong>Advanced metrics, particularly MTR<sub>REX</sub> and AREX, improve metabolic CEST-MRI for BC subtyping in a murine preclinical model, while MTR<sub>asym</sub> is inadequate for this purpose.</p><p><strong>Relevance statement: </strong>Our findings underscore the importance of applying advanced postprocessing metrics to metabolic CEST-MRI for improved noninvasive BC characterization in a murine preclinical model.</p><p><strong>Key points: </strong>Advanced multimetabolic APTw-CEST and 2-deoxy-D-glucose-CEST postprocessing metrics allowed adequate preclinical murine BC subtyping. AREX showed potential for 2-deoxy-D-glucose-CEST in tumor characterization; however, APTw-CEST remains superior. MTR<sub>asym</sub> failed to distinguish between tumor subtypes in CEST-MRI.</p>","PeriodicalId":36926,"journal":{"name":"European Radiology Experimental","volume":"10 1","pages":"5"},"PeriodicalIF":3.6,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12816453/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145998893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
European Radiology Experimental
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1