Pub Date : 2024-12-01Epub Date: 2024-06-10DOI: 10.1097/RLI.0000000000001092
Reyhaneh Nosrati, Fatih Calakli, Onur Afacan, Kristina Pelkola, Reid Nichols, Pauline Connaughton, M Alejandra Bedoya, Andy Tsai, Sarah Bixby, Simon K Warfield
Objectives: The T1-weighted GRE (gradient recalled echo) sequence with the Dixon technique for water/fat separation is an essential component of abdominal MRI (magnetic resonance imaging), useful in detecting tumors and characterizing hemorrhage/fat content. Unfortunately, the current implementation of this sequence suffers from several problems: (1) low resolution to maintain high pixel bandwidth and minimize chemical shift; (2) image blurring due to respiratory motion; (3) water/fat swapping due to the natural ambiguity between fat and water peaks; and (4) off-resonance fat blurring due to the multipeak nature of the fat spectrum. The goal of this study was to evaluate the image quality of water/fat separation using a high-resolution 3-point Dixon golden angle radial acquisition with retrospective motion compensation and multipeak fat modeling in children undergoing abdominal MRI.
Materials and methods: Twenty-two pediatric patients (4.2 ± 2.3 years) underwent abdominal MRI on a 3 T scanner with routine abdominal protocol and with a 3-point Dixon radial-VIBE (volumetric interpolated breath-hold examination) sequence. Field maps were calculated using 3D graph-cut optimization followed by fat and water calculation from k-space data by iteratively solving an optimization problem. A 6-peak fat model was used to model chemical shifts in k-space. Residual respiratory motion was corrected through soft-gating by weighting each projection based on the estimated respiratory motion from the center of the k-space. Reconstructed images were reviewed by 3 pediatric radiologists on a PACS (picture archiving and communication systems) workstation. Subjective image quality and water/fat swapping artifact were scored by each pediatric radiologist using a 5-point Likert scale. The VoL (variance of Laplacian) of the reconstructed images was used to objectively quantify image sharpness.
Results: Based on the overall Likert scores, the images generated using the described method were significantly superior to those reconstructed by the conventional 2-point Dixon technique ( P < 0.05). Water/fat swapping artifact was observed in 14 of 22 patients using 2-point Dixon, and this artifact was not present when using the proposed method. Image sharpness was significantly improved using the proposed framework.
Conclusions: In smaller patients, a high-quality water/fat separation with sharp visualization of fine details is critical for diagnostic accuracy. High-resolution golden angle radial-VIBE 3-point Dixon acquisition with 6-peak fat model and soft-gated motion correction offers improved image quality at the expense of an additional ~1-minute acquisition time. Thus, this technique offers the potential to replace the conventional 2-point Dixon technique.
{"title":"Free-Breathing High-Resolution, Swap-Free, and Motion-Corrected Water/Fat Separation in Pediatric Abdominal MRI.","authors":"Reyhaneh Nosrati, Fatih Calakli, Onur Afacan, Kristina Pelkola, Reid Nichols, Pauline Connaughton, M Alejandra Bedoya, Andy Tsai, Sarah Bixby, Simon K Warfield","doi":"10.1097/RLI.0000000000001092","DOIUrl":"10.1097/RLI.0000000000001092","url":null,"abstract":"<p><strong>Objectives: </strong>The T1-weighted GRE (gradient recalled echo) sequence with the Dixon technique for water/fat separation is an essential component of abdominal MRI (magnetic resonance imaging), useful in detecting tumors and characterizing hemorrhage/fat content. Unfortunately, the current implementation of this sequence suffers from several problems: (1) low resolution to maintain high pixel bandwidth and minimize chemical shift; (2) image blurring due to respiratory motion; (3) water/fat swapping due to the natural ambiguity between fat and water peaks; and (4) off-resonance fat blurring due to the multipeak nature of the fat spectrum. The goal of this study was to evaluate the image quality of water/fat separation using a high-resolution 3-point Dixon golden angle radial acquisition with retrospective motion compensation and multipeak fat modeling in children undergoing abdominal MRI.</p><p><strong>Materials and methods: </strong>Twenty-two pediatric patients (4.2 ± 2.3 years) underwent abdominal MRI on a 3 T scanner with routine abdominal protocol and with a 3-point Dixon radial-VIBE (volumetric interpolated breath-hold examination) sequence. Field maps were calculated using 3D graph-cut optimization followed by fat and water calculation from k-space data by iteratively solving an optimization problem. A 6-peak fat model was used to model chemical shifts in k-space. Residual respiratory motion was corrected through soft-gating by weighting each projection based on the estimated respiratory motion from the center of the k-space. Reconstructed images were reviewed by 3 pediatric radiologists on a PACS (picture archiving and communication systems) workstation. Subjective image quality and water/fat swapping artifact were scored by each pediatric radiologist using a 5-point Likert scale. The VoL (variance of Laplacian) of the reconstructed images was used to objectively quantify image sharpness.</p><p><strong>Results: </strong>Based on the overall Likert scores, the images generated using the described method were significantly superior to those reconstructed by the conventional 2-point Dixon technique ( P < 0.05). Water/fat swapping artifact was observed in 14 of 22 patients using 2-point Dixon, and this artifact was not present when using the proposed method. Image sharpness was significantly improved using the proposed framework.</p><p><strong>Conclusions: </strong>In smaller patients, a high-quality water/fat separation with sharp visualization of fine details is critical for diagnostic accuracy. High-resolution golden angle radial-VIBE 3-point Dixon acquisition with 6-peak fat model and soft-gated motion correction offers improved image quality at the expense of an additional ~1-minute acquisition time. Thus, this technique offers the potential to replace the conventional 2-point Dixon technique.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":"805-812"},"PeriodicalIF":4.4,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560742/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141300608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-01Epub Date: 2024-06-11DOI: 10.1097/RLI.0000000000001093
Ricardo Donners, Jan Vosshenrich, Martin Segeroth, Magdalena Seng, Matthias Fenchel, Marcel Dominik Nickel, Michael Bach, Florian Schmaranzer, Inga Todorski, Markus M Obmann, Dorothee Harder, Hanns-Christian Breit
Objectives: The aim of this study was to compare deep learning reconstructed (DLR) 0.55 T magnetic resonance imaging (MRI) quality, identification, and grading of structural anomalies and reader confidence levels with conventional 3 T knee MRI in patients with knee pain following trauma.
Materials and methods: This prospective study of 26 symptomatic patients (5 women) includes 52 paired DLR 0.55 T and conventional 3 T MRI examinations obtained in 1 setting. A novel, commercially available DLR algorithm was employed for 0.55 T image reconstruction. Four board-certified radiologists reviewed all images independently and graded image quality, noted structural anomalies and their respective reporting confidence levels for the presence or absence, as well as grading of bone, cartilage, meniscus, ligament, and tendon lesions. Image quality and reader confidence levels were compared ( P < 0.05, significant), and MRI findings were correlated between 0.55 T and 3 T MRI using Cohen kappa (κ).
Results: In reader's consensus, good image quality was found for DLR 0.55 T MRI and 3 T MRI (3.8 vs 4.1/5 points, P = 0.06). There was near-perfect agreement between 0.55 T DLR and 3 T MRI regarding the identification of structural anomalies for all readers (each κ ≥ 0.80). Substantial to near-perfection agreement between 0.55 T and 3 T MRI was reported for grading of cartilage (κ = 0.65-0.86) and meniscus lesions (κ = 0.71-1.0). High confidence levels were found for all readers for DLR 0.55 T and 3 T MRI, with 3 readers showing higher confidence levels for reporting cartilage lesions on 3 T MRI.
Conclusions: In conclusion, new-generation 0.55 T DLR MRI provides good image quality, comparable to conventional 3 T MRI, and allows for reliable identification of internal derangement of the knee with high reader confidence.
研究目的本研究旨在比较深度学习重建(DLR)0.55 T 磁共振成像(MRI)的质量、结构异常的识别和分级以及读者对创伤后膝关节疼痛患者传统 3 T 膝关节 MRI 的置信度:这项前瞻性研究共对 26 名有症状的患者(5 名女性)进行了检查,其中包括 52 次在 1 个环境中获得的配对 DLR 0.55 T 和常规 3 T MRI 检查。在 0.55 T 图像重建中采用了一种新型的、市场上可买到的 DLR 算法。四位经委员会认证的放射科医生独立审查了所有图像,并对图像质量进行了分级,指出了结构异常和各自对是否存在结构异常的报告置信度,并对骨、软骨、半月板、韧带和肌腱病变进行了分级。对图像质量和读者信心水平进行比较(P < 0.05,差异显著),并使用 Cohen kappa (κ)对 0.55 T 和 3 T MRI 结果进行相关性分析:结果:读者一致认为,DLR 0.55 T MRI 和 3 T MRI 的图像质量良好(3.8 vs 4.1/5 points,P = 0.06)。在结构异常的识别方面,0.55 T DLR 和 3 T MRI 几乎与所有读者完全一致(各 κ ≥ 0.80)。在软骨(κ = 0.65-0.86)和半月板病变(κ = 0.71-1.0)的分级方面,0.55 T 和 3 T MRI 的结果基本接近完美一致。所有读者对 DLR 0.55 T 和 3 T MRI 的置信度都很高,其中 3 位读者对 3 T MRI 报告软骨损伤的置信度更高:总之,新一代 0.55 T DLR MRI 具有良好的图像质量,可与传统的 3 T MRI 相媲美,并能可靠地识别膝关节内部病变,读者的置信度较高。
{"title":"Deep Learning Reconstructed New-Generation 0.55 T MRI of the Knee-A Prospective Comparison With Conventional 3 T MRI.","authors":"Ricardo Donners, Jan Vosshenrich, Martin Segeroth, Magdalena Seng, Matthias Fenchel, Marcel Dominik Nickel, Michael Bach, Florian Schmaranzer, Inga Todorski, Markus M Obmann, Dorothee Harder, Hanns-Christian Breit","doi":"10.1097/RLI.0000000000001093","DOIUrl":"10.1097/RLI.0000000000001093","url":null,"abstract":"<p><strong>Objectives: </strong>The aim of this study was to compare deep learning reconstructed (DLR) 0.55 T magnetic resonance imaging (MRI) quality, identification, and grading of structural anomalies and reader confidence levels with conventional 3 T knee MRI in patients with knee pain following trauma.</p><p><strong>Materials and methods: </strong>This prospective study of 26 symptomatic patients (5 women) includes 52 paired DLR 0.55 T and conventional 3 T MRI examinations obtained in 1 setting. A novel, commercially available DLR algorithm was employed for 0.55 T image reconstruction. Four board-certified radiologists reviewed all images independently and graded image quality, noted structural anomalies and their respective reporting confidence levels for the presence or absence, as well as grading of bone, cartilage, meniscus, ligament, and tendon lesions. Image quality and reader confidence levels were compared ( P < 0.05, significant), and MRI findings were correlated between 0.55 T and 3 T MRI using Cohen kappa (κ).</p><p><strong>Results: </strong>In reader's consensus, good image quality was found for DLR 0.55 T MRI and 3 T MRI (3.8 vs 4.1/5 points, P = 0.06). There was near-perfect agreement between 0.55 T DLR and 3 T MRI regarding the identification of structural anomalies for all readers (each κ ≥ 0.80). Substantial to near-perfection agreement between 0.55 T and 3 T MRI was reported for grading of cartilage (κ = 0.65-0.86) and meniscus lesions (κ = 0.71-1.0). High confidence levels were found for all readers for DLR 0.55 T and 3 T MRI, with 3 readers showing higher confidence levels for reporting cartilage lesions on 3 T MRI.</p><p><strong>Conclusions: </strong>In conclusion, new-generation 0.55 T DLR MRI provides good image quality, comparable to conventional 3 T MRI, and allows for reliable identification of internal derangement of the knee with high reader confidence.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":"823-830"},"PeriodicalIF":7.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141300607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Objectives: Clinical experience regarding the use of dedicated photon-counting breast CT (PC-BCT) for diagnosis of breast microcalcifications is scarce. This study systematically compares the detection and classification of breast microcalcifications using a dedicated breast photon-counting CT, especially designed for examining the breast, in comparison with digital breast tomosynthesis (DBT).
Materials and methods: This is a prospective intraindividual study on women with DBT screening-detected BI-RADS-4/-5 microcalcifications who underwent PC-BCT before biopsy. PC-BCT images were reconstructed with a noninterpolated spatial resolution of 0.15 × 0.15 × 0.15 mm (reconstruction mode 1 [RM-1]) and with 0.3 × 0.3 × 0.3 mm (reconstruction mode 2 [RM-2]), plus thin-slab maximum intensity projection (MIP) reconstructions. Two radiologists independently rated the detection of microcalcifications in direct comparison with DBT on a 5-point scale. The distribution and morphology of microcalcifications were then rated according to BI-RADS. The size of the smallest discernible microcalcification particle was measured. For PC-BCT, the average glandular dose was determined by Monte Carlo simulations; for DBT, the information provided by the DBT system was used.
Results: Between September 2022 and July 2023, 22 participants (mean age, 61; range, 42-85 years) with microcalcifications (16 malignant; 6 benign) were included. In 2/22 with microcalcifications in the posterior region, microcalcifications were not detectable on PC-BCT, likely because they were not included in the PC-BCT volume. In the remaining 20 participants, microcalcifications were detectable. With high between-reader agreement (κ > 0.8), conspicuity of microcalcifications was rated similar for DBT and MIPs of RM-1 (mean, 4.83 ± 0.38 vs 4.86 ± 0.35) ( P = 0.66), but was significantly lower ( P < 0.05) for the remaining PC-BCT reconstructions: 2.11 ± 0.92 (RM-2), 2.64 ± 0.80 (MIPs of RM-2), and 3.50 ± 1.23 (RM-1). Identical distribution qualifiers were assigned for PC-BCT and DBT in 18/20 participants, with excellent agreement (κ = 0.91), whereas identical morphologic qualifiers were assigned in only 5/20, with poor agreement (κ = 0.44). The median size of smallest discernible microcalcification particle was 0.2 versus 0.6 versus 1.1 mm in DBT versus RM-1 versus RM-2 ( P < 0.001), likely due to blooming effects. Average glandular dose was 7.04 mGy (PC-BCT) versus 6.88 mGy (DBT) ( P = 0.67).
Conclusions: PC-BCT allows reliable detection of in-breast microcalcifications as long as they are not located in the posterior part of the breast and allows assessment of their distribution, but not of their individual morphology.
{"title":"Dedicated Photon-Counting CT for Detection and Classification of Microcalcifications: An Intraindividual Comparison With Digital Breast Tomosynthesis.","authors":"Luisa Charlotte Huck, Maike Bode, Eloisa Zanderigo, Caroline Wilpert, Vanessa Raaff, Ebba Dethlefsen, Evelyn Wenkel, Christiane Katharina Kuhl","doi":"10.1097/RLI.0000000000001097","DOIUrl":"10.1097/RLI.0000000000001097","url":null,"abstract":"<p><strong>Objectives: </strong>Clinical experience regarding the use of dedicated photon-counting breast CT (PC-BCT) for diagnosis of breast microcalcifications is scarce. This study systematically compares the detection and classification of breast microcalcifications using a dedicated breast photon-counting CT, especially designed for examining the breast, in comparison with digital breast tomosynthesis (DBT).</p><p><strong>Materials and methods: </strong>This is a prospective intraindividual study on women with DBT screening-detected BI-RADS-4/-5 microcalcifications who underwent PC-BCT before biopsy. PC-BCT images were reconstructed with a noninterpolated spatial resolution of 0.15 × 0.15 × 0.15 mm (reconstruction mode 1 [RM-1]) and with 0.3 × 0.3 × 0.3 mm (reconstruction mode 2 [RM-2]), plus thin-slab maximum intensity projection (MIP) reconstructions. Two radiologists independently rated the detection of microcalcifications in direct comparison with DBT on a 5-point scale. The distribution and morphology of microcalcifications were then rated according to BI-RADS. The size of the smallest discernible microcalcification particle was measured. For PC-BCT, the average glandular dose was determined by Monte Carlo simulations; for DBT, the information provided by the DBT system was used.</p><p><strong>Results: </strong>Between September 2022 and July 2023, 22 participants (mean age, 61; range, 42-85 years) with microcalcifications (16 malignant; 6 benign) were included. In 2/22 with microcalcifications in the posterior region, microcalcifications were not detectable on PC-BCT, likely because they were not included in the PC-BCT volume. In the remaining 20 participants, microcalcifications were detectable. With high between-reader agreement (κ > 0.8), conspicuity of microcalcifications was rated similar for DBT and MIPs of RM-1 (mean, 4.83 ± 0.38 vs 4.86 ± 0.35) ( P = 0.66), but was significantly lower ( P < 0.05) for the remaining PC-BCT reconstructions: 2.11 ± 0.92 (RM-2), 2.64 ± 0.80 (MIPs of RM-2), and 3.50 ± 1.23 (RM-1). Identical distribution qualifiers were assigned for PC-BCT and DBT in 18/20 participants, with excellent agreement (κ = 0.91), whereas identical morphologic qualifiers were assigned in only 5/20, with poor agreement (κ = 0.44). The median size of smallest discernible microcalcification particle was 0.2 versus 0.6 versus 1.1 mm in DBT versus RM-1 versus RM-2 ( P < 0.001), likely due to blooming effects. Average glandular dose was 7.04 mGy (PC-BCT) versus 6.88 mGy (DBT) ( P = 0.67).</p><p><strong>Conclusions: </strong>PC-BCT allows reliable detection of in-breast microcalcifications as long as they are not located in the posterior part of the breast and allows assessment of their distribution, but not of their individual morphology.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":"838-844"},"PeriodicalIF":7.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141456905","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-01Epub Date: 2024-06-24DOI: 10.1097/RLI.0000000000001098
Birte M Hofmann, Kai Riecke, Stefan Klein, Mark A Klemens, Petra Palkowitsch, Johannes F Kahn, Helena Posch, Matthias Berse, Wolfgang Ebert
Objectives: To investigate the signal-enhancement properties of the tetrameric gadolinium-based contrast agent (GBCA) gadoquatrane in relation to the administered dose and compare its properties to those of a standard dose of gadobutrol, as a representative of the currently established macrocyclic GBCAs for magnetic resonance imaging.
Materials and methods: In this randomized, single-blind, 4 × 4 crossover study, 43 healthy adults (19-50 years of age) received 3 single IV injections of gadoquatrane (0.01, 0.03, and 0.06 mmol gadolinium/kg body weight) and 1 injection of gadobutrol (0.1 mmol gadolinium/kg body weight) in randomized sequence with 1-week washout periods between administrations. The relative signal enhancement (RSE) was determined in predefined areas of interest in magnetic resonance image sets of the head-neck region. RSE-vs-dose curves (dose-response curves) were established by linear regression, and comparator-equivalent doses were determined by Bayesian inverse regression analysis. Further, 3 blood samples were taken after each injection for pharmacokinetic analyses, and safety data were assessed.
Results: The RSE increased with gadoquatrane dose. A linear function adequately fitted this relationship. In line with the more than 2-fold higher r1 relaxivity of gadoquatrane per gadolinium ion, gadobutrol-equivalent RSE was achieved with gadoquatrane at less than half the gadolinium dose and less than one eighth of the molecule dose.Administration of gadoquatrane and gadobutrol resulted in very similar dose-normalized gadolinium concentrations in plasma, indicating that the pharmacokinetic profiles are essentially the same. Both contrast agents were well tolerated. Adverse events were rare and not dependent on the dose administered.
Conclusions: Gadoquatrane has the potential to be an effective GBCA that can be used at substantially lower doses in clinical routine than the currently established macrocyclic GBCAs.
{"title":"Relationship of Dose and Signal Enhancement Properties of Gadoquatrane, a New Tetrameric, Macrocyclic Gadolinium-Based Contrast Agent, Compared With Gadobutrol: A Randomized Crossover Study in Healthy Adults.","authors":"Birte M Hofmann, Kai Riecke, Stefan Klein, Mark A Klemens, Petra Palkowitsch, Johannes F Kahn, Helena Posch, Matthias Berse, Wolfgang Ebert","doi":"10.1097/RLI.0000000000001098","DOIUrl":"10.1097/RLI.0000000000001098","url":null,"abstract":"<p><strong>Objectives: </strong>To investigate the signal-enhancement properties of the tetrameric gadolinium-based contrast agent (GBCA) gadoquatrane in relation to the administered dose and compare its properties to those of a standard dose of gadobutrol, as a representative of the currently established macrocyclic GBCAs for magnetic resonance imaging.</p><p><strong>Materials and methods: </strong>In this randomized, single-blind, 4 × 4 crossover study, 43 healthy adults (19-50 years of age) received 3 single IV injections of gadoquatrane (0.01, 0.03, and 0.06 mmol gadolinium/kg body weight) and 1 injection of gadobutrol (0.1 mmol gadolinium/kg body weight) in randomized sequence with 1-week washout periods between administrations. The relative signal enhancement (RSE) was determined in predefined areas of interest in magnetic resonance image sets of the head-neck region. RSE-vs-dose curves (dose-response curves) were established by linear regression, and comparator-equivalent doses were determined by Bayesian inverse regression analysis. Further, 3 blood samples were taken after each injection for pharmacokinetic analyses, and safety data were assessed.</p><p><strong>Results: </strong>The RSE increased with gadoquatrane dose. A linear function adequately fitted this relationship. In line with the more than 2-fold higher r1 relaxivity of gadoquatrane per gadolinium ion, gadobutrol-equivalent RSE was achieved with gadoquatrane at less than half the gadolinium dose and less than one eighth of the molecule dose.Administration of gadoquatrane and gadobutrol resulted in very similar dose-normalized gadolinium concentrations in plasma, indicating that the pharmacokinetic profiles are essentially the same. Both contrast agents were well tolerated. Adverse events were rare and not dependent on the dose administered.</p><p><strong>Conclusions: </strong>Gadoquatrane has the potential to be an effective GBCA that can be used at substantially lower doses in clinical routine than the currently established macrocyclic GBCAs.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":"845-853"},"PeriodicalIF":7.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141431950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-01Epub Date: 2024-06-25DOI: 10.1097/RLI.0000000000001094
Nikolaj Bøgh, Lotte B Bertelsen, Camilla W Rasmussen, Sabrina K Bech, Anna K Keller, Mia G Madsen, Frederik Harving, Thomas H Thorsen, Ida K Mieritz, Esben Ss Hansen, Alkwin Wanders, Christoffer Laustsen
Objectives: Fibrosis is the final common pathway for chronic kidney disease and the best predictor for disease progression. Besides invasive biopsies, biomarkers for its detection are lacking. To address this, we used hyperpolarized 13 C-pyruvate MRI to detect the metabolic changes associated with fibrogenic activity of myofibroblasts.
Materials and methods: Hyperpolarized 13 C-pyruvate MRI was performed in 2 pig models of kidney fibrosis (unilateral ureteral obstruction and ischemia-reperfusion injury). The imaging data were correlated with histology, biochemical, and genetic measures of metabolism and fibrosis. The porcine experiments were supplemented with cell-line experiments to inform the origins of metabolic changes in fibrogenesis. Lastly, healthy and fibrotic human kidneys were analyzed for the metabolic alterations accessible with hyperpolarized 13 C-pyruvate MRI.
Results: In the 2 large animal models of kidney fibrosis, metabolic imaging revealed alterations in amino acid metabolism and glycolysis. Conversion from hyperpolarized 13 C-pyruvate to 13 C-alanine decreased, whereas conversion to 13 C-lactate increased. These changes were shown to reflect profibrotic activity in cultured epithelial cells, macrophages, and fibroblasts, which are important precursors of myofibroblasts. Importantly, metabolic MRI using hyperpolarized 13 C-pyruvate was able to detect these changes earlier than fibrosis-sensitive structural imaging. Lastly, we found that the same metabolic profile is present in fibrotic tissue from human kidneys. This affirms the translational potential of metabolic MRI as an early indicator of fibrogenesis associated metabolism.
Conclusions: Our findings demonstrate the promise of hyperpolarized 13 C-pyruvate MRI for noninvasive detection of fibrosis development, which could enable earlier diagnosis and intervention for patients at risk of kidney fibrosis.
目标:纤维化是慢性肾脏病的最终常见途径,也是疾病进展的最佳预测指标。除侵入性活检外,目前还缺乏检测纤维化的生物标志物。为解决这一问题,我们利用超极化 13 C 丙酮酸核磁共振成像检测与肌成纤维细胞纤维化活性相关的代谢变化。材料与方法:在 2 个猪肾脏纤维化模型(单侧输尿管梗阻和缺血再灌注损伤)中进行了超极化 13 C 丙酮酸核磁共振成像。成像数据与代谢和纤维化的组织学、生化和遗传测量结果相关联。猪实验辅以细胞系实验,以了解纤维化过程中代谢变化的起源。最后,利用超极化 13 C 丙酮酸核磁共振成像分析了健康肾脏和纤维化人肾的代谢变化:结果:在两种大型肾脏纤维化动物模型中,代谢成像显示了氨基酸代谢和糖酵解的改变。从超极化 13 C 丙酮酸到 13 C 丙氨酸的转化率下降,而到 13 C 乳酸的转化率上升。研究表明,这些变化反映了培养的上皮细胞、巨噬细胞和成纤维细胞(它们是肌成纤维细胞的重要前体)的坏死活性。重要的是,使用超极化 13 C 丙酮酸的代谢磁共振成像能够比纤维化敏感结构成像更早地检测到这些变化。最后,我们发现人类肾脏纤维化组织中也存在相同的代谢特征。这肯定了代谢磁共振成像作为纤维化相关代谢的早期指标的转化潜力:我们的研究结果表明,超极化 13 C 丙酮酸核磁共振成像有望用于无创检测肾脏纤维化的发展情况,从而能够对有肾脏纤维化风险的患者进行早期诊断和干预。
{"title":"Metabolic MRI With Hyperpolarized 13 C-Pyruvate for Early Detection of Fibrogenic Kidney Metabolism.","authors":"Nikolaj Bøgh, Lotte B Bertelsen, Camilla W Rasmussen, Sabrina K Bech, Anna K Keller, Mia G Madsen, Frederik Harving, Thomas H Thorsen, Ida K Mieritz, Esben Ss Hansen, Alkwin Wanders, Christoffer Laustsen","doi":"10.1097/RLI.0000000000001094","DOIUrl":"10.1097/RLI.0000000000001094","url":null,"abstract":"<p><strong>Objectives: </strong>Fibrosis is the final common pathway for chronic kidney disease and the best predictor for disease progression. Besides invasive biopsies, biomarkers for its detection are lacking. To address this, we used hyperpolarized 13 C-pyruvate MRI to detect the metabolic changes associated with fibrogenic activity of myofibroblasts.</p><p><strong>Materials and methods: </strong>Hyperpolarized 13 C-pyruvate MRI was performed in 2 pig models of kidney fibrosis (unilateral ureteral obstruction and ischemia-reperfusion injury). The imaging data were correlated with histology, biochemical, and genetic measures of metabolism and fibrosis. The porcine experiments were supplemented with cell-line experiments to inform the origins of metabolic changes in fibrogenesis. Lastly, healthy and fibrotic human kidneys were analyzed for the metabolic alterations accessible with hyperpolarized 13 C-pyruvate MRI.</p><p><strong>Results: </strong>In the 2 large animal models of kidney fibrosis, metabolic imaging revealed alterations in amino acid metabolism and glycolysis. Conversion from hyperpolarized 13 C-pyruvate to 13 C-alanine decreased, whereas conversion to 13 C-lactate increased. These changes were shown to reflect profibrotic activity in cultured epithelial cells, macrophages, and fibroblasts, which are important precursors of myofibroblasts. Importantly, metabolic MRI using hyperpolarized 13 C-pyruvate was able to detect these changes earlier than fibrosis-sensitive structural imaging. Lastly, we found that the same metabolic profile is present in fibrotic tissue from human kidneys. This affirms the translational potential of metabolic MRI as an early indicator of fibrogenesis associated metabolism.</p><p><strong>Conclusions: </strong>Our findings demonstrate the promise of hyperpolarized 13 C-pyruvate MRI for noninvasive detection of fibrosis development, which could enable earlier diagnosis and intervention for patients at risk of kidney fibrosis.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":"813-822"},"PeriodicalIF":7.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141446134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-01Epub Date: 2024-07-05DOI: 10.1097/RLI.0000000000001105
Matthias Wetzl, Theresa Heilingbrunner, Felix Heindl, Evelyn Wenkel, Michael Uder, Sabine Ohlmeyer
Objectives: To evaluate the detectability of non-contrast-enhanced and contrast-enhanced spiral breast computed tomography ([non]-CE-SBCT) compared with mammography. Secondary objectives are to determine detectability depending on breast density and to evaluate appearance of breast malignancies according to BI-RADS descriptors.
Methods: This retrospective institutional review board-approved study included 90 women with 105 biopsy-proven malignant breast lesions. Breast density, BI-RADS descriptors, and detectability were evaluated by 2 independent readers. Diagnostic confidence was rated on a 4-point Likert scale.
Results: For readers 1 and 2, detectability was 83.8% and 80.0% for mammography, 99.1% and 99.1% for CE-SBCT ( P < 0.05), and 66.7% and 61.9% for non-CE-SBCT ( P < 0.05). With both readers, detectability in CE-SBCT was high for density A/B/C/D (both 100%/100%/100%/87.5%). Detectability of readers declined with increasing density for mammography (density A = 100%, B = 89.1% and 95.1%, C = 73.1%, D = 50.0% and 71.4%; P < 0.05) and for non-CE-SBCT (density A = 87.5% and 90.7%, B = 65.5% and 69.1%, C = 54.8% and 60.0%, D = 37.5%; P < 0.05). Mass lesions were detected with CT as often as with mammography, whereas architectural distortions and microcalcifications were detected less often with SBCT. Diagnostic confidence was very high or high in 97.2% for CE-SBCT, in 74.1% for non-CE-SBCT, and in 81.4% for mammography.
Conclusions: Detectability and diagnostic confidence were very high in CE-SBCT, regardless of breast density. The detectability of non-CE-SBCT was lower than that of mammography and declined with increasing breast density.
{"title":"Detectability of Breast Cancer in Dedicated Breast CT Compared With Mammography Dependent on Breast Density.","authors":"Matthias Wetzl, Theresa Heilingbrunner, Felix Heindl, Evelyn Wenkel, Michael Uder, Sabine Ohlmeyer","doi":"10.1097/RLI.0000000000001105","DOIUrl":"10.1097/RLI.0000000000001105","url":null,"abstract":"<p><strong>Objectives: </strong>To evaluate the detectability of non-contrast-enhanced and contrast-enhanced spiral breast computed tomography ([non]-CE-SBCT) compared with mammography. Secondary objectives are to determine detectability depending on breast density and to evaluate appearance of breast malignancies according to BI-RADS descriptors.</p><p><strong>Methods: </strong>This retrospective institutional review board-approved study included 90 women with 105 biopsy-proven malignant breast lesions. Breast density, BI-RADS descriptors, and detectability were evaluated by 2 independent readers. Diagnostic confidence was rated on a 4-point Likert scale.</p><p><strong>Results: </strong>For readers 1 and 2, detectability was 83.8% and 80.0% for mammography, 99.1% and 99.1% for CE-SBCT ( P < 0.05), and 66.7% and 61.9% for non-CE-SBCT ( P < 0.05). With both readers, detectability in CE-SBCT was high for density A/B/C/D (both 100%/100%/100%/87.5%). Detectability of readers declined with increasing density for mammography (density A = 100%, B = 89.1% and 95.1%, C = 73.1%, D = 50.0% and 71.4%; P < 0.05) and for non-CE-SBCT (density A = 87.5% and 90.7%, B = 65.5% and 69.1%, C = 54.8% and 60.0%, D = 37.5%; P < 0.05). Mass lesions were detected with CT as often as with mammography, whereas architectural distortions and microcalcifications were detected less often with SBCT. Diagnostic confidence was very high or high in 97.2% for CE-SBCT, in 74.1% for non-CE-SBCT, and in 81.4% for mammography.</p><p><strong>Conclusions: </strong>Detectability and diagnostic confidence were very high in CE-SBCT, regardless of breast density. The detectability of non-CE-SBCT was lower than that of mammography and declined with increasing breast density.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":"861-865"},"PeriodicalIF":7.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141468004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-01Epub Date: 2024-07-04DOI: 10.1097/RLI.0000000000001095
Adrian Alexander Marth, Constantin von Deuster, Stefan Sommer, Georg Constantin Feuerriegel, Sophia Samira Goller, Reto Sutter, Daniel Nanz
Objectives: The aim of this study was to compare the image quality of 7 T turbo spin echo (TSE) knee images acquired with varying factors of parallel-imaging acceleration reconstructed with deep learning (DL)-based and conventional algorithms.
Materials and methods: This was a prospective single-center study. Twenty-three healthy volunteers underwent 7 T knee magnetic resonance imaging. Two-, 3-, and 4-fold accelerated high-resolution fat-signal-suppressing proton density (PD-fs) and T1-weighted coronal 2D TSE acquisitions with an encoded voxel volume of 0.31 × 0.31 × 1.5 mm 3 were acquired. Each set of raw data was reconstructed with a DL-based and a conventional Generalized Autocalibrating Partially Parallel Acquisition (GRAPPA) algorithm. Three readers rated image contrast, sharpness, artifacts, noise, and overall quality. Friedman analysis of variance and the Wilcoxon signed rank test were used for comparison of image quality criteria.
Results: The mean age of the participants was 32.0 ± 8.1 years (15 male, 8 female). Acquisition times at 4-fold acceleration were 4 minutes 15 seconds (PD-fs, Supplemental Video is available at http://links.lww.com/RLI/A938 ) and 3 minutes 9 seconds (T1, Supplemental Video available at http://links.lww.com/RLI/A939 ). At 4-fold acceleration, image contrast, sharpness, noise, and overall quality of images reconstructed with the DL-based algorithm were significantly better rated than the corresponding GRAPPA reconstructions ( P < 0.001). Four-fold accelerated DL-reconstructed images scored significantly better than 2- to 3-fold GRAPPA-reconstructed images with regards to image contrast, sharpness, noise, and overall quality ( P ≤ 0.031). Image contrast of PD-fs images at 2-fold acceleration ( P = 0.087), image noise of T1-weighted images at 2-fold acceleration ( P = 0.180), and image artifacts for both sequences at 2- and 3-fold acceleration ( P ≥ 0.102) of GRAPPA reconstructions were not rated differently than those of 4-fold accelerated DL-reconstructed images. Furthermore, no significant difference was observed for all image quality measures among 2-fold, 3-fold, and 4-fold accelerated DL reconstructions ( P ≥ 0.082).
Conclusions: This study explored the technical potential of DL-based image reconstruction in accelerated 2D TSE acquisitions of the knee at 7 T. DL reconstruction significantly improved a variety of image quality measures of high-resolution TSE images acquired with a 4-fold parallel-imaging acceleration compared with a conventional reconstruction algorithm.
{"title":"Accelerated High-Resolution Deep Learning Reconstruction Turbo Spin Echo MRI of the Knee at 7 T.","authors":"Adrian Alexander Marth, Constantin von Deuster, Stefan Sommer, Georg Constantin Feuerriegel, Sophia Samira Goller, Reto Sutter, Daniel Nanz","doi":"10.1097/RLI.0000000000001095","DOIUrl":"10.1097/RLI.0000000000001095","url":null,"abstract":"<p><strong>Objectives: </strong>The aim of this study was to compare the image quality of 7 T turbo spin echo (TSE) knee images acquired with varying factors of parallel-imaging acceleration reconstructed with deep learning (DL)-based and conventional algorithms.</p><p><strong>Materials and methods: </strong>This was a prospective single-center study. Twenty-three healthy volunteers underwent 7 T knee magnetic resonance imaging. Two-, 3-, and 4-fold accelerated high-resolution fat-signal-suppressing proton density (PD-fs) and T1-weighted coronal 2D TSE acquisitions with an encoded voxel volume of 0.31 × 0.31 × 1.5 mm 3 were acquired. Each set of raw data was reconstructed with a DL-based and a conventional Generalized Autocalibrating Partially Parallel Acquisition (GRAPPA) algorithm. Three readers rated image contrast, sharpness, artifacts, noise, and overall quality. Friedman analysis of variance and the Wilcoxon signed rank test were used for comparison of image quality criteria.</p><p><strong>Results: </strong>The mean age of the participants was 32.0 ± 8.1 years (15 male, 8 female). Acquisition times at 4-fold acceleration were 4 minutes 15 seconds (PD-fs, Supplemental Video is available at http://links.lww.com/RLI/A938 ) and 3 minutes 9 seconds (T1, Supplemental Video available at http://links.lww.com/RLI/A939 ). At 4-fold acceleration, image contrast, sharpness, noise, and overall quality of images reconstructed with the DL-based algorithm were significantly better rated than the corresponding GRAPPA reconstructions ( P < 0.001). Four-fold accelerated DL-reconstructed images scored significantly better than 2- to 3-fold GRAPPA-reconstructed images with regards to image contrast, sharpness, noise, and overall quality ( P ≤ 0.031). Image contrast of PD-fs images at 2-fold acceleration ( P = 0.087), image noise of T1-weighted images at 2-fold acceleration ( P = 0.180), and image artifacts for both sequences at 2- and 3-fold acceleration ( P ≥ 0.102) of GRAPPA reconstructions were not rated differently than those of 4-fold accelerated DL-reconstructed images. Furthermore, no significant difference was observed for all image quality measures among 2-fold, 3-fold, and 4-fold accelerated DL reconstructions ( P ≥ 0.082).</p><p><strong>Conclusions: </strong>This study explored the technical potential of DL-based image reconstruction in accelerated 2D TSE acquisitions of the knee at 7 T. DL reconstruction significantly improved a variety of image quality measures of high-resolution TSE images acquired with a 4-fold parallel-imaging acceleration compared with a conventional reconstruction algorithm.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":"831-837"},"PeriodicalIF":7.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141498073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-01Epub Date: 2024-07-29DOI: 10.1097/RLI.0000000000001102
Christian Roest, Derya Yakar, Dorjan Ivan Rener Sitar, Joeran S Bosma, Dennis B Rouw, Stefan Johannes Fransen, Henkjan Huisman, Thomas C Kwee
Objectives: Deep learning (DL) studies for the detection of clinically significant prostate cancer (csPCa) on magnetic resonance imaging (MRI) often overlook potentially relevant clinical parameters such as prostate-specific antigen, prostate volume, and age. This study explored the integration of clinical parameters and MRI-based DL to enhance diagnostic accuracy for csPCa on MRI.
Materials and methods: We retrospectively analyzed 932 biparametric prostate MRI examinations performed for suspected csPCa (ISUP ≥2) at 2 institutions. Each MRI scan was automatically analyzed by a previously developed DL model to detect and segment csPCa lesions. Three sets of features were extracted: DL lesion suspicion levels, clinical parameters (prostate-specific antigen, prostate volume, age), and MRI-based lesion volumes for all DL-detected lesions. Six multimodal artificial intelligence (AI) classifiers were trained for each combination of feature sets, employing both early (feature-level) and late (decision-level) information fusion methods. The diagnostic performance of each model was tested internally on 20% of center 1 data and externally on center 2 data (n = 529). Receiver operating characteristic comparisons determined the optimal feature combination and information fusion method and assessed the benefit of multimodal versus unimodal analysis. The optimal model performance was compared with a radiologist using PI-RADS.
Results: Internally, the multimodal AI integrating DL suspicion levels with clinical features via early fusion achieved the highest performance. Externally, it surpassed baselines using clinical parameters (0.77 vs 0.67 area under the curve [AUC], P < 0.001) and DL suspicion levels alone (AUC: 0.77 vs 0.70, P = 0.006). Early fusion outperformed late fusion in external data (0.77 vs 0.73 AUC, P = 0.005). No significant performance gaps were observed between multimodal AI and radiologist assessments (internal: 0.87 vs 0.88 AUC; external: 0.77 vs 0.75 AUC, both P > 0.05).
Conclusions: Multimodal AI (combining DL suspicion levels and clinical parameters) outperforms clinical and MRI-only AI for csPCa detection. Early information fusion enhanced AI robustness in our multicenter setting. Incorporating lesion volumes did not enhance diagnostic efficacy.
{"title":"Multimodal AI Combining Clinical and Imaging Inputs Improves Prostate Cancer Detection.","authors":"Christian Roest, Derya Yakar, Dorjan Ivan Rener Sitar, Joeran S Bosma, Dennis B Rouw, Stefan Johannes Fransen, Henkjan Huisman, Thomas C Kwee","doi":"10.1097/RLI.0000000000001102","DOIUrl":"10.1097/RLI.0000000000001102","url":null,"abstract":"<p><strong>Objectives: </strong>Deep learning (DL) studies for the detection of clinically significant prostate cancer (csPCa) on magnetic resonance imaging (MRI) often overlook potentially relevant clinical parameters such as prostate-specific antigen, prostate volume, and age. This study explored the integration of clinical parameters and MRI-based DL to enhance diagnostic accuracy for csPCa on MRI.</p><p><strong>Materials and methods: </strong>We retrospectively analyzed 932 biparametric prostate MRI examinations performed for suspected csPCa (ISUP ≥2) at 2 institutions. Each MRI scan was automatically analyzed by a previously developed DL model to detect and segment csPCa lesions. Three sets of features were extracted: DL lesion suspicion levels, clinical parameters (prostate-specific antigen, prostate volume, age), and MRI-based lesion volumes for all DL-detected lesions. Six multimodal artificial intelligence (AI) classifiers were trained for each combination of feature sets, employing both early (feature-level) and late (decision-level) information fusion methods. The diagnostic performance of each model was tested internally on 20% of center 1 data and externally on center 2 data (n = 529). Receiver operating characteristic comparisons determined the optimal feature combination and information fusion method and assessed the benefit of multimodal versus unimodal analysis. The optimal model performance was compared with a radiologist using PI-RADS.</p><p><strong>Results: </strong>Internally, the multimodal AI integrating DL suspicion levels with clinical features via early fusion achieved the highest performance. Externally, it surpassed baselines using clinical parameters (0.77 vs 0.67 area under the curve [AUC], P < 0.001) and DL suspicion levels alone (AUC: 0.77 vs 0.70, P = 0.006). Early fusion outperformed late fusion in external data (0.77 vs 0.73 AUC, P = 0.005). No significant performance gaps were observed between multimodal AI and radiologist assessments (internal: 0.87 vs 0.88 AUC; external: 0.77 vs 0.75 AUC, both P > 0.05).</p><p><strong>Conclusions: </strong>Multimodal AI (combining DL suspicion levels and clinical parameters) outperforms clinical and MRI-only AI for csPCa detection. Early information fusion enhanced AI robustness in our multicenter setting. Incorporating lesion volumes did not enhance diagnostic efficacy.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":"854-860"},"PeriodicalIF":7.0,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141792489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-12DOI: 10.1097/RLI.0000000000001125
Lennart Walger, Tobias Bauer, David Kügler, Matthias H Schmitz, Fabiane Schuch, Christophe Arendt, Tobias Baumgartner, Johannes Birkenheier, Valeri Borger, Christoph Endler, Franziska Grau, Christian Immanuel, Markus Kölle, Patrick Kupczyk, Asadeh Lakghomi, Sarah Mackert, Elisabeth Neuhaus, Julia Nordsiek, Anna-Maria Odenthal, Karmele Olaciregui Dague, Laura Ostermann, Jan Pukropski, Attila Racz, Klaus von der Ropp, Frederic Carsten Schmeel, Felix Schrader, Aileen Sitter, Alexander Unruh-Pinheiro, Marilia Voigt, Martin Vychopen, Philip von Wedel, Randi von Wrede, Ulrike Attenberger, Hartmut Vatter, Alexandra Philipsen, Albert Becker, Martin Reuter, Elke Hattingen, Josemir W Sander, Alexander Radbruch, Rainer Surges, Theodor Rüber
Objectives: Artificial intelligence (AI) is thought to improve lesion detection. However, a lack of knowledge about human performance prevents a comparative evaluation of AI and an accurate assessment of its impact on clinical decision-making. The objective of this work is to quantitatively evaluate the ability of humans to detect focal cortical dysplasia (FCD), compare it to state-of-the-art AI, and determine how it may aid diagnostics.
Materials and methods: We prospectively recorded the performance of readers in detecting FCDs using single points and 3-dimensional bounding boxes. We acquired predictions of 3 AI models for the same dataset and compared these to readers. Finally, we analyzed pairwise combinations of readers and models.
Results: Twenty-eight readers, including 20 nonexpert and 5 expert physicians, reviewed 180 cases: 146 subjects with FCD (median age: 25, interquartile range: 18) and 34 healthy control subjects (median age: 43, interquartile range: 19). Nonexpert readers detected 47% (95% confidence interval [CI]: 46, 49) of FCDs, whereas experts detected 68% (95% CI: 65, 71). The 3 AI models detected 32%, 51%, and 72% of FCDs, respectively. The latter, however, also predicted more than 13 false-positive clusters per subject on average. Human performance was improved in the presence of a transmantle sign ( P < 0.001) and cortical thickening ( P < 0.001). In contrast, AI models were sensitive to abnormal gyration ( P < 0.01) or gray-white matter blurring ( P < 0.01). Compared with single experts, expert-expert pairs detected 13% (95% CI: 9, 18) more FCDs ( P < 0.001). All AI models increased expert detection rates by up to 19% (95% CI: 15, 24) ( P < 0.001). Nonexpert+AI pairs could still outperform single experts by up to 13% (95% CI: 10, 17).
Conclusions: This study pioneers the comparative evaluation of humans and AI for FCD lesion detection. It shows that AI and human predictions differ, especially for certain MRI features of FCD, and, thus, how AI may complement the diagnostic workup.
{"title":"A Quantitative Comparison Between Human and Artificial Intelligence in the Detection of Focal Cortical Dysplasia.","authors":"Lennart Walger, Tobias Bauer, David Kügler, Matthias H Schmitz, Fabiane Schuch, Christophe Arendt, Tobias Baumgartner, Johannes Birkenheier, Valeri Borger, Christoph Endler, Franziska Grau, Christian Immanuel, Markus Kölle, Patrick Kupczyk, Asadeh Lakghomi, Sarah Mackert, Elisabeth Neuhaus, Julia Nordsiek, Anna-Maria Odenthal, Karmele Olaciregui Dague, Laura Ostermann, Jan Pukropski, Attila Racz, Klaus von der Ropp, Frederic Carsten Schmeel, Felix Schrader, Aileen Sitter, Alexander Unruh-Pinheiro, Marilia Voigt, Martin Vychopen, Philip von Wedel, Randi von Wrede, Ulrike Attenberger, Hartmut Vatter, Alexandra Philipsen, Albert Becker, Martin Reuter, Elke Hattingen, Josemir W Sander, Alexander Radbruch, Rainer Surges, Theodor Rüber","doi":"10.1097/RLI.0000000000001125","DOIUrl":"10.1097/RLI.0000000000001125","url":null,"abstract":"<p><strong>Objectives: </strong>Artificial intelligence (AI) is thought to improve lesion detection. However, a lack of knowledge about human performance prevents a comparative evaluation of AI and an accurate assessment of its impact on clinical decision-making. The objective of this work is to quantitatively evaluate the ability of humans to detect focal cortical dysplasia (FCD), compare it to state-of-the-art AI, and determine how it may aid diagnostics.</p><p><strong>Materials and methods: </strong>We prospectively recorded the performance of readers in detecting FCDs using single points and 3-dimensional bounding boxes. We acquired predictions of 3 AI models for the same dataset and compared these to readers. Finally, we analyzed pairwise combinations of readers and models.</p><p><strong>Results: </strong>Twenty-eight readers, including 20 nonexpert and 5 expert physicians, reviewed 180 cases: 146 subjects with FCD (median age: 25, interquartile range: 18) and 34 healthy control subjects (median age: 43, interquartile range: 19). Nonexpert readers detected 47% (95% confidence interval [CI]: 46, 49) of FCDs, whereas experts detected 68% (95% CI: 65, 71). The 3 AI models detected 32%, 51%, and 72% of FCDs, respectively. The latter, however, also predicted more than 13 false-positive clusters per subject on average. Human performance was improved in the presence of a transmantle sign ( P < 0.001) and cortical thickening ( P < 0.001). In contrast, AI models were sensitive to abnormal gyration ( P < 0.01) or gray-white matter blurring ( P < 0.01). Compared with single experts, expert-expert pairs detected 13% (95% CI: 9, 18) more FCDs ( P < 0.001). All AI models increased expert detection rates by up to 19% (95% CI: 15, 24) ( P < 0.001). Nonexpert+AI pairs could still outperform single experts by up to 13% (95% CI: 10, 17).</p><p><strong>Conclusions: </strong>This study pioneers the comparative evaluation of humans and AI for FCD lesion detection. It shows that AI and human predictions differ, especially for certain MRI features of FCD, and, thus, how AI may complement the diagnostic workup.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142500556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-08DOI: 10.1097/RLI.0000000000001134
Bjarne Kerber, Thomas Flohr, Silvia Ulrich, Mona Lichtblau, Thomas Frauenfelder, Sabine Franckenberg
Objectives: The aim of this study was to evaluate the feasibility and efficacy of chronic pulmonary thromboembolism assessment using photon-counting detector computed tomography (PCD-CT) iodine maps of the lung parenchyma.
Materials and methods: This institutional review board-approved retrospective study included 83 subjects (49.4% male, aged 62.4 ± 13.4 years; 50.6% female, aged 59.9 ± 17.1 years) who underwent clinically indicated PCD-CT scan to rule out chronic thromboembolic pulmonary hypertension (CTEPH). Two blinded readers used iodine maps and corresponding sharp-kernel CT reconstructions in the lung window to evaluate perfusion defects and identify patients with chronic pulmonary thromboembolism (CTEPH, CTEPH overlap with other causes of pulmonary hypertension [PH], chronic thromboembolic disease [CTED]). No other clinical or imaging information was given. Discordance was resolved in a subsequent consensus read. The clinical diagnosis was reviewed in an interdisciplinary clinical setting. The accuracy, sensitivity, and specificity of radiologic evaluation and clinical diagnosis were calculated.
Results: Of the 83 subjects included, 32 were diagnosed with CTEPH, CTEPH overlap, or CTED, 35 were diagnosed with PH caused by other pathologic mechanisms, 10 had no PH, and 6 had suffered previous acute pulmonary embolism, which resolved. The interreader agreement was good (Cohen κ = 0.74). The consensus reached high accuracy (0.88), sensitivity (0.94), and specificity (0.84), as well as good agreement with interdisciplinary clinical diagnosis (Cohen κ = 0.75). No cases with confirmed CTEPH as the primary cause of PH or CTED were missed. Patients with pulmonary arterial hypertension were most frequently rated false-positive. The mean effective dose (±standard deviation) was 1.3 (±0.76) mSv.
Conclusions: Accurate, sensitive, and specific diagnosis of pulmonary chronic thromboembolism at low radiation dose is possible using iodine maps reconstructed from PCD-CT scans.
{"title":"Photon-Counting CT Iodine Maps for Diagnosing Chronic Pulmonary Thromboembolism: A Pilot Study.","authors":"Bjarne Kerber, Thomas Flohr, Silvia Ulrich, Mona Lichtblau, Thomas Frauenfelder, Sabine Franckenberg","doi":"10.1097/RLI.0000000000001134","DOIUrl":"https://doi.org/10.1097/RLI.0000000000001134","url":null,"abstract":"<p><strong>Objectives: </strong>The aim of this study was to evaluate the feasibility and efficacy of chronic pulmonary thromboembolism assessment using photon-counting detector computed tomography (PCD-CT) iodine maps of the lung parenchyma.</p><p><strong>Materials and methods: </strong>This institutional review board-approved retrospective study included 83 subjects (49.4% male, aged 62.4 ± 13.4 years; 50.6% female, aged 59.9 ± 17.1 years) who underwent clinically indicated PCD-CT scan to rule out chronic thromboembolic pulmonary hypertension (CTEPH). Two blinded readers used iodine maps and corresponding sharp-kernel CT reconstructions in the lung window to evaluate perfusion defects and identify patients with chronic pulmonary thromboembolism (CTEPH, CTEPH overlap with other causes of pulmonary hypertension [PH], chronic thromboembolic disease [CTED]). No other clinical or imaging information was given. Discordance was resolved in a subsequent consensus read. The clinical diagnosis was reviewed in an interdisciplinary clinical setting. The accuracy, sensitivity, and specificity of radiologic evaluation and clinical diagnosis were calculated.</p><p><strong>Results: </strong>Of the 83 subjects included, 32 were diagnosed with CTEPH, CTEPH overlap, or CTED, 35 were diagnosed with PH caused by other pathologic mechanisms, 10 had no PH, and 6 had suffered previous acute pulmonary embolism, which resolved. The interreader agreement was good (Cohen κ = 0.74). The consensus reached high accuracy (0.88), sensitivity (0.94), and specificity (0.84), as well as good agreement with interdisciplinary clinical diagnosis (Cohen κ = 0.75). No cases with confirmed CTEPH as the primary cause of PH or CTED were missed. Patients with pulmonary arterial hypertension were most frequently rated false-positive. The mean effective dose (±standard deviation) was 1.3 (±0.76) mSv.</p><p><strong>Conclusions: </strong>Accurate, sensitive, and specific diagnosis of pulmonary chronic thromboembolism at low radiation dose is possible using iodine maps reconstructed from PCD-CT scans.</p>","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142604398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}