Pub Date : 2024-08-31DOI: 10.1016/j.ejro.2024.100594
Elizabeth P.V. Le , Mark Y.Z. Wong , Leonardo Rundo , Jason M. Tarkin , Nicholas R. Evans , Jonathan R. Weir-McCall , Mohammed M. Chowdhury , Patrick A. Coughlin , Holly Pavey , Fulvio Zaccagna , Chris Wall , Rouchelle Sriranjan , Andrej Corovic , Yuan Huang , Elizabeth A. Warburton , Evis Sala , Michael Roberts , Carola-Bibiane Schönlieb , James H.F. Rudd
Purpose
To assess radiomics and deep learning (DL) methods in identifying symptomatic Carotid Artery Disease (CAD) from carotid CT angiography (CTA) images. We further compare the performance of these novel methods to the conventional calcium score.
Methods
Carotid CT angiography (CTA) images from symptomatic patients (ischaemic stroke/transient ischaemic attack within the last 3 months) and asymptomatic patients were analysed. Carotid arteries were classified into culprit, non-culprit and asymptomatic. The calcium score was assessed using the Agatston method. 93 radiomic features were extracted from regions-of-interest drawn on 14 consecutive CTA slices. For DL, convolutional neural networks (CNNs) with and without transfer learning were trained directly on CTA slices. Predictive performance was assessed over 5-fold cross validated AUC scores. SHAP and GRAD-CAM algorithms were used for explainability.
Results
132 carotid arteries were analysed (41 culprit, 41 non-culprit, and 50 asymptomatic). For asymptomatic vs symptomatic arteries, radiomics attained a mean AUC of 0.96(± 0.02), followed by DL 0.86(± 0.06) and then calcium 0.79(± 0.08). For culprit vs non-culprit arteries, radiomics achieved a mean AUC of 0.75(± 0.09), followed by DL 0.67(± 0.10) and then calcium 0.60(± 0.02). For multi-class classification, the mean AUCs were 0.95(± 0.07), 0.79(± 0.05), and 0.71(± 0.07) for radiomics, DL and calcium, respectively. Explainability revealed consistent patterns in the most important radiomic features.
Conclusions
Our study highlights the potential of novel image analysis techniques in extracting quantitative information beyond calcification in the identification of CAD. Though further work is required, the transition of these novel techniques into clinical practice may eventually facilitate better stroke risk stratification.
{"title":"Using machine learning to predict carotid artery symptoms from CT angiography: A radiomics and deep learning approach","authors":"Elizabeth P.V. Le , Mark Y.Z. Wong , Leonardo Rundo , Jason M. Tarkin , Nicholas R. Evans , Jonathan R. Weir-McCall , Mohammed M. Chowdhury , Patrick A. Coughlin , Holly Pavey , Fulvio Zaccagna , Chris Wall , Rouchelle Sriranjan , Andrej Corovic , Yuan Huang , Elizabeth A. Warburton , Evis Sala , Michael Roberts , Carola-Bibiane Schönlieb , James H.F. Rudd","doi":"10.1016/j.ejro.2024.100594","DOIUrl":"10.1016/j.ejro.2024.100594","url":null,"abstract":"<div><h3>Purpose</h3><p>To assess radiomics and deep learning (DL) methods in identifying symptomatic Carotid Artery Disease (CAD) from carotid CT angiography (CTA) images. We further compare the performance of these novel methods to the conventional calcium score.</p></div><div><h3>Methods</h3><p>Carotid CT angiography (CTA) images from symptomatic patients (ischaemic stroke/transient ischaemic attack within the last 3 months) and asymptomatic patients were analysed. Carotid arteries were classified into culprit, non-culprit and asymptomatic. The calcium score was assessed using the Agatston method. 93 radiomic features were extracted from regions-of-interest drawn on 14 consecutive CTA slices. For DL, convolutional neural networks (CNNs) with and without transfer learning were trained directly on CTA slices. Predictive performance was assessed over 5-fold cross validated AUC scores. SHAP and GRAD-CAM algorithms were used for explainability.</p></div><div><h3>Results</h3><p>132 carotid arteries were analysed (41 culprit, 41 non-culprit, and 50 asymptomatic). For asymptomatic vs symptomatic arteries, radiomics attained a mean AUC of 0.96(± 0.02), followed by DL 0.86(± 0.06) and then calcium 0.79(± 0.08). For culprit vs non-culprit arteries, radiomics achieved a mean AUC of 0.75(± 0.09), followed by DL 0.67(± 0.10) and then calcium 0.60(± 0.02). For multi-class classification, the mean AUCs were 0.95(± 0.07), 0.79(± 0.05), and 0.71(± 0.07) for radiomics, DL and calcium, respectively. Explainability revealed consistent patterns in the most important radiomic features.</p></div><div><h3>Conclusions</h3><p>Our study highlights the potential of novel image analysis techniques in extracting quantitative information beyond calcification in the identification of CAD. Though further work is required, the transition of these novel techniques into clinical practice may eventually facilitate better stroke risk stratification.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100594"},"PeriodicalIF":1.8,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000492/pdfft?md5=bb89145f8b5e821a8f445782d782898c&pid=1-s2.0-S2352047724000492-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142096934","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-31DOI: 10.1016/j.ejro.2024.100597
Liang Hu , Jiang-Feng Pan , Zheng Han, Xiu-Mei Xia
Purpose
Sex-based differences in lumbar spine's fat content in adults are minimal, but significant variations exist in diffusion-weighted imaging (DWI) signal characteristics. This study aimed to investigate fat content’s impact on DWI performance in lumbar spine and potential sex differences.
Methods
A retrospective analysis was conducted on upper abdominal MRI examinations in asymptomatic adult. The lumbar 1 vertebral apparent diffusion coefficient (ADC) values and fat fraction were measured. Using DWI images (b = 800 s/mm2), the lumbar 1 vertebral signal was categorized into high and iso-low signal groups. A univariate and multivariate analysis was conducted to investigate the influence of fat fraction on DWI performance. Finally, the participants were divided into three groups to analyze sex differences in the effect of fat content on DWI performance.
Results
202 subjects, 99 men were included. Fat content significantly influenced lumbar spine DWI signal in both sexes (p < 0.05). The effect on ADC values was significant only in women (p < 0.001). Women demonstrated a significantly higher proportion of high DWI signal than men in the low (p = 0.002) and middle (p = 0.012) fat content groups. Additionally, women had higher ADC values in the low fat group (p = 0.004) but lower values in the high fat group (p = 0.004).
Conclusion
Fat content significantly impacts the DWI signal of lumbar spine, with a slight sex difference observed. These sex differences suggest that DWI signals may provide valuable information about the bone marrow beyond fat content.
{"title":"Impact of fat content on lumbar spine DWI performance: A sex-based comparative study","authors":"Liang Hu , Jiang-Feng Pan , Zheng Han, Xiu-Mei Xia","doi":"10.1016/j.ejro.2024.100597","DOIUrl":"10.1016/j.ejro.2024.100597","url":null,"abstract":"<div><h3>Purpose</h3><p>Sex-based differences in lumbar spine's fat content in adults are minimal, but significant variations exist in diffusion-weighted imaging (DWI) signal characteristics. This study aimed to investigate fat content’s impact on DWI performance in lumbar spine and potential sex differences.</p></div><div><h3>Methods</h3><p>A retrospective analysis was conducted on upper abdominal MRI examinations in asymptomatic adult. The lumbar 1 vertebral apparent diffusion coefficient (ADC) values and fat fraction were measured. Using DWI images (b = 800 s/mm<sup>2</sup>), the lumbar 1 vertebral signal was categorized into high and iso-low signal groups. A univariate and multivariate analysis was conducted to investigate the influence of fat fraction on DWI performance. Finally, the participants were divided into three groups to analyze sex differences in the effect of fat content on DWI performance.</p></div><div><h3>Results</h3><p>202 subjects, 99 men were included. Fat content significantly influenced lumbar spine DWI signal in both sexes (<em>p</em> < 0.05). The effect on ADC values was significant only in women (<em>p</em> < 0.001). Women demonstrated a significantly higher proportion of high DWI signal than men in the low (<em>p</em> = 0.002) and middle (<em>p</em> = 0.012) fat content groups. Additionally, women had higher ADC values in the low fat group (<em>p</em> = 0.004) but lower values in the high fat group (<em>p</em> = 0.004).</p></div><div><h3>Conclusion</h3><p>Fat content significantly impacts the DWI signal of lumbar spine, with a slight sex difference observed. These sex differences suggest that DWI signals may provide valuable information about the bone marrow beyond fat content.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100597"},"PeriodicalIF":1.8,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000522/pdfft?md5=85475f718a6604cd257db158d1c5e727&pid=1-s2.0-S2352047724000522-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142096933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-31DOI: 10.1016/j.ejro.2024.100599
Hong Zhu , Deyan Kong , Jiale Qian , Xiaomeng Shi , Jing Fan
Purpose
To compare image quality and detection accuracy of renal stones between deep learning image reconstruction (DLIR) and Adaptive Statistical Iterative Reconstruction-Veo (ASIR-V) reconstructed virtual non-contrast (VNC) images and true non-contrast (TNC) images in spectral CT Urography (CTU).
Methods
A retrospective analysis was conducted on images of 70 patients who underwent abdominal-pelvic CTU in TNC phase using non-contrast scan and contrast-enhanced corticomedullary phase (CP) and excretory phase (EP) using spectral scan. The TNC scan was reconstructed using ASIR-V70 % (TNC-AR70), contrast-enhanced scans were reconstructed using AR70, DLIR medium-level (DM), and high-level (DH) to obtain CP-VNC-AR70/DM/DH and EP-VNC-AR70/DM/DH image groups, respectively. CT value, image quality and kidney stones quantification accuracy were measured and compared among groups. The subjective evaluation was independently assessed by two senior radiologists using the 5-point Likert scale for image quality and lesion visibility.
Results
DH images were superior to AR70 and DM images in objective image quality evaluation. There was no statistical difference in the liver and spleen (both P > 0.05), or within 6HU in renal and fat in CT value between VNC and TNC images. EP-VNC-DH had the lowest image noise, highest SNR, and CNR, and VNC-AR70 images had better noise and SNR performance than TNC-AR70 images (all p < 0.05). EP-VNC-DH had the highest subjective image quality, and CP-VNC-DH performed the best in lesion visibility. In stone CT value and volume measurements, there was no statistical difference between VNC and TNC (P > 0.05).
Conclusion
The DLIR-reconstructed VNC images in CTU provide better image quality than the ASIR-V reconstructed TNC images and similar quantification accuracy for kidney stones for potential dose savings.
The study highlights that deep learning image reconstruction (DLIR)-reconstructed virtual non-contrast (VNC) images in spectral CT Urography (CTU) offer improved image quality compared to traditional true non-contrast (TNC) images, while maintaining similar accuracy in kidney stone detection, suggesting potential dose savings in clinical practice.
{"title":"The impact of deep learning image reconstruction of spectral CTU virtual non contrast images for patients with renal stones","authors":"Hong Zhu , Deyan Kong , Jiale Qian , Xiaomeng Shi , Jing Fan","doi":"10.1016/j.ejro.2024.100599","DOIUrl":"10.1016/j.ejro.2024.100599","url":null,"abstract":"<div><h3>Purpose</h3><p>To compare image quality and detection accuracy of renal stones between deep learning image reconstruction (DLIR) and Adaptive Statistical Iterative Reconstruction-Veo (ASIR-V) reconstructed virtual non-contrast (VNC) images and true non-contrast (TNC) images in spectral CT Urography (CTU).</p></div><div><h3>Methods</h3><p>A retrospective analysis was conducted on images of 70 patients who underwent abdominal-pelvic CTU in TNC phase using non-contrast scan and contrast-enhanced corticomedullary phase (CP) and excretory phase (EP) using spectral scan. The TNC scan was reconstructed using ASIR-V70 % (TNC-AR70), contrast-enhanced scans were reconstructed using AR70, DLIR medium-level (DM), and high-level (DH) to obtain CP-VNC-AR70/DM/DH and EP-VNC-AR70/DM/DH image groups, respectively. CT value, image quality and kidney stones quantification accuracy were measured and compared among groups. The subjective evaluation was independently assessed by two senior radiologists using the 5-point Likert scale for image quality and lesion visibility.</p></div><div><h3>Results</h3><p>DH images were superior to AR70 and DM images in objective image quality evaluation. There was no statistical difference in the liver and spleen (both P > 0.05), or within 6HU in renal and fat in CT value between VNC and TNC images. EP-VNC-DH had the lowest image noise, highest SNR, and CNR, and VNC-AR70 images had better noise and SNR performance than TNC-AR70 images (all p < 0.05). EP-VNC-DH had the highest subjective image quality, and CP-VNC-DH performed the best in lesion visibility. In stone CT value and volume measurements, there was no statistical difference between VNC and TNC (P > 0.05).</p></div><div><h3>Conclusion</h3><p>The DLIR-reconstructed VNC images in CTU provide better image quality than the ASIR-V reconstructed TNC images and similar quantification accuracy for kidney stones for potential dose savings.</p><p>The study highlights that deep learning image reconstruction (DLIR)-reconstructed virtual non-contrast (VNC) images in spectral CT Urography (CTU) offer improved image quality compared to traditional true non-contrast (TNC) images, while maintaining similar accuracy in kidney stone detection, suggesting potential dose savings in clinical practice.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100599"},"PeriodicalIF":1.8,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000546/pdfft?md5=26733059cc2a262840a0fc61adcdcfbb&pid=1-s2.0-S2352047724000546-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142096932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-30DOI: 10.1016/j.ejro.2024.100596
Anja A. Joye , Marta Bogowicz , Janine Gote-Schniering , Thomas Frauenfelder , Matthias Guckenberger , Britta Maurer , Stephanie Tanadini-Lang , Hubert S. Gabryś
Purpose
The purpose of this study was to evaluate the efficacy of radiomics derived from slice-reduced CT (srCT) scans versus full-chest CT (fcCT) for diagnosing and staging of interstitial lung disease (ILD) in systemic sclerosis (SSc), considering the potential to reduce radiation exposure.
Material and methods
The fcCT corresponded to a standard high-resolution full-chest CT whereas the srCT consisted of nine axial slices. 1451 radiomic features in two dimensions from srCT and 1375 features in three dimensions from fcCT scans were extracted from 166 SSc patients. The study included first- and second-order features from original and wavelet-transformed images. We assessed the predictive performance of quantitative CT (qCT)-based logistic regression (LR) models relying on preselected features and machine learning workflows involving LR and extra-trees classifiers with data-driven feature selection. The area under the receiver operating characteristic curve (AUC) was used to estimate model performance.
Results
The best models for diagnosis and staging ILD achieved AUC=0.85±0.08 and AUC=0.82±0.08 with srCT, and AUC=0.83±0.06 and AUC=0.76±0.08 with fcCT, respectively. srCT-based models showed slightly superior performance over fcCT-based models, particularly in 2D-radiomic analyses when interpolation resolution closely matched the original in-plane resolution. For diagnosis, the LR outperformed qCT-models, whereas for staging, the best results were obtained with a qCT-based model.
Conclusions
Radiomics from srCT is an effective and preferable alternative to fcCT for diagnosing and staging SSc-ILD. This approach not only enhances predictive accuracy but also minimizes radiation exposure risks, offering a promising avenue for improved treatment decision support in SSc-ILD management.
{"title":"Radiomics on slice-reduced versus full-chest computed tomography for diagnosis and staging of interstitial lung disease in systemic sclerosis: A comparative analysis","authors":"Anja A. Joye , Marta Bogowicz , Janine Gote-Schniering , Thomas Frauenfelder , Matthias Guckenberger , Britta Maurer , Stephanie Tanadini-Lang , Hubert S. Gabryś","doi":"10.1016/j.ejro.2024.100596","DOIUrl":"10.1016/j.ejro.2024.100596","url":null,"abstract":"<div><h3>Purpose</h3><p>The purpose of this study was to evaluate the efficacy of radiomics derived from slice-reduced CT (srCT) scans versus full-chest CT (fcCT) for diagnosing and staging of interstitial lung disease (ILD) in systemic sclerosis (SSc), considering the potential to reduce radiation exposure.</p></div><div><h3>Material and methods</h3><p>The fcCT corresponded to a standard high-resolution full-chest CT whereas the srCT consisted of nine axial slices. 1451 radiomic features in two dimensions from srCT and 1375 features in three dimensions from fcCT scans were extracted from 166 SSc patients. The study included first- and second-order features from original and wavelet-transformed images. We assessed the predictive performance of quantitative CT (qCT)-based logistic regression (LR) models relying on preselected features and machine learning workflows involving LR and extra-trees classifiers with data-driven feature selection. The area under the receiver operating characteristic curve (AUC) was used to estimate model performance.</p></div><div><h3>Results</h3><p>The best models for diagnosis and staging ILD achieved AUC=0.85±0.08 and AUC=0.82±0.08 with srCT, and AUC=0.83±0.06 and AUC=0.76±0.08 with fcCT, respectively. srCT-based models showed slightly superior performance over fcCT-based models, particularly in 2D-radiomic analyses when interpolation resolution closely matched the original in-plane resolution. For diagnosis, the LR outperformed qCT-models, whereas for staging, the best results were obtained with a qCT-based model.</p></div><div><h3>Conclusions</h3><p>Radiomics from srCT is an effective and preferable alternative to fcCT for diagnosing and staging SSc-ILD. This approach not only enhances predictive accuracy but also minimizes radiation exposure risks, offering a promising avenue for improved treatment decision support in SSc-ILD management.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100596"},"PeriodicalIF":1.8,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000510/pdfft?md5=d4f83a55c0d66a111429abfa78cf9995&pid=1-s2.0-S2352047724000510-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142096931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-12DOI: 10.1016/j.ejro.2024.100595
Piero Boraschi , Valentina Mazzantini , Francescamaria Donati , Barbara Coco , Barbara Vianello , Andrea Pinna , Riccardo Morganti , Piero Colombatto , Maurizia Rossana Brunetto , Emanuele Neri
Purpose
To analyze the role of qualitative and quantitative 3 T MR imaging assessment as a non-invasive method for the evaluation of disease severity in patients with primary sclerosing cholangitis (PSC).
Methods
A series of 26 patients, with histological diagnosis of PSC undergoing 3 T MRI and hepatological evaluation, was retrospectively enrolled. All MR examinations included diffusion-weighted imaging (DWI), T2-weighted (T2w) and T1-weighted (T1w) sequences, before and after administration of Gd-EOB-DTPA with the acquisition of both dynamic and hepato-biliary phase (HBP). Qualitative analysis was performed by assessment of liver parenchyma and biliary tract changes, also including biliary excretion of gadoxetic acid on HBP. Quantitative evaluation was conducted on liver parenchyma by measurement of apparent diffusion coefficient (ADC) and relative enhancement (RE) on 3-minute delayed phase and on HBP. Results of blood tests (ALT, ALP, GGT, total and direct bilirubin, albumin, and platelets) and transient elastography-derived liver stiffness measurements (TE-LSM) were collected and correlated with qualitative and quantitative MRI findings.
Results
Among qualitative and quantitative findings, fibrosis visual assessment and RE had the best performance in estimating disease severity, showing a statistically significant correlation with both biomarkers of cholestasis and TE-LSM. Statistical analysis also revealed a significant correlation of gadoxetic acid biliary excretion with ALT and direct bilirubin, as well as of ADC with total bilirubin.
Conclusion
Qualitative and quantitative 3 T MR evaluation is a promising non-invasive method for the assessment of disease severity in patients with PSC.
目的分析定性和定量 3 T MR 成像评估作为一种非侵入性方法在原发性硬化性胆管炎(PSC)患者疾病严重程度评估中的作用。所有磁共振检查包括弥散加权成像(DWI)、T2加权(T2w)和T1加权(T1w)序列,在使用Gd-EOB-DTPA前后均采集动态和肝胆相(HBP)。定性分析是通过评估肝脏实质和胆道的变化来进行的,还包括钆醋酸在 HBP 上的胆汁排泄。通过测量 3 分钟延迟期和 HBP 的表观弥散系数(ADC)和相对增强(RE),对肝实质进行定量评估。结果在定性和定量结果中,纤维化目测评估和 RE 在估计疾病严重程度方面表现最佳,与胆汁淤积的生物标记物和 TE-LSM 均有统计学意义的相关性。统计分析还显示,钆喷酸胆汁排泄量与谷丙转氨酶和直接胆红素以及ADC与总胆红素存在显著相关性。
{"title":"Primary sclerosing cholangitis: Is qualitative and quantitative 3 T MR imaging useful for the evaluation of disease severity?","authors":"Piero Boraschi , Valentina Mazzantini , Francescamaria Donati , Barbara Coco , Barbara Vianello , Andrea Pinna , Riccardo Morganti , Piero Colombatto , Maurizia Rossana Brunetto , Emanuele Neri","doi":"10.1016/j.ejro.2024.100595","DOIUrl":"10.1016/j.ejro.2024.100595","url":null,"abstract":"<div><h3>Purpose</h3><p>To analyze the role of qualitative and quantitative 3 T MR imaging assessment as a non-invasive method for the evaluation of disease severity in patients with primary sclerosing cholangitis (PSC).</p></div><div><h3>Methods</h3><p>A series of 26 patients, with histological diagnosis of PSC undergoing 3 T MRI and hepatological evaluation, was retrospectively enrolled. All MR examinations included diffusion-weighted imaging (DWI), T2-weighted (T2w) and T1-weighted (T1w) sequences, before and after administration of Gd-EOB-DTPA with the acquisition of both dynamic and hepato-biliary phase (HBP). Qualitative analysis was performed by assessment of liver parenchyma and biliary tract changes, also including biliary excretion of gadoxetic acid on HBP. Quantitative evaluation was conducted on liver parenchyma by measurement of apparent diffusion coefficient (ADC) and relative enhancement (RE) on 3-minute delayed phase and on HBP. Results of blood tests (ALT, ALP, GGT, total and direct bilirubin, albumin, and platelets) and transient elastography-derived liver stiffness measurements (TE-LSM) were collected and correlated with qualitative and quantitative MRI findings.</p></div><div><h3>Results</h3><p>Among qualitative and quantitative findings, fibrosis visual assessment and RE had the best performance in estimating disease severity, showing a statistically significant correlation with both biomarkers of cholestasis and TE-LSM. Statistical analysis also revealed a significant correlation of gadoxetic acid biliary excretion with ALT and direct bilirubin, as well as of ADC with total bilirubin.</p></div><div><h3>Conclusion</h3><p>Qualitative and quantitative 3 T MR evaluation is a promising non-invasive method for the assessment of disease severity in patients with PSC.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100595"},"PeriodicalIF":1.8,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000509/pdfft?md5=0e01005ca24f98d2928c6cb7a9ad3e17&pid=1-s2.0-S2352047724000509-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141954097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Artificial intelligence (AI) has been proven useful for the assessment of tubes and lines on chest radiographs of general patients. However, validation on intensive care unit (ICU) patients remains imperative.
Methods
This retrospective case-control study evaluated the performance of deep learning (DL) models for tubes and lines classification on both an external public dataset and a local dataset comprising 303 films randomly sampled from the ICU database. The endotracheal tubes (ETTs), central venous catheters (CVCs), and nasogastric tubes (NGTs) were classified into “Normal,” “Abnormal,” or “Borderline” positions by DL models with and without rule-based modification. Their performance was evaluated using an experienced radiologist as the standard reference.
Results
The algorithm showed decreased performance on the local ICU dataset, compared to that of the external dataset, decreasing from the Area Under the Curve of Receiver (AUC) of 0.967 (95 % CI 0.965–0.973) to the AUC of 0.70 (95 % CI 0.68–0.77). Significant improvement in the ETT classification task was observed after modifications were made to the model to allow the use of the spatial relationship between line tips and reference anatomy with the improvement of the AUC, increasing from 0.71 (95 % CI 0.70 – 0.75) to 0.86 (95 % CI 0.83 – 0.94)
Conclusions
The externally trained model exhibited limited generalizability on the local ICU dataset. Therefore, evaluating the performance of externally trained AI before integrating it into critical care routine is crucial. Rule-based algorithm may be used in combination with DL to improve results.
{"title":"Deep learning for tubes and lines detection in critical illness: Generalizability and comparison with residents","authors":"Pootipong Wongveerasin, Trongtum Tongdee, Pairash Saiviroonporn","doi":"10.1016/j.ejro.2024.100593","DOIUrl":"10.1016/j.ejro.2024.100593","url":null,"abstract":"<div><h3>Background</h3><p>Artificial intelligence (AI) has been proven useful for the assessment of tubes and lines on chest radiographs of general patients. However, validation on intensive care unit (ICU) patients remains imperative.</p></div><div><h3>Methods</h3><p>This retrospective case-control study evaluated the performance of deep learning (DL) models for tubes and lines classification on both an external public dataset and a local dataset comprising 303 films randomly sampled from the ICU database. The endotracheal tubes (ETTs), central venous catheters (CVCs), and nasogastric tubes (NGTs) were classified into “Normal,” “Abnormal,” or “Borderline” positions by DL models with and without rule-based modification. Their performance was evaluated using an experienced radiologist as the standard reference.</p></div><div><h3>Results</h3><p>The algorithm showed decreased performance on the local ICU dataset, compared to that of the external dataset, decreasing from the Area Under the Curve of Receiver (AUC) of 0.967 (95 % CI 0.965–0.973) to the AUC of 0.70 (95 % CI 0.68–0.77). Significant improvement in the ETT classification task was observed after modifications were made to the model to allow the use of the spatial relationship between line tips and reference anatomy with the improvement of the AUC, increasing from 0.71 (95 % CI 0.70 – 0.75) to 0.86 (95 % CI 0.83 – 0.94)</p></div><div><h3>Conclusions</h3><p>The externally trained model exhibited limited generalizability on the local ICU dataset. Therefore, evaluating the performance of externally trained AI before integrating it into critical care routine is crucial. Rule-based algorithm may be used in combination with DL to improve results.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100593"},"PeriodicalIF":1.8,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000480/pdfft?md5=e3984dd26f8e8aa3a7cf367184907496&pid=1-s2.0-S2352047724000480-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141962138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-26DOI: 10.1016/j.ejro.2024.100589
Sadhana Kalidindi , Janani Baradwaj
The rapid evolution of artificial intelligence (AI) in healthcare, particularly in radiology, underscores a transformative era marked by a potential for enhanced diagnostic precision, increased patient engagement, and streamlined clinical workflows. Amongst the key developments at the heart of this transformation are Large Language Models like the Generative Pre-trained Transformer 4 (GPT-4), whose integration into radiological practices could potentially herald a significant leap by assisting in the generation and summarization of radiology reports, aiding in differential diagnoses, and recommending evidence-based treatments. This review delves into the multifaceted potential applications of Large Language Models within radiology, using GPT-4 as an example, from improving diagnostic accuracy and reporting efficiency to translating complex medical findings into patient-friendly summaries. The review acknowledges the ethical, privacy, and technical challenges inherent in deploying AI technologies, emphasizing the importance of careful oversight, validation, and adherence to regulatory standards. Through a balanced discourse on the potential and pitfalls of GPT-4 in radiology, the article aims to provide a comprehensive overview of how these models have the potential to reshape the future of radiological services, fostering improvements in patient care, educational methodologies, and clinical research.
{"title":"Advancing radiology with GPT-4: Innovations in clinical applications, patient engagement, research, and learning","authors":"Sadhana Kalidindi , Janani Baradwaj","doi":"10.1016/j.ejro.2024.100589","DOIUrl":"10.1016/j.ejro.2024.100589","url":null,"abstract":"<div><p>The rapid evolution of artificial intelligence (AI) in healthcare, particularly in radiology, underscores a transformative era marked by a potential for enhanced diagnostic precision, increased patient engagement, and streamlined clinical workflows. Amongst the key developments at the heart of this transformation are Large Language Models like the Generative Pre-trained Transformer 4 (GPT-4), whose integration into radiological practices could potentially herald a significant leap by assisting in the generation and summarization of radiology reports, aiding in differential diagnoses, and recommending evidence-based treatments. This review delves into the multifaceted potential applications of Large Language Models within radiology, using GPT-4 as an example, from improving diagnostic accuracy and reporting efficiency to translating complex medical findings into patient-friendly summaries. The review acknowledges the ethical, privacy, and technical challenges inherent in deploying AI technologies, emphasizing the importance of careful oversight, validation, and adherence to regulatory standards. Through a balanced discourse on the potential and pitfalls of GPT-4 in radiology, the article aims to provide a comprehensive overview of how these models have the potential to reshape the future of radiological services, fostering improvements in patient care, educational methodologies, and clinical research.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100589"},"PeriodicalIF":1.8,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000443/pdfft?md5=c7fde8cd6249665c5a25cde285154c5f&pid=1-s2.0-S2352047724000443-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141949951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-19DOI: 10.1016/j.ejro.2024.100592
Hong-Jian Luo , Jia-Liang Ren , Li mei Guo , Jin liang Niu , Xiao-Li Song
Background
Human epidermal growth factor receptor 2 (HER2) is a tumor biomarker with significant prognostic and therapeutic implications for invasive ductal breast carcinoma (IDC).
Objective
This study aimed to explore the effectiveness of a multisequence magnetic resonance imaging (MRI)-based machine learning radiomics model in classifying the expression status of HER2, including HER2-positive, HER2-low, and HER2 completely negative (HER2-zero), among patients with IDC.
Methods
A total of 402 female patients with IDC confirmed through surgical pathology were enrolled and subsequently divided into a training group (n = 250, center I) and a validation group (n = 152, center II). Radiomics features were extracted from the preoperative MRI. A simulated annealing algorithm was used for key feature selection. Two classification tasks were performed: task 1, the classification of HER2-positive vs. HER2-negative (HER2-low and HER2-zero), and task 2, the classification of HER2-low vs. HER2-zero. Logistic regression, random forest (RF), and support vector machine were conducted to establish radiomics models. The performance of the models was evaluated using the area under the curve (AUC) of the operating characteristics (ROC).
Results
In total, 4506 radiomics features were extracted from multisequence MRI. A radiomics model for prediction of expression state of HER2 was successfully developed. Among the three classification algorithms, RF achieved the highest performance in classifying HER2-positive from HER2-negative and HER2-low from HER2-zero, with AUC values of 0.777 and 0.731, respectively.
Conclusions
Machine learning-based MRI radiomics may aid in the non-invasive prediction of the different expression status of HER2 in IDC.
{"title":"MRI-based machine learning radiomics for prediction of HER2 expression status in breast invasive ductal carcinoma","authors":"Hong-Jian Luo , Jia-Liang Ren , Li mei Guo , Jin liang Niu , Xiao-Li Song","doi":"10.1016/j.ejro.2024.100592","DOIUrl":"10.1016/j.ejro.2024.100592","url":null,"abstract":"<div><h3>Background</h3><p>Human epidermal growth factor receptor 2 (HER2) is a tumor biomarker with significant prognostic and therapeutic implications for invasive ductal breast carcinoma (IDC).</p></div><div><h3>Objective</h3><p>This study aimed to explore the effectiveness of a multisequence magnetic resonance imaging (MRI)-based machine learning radiomics model in classifying the expression status of HER2, including HER2-positive, HER2-low, and HER2 completely negative (HER2-zero), among patients with IDC.</p></div><div><h3>Methods</h3><p>A total of 402 female patients with IDC confirmed through surgical pathology were enrolled and subsequently divided into a training group (n = 250, center I) and a validation group (n = 152, center II). Radiomics features were extracted from the preoperative MRI. A simulated annealing algorithm was used for key feature selection. Two classification tasks were performed: task 1, the classification of HER2-positive vs. HER2-negative (HER2-low and HER2-zero), and task 2, the classification of HER2-low vs. HER2-zero. Logistic regression, random forest (RF), and support vector machine were conducted to establish radiomics models. The performance of the models was evaluated using the area under the curve (AUC) of the operating characteristics (ROC).</p></div><div><h3>Results</h3><p>In total, 4506 radiomics features were extracted from multisequence MRI. A radiomics model for prediction of expression state of HER2 was successfully developed. Among the three classification algorithms, RF achieved the highest performance in classifying HER2-positive from HER2-negative and HER2-low from HER2-zero, with AUC values of 0.777 and 0.731, respectively.</p></div><div><h3>Conclusions</h3><p>Machine learning-based MRI radiomics may aid in the non-invasive prediction of the different expression status of HER2 in IDC.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100592"},"PeriodicalIF":1.8,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000479/pdfft?md5=6c50536b046e7c7b8b20145494d78138&pid=1-s2.0-S2352047724000479-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141728658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-18DOI: 10.1016/j.ejro.2024.100591
Katarzyna Bokwa-Dąbrowska , Dan Mocanu , Alex Alexiev , Katarina Nilsson Helander , Pawel Szaro
Introduction
Peroneus brevis split rupture poses a diagnostic challenge, often requiring magnetic resonance imaging (MRI), yet splits are missed in initial radiological reports. However, the frequency of reported peroneus brevis split rupture in clinical MRI examinations is unknown.
Aim
This study aimed to investigate underreporting frequency of peroneus brevis split rupture in patients with lateral ankle pain.
Methods
We re-evaluated 143 consecutive MRI examinations of the ankle joint, conducted in 2021 in our region, for patients experiencing ankle pain persisting for more than 8 months. Two musculoskeletal radiologists, with 12 and 8 years of experience respectively, assessed the presence of peroneus brevis split rupture. Patients with recent ankle trauma, fractures, postoperative changes, or MRI artifacts were excluded. The radiologists evaluated each MRI for incomplete or complete peroneus brevis split rupture. The consensus between the raters was used as the reference standard. Additionally, raters reviewed the original clinical radiological reports to determine if the presence of peroneus brevis split rupture was noted. Agreement between raters' assessments, consensus, and initial reports was evaluated using Gwet’s AC1 coefficients.
Results
Initial radiological reports indicated 23 cases (52.3 %) of peroneus brevis split rupture, meaning 21 cases (47.7 %) were underreported. The Gwet’s AC1 coefficients showed that the agreement between raters and initial reports was 0.401 (standard error 0.070), 95 % CI (0.261, 0.541), p<.001, while the agreement between raters in the study was 0.716 (standard error 0.082), 95 % CI (0.551, 0.881), p<.001.
Conclusion
Peroneus brevis split rupture is underreported on MRI scans of patients with lateral ankle pain.
{"title":"Peroneus brevis split rupture is underreported on magnetic resonance imaging of the ankle in patients with chronic lateral ankle pain","authors":"Katarzyna Bokwa-Dąbrowska , Dan Mocanu , Alex Alexiev , Katarina Nilsson Helander , Pawel Szaro","doi":"10.1016/j.ejro.2024.100591","DOIUrl":"10.1016/j.ejro.2024.100591","url":null,"abstract":"<div><h3>Introduction</h3><p>Peroneus brevis split rupture poses a diagnostic challenge, often requiring magnetic resonance imaging (MRI), yet splits are missed in initial radiological reports. However, the frequency of reported peroneus brevis split rupture in clinical MRI examinations is unknown.</p></div><div><h3>Aim</h3><p>This study aimed to investigate underreporting frequency of peroneus brevis split rupture in patients with lateral ankle pain.</p></div><div><h3>Methods</h3><p>We re-evaluated 143 consecutive MRI examinations of the ankle joint, conducted in 2021 in our region, for patients experiencing ankle pain persisting for more than 8 months. Two musculoskeletal radiologists, with 12 and 8 years of experience respectively, assessed the presence of peroneus brevis split rupture. Patients with recent ankle trauma, fractures, postoperative changes, or MRI artifacts were excluded. The radiologists evaluated each MRI for incomplete or complete peroneus brevis split rupture. The consensus between the raters was used as the reference standard. Additionally, raters reviewed the original clinical radiological reports to determine if the presence of peroneus brevis split rupture was noted. Agreement between raters' assessments, consensus, and initial reports was evaluated using Gwet’s AC1 coefficients.</p></div><div><h3>Results</h3><p>Initial radiological reports indicated 23 cases (52.3 %) of peroneus brevis split rupture, meaning 21 cases (47.7 %) were underreported. The Gwet’s AC1 coefficients showed that the agreement between raters and initial reports was 0.401 (standard error 0.070), 95 % CI (0.261, 0.541), p<.001, while the agreement between raters in the study was 0.716 (standard error 0.082), 95 % CI (0.551, 0.881), p<.001.</p></div><div><h3>Conclusion</h3><p>Peroneus brevis split rupture is underreported on MRI scans of patients with lateral ankle pain.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100591"},"PeriodicalIF":1.8,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000467/pdfft?md5=81c4cc1b6e990ec15e6ae90efef8ecf3&pid=1-s2.0-S2352047724000467-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141728657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-07-13DOI: 10.1016/j.ejro.2024.100590
Gustav Lodeiro , Katarzyna Bokwa-Dąbrowska , Andreia Miron , Pawel Szaro
Background
Diffusion-weighted imaging (DWI) is widely used in neuroradiology or abdominal imaging but not yet implemented in the diagnosis of musculoskeletal tumors.
Aim
This study aimed to evaluate how including diffusion imaging in the MRI protocol for patients with musculoskeletal tumors affects the agreement between radiologists and non-radiologist.
Methods
Thirty-nine patients with musculoskeletal tumors (Ewing sarcoma, osteosarcoma, and benign tumors) consulted at our institution were included. Three raters with different experience levels evaluated examinations blinded to all clinical data. The final diagnosis was determined by consensus. MRI examinations were split into 1) conventional sequences and 2) conventional sequences combined with DWI. We evaluated the presence or absence of diffusion restriction, solid nature, necrosis, deep localization, and diameter >4 cm as known radiological markers of malignancy. Agreement between raters was evaluated using Gwet’s AC1 coefficients and interpreted according to Landis and Koch.
Results
The lowest agreement was for diffusion restriction in both groups of raters. Agreement among all raters ranged from 0.51 to 0.945, indicating moderate to almost perfect agreement, and 0.772–0.965 among only radiologists indicating substantial to almost perfect agreement.
Conclusion
The agreement in evaluating diffusion-weighted MRI sequences was lower than that for conventional MRI sequences, both among radiologists and non-radiologist and among radiologists alone. This indicates that assessing diffusion imaging is more challenging, and experience may impact the agreement.
{"title":"Impact of diffusion-weighted imaging on agreement between radiologists and non-radiologist in musculoskeletal tumor imaging using magnetic resonance","authors":"Gustav Lodeiro , Katarzyna Bokwa-Dąbrowska , Andreia Miron , Pawel Szaro","doi":"10.1016/j.ejro.2024.100590","DOIUrl":"https://doi.org/10.1016/j.ejro.2024.100590","url":null,"abstract":"<div><h3>Background</h3><p>Diffusion-weighted imaging (DWI) is widely used in neuroradiology or abdominal imaging but not yet implemented in the diagnosis of musculoskeletal tumors.</p></div><div><h3>Aim</h3><p>This study aimed to evaluate how including diffusion imaging in the MRI protocol for patients with musculoskeletal tumors affects the agreement between radiologists and non-radiologist.</p></div><div><h3>Methods</h3><p>Thirty-nine patients with musculoskeletal tumors (Ewing sarcoma, osteosarcoma, and benign tumors) consulted at our institution were included. Three raters with different experience levels evaluated examinations blinded to all clinical data. The final diagnosis was determined by consensus. MRI examinations were split into 1) conventional sequences and 2) conventional sequences combined with DWI. We evaluated the presence or absence of diffusion restriction, solid nature, necrosis, deep localization, and diameter >4 cm as known radiological markers of malignancy. Agreement between raters was evaluated using Gwet’s AC1 coefficients and interpreted according to Landis and Koch.</p></div><div><h3>Results</h3><p>The lowest agreement was for diffusion restriction in both groups of raters. Agreement among all raters ranged from 0.51 to 0.945, indicating moderate to almost perfect agreement, and 0.772–0.965 among only radiologists indicating substantial to almost perfect agreement.</p></div><div><h3>Conclusion</h3><p>The agreement in evaluating diffusion-weighted MRI sequences was lower than that for conventional MRI sequences, both among radiologists and non-radiologist and among radiologists alone. This indicates that assessing diffusion imaging is more challenging, and experience may impact the agreement.</p></div>","PeriodicalId":38076,"journal":{"name":"European Journal of Radiology Open","volume":"13 ","pages":"Article 100590"},"PeriodicalIF":1.8,"publicationDate":"2024-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352047724000455/pdfft?md5=b3dd9896e7d44191226ac654c59dcb54&pid=1-s2.0-S2352047724000455-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141607342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}