Pub Date : 2023-09-15eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1223294
Harrison C Gottlich, Panagiotis Korfiatis, Adriana V Gregory, Timothy L Kline
Introduction: Methods that automatically flag poor performing predictions are drastically needed to safely implement machine learning workflows into clinical practice as well as to identify difficult cases during model training.
Methods: Disagreement between the fivefold cross-validation sub-models was quantified using dice scores between folds and summarized as a surrogate for model confidence. The summarized Interfold Dices were compared with thresholds informed by human interobserver values to determine whether final ensemble model performance should be manually reviewed.
Results: The method on all tasks efficiently flagged poor segmented images without consulting a reference standard. Using the median Interfold Dice for comparison, substantial dice score improvements after excluding flagged images was noted for the in-domain CT (0.85 ± 0.20 to 0.91 ± 0.08, 8/50 images flagged) and MR (0.76 ± 0.27 to 0.85 ± 0.09, 8/50 images flagged). Most impressively, there were dramatic dice score improvements in the simulated out-of-distribution task where the model was trained on a radical nephrectomy dataset with different contrast phases predicting a partial nephrectomy all cortico-medullary phase dataset (0.67 ± 0.36 to 0.89 ± 0.10, 122/300 images flagged).
Discussion: Comparing interfold sub-model disagreement against human interobserver values is an effective and efficient way to assess automated predictions when a reference standard is not available. This functionality provides a necessary safeguard to patient care important to safely implement automated medical image segmentation workflows.
{"title":"AI in the Loop: functionalizing fold performance disagreement to monitor automated medical image segmentation workflows.","authors":"Harrison C Gottlich, Panagiotis Korfiatis, Adriana V Gregory, Timothy L Kline","doi":"10.3389/fradi.2023.1223294","DOIUrl":"10.3389/fradi.2023.1223294","url":null,"abstract":"<p><strong>Introduction: </strong>Methods that automatically flag poor performing predictions are drastically needed to safely implement machine learning workflows into clinical practice as well as to identify difficult cases during model training.</p><p><strong>Methods: </strong>Disagreement between the fivefold cross-validation sub-models was quantified using dice scores between folds and summarized as a surrogate for model confidence. The summarized Interfold Dices were compared with thresholds informed by human interobserver values to determine whether final ensemble model performance should be manually reviewed.</p><p><strong>Results: </strong>The method on all tasks efficiently flagged poor segmented images without consulting a reference standard. Using the median Interfold Dice for comparison, substantial dice score improvements after excluding flagged images was noted for the in-domain CT (0.85 ± 0.20 to 0.91 ± 0.08, 8/50 images flagged) and MR (0.76 ± 0.27 to 0.85 ± 0.09, 8/50 images flagged). Most impressively, there were dramatic dice score improvements in the simulated out-of-distribution task where the model was trained on a radical nephrectomy dataset with different contrast phases predicting a partial nephrectomy all cortico-medullary phase dataset (0.67 ± 0.36 to 0.89 ± 0.10, 122/300 images flagged).</p><p><strong>Discussion: </strong>Comparing interfold sub-model disagreement against human interobserver values is an effective and efficient way to assess automated predictions when a reference standard is not available. This functionality provides a necessary safeguard to patient care important to safely implement automated medical image segmentation workflows.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1223294"},"PeriodicalIF":0.0,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10540615/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41175918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-11eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1238566
Zihao Tang, Sheng Chen, Arkiev D'Souza, Dongnan Liu, Fernando Calamante, Michael Barnett, Weidong Cai, Chenyu Wang, Mariano Cabezas
Diffusion-weighted Imaging (DWI) is a non-invasive imaging technique based on Magnetic Resonance Imaging (MRI) principles to measure water diffusivity and reveal details of the underlying brain micro-structure. By fitting a tensor model to quantify the directionality of water diffusion a Diffusion Tensor Image (DTI) can be derived and scalar measures, such as fractional anisotropy (FA), can then be estimated from the DTI to summarise quantitative microstructural information for clinical studies. In particular, FA has been shown to be a useful research metric to identify tissue abnormalities in neurological disease (e.g. decreased anisotropy as a proxy for tissue damage). However, time constraints in clinical practice lead to low angular resolution diffusion imaging (LARDI) acquisitions that can cause inaccurate FA value estimates when compared to those generated from high angular resolution diffusion imaging (HARDI) acquisitions. In this work, we propose High Angular DTI Estimation Network (HADTI-Net) to estimate an enhanced DTI model from LARDI with a set of minimal and evenly distributed diffusion gradient directions. Extensive experiments have been conducted to show the reliability and generalisation of HADTI-Net to generate high angular DTI estimation from any minimal evenly distributed diffusion gradient directions and to explore the feasibility of applying a data-driven method for this task. The code repository of this work and other related works can be found at https://mri-synthesis.github.io/.
{"title":"High angular diffusion tensor imaging estimation from minimal evenly distributed diffusion gradient directions.","authors":"Zihao Tang, Sheng Chen, Arkiev D'Souza, Dongnan Liu, Fernando Calamante, Michael Barnett, Weidong Cai, Chenyu Wang, Mariano Cabezas","doi":"10.3389/fradi.2023.1238566","DOIUrl":"https://doi.org/10.3389/fradi.2023.1238566","url":null,"abstract":"<p><p>Diffusion-weighted Imaging (DWI) is a non-invasive imaging technique based on Magnetic Resonance Imaging (MRI) principles to measure water diffusivity and reveal details of the underlying brain micro-structure. By fitting a tensor model to quantify the directionality of water diffusion a Diffusion Tensor Image (DTI) can be derived and scalar measures, such as fractional anisotropy (FA), can then be estimated from the DTI to summarise quantitative microstructural information for clinical studies. In particular, FA has been shown to be a useful research metric to identify tissue abnormalities in neurological disease (e.g. decreased anisotropy as a proxy for tissue damage). However, time constraints in clinical practice lead to low angular resolution diffusion imaging (LARDI) acquisitions that can cause inaccurate FA value estimates when compared to those generated from high angular resolution diffusion imaging (HARDI) acquisitions. In this work, we propose High Angular DTI Estimation Network (HADTI-Net) to estimate an enhanced DTI model from LARDI with a set of minimal and evenly distributed diffusion gradient directions. Extensive experiments have been conducted to show the reliability and generalisation of HADTI-Net to generate high angular DTI estimation from any minimal evenly distributed diffusion gradient directions and to explore the feasibility of applying a data-driven method for this task. The code repository of this work and other related works can be found at https://mri-synthesis.github.io/.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1238566"},"PeriodicalIF":0.0,"publicationDate":"2023-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10520249/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41163244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-07eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1211859
Yannick Suter, Michelle Notter, Raphael Meier, Tina Loosli, Philippe Schucht, Roland Wiest, Mauricio Reyes, Urspeter Knecht
Automated tumor segmentation tools for glioblastoma show promising performance. To apply these tools for automated response assessment, longitudinal segmentation, and tumor measurement, consistency is critical. This study aimed to determine whether BraTumIA and HD-GLIO are suited for this task. We evaluated two segmentation tools with respect to automated response assessment on the single-center retrospective LUMIERE dataset with 80 patients and a total of 502 post-operative time points. Volumetry and automated bi-dimensional measurements were compared with expert measurements following the Response Assessment in Neuro-Oncology (RANO) guidelines. The longitudinal trend agreement between the expert and methods was evaluated, and the RANO progression thresholds were tested against the expert-derived time-to-progression (TTP). The TTP and overall survival (OS) correlation was used to check the progression thresholds. We evaluated the automated detection and influence of non-measurable lesions. The tumor volume trend agreement calculated between segmentation volumes and the expert bi-dimensional measurements was high (HD-GLIO: 81.1%, BraTumIA: 79.7%). BraTumIA achieved the closest match to the expert TTP using the recommended RANO progression threshold. HD-GLIO-derived tumor volumes reached the highest correlation between TTP and OS (0.55). Both tools failed at an accurate lesion count across time. Manual false-positive removal and restricting to a maximum number of measurable lesions had no beneficial effect. Expert supervision and manual corrections are still necessary when applying the tested automated segmentation tools for automated response assessment. The longitudinal consistency of current segmentation tools needs further improvement. Validation of volumetric and bi-dimensional progression thresholds with multi-center studies is required to move toward volumetry-based response assessment.
{"title":"Evaluating automated longitudinal tumor measurements for glioblastoma response assessment.","authors":"Yannick Suter, Michelle Notter, Raphael Meier, Tina Loosli, Philippe Schucht, Roland Wiest, Mauricio Reyes, Urspeter Knecht","doi":"10.3389/fradi.2023.1211859","DOIUrl":"https://doi.org/10.3389/fradi.2023.1211859","url":null,"abstract":"<p><p>Automated tumor segmentation tools for glioblastoma show promising performance. To apply these tools for automated response assessment, longitudinal segmentation, and tumor measurement, consistency is critical. This study aimed to determine whether BraTumIA and HD-GLIO are suited for this task. We evaluated two segmentation tools with respect to automated response assessment on the single-center retrospective LUMIERE dataset with 80 patients and a total of 502 post-operative time points. Volumetry and automated bi-dimensional measurements were compared with expert measurements following the Response Assessment in Neuro-Oncology (RANO) guidelines. The longitudinal trend agreement between the expert and methods was evaluated, and the RANO progression thresholds were tested against the expert-derived time-to-progression (TTP). The TTP and overall survival (OS) correlation was used to check the progression thresholds. We evaluated the automated detection and influence of non-measurable lesions. The tumor volume trend agreement calculated between segmentation volumes and the expert bi-dimensional measurements was high (HD-GLIO: 81.1%, BraTumIA: 79.7%). BraTumIA achieved the closest match to the expert TTP using the recommended RANO progression threshold. HD-GLIO-derived tumor volumes reached the highest correlation between TTP and OS (0.55). Both tools failed at an accurate lesion count across time. Manual false-positive removal and restricting to a maximum number of measurable lesions had no beneficial effect. Expert supervision and manual corrections are still necessary when applying the tested automated segmentation tools for automated response assessment. The longitudinal consistency of current segmentation tools needs further improvement. Validation of volumetric and bi-dimensional progression thresholds with multi-center studies is required to move toward volumetry-based response assessment.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1211859"},"PeriodicalIF":0.0,"publicationDate":"2023-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10513769/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41177631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-05eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1225215
Wenhui Zhang, Surajit Ray
With the increasing integration of functional imaging techniques like Positron Emission Tomography (PET) into radiotherapy (RT) practices, a paradigm shift in cancer treatment methodologies is underway. A fundamental step in RT planning is the accurate segmentation of tumours based on clinical diagnosis. Furthermore, novel tumour control methods, such as intensity modulated radiation therapy (IMRT) dose painting, demand the precise delineation of multiple intensity value contours to ensure optimal tumour dose distribution. Recently, convolutional neural networks (CNNs) have made significant strides in 3D image segmentation tasks, most of which present the output map at a voxel-wise level. However, because of information loss in subsequent downsampling layers, they frequently fail to precisely identify precise object boundaries. Moreover, in the context of dose painting strategies, there is an imperative need for reliable and precise image segmentation techniques to delineate high recurrence-risk contours. To address these challenges, we introduce a 3D coarse-to-fine framework, integrating a CNN with a kernel smoothing-based probability volume contour approach (KsPC). This integrated approach generates contour-based segmentation volumes, mimicking expert-level precision and providing accurate probability contours crucial for optimizing dose painting/IMRT strategies. Our final model, named KsPC-Net, leverages a CNN backbone to automatically learn parameters in the kernel smoothing process, thereby obviating the need for user-supplied tuning parameters. The 3D KsPC-Net exploits the strength of KsPC to simultaneously identify object boundaries and generate corresponding probability volume contours, which can be trained within an end-to-end framework. The proposed model has demonstrated promising performance, surpassing state-of-the-art models when tested against the MICCAI 2021 challenge dataset (HECKTOR).
{"title":"From coarse to fine: a deep 3D probability volume contours framework for tumour segmentation and dose painting in PET images.","authors":"Wenhui Zhang, Surajit Ray","doi":"10.3389/fradi.2023.1225215","DOIUrl":"10.3389/fradi.2023.1225215","url":null,"abstract":"<p><p>With the increasing integration of functional imaging techniques like Positron Emission Tomography (PET) into radiotherapy (RT) practices, a paradigm shift in cancer treatment methodologies is underway. A fundamental step in RT planning is the accurate segmentation of tumours based on clinical diagnosis. Furthermore, novel tumour control methods, such as intensity modulated radiation therapy (IMRT) dose painting, demand the precise delineation of multiple intensity value contours to ensure optimal tumour dose distribution. Recently, convolutional neural networks (CNNs) have made significant strides in 3D image segmentation tasks, most of which present the output map at a voxel-wise level. However, because of information loss in subsequent downsampling layers, they frequently fail to precisely identify precise object boundaries. Moreover, in the context of dose painting strategies, there is an imperative need for reliable and precise image segmentation techniques to delineate high recurrence-risk contours. To address these challenges, we introduce a 3D coarse-to-fine framework, integrating a CNN with a kernel smoothing-based probability volume contour approach (KsPC). This integrated approach generates contour-based segmentation volumes, mimicking expert-level precision and providing accurate probability contours crucial for optimizing dose painting/IMRT strategies. Our final model, named KsPC-Net, leverages a CNN backbone to automatically learn parameters in the kernel smoothing process, thereby obviating the need for user-supplied tuning parameters. The 3D KsPC-Net exploits the strength of KsPC to simultaneously identify object boundaries and generate corresponding probability volume contours, which can be trained within an end-to-end framework. The proposed model has demonstrated promising performance, surpassing state-of-the-art models when tested against the MICCAI 2021 challenge dataset (HECKTOR).</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1225215"},"PeriodicalIF":0.0,"publicationDate":"2023-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10512384/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41155957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-09-04eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1168901
Chaowei Wu, Nan Wang, Srinivas Gaddam, Lixia Wang, Hui Han, Kyunghyun Sung, Anthony G Christodoulou, Yibin Xie, Stephen Pandol, Debiao Li
Introduction: Dynamic contrast-enhanced (DCE) MRI has important clinical value for early detection, accurate staging, and therapeutic monitoring of cancers. However, conventional multi-phasic abdominal DCE-MRI has limited temporal resolution and provides qualitative or semi-quantitative assessments of tissue vascularity. In this study, the feasibility of retrospectively quantifying multi-phasic abdominal DCE-MRI by using pharmacokinetics-informed deep learning to improve temporal resolution was investigated.
Method: Forty-five subjects consisting of healthy controls, pancreatic ductal adenocarcinoma (PDAC), and chronic pancreatitis (CP) were imaged with a 2-s temporal-resolution quantitative DCE sequence, from which 30-s temporal-resolution multi-phasic DCE-MRI was synthesized based on clinical protocol. A pharmacokinetics-informed neural network was trained to improve the temporal resolution of the multi-phasic DCE before the quantification of pharmacokinetic parameters. Through ten-fold cross-validation, the agreement between pharmacokinetic parameters estimated from synthesized multi-phasic DCE after deep learning inference was assessed against reference parameters from the corresponding quantitative DCE-MRI images. The ability of the deep learning estimated parameters to differentiate abnormal from normal tissues was assessed as well.
Results: The pharmacokinetic parameters estimated after deep learning have a high level of agreement with the reference values. In the cross-validation, all three pharmacokinetic parameters (transfer constant , fractional extravascular extracellular volume , and rate constant ) achieved intraclass correlation coefficient and R2 between 0.84-0.94, and low coefficients of variation (10.1%, 12.3%, and 5.6%, respectively) relative to the reference values. Significant differences were found between healthy pancreas, PDAC tumor and non-tumor, and CP pancreas.
Discussion: Retrospective quantification (RoQ) of clinical multi-phasic DCE-MRI is possible by deep learning. This technique has the potential to derive quantitative pharmacokinetic parameters from clinical multi-phasic DCE data for a more objective and precise assessment of cancer.
{"title":"Retrospective quantification of clinical abdominal DCE-MRI using pharmacokinetics-informed deep learning: a proof-of-concept study.","authors":"Chaowei Wu, Nan Wang, Srinivas Gaddam, Lixia Wang, Hui Han, Kyunghyun Sung, Anthony G Christodoulou, Yibin Xie, Stephen Pandol, Debiao Li","doi":"10.3389/fradi.2023.1168901","DOIUrl":"https://doi.org/10.3389/fradi.2023.1168901","url":null,"abstract":"<p><strong>Introduction: </strong>Dynamic contrast-enhanced (DCE) MRI has important clinical value for early detection, accurate staging, and therapeutic monitoring of cancers. However, conventional multi-phasic abdominal DCE-MRI has limited temporal resolution and provides qualitative or semi-quantitative assessments of tissue vascularity. In this study, the feasibility of retrospectively quantifying multi-phasic abdominal DCE-MRI by using pharmacokinetics-informed deep learning to improve temporal resolution was investigated.</p><p><strong>Method: </strong>Forty-five subjects consisting of healthy controls, pancreatic ductal adenocarcinoma (PDAC), and chronic pancreatitis (CP) were imaged with a 2-s temporal-resolution quantitative DCE sequence, from which 30-s temporal-resolution multi-phasic DCE-MRI was synthesized based on clinical protocol. A pharmacokinetics-informed neural network was trained to improve the temporal resolution of the multi-phasic DCE before the quantification of pharmacokinetic parameters. Through ten-fold cross-validation, the agreement between pharmacokinetic parameters estimated from synthesized multi-phasic DCE after deep learning inference was assessed against reference parameters from the corresponding quantitative DCE-MRI images. The ability of the deep learning estimated parameters to differentiate abnormal from normal tissues was assessed as well.</p><p><strong>Results: </strong>The pharmacokinetic parameters estimated after deep learning have a high level of agreement with the reference values. In the cross-validation, all three pharmacokinetic parameters (transfer constant <math><msup><mi>K</mi><mrow><mrow><mi>trans</mi></mrow></mrow></msup></math>, fractional extravascular extracellular volume <math><msub><mi>v</mi><mi>e</mi></msub></math>, and rate constant <math><msub><mi>k</mi><mrow><mrow><mi>ep</mi></mrow></mrow></msub></math>) achieved intraclass correlation coefficient and <i>R</i><sup>2</sup> between 0.84-0.94, and low coefficients of variation (10.1%, 12.3%, and 5.6%, respectively) relative to the reference values. Significant differences were found between healthy pancreas, PDAC tumor and non-tumor, and CP pancreas.</p><p><strong>Discussion: </strong>Retrospective quantification (RoQ) of clinical multi-phasic DCE-MRI is possible by deep learning. This technique has the potential to derive quantitative pharmacokinetic parameters from clinical multi-phasic DCE data for a more objective and precise assessment of cancer.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1168901"},"PeriodicalIF":0.0,"publicationDate":"2023-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10507354/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41168695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-24eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1240544
Bino A Varghese, Brandon K K Fields, Darryl H Hwang, Vinay A Duddalwar, George R Matcuk, Steven Y Cen
To date, studies investigating radiomics-based predictive models have tended to err on the side of data-driven or exploratory analysis of many thousands of extracted features. In particular, spatial assessments of texture have proven to be especially adept at assessing for features of intratumoral heterogeneity in oncologic imaging, which likewise may correspond with tumor biology and behavior. These spatial assessments can be generally classified as spatial filters, which detect areas of rapid change within the grayscale in order to enhance edges and/or textures within an image, or neighborhood-based methods, which quantify gray-level differences of neighboring pixels/voxels within a set distance. Given the high dimensionality of radiomics datasets, data dimensionality reduction methods have been proposed in an attempt to optimize model performance in machine learning studies; however, it should be noted that these approaches should only be applied to training data in order to avoid information leakage and model overfitting. While area under the curve of the receiver operating characteristic is perhaps the most commonly reported assessment of model performance, it is prone to overestimation when output classifications are unbalanced. In such cases, confusion matrices may be additionally reported, whereby diagnostic cut points for model predicted probability may hold more clinical significance to clinical colleagues with respect to related forms of diagnostic testing.
{"title":"Spatial assessments in texture analysis: what the radiologist needs to know.","authors":"Bino A Varghese, Brandon K K Fields, Darryl H Hwang, Vinay A Duddalwar, George R Matcuk, Steven Y Cen","doi":"10.3389/fradi.2023.1240544","DOIUrl":"10.3389/fradi.2023.1240544","url":null,"abstract":"<p><p>To date, studies investigating radiomics-based predictive models have tended to err on the side of data-driven or exploratory analysis of many thousands of extracted features. In particular, spatial assessments of texture have proven to be especially adept at assessing for features of intratumoral heterogeneity in oncologic imaging, which likewise may correspond with tumor biology and behavior. These spatial assessments can be generally classified as spatial filters, which detect areas of rapid change within the grayscale in order to enhance edges and/or textures within an image, or neighborhood-based methods, which quantify gray-level differences of neighboring pixels/voxels within a set distance. Given the high dimensionality of radiomics datasets, data dimensionality reduction methods have been proposed in an attempt to optimize model performance in machine learning studies; however, it should be noted that these approaches should only be applied to training data in order to avoid information leakage and model overfitting. While area under the curve of the receiver operating characteristic is perhaps the most commonly reported assessment of model performance, it is prone to overestimation when output classifications are unbalanced. In such cases, confusion matrices may be additionally reported, whereby diagnostic cut points for model predicted probability may hold more clinical significance to clinical colleagues with respect to related forms of diagnostic testing.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1240544"},"PeriodicalIF":0.0,"publicationDate":"2023-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10484588/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10225205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-08-08eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1241651
Joseph M Rich, Lokesh N Bhardwaj, Aman Shah, Krish Gangal, Mohitha S Rapaka, Assad A Oberai, Brandon K K Fields, George R Matcuk, Vinay A Duddalwar
Introduction: Image segmentation is an important process for quantifying characteristics of malignant bone lesions, but this task is challenging and laborious for radiologists. Deep learning has shown promise in automating image segmentation in radiology, including for malignant bone lesions. The purpose of this review is to investigate deep learning-based image segmentation methods for malignant bone lesions on Computed Tomography (CT), Magnetic Resonance Imaging (MRI), and Positron-Emission Tomography/CT (PET/CT).
Method: The literature search of deep learning-based image segmentation of malignant bony lesions on CT and MRI was conducted in PubMed, Embase, Web of Science, and Scopus electronic databases following the guidelines of Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA). A total of 41 original articles published between February 2017 and March 2023 were included in the review.
Results: The majority of papers studied MRI, followed by CT, PET/CT, and PET/MRI. There was relatively even distribution of papers studying primary vs. secondary malignancies, as well as utilizing 3-dimensional vs. 2-dimensional data. Many papers utilize custom built models as a modification or variation of U-Net. The most common metric for evaluation was the dice similarity coefficient (DSC). Most models achieved a DSC above 0.6, with medians for all imaging modalities between 0.85-0.9.
Discussion: Deep learning methods show promising ability to segment malignant osseous lesions on CT, MRI, and PET/CT. Some strategies which are commonly applied to help improve performance include data augmentation, utilization of large public datasets, preprocessing including denoising and cropping, and U-Net architecture modification. Future directions include overcoming dataset and annotation homogeneity and generalizing for clinical applicability.
简介图像分割是量化恶性骨病变特征的重要过程,但对于放射科医生来说,这项任务既具有挑战性又费力。深度学习在放射学图像自动分割方面大有可为,包括恶性骨病变。本综述旨在研究计算机断层扫描(CT)、磁共振成像(MRI)和正电子发射断层扫描/CT(PET/CT)上基于深度学习的恶性骨病变图像分割方法:根据系统综述和元分析首选报告项目(Preferred Reporting Items for Systematic Reviews and Meta-Analyses,PRISMA)指南,在PubMed、Embase、Web of Science和Scopus电子数据库中对基于深度学习的CT和MRI恶性骨病变图像分割进行了文献检索。共有41篇发表于2017年2月至2023年3月期间的原创文章被纳入综述:大多数论文研究的是 MRI,其次是 CT、PET/CT 和 PET/MRI。研究原发性与继发性恶性肿瘤以及利用三维与二维数据的论文分布相对均匀。许多论文利用定制模型作为 U-Net 的修改或变体。最常用的评估指标是骰子相似系数(DSC)。大多数模型的骰子相似系数都在 0.6 以上,所有成像模式的中位数都在 0.85-0.9 之间:深度学习方法在分割 CT、MRI 和 PET/CT 上的恶性骨质病变方面表现出良好的能力。为帮助提高性能,通常采用的一些策略包括数据增强、利用大型公共数据集、预处理(包括去噪和裁剪)以及 U-Net 架构修改。未来的研究方向包括克服数据集和注释的同质性,以及临床应用的通用性。
{"title":"Deep learning image segmentation approaches for malignant bone lesions: a systematic review and meta-analysis.","authors":"Joseph M Rich, Lokesh N Bhardwaj, Aman Shah, Krish Gangal, Mohitha S Rapaka, Assad A Oberai, Brandon K K Fields, George R Matcuk, Vinay A Duddalwar","doi":"10.3389/fradi.2023.1241651","DOIUrl":"10.3389/fradi.2023.1241651","url":null,"abstract":"<p><strong>Introduction: </strong>Image segmentation is an important process for quantifying characteristics of malignant bone lesions, but this task is challenging and laborious for radiologists. Deep learning has shown promise in automating image segmentation in radiology, including for malignant bone lesions. The purpose of this review is to investigate deep learning-based image segmentation methods for malignant bone lesions on Computed Tomography (CT), Magnetic Resonance Imaging (MRI), and Positron-Emission Tomography/CT (PET/CT).</p><p><strong>Method: </strong>The literature search of deep learning-based image segmentation of malignant bony lesions on CT and MRI was conducted in PubMed, Embase, Web of Science, and Scopus electronic databases following the guidelines of Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA). A total of 41 original articles published between February 2017 and March 2023 were included in the review.</p><p><strong>Results: </strong>The majority of papers studied MRI, followed by CT, PET/CT, and PET/MRI. There was relatively even distribution of papers studying primary vs. secondary malignancies, as well as utilizing 3-dimensional vs. 2-dimensional data. Many papers utilize custom built models as a modification or variation of U-Net. The most common metric for evaluation was the dice similarity coefficient (DSC). Most models achieved a DSC above 0.6, with medians for all imaging modalities between 0.85-0.9.</p><p><strong>Discussion: </strong>Deep learning methods show promising ability to segment malignant osseous lesions on CT, MRI, and PET/CT. Some strategies which are commonly applied to help improve performance include data augmentation, utilization of large public datasets, preprocessing including denoising and cropping, and U-Net architecture modification. Future directions include overcoming dataset and annotation homogeneity and generalizing for clinical applicability.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1241651"},"PeriodicalIF":0.0,"publicationDate":"2023-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10442705/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10069334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-06-22eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1088068
Ricardo Bigolin Lanfredi, Joyce D Schroeder, Tolga Tasdizen
Convolutional neural networks (CNNs) have been successfully applied to chest x-ray (CXR) images. Moreover, annotated bounding boxes have been shown to improve the interpretability of a CNN in terms of localizing abnormalities. However, only a few relatively small CXR datasets containing bounding boxes are available, and collecting them is very costly. Opportunely, eye-tracking (ET) data can be collected during the clinical workflow of a radiologist. We use ET data recorded from radiologists while dictating CXR reports to train CNNs. We extract snippets from the ET data by associating them with the dictation of keywords and use them to supervise the localization of specific abnormalities. We show that this method can improve a model's interpretability without impacting its image-level classification.
卷积神经网络(CNN)已成功应用于胸部 X 光(CXR)图像。此外,注释边界框已被证明可提高 CNN 在定位异常方面的可解释性。然而,目前只有少数包含边界框的相对较小的 CXR 数据集,而且收集这些数据集的成本非常高。眼动跟踪(ET)数据可以在放射科医生的临床工作流程中收集。我们使用放射科医生在口述 CXR 报告时记录的 ET 数据来训练 CNN。我们从 ET 数据中提取片段,将它们与关键字的口述关联起来,并用它们来监督特定异常的定位。我们的研究表明,这种方法可以提高模型的可解释性,而不会影响其图像级分类。
{"title":"Localization supervision of chest x-ray classifiers using label-specific eye-tracking annotation.","authors":"Ricardo Bigolin Lanfredi, Joyce D Schroeder, Tolga Tasdizen","doi":"10.3389/fradi.2023.1088068","DOIUrl":"10.3389/fradi.2023.1088068","url":null,"abstract":"<p><p>Convolutional neural networks (CNNs) have been successfully applied to chest x-ray (CXR) images. Moreover, annotated bounding boxes have been shown to improve the interpretability of a CNN in terms of localizing abnormalities. However, only a few relatively small CXR datasets containing bounding boxes are available, and collecting them is very costly. Opportunely, eye-tracking (ET) data can be collected during the clinical workflow of a radiologist. We use ET data recorded from radiologists while dictating CXR reports to train CNNs. We extract snippets from the ET data by associating them with the dictation of keywords and use them to supervise the localization of specific abnormalities. We show that this method can improve a model's interpretability without impacting its image-level classification.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1088068"},"PeriodicalIF":0.0,"publicationDate":"2023-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10365091/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9930026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-06-02eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1144004
Yitong Yang, Zahraw Shah, Athira J Jacob, Jackson Hair, Teodora Chitiboi, Tiziano Passerini, Jerome Yerly, Lorenzo Di Sopra, Davide Piccini, Zahra Hosseini, Puneet Sharma, Anurag Sahu, Matthias Stuber, John N Oshinski
Introduction: Deep learning (DL)-based segmentation has gained popularity for routine cardiac magnetic resonance (CMR) image analysis and in particular, delineation of left ventricular (LV) borders for LV volume determination. Free-breathing, self-navigated, whole-heart CMR exams provide high-resolution, isotropic coverage of the heart for assessment of cardiac anatomy including LV volume. The combination of whole-heart free-breathing CMR and DL-based LV segmentation has the potential to streamline the acquisition and analysis of clinical CMR exams. The purpose of this study was to compare the performance of a DL-based automatic LV segmentation network trained primarily on computed tomography (CT) images in two whole-heart CMR reconstruction methods: (1) an in-line respiratory motion-corrected (Mcorr) reconstruction and (2) an off-line, compressed sensing-based, multi-volume respiratory motion-resolved (Mres) reconstruction. Given that Mres images were shown to have greater image quality in previous studies than Mcorr images, we hypothesized that the LV volumes segmented from Mres images are closer to the manual expert-traced left ventricular endocardial border than the Mcorr images.
Method: This retrospective study used 15 patients who underwent clinically indicated 1.5 T CMR exams with a prototype ECG-gated 3D radial phyllotaxis balanced steady state free precession (bSSFP) sequence. For each reconstruction method, the absolute volume difference (AVD) of the automatically and manually segmented LV volumes was used as the primary quantity to investigate whether 3D DL-based LV segmentation generalized better on Mcorr or Mres 3D whole-heart images. Additionally, we assessed the 3D Dice similarity coefficient between the manual and automatic LV masks of each reconstructed 3D whole-heart image and the sharpness of the LV myocardium-blood pool interface. A two-tail paired Student's t-test (alpha = 0.05) was used to test the significance in this study.
Results & discussion: The AVD in the respiratory Mres reconstruction was lower than the AVD in the respiratory Mcorr reconstruction: 7.73 ± 6.54 ml vs. 20.0 ± 22.4 ml, respectively (n = 15, p-value = 0.03). The 3D Dice coefficient between the DL-segmented masks and the manually segmented masks was higher for Mres images than for Mcorr images: 0.90 ± 0.02 vs. 0.87 ± 0.03 respectively, with a p-value = 0.02. Sharpness on Mres images was higher than on Mcorr images: 0.15 ± 0.05 vs. 0.12 ± 0.04, respectively, with a p-value of 0.014 (n = 15).
Conclusion: We conclude that the DL-based 3D automatic LV segmentation network trained on CT images and fine-tuned on MR images generalized better on Mres images than on Mcorr images for quantifying LV volumes.
{"title":"Deep learning-based left ventricular segmentation demonstrates improved performance on respiratory motion-resolved whole-heart reconstructions.","authors":"Yitong Yang, Zahraw Shah, Athira J Jacob, Jackson Hair, Teodora Chitiboi, Tiziano Passerini, Jerome Yerly, Lorenzo Di Sopra, Davide Piccini, Zahra Hosseini, Puneet Sharma, Anurag Sahu, Matthias Stuber, John N Oshinski","doi":"10.3389/fradi.2023.1144004","DOIUrl":"10.3389/fradi.2023.1144004","url":null,"abstract":"<p><strong>Introduction: </strong>Deep learning (DL)-based segmentation has gained popularity for routine cardiac magnetic resonance (CMR) image analysis and in particular, delineation of left ventricular (LV) borders for LV volume determination. Free-breathing, self-navigated, whole-heart CMR exams provide high-resolution, isotropic coverage of the heart for assessment of cardiac anatomy including LV volume. The combination of whole-heart free-breathing CMR and DL-based LV segmentation has the potential to streamline the acquisition and analysis of clinical CMR exams. The purpose of this study was to compare the performance of a DL-based automatic LV segmentation network trained primarily on computed tomography (CT) images in two whole-heart CMR reconstruction methods: (1) an in-line respiratory motion-corrected (Mcorr) reconstruction and (2) an off-line, compressed sensing-based, multi-volume respiratory motion-resolved (Mres) reconstruction. Given that Mres images were shown to have greater image quality in previous studies than Mcorr images, we <i>hypothesized</i> that the LV volumes segmented from Mres images are closer to the manual expert-traced left ventricular endocardial border than the Mcorr images.</p><p><strong>Method: </strong>This retrospective study used 15 patients who underwent clinically indicated 1.5 T CMR exams with a prototype ECG-gated 3D radial phyllotaxis balanced steady state free precession (bSSFP) sequence. For each reconstruction method, the absolute volume difference (AVD) of the automatically and manually segmented LV volumes was used as the primary quantity to investigate whether 3D DL-based LV segmentation generalized better on Mcorr or Mres 3D whole-heart images. Additionally, we assessed the 3D Dice similarity coefficient between the manual and automatic LV masks of each reconstructed 3D whole-heart image and the sharpness of the LV myocardium-blood pool interface. A two-tail paired Student's <i>t</i>-test (alpha = 0.05) was used to test the significance in this study.</p><p><strong>Results & discussion: </strong>The AVD in the respiratory Mres reconstruction was lower than the AVD in the respiratory Mcorr reconstruction: 7.73 ± 6.54 ml vs. 20.0 ± 22.4 ml, respectively (<i>n</i> = 15, <i>p</i>-value = 0.03). The 3D Dice coefficient between the DL-segmented masks and the manually segmented masks was higher for Mres images than for Mcorr images: 0.90 ± 0.02 vs. 0.87 ± 0.03 respectively, with a <i>p</i>-value = 0.02. Sharpness on Mres images was higher than on Mcorr images: 0.15 ± 0.05 vs. 0.12 ± 0.04, respectively, with a <i>p</i>-value of 0.014 (<i>n</i> = 15).</p><p><strong>Conclusion: </strong>We conclude that the DL-based 3D automatic LV segmentation network trained on CT images and fine-tuned on MR images generalized better on Mres images than on Mcorr images for quantifying LV volumes.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1144004"},"PeriodicalIF":0.0,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10365088/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10234001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2023-05-22eCollection Date: 2023-01-01DOI: 10.3389/fradi.2023.1180699
Anjali Agrawal
Millennial radiology is marked by technical disruptions. Advances in internet, digital communications and computing technology, paved way for digitalized workflow orchestration of busy radiology departments. The COVID pandemic brought teleradiology to the forefront, highlighting its importance in maintaining continuity of radiological services, making it an integral component of the radiology practice. Increasing computing power and integrated multimodal data are driving incorporation of artificial intelligence at various stages of the radiology image and reporting cycle. These have and will continue to transform the career landscape in radiology, with more options for radiologists with varied interests and career goals. The ability to work from anywhere and anytime needs to be balanced with other aspects of life. Robust communication, internal and external collaboration, self-discipline, and self-motivation are key to achieving the desired balance while practicing radiology the unconventional way.
{"title":"Digital transformation of career landscapes in radiology: personal and professional implications.","authors":"Anjali Agrawal","doi":"10.3389/fradi.2023.1180699","DOIUrl":"10.3389/fradi.2023.1180699","url":null,"abstract":"<p><p>Millennial radiology is marked by technical disruptions. Advances in internet, digital communications and computing technology, paved way for digitalized workflow orchestration of busy radiology departments. The COVID pandemic brought teleradiology to the forefront, highlighting its importance in maintaining continuity of radiological services, making it an integral component of the radiology practice. Increasing computing power and integrated multimodal data are driving incorporation of artificial intelligence at various stages of the radiology image and reporting cycle. These have and will continue to transform the career landscape in radiology, with more options for radiologists with varied interests and career goals. The ability to work from anywhere and anytime needs to be balanced with other aspects of life. Robust communication, internal and external collaboration, self-discipline, and self-motivation are key to achieving the desired balance while practicing radiology the unconventional way.</p>","PeriodicalId":73101,"journal":{"name":"Frontiers in radiology","volume":"3 ","pages":"1180699"},"PeriodicalIF":0.0,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10364979/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10233998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}