Computational analysis of variability and uncertainty in the clinical reference on magnetic resonance imaging radiomics: modelling and performance.

IF 6 4区计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Visual Computing for Industry Biomedicine and Art Pub Date : 2024-11-19 DOI:10.1186/s42492-024-00180-9

Cindy Xue, Jing Yuan, Gladys G Lo, Darren M C Poon, Winnie Cw Chu

{"title":"Computational analysis of variability and uncertainty in the clinical reference on magnetic resonance imaging radiomics: modelling and performance.","authors":"Cindy Xue, Jing Yuan, Gladys G Lo, Darren M C Poon, Winnie Cw Chu","doi":"10.1186/s42492-024-00180-9","DOIUrl":null,"url":null,"abstract":"<p><p>To conduct a computational investigation to explore the influence of clinical reference uncertainty on magnetic resonance imaging (MRI) radiomics feature selection, modelling, and performance. This study used two sets of publicly available prostate cancer MRI = radiomics data (Dataset 1: n = 260; Dataset 2: n = 100) with Gleason score clinical references. Each dataset was divided into training and holdout testing datasets at a ratio of 7:3 and analysed independently. The clinical references of the training set were permuted at different levels (increments of 5%) and repeated 20 times. Four feature selection algorithms and two classifiers were used to construct the models. Cross-validation was employed for training, while a separate hold-out testing set was used for evaluation. The Jaccard similarity coefficient was used to evaluate feature selection, while the area under the curve (AUC) and accuracy were used to assess model performance. An analysis of variance test with Bonferroni correction was conducted to compare the metrics of each model. The consistency of the feature selection performance decreased substantially with the clinical reference permutation. AUCs of the trained models with permutation particularly after 20% were significantly lower (Dataset 1 (with ≥ 20% permutation): 0.67, and Dataset 2 (≥ 20% permutation): 0.74), compared to the AUC of models without permutation (Dataset 1: 0.94, Dataset 2: 0.97). The performances of the models were also associated with larger uncertainties and an increasing number of permuted clinical references. Clinical reference uncertainty can substantially influence MRI radiomic feature selection and modelling. The high accuracy of clinical references should be helpful in building reliable and robust radiomic models. Careful interpretation of the model performance is necessary, particularly for high-dimensional data.</p>","PeriodicalId":29931,"journal":{"name":"Visual Computing for Industry Biomedicine and Art","volume":"7 1","pages":"28"},"PeriodicalIF":6.0000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11573982/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Visual Computing for Industry Biomedicine and Art","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1186/s42492-024-00180-9","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

To conduct a computational investigation to explore the influence of clinical reference uncertainty on magnetic resonance imaging (MRI) radiomics feature selection, modelling, and performance. This study used two sets of publicly available prostate cancer MRI = radiomics data (Dataset 1: n = 260; Dataset 2: n = 100) with Gleason score clinical references. Each dataset was divided into training and holdout testing datasets at a ratio of 7:3 and analysed independently. The clinical references of the training set were permuted at different levels (increments of 5%) and repeated 20 times. Four feature selection algorithms and two classifiers were used to construct the models. Cross-validation was employed for training, while a separate hold-out testing set was used for evaluation. The Jaccard similarity coefficient was used to evaluate feature selection, while the area under the curve (AUC) and accuracy were used to assess model performance. An analysis of variance test with Bonferroni correction was conducted to compare the metrics of each model. The consistency of the feature selection performance decreased substantially with the clinical reference permutation. AUCs of the trained models with permutation particularly after 20% were significantly lower (Dataset 1 (with ≥ 20% permutation): 0.67, and Dataset 2 (≥ 20% permutation): 0.74), compared to the AUC of models without permutation (Dataset 1: 0.94, Dataset 2: 0.97). The performances of the models were also associated with larger uncertainties and an increasing number of permuted clinical references. Clinical reference uncertainty can substantially influence MRI radiomic feature selection and modelling. The high accuracy of clinical references should be helpful in building reliable and robust radiomic models. Careful interpretation of the model performance is necessary, particularly for high-dimensional data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

磁共振成像放射组学临床参考文献中变异性和不确定性的计算分析：建模与性能。

进行计算研究，探索临床参考不确定性对磁共振成像（MRI）放射组学特征选择、建模和性能的影响。本研究使用了两组公开的前列腺癌磁共振成像 = 放射组学数据（数据集 1：n = 260；数据集 2：n = 100），其中包含格里森评分临床参考值。每个数据集按 7:3 的比例分为训练数据集和暂停测试数据集，并进行独立分析。训练集的临床参考资料按不同级别（增量为 5%）进行置换，并重复 20 次。在构建模型时使用了四种特征选择算法和两种分类器。训练时采用交叉验证，评估时则使用单独的保留测试集。Jaccard 相似系数用于评估特征选择，而曲线下面积（AUC）和准确率则用于评估模型性能。对每个模型的指标进行了带 Bonferroni 校正的方差分析测试比较。特征选择性能的一致性随着临床参照排列的增加而大大降低。经过训练的模型的AUCs，尤其是20%以后的包被率明显降低（数据集1（包被率≥20%）：0.67；数据集2（包被率≥20%）：0.67）：0.67，数据集 2（≥ 20% 变异）：0.74）：数据集 1：0.94；数据集 2：0.97）。模型的性能还与不确定性的增大和置换临床参考文献数量的增加有关。临床参考文献的不确定性会严重影响磁共振成像放射学特征选择和建模。临床参考文献的高准确性应有助于建立可靠、稳健的放射学模型。有必要对模型性能进行仔细解读，尤其是高维数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Visual Computing for Industry Biomedicine and Art Multiple-

CiteScore

5.60

自引率

0.00%

发文量