Towards automatical tumor segmentation in radiomics: a comparative analysis of various methods and radiologists for both region extraction and downstream diagnosis.
Ying Yu, Gang-Feng Li, Wei-Xiong Tan, Xiao-Yan Qu, Tao Zhang, Xing-Yi Hou, Yuan-Bo Zhu, Zhi-Ying Ma, Lu Yang, Ya Gao, Mei Yu, Cui Yue, Zhen Zhou, Yang Yang, Lin-Feng Yan, Guang-Bin Cui
{"title":"Towards automatical tumor segmentation in radiomics: a comparative analysis of various methods and radiologists for both region extraction and downstream diagnosis.","authors":"Ying Yu, Gang-Feng Li, Wei-Xiong Tan, Xiao-Yan Qu, Tao Zhang, Xing-Yi Hou, Yuan-Bo Zhu, Zhi-Ying Ma, Lu Yang, Ya Gao, Mei Yu, Cui Yue, Zhen Zhou, Yang Yang, Lin-Feng Yan, Guang-Bin Cui","doi":"10.1186/s12880-025-01596-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>By discussing the difference, stability and classification ability of tumor contour extracted by artificial intelligence and doctors, can a more stable method of tumor contour extraction be obtained?</p><p><strong>Methods: </strong>We propose a novel framework for the automatic segmentation of lung tumor contours and the differential diagnosis of downstream tasks. This framework integrates four key modules: tumor segmentation, extraction of radiomic features, feature selection, and the development of diagnostic models for clinical applications. Using this framework, we conducted a study involving a cohort of 1,429 patients suspected of lung cancer. Four automatic segmentation methods (RNN, UNET, WFCM, and SNAKE) were evaluated against manual segmentation performed by three radiologists with varying levels of expertise. We further studied the consistency of radiomic features extracted from these methods and evaluates their diagnostic performance across three downstream tasks: benign vs. malignant classification, lung adenocarcinoma infiltration, and lung nodule density classification.</p><p><strong>Results: </strong>The Dice coefficient of RNN is the highest among the four automatic segmentation methods (0.803 > 0.751, 0.576, 0.560), and all P < 0.05. In the consistency comparison of the seven contour-extracted radiomic features, that the features extracted by RNN and S1 (the senior radiologist) showed the highest similarity which was higher than the other automatic segmentation methods and doctors with low seniority. In all three downstream tasks, the radiomic features extracted from RNN segmentation contours showed the highest diagnostic discrimination. In the classification of benign and malignant nodules, the RNN method performed slightly better than the S1 method, with an AUC of 0.840 ± 0.01 and 0.824 ± 0.015, respectively, and significantly better than the other five methods. Similarly, the RNN method had an AUC value of 0.946 in lung adenocarcinoma infiltration, and a kappa value of 0.729 in lung nodule density classification, both of which were better than the other six methods.</p><p><strong>Conclusions: </strong>Our findings suggest that AI-driven tumor segmentation methods can enhance clinical decision-making by providing reliable and reproducible results, ultimately emphasizing the auxiliary role of automated tumor contouring in clinical practice. The findings will have important implications for the application of radiomics in clinical practice.</p>","PeriodicalId":9020,"journal":{"name":"BMC Medical Imaging","volume":"25 1","pages":"63"},"PeriodicalIF":2.9000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11863488/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12880-025-01596-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: By discussing the difference, stability and classification ability of tumor contour extracted by artificial intelligence and doctors, can a more stable method of tumor contour extraction be obtained?
Methods: We propose a novel framework for the automatic segmentation of lung tumor contours and the differential diagnosis of downstream tasks. This framework integrates four key modules: tumor segmentation, extraction of radiomic features, feature selection, and the development of diagnostic models for clinical applications. Using this framework, we conducted a study involving a cohort of 1,429 patients suspected of lung cancer. Four automatic segmentation methods (RNN, UNET, WFCM, and SNAKE) were evaluated against manual segmentation performed by three radiologists with varying levels of expertise. We further studied the consistency of radiomic features extracted from these methods and evaluates their diagnostic performance across three downstream tasks: benign vs. malignant classification, lung adenocarcinoma infiltration, and lung nodule density classification.
Results: The Dice coefficient of RNN is the highest among the four automatic segmentation methods (0.803 > 0.751, 0.576, 0.560), and all P < 0.05. In the consistency comparison of the seven contour-extracted radiomic features, that the features extracted by RNN and S1 (the senior radiologist) showed the highest similarity which was higher than the other automatic segmentation methods and doctors with low seniority. In all three downstream tasks, the radiomic features extracted from RNN segmentation contours showed the highest diagnostic discrimination. In the classification of benign and malignant nodules, the RNN method performed slightly better than the S1 method, with an AUC of 0.840 ± 0.01 and 0.824 ± 0.015, respectively, and significantly better than the other five methods. Similarly, the RNN method had an AUC value of 0.946 in lung adenocarcinoma infiltration, and a kappa value of 0.729 in lung nodule density classification, both of which were better than the other six methods.
Conclusions: Our findings suggest that AI-driven tumor segmentation methods can enhance clinical decision-making by providing reliable and reproducible results, ultimately emphasizing the auxiliary role of automated tumor contouring in clinical practice. The findings will have important implications for the application of radiomics in clinical practice.
期刊介绍:
BMC Medical Imaging is an open access journal publishing original peer-reviewed research articles in the development, evaluation, and use of imaging techniques and image processing tools to diagnose and manage disease.