{"title":"Multi-modal feature integration for thyroid nodule prediction: Combining clinical data with ultrasound-based deep features","authors":"Jing Li, Qiang Guo, Xingli Tan","doi":"10.1016/j.jrras.2024.101217","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>This study presents an advanced machine learning (ML)-based framework for accurate risk stratification of thyroid nodules by integrating clinical, radiological, and deep imaging features.</div></div><div><h3>Methods</h3><div>We analyzed data from 580 patients with thyroid nodules, categorized from TIRADS 2 to 5. Clinical and radiological features (e.g., nodule size, TIRADS category, echogenicity) were combined with deep imaging features extracted from ultrasound scans using EfficientNet-B0. Predictive models were developed using XGBoost, Random Forest, and Support Vector Machine (SVM), resulting in clinical-only, imaging-only, and hybrid models. Additionally, a stacking-based meta-model integrated predictions from all three models to enhance performance. Model evaluation metrics included accuracy, F1-score, and AUC-ROC, with hyperparameter tuning applied to optimize outcomes.</div></div><div><h3>Results</h3><div>The clinical models showed strong predictive ability, with XGBoost achieving 81% accuracy, 0.85 AUC-ROC, and an F1-score of 0.82. Imaging models demonstrated the value of deep feature extraction, reaching up to 79% accuracy and 0.83 AUC-ROC. The hybrid model improved predictions further, with XGBoost achieving 85% accuracy and 0.87 AUC-ROC. The stacking ensemble model provided the best performance, achieving 87% accuracy, an F1-score of 0.87, and 0.90 AUC-ROC, demonstrating the benefits of multi-modal feature integration.</div></div><div><h3>Conclusion</h3><div>Our study shows that combining clinical, radiological, and deep imaging features significantly enhances the prediction of thyroid nodule malignancy. The stacking-based framework offers a scalable, reproducible tool to support more accurate clinical decision-making, reduce unnecessary biopsies, and improve diagnostic precision.</div></div>","PeriodicalId":16920,"journal":{"name":"Journal of Radiation Research and Applied Sciences","volume":"18 1","pages":"Article 101217"},"PeriodicalIF":1.7000,"publicationDate":"2024-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Radiation Research and Applied Sciences","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1687850724004011","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
This study presents an advanced machine learning (ML)-based framework for accurate risk stratification of thyroid nodules by integrating clinical, radiological, and deep imaging features.
Methods
We analyzed data from 580 patients with thyroid nodules, categorized from TIRADS 2 to 5. Clinical and radiological features (e.g., nodule size, TIRADS category, echogenicity) were combined with deep imaging features extracted from ultrasound scans using EfficientNet-B0. Predictive models were developed using XGBoost, Random Forest, and Support Vector Machine (SVM), resulting in clinical-only, imaging-only, and hybrid models. Additionally, a stacking-based meta-model integrated predictions from all three models to enhance performance. Model evaluation metrics included accuracy, F1-score, and AUC-ROC, with hyperparameter tuning applied to optimize outcomes.
Results
The clinical models showed strong predictive ability, with XGBoost achieving 81% accuracy, 0.85 AUC-ROC, and an F1-score of 0.82. Imaging models demonstrated the value of deep feature extraction, reaching up to 79% accuracy and 0.83 AUC-ROC. The hybrid model improved predictions further, with XGBoost achieving 85% accuracy and 0.87 AUC-ROC. The stacking ensemble model provided the best performance, achieving 87% accuracy, an F1-score of 0.87, and 0.90 AUC-ROC, demonstrating the benefits of multi-modal feature integration.
Conclusion
Our study shows that combining clinical, radiological, and deep imaging features significantly enhances the prediction of thyroid nodule malignancy. The stacking-based framework offers a scalable, reproducible tool to support more accurate clinical decision-making, reduce unnecessary biopsies, and improve diagnostic precision.
期刊介绍:
Journal of Radiation Research and Applied Sciences provides a high quality medium for the publication of substantial, original and scientific and technological papers on the development and applications of nuclear, radiation and isotopes in biology, medicine, drugs, biochemistry, microbiology, agriculture, entomology, food technology, chemistry, physics, solid states, engineering, environmental and applied sciences.