Enhancing thyroid nodule classification: A comprehensive analysis of feature selection in thermography

IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Infrared Physics & Technology Pub Date : 2025-03-01 Epub Date: 2025-01-30 DOI:10.1016/j.infrared.2025.105730
Mahnaz Etehadtavakol , Mojtaba Sirati-Amsheh , Golnaz Moallem , Eddie Yin Kwee Ng
{"title":"Enhancing thyroid nodule classification: A comprehensive analysis of feature selection in thermography","authors":"Mahnaz Etehadtavakol ,&nbsp;Mojtaba Sirati-Amsheh ,&nbsp;Golnaz Moallem ,&nbsp;Eddie Yin Kwee Ng","doi":"10.1016/j.infrared.2025.105730","DOIUrl":null,"url":null,"abstract":"<div><div>Early detection of thyroid malignancies is crucial, yet traditional diagnostic methods are often costly and carry inherent risks. Thermography presents a non-invasive alternative, but existing studies frequently lack comprehensive methodological frameworks for broader applications. In the realm of machine learning and classification, feature selection is pivotal for enhancing model performance by reducing overfitting, shortening training times, minimizing dimensionality, improving interpretability, and focusing on the most relevant features.</div><div>This study aims to identify the most informative features and evaluate the efficacy of various feature selection techniques—both unsupervised and supervised (filter, wrapper, and embedded)—in improving the classification accuracy of thyroid nodules using thermography images. Multiple machine learning models, including Support Vector Machines, Random Forest, Decision Tree, AdaBoost, and XGBoost, were assessed as classifiers utilizing group k-fold cross-validation.</div><div>Among the feature selection methods, LASSO (supervised embedding-based feature selection) showed the best performance, achieving 86% accuracy with an AUC of 0.91 for the random forest model and 86 % accuracy with an AUC of 0.92 for the XGBoost model. This research underscores the critical role of feature selection in the classification of thyroid nodules using thermography, providing valuable insights for advancing non-invasive diagnostic methodologies in thyroid assessment.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"145 ","pages":"Article 105730"},"PeriodicalIF":3.4000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525000234","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/30 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0

Abstract

Early detection of thyroid malignancies is crucial, yet traditional diagnostic methods are often costly and carry inherent risks. Thermography presents a non-invasive alternative, but existing studies frequently lack comprehensive methodological frameworks for broader applications. In the realm of machine learning and classification, feature selection is pivotal for enhancing model performance by reducing overfitting, shortening training times, minimizing dimensionality, improving interpretability, and focusing on the most relevant features.
This study aims to identify the most informative features and evaluate the efficacy of various feature selection techniques—both unsupervised and supervised (filter, wrapper, and embedded)—in improving the classification accuracy of thyroid nodules using thermography images. Multiple machine learning models, including Support Vector Machines, Random Forest, Decision Tree, AdaBoost, and XGBoost, were assessed as classifiers utilizing group k-fold cross-validation.
Among the feature selection methods, LASSO (supervised embedding-based feature selection) showed the best performance, achieving 86% accuracy with an AUC of 0.91 for the random forest model and 86 % accuracy with an AUC of 0.92 for the XGBoost model. This research underscores the critical role of feature selection in the classification of thyroid nodules using thermography, providing valuable insights for advancing non-invasive diagnostic methodologies in thyroid assessment.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
增强甲状腺结节分类:热成像特征选择的综合分析
早期检测甲状腺恶性肿瘤是至关重要的,但传统的诊断方法往往是昂贵的,并携带固有的风险。热成像提供了一种非侵入性的替代方法,但现有的研究往往缺乏广泛应用的综合方法框架。在机器学习和分类领域,特征选择是通过减少过拟合、缩短训练时间、最小化维数、提高可解释性和关注最相关特征来提高模型性能的关键。本研究旨在识别最具信息量的特征,并评估各种特征选择技术(包括无监督和有监督的(过滤、包装和嵌入))在使用热成像图像提高甲状腺结节分类准确性方面的有效性。多种机器学习模型,包括支持向量机、随机森林、决策树、AdaBoost和XGBoost,被评估为分类器,使用k-fold交叉验证。在特征选择方法中,LASSO(基于监督嵌入的特征选择)表现出最好的性能,随机森林模型的准确率为86%,AUC为0.91,XGBoost模型的准确率为86%,AUC为0.92。本研究强调了特征选择在甲状腺结节热成像分类中的关键作用,为推进甲状腺评估的非侵入性诊断方法提供了有价值的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
5.70
自引率
12.10%
发文量
400
审稿时长
67 days
期刊介绍: The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region. Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine. Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.
期刊最新文献
Detecting heterogeneities in multispectral transmission images via variable threshold terraced compression and visual display methods A physics-driven heat conduction framework for edge detection in thermal infrared images Kaiser-windowed linear frequency modulation infrared thermographic detection of low-velocity impact damage in CFRP Yb:YGG, crystal growth, concentration dependent, spectroscopy and the luminescence mechanisms Distribution of potential in the p-region of HgCdTe n-on-p large-format focal plane arrays
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1