Deep transfer learning hybrid techniques for precision in breast cancer tumor histopathology classification.

IF 3.4 3区 医学 Q1 MEDICAL INFORMATICS Health Information Science and Systems Pub Date : 2025-02-11 eCollection Date: 2025-12-01 DOI:10.1007/s13755-025-00337-7
Muniraj Gupta, Nidhi Verma, Naveen Sharma, Satyendra Narayan Singh, R K Brojen Singh, Saurabh Kumar Sharma
{"title":"Deep transfer learning hybrid techniques for precision in breast cancer tumor histopathology classification.","authors":"Muniraj Gupta, Nidhi Verma, Naveen Sharma, Satyendra Narayan Singh, R K Brojen Singh, Saurabh Kumar Sharma","doi":"10.1007/s13755-025-00337-7","DOIUrl":null,"url":null,"abstract":"<p><p>The breast cancer is one of the most prevalent causes of cancer-related death globally. Preliminary diagnosis of breast cancer increases the patient's chances of survival. Breast cancer classification is a challenging problem due to dense tissue structures, subtle variations, cellular heterogeneity, artifacts, and variability. In this paper, we propose three hybrid deep-transfer learning models for breast cancer classification using histopathology images. These models use Xception model as a base model, and we add seven more layers to fine-tune the base model. We also performed an extensive comparative analysis of five prominent machine-learning classifiers, namely Random Forest Classifier (RFC), Logistic Regression (LR), Support Vector Classifier (SVC), K-Nearest Neighbors (KNN), and Ada-boost. We incorporate the best performing two classifiers, namely RFC and SVC, in the fine-tuned Xception model, and accordingly, they are named as Xception Random Forest (XRF) and Xception Support Vector (XSV), respectively. The fine-tuned Xception model with softmax classifier is termed as Multi-layer Xception Classifier (MXC). These three models are evaluated on the two publically available datasets: BreakHis and Breast Histopathology Images Database (BHID). Our all three models perform better than the state-of-the-art methods. The XRF provides the best performance at the 40 × magnification level on the BreakHis dataset, with an accuracy (ACC) of 94.44%, F1 score (F1) of 94.44%, area under the receiver operating characteristic curve (AUC) of 95.12%, Matthew's correlation coefficient (MCC) of 88.98%, kappa (K) of 88.88%, and classification success index (CSI) of 89.23%. The MXC provides the best performance on the BHID dataset, with an ACC of 88.50%, F1 of 88.50%, AUC of 95.12%, MCC of 77.03%, K of 77.00%, and CSI of 79.13%. Further, to validate our models, we performed fivefold cross-validation on both datasets and obtained similar results.</p>","PeriodicalId":46312,"journal":{"name":"Health Information Science and Systems","volume":"13 1","pages":"20"},"PeriodicalIF":3.4000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11813847/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Information Science and Systems","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s13755-025-00337-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/12/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

The breast cancer is one of the most prevalent causes of cancer-related death globally. Preliminary diagnosis of breast cancer increases the patient's chances of survival. Breast cancer classification is a challenging problem due to dense tissue structures, subtle variations, cellular heterogeneity, artifacts, and variability. In this paper, we propose three hybrid deep-transfer learning models for breast cancer classification using histopathology images. These models use Xception model as a base model, and we add seven more layers to fine-tune the base model. We also performed an extensive comparative analysis of five prominent machine-learning classifiers, namely Random Forest Classifier (RFC), Logistic Regression (LR), Support Vector Classifier (SVC), K-Nearest Neighbors (KNN), and Ada-boost. We incorporate the best performing two classifiers, namely RFC and SVC, in the fine-tuned Xception model, and accordingly, they are named as Xception Random Forest (XRF) and Xception Support Vector (XSV), respectively. The fine-tuned Xception model with softmax classifier is termed as Multi-layer Xception Classifier (MXC). These three models are evaluated on the two publically available datasets: BreakHis and Breast Histopathology Images Database (BHID). Our all three models perform better than the state-of-the-art methods. The XRF provides the best performance at the 40 × magnification level on the BreakHis dataset, with an accuracy (ACC) of 94.44%, F1 score (F1) of 94.44%, area under the receiver operating characteristic curve (AUC) of 95.12%, Matthew's correlation coefficient (MCC) of 88.98%, kappa (K) of 88.88%, and classification success index (CSI) of 89.23%. The MXC provides the best performance on the BHID dataset, with an ACC of 88.50%, F1 of 88.50%, AUC of 95.12%, MCC of 77.03%, K of 77.00%, and CSI of 79.13%. Further, to validate our models, we performed fivefold cross-validation on both datasets and obtained similar results.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
深度迁移学习混合技术在乳腺癌肿瘤组织病理分类中的应用。
乳腺癌是全球癌症相关死亡的最普遍原因之一。乳腺癌的初步诊断增加了患者的生存机会。由于致密的组织结构、细微的变异、细胞异质性、伪影和可变性,乳腺癌的分类是一个具有挑战性的问题。在本文中,我们提出了三种混合深度迁移学习模型,用于使用组织病理学图像进行乳腺癌分类。这些模型使用Xception模型作为基本模型,我们添加了7个层来微调基本模型。我们还对五种著名的机器学习分类器进行了广泛的比较分析,即随机森林分类器(RFC)、逻辑回归(LR)、支持向量分类器(SVC)、k -近邻(KNN)和Ada-boost。我们将性能最好的两个分类器,即RFC和SVC,结合在微调的异常模型中,因此,它们分别被命名为异常随机森林(XRF)和异常支持向量(XSV)。使用softmax分类器的微调异常模型称为多层异常分类器(MXC)。这三种模型在两个公开可用的数据集上进行评估:BreakHis和乳腺组织病理学图像数据库(BHID)。我们的三个模型都比最先进的方法表现得更好。在BreakHis数据集上,XRF在40倍放大水平下表现最佳,准确率(ACC)为94.44%,F1评分(F1)为94.44%,受检者工作特征曲线下面积(AUC)为95.12%,马修相关系数(MCC)为88.98%,kappa (K)为88.88%,分类成功指数(CSI)为89.23%。MXC在BHID数据集上表现最佳,ACC为88.50%,F1为88.50%,AUC为95.12%,MCC为77.03%,K为77.00%,CSI为79.13%。此外,为了验证我们的模型,我们对两个数据集进行了五倍交叉验证,并获得了相似的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
11.30
自引率
5.00%
发文量
30
期刊介绍: Health Information Science and Systems is a multidisciplinary journal that integrates artificial intelligence/computer science/information technology with health science and services, embracing information science research coupled with topics related to the modeling, design, development, integration and management of health information systems, smart health, artificial intelligence in medicine, and computer aided diagnosis, medical expert systems. The scope includes: i.) smart health, artificial Intelligence in medicine, computer aided diagnosis, medical image processing, medical expert systems ii.) medical big data, medical/health/biomedicine information resources such as patient medical records, devices and equipments, software and tools to capture, store, retrieve, process, analyze, optimize the use of information in the health domain, iii.) data management, data mining, and knowledge discovery, all of which play a key role in decision making, management of public health, examination of standards, privacy and security issues, iv.) development of new architectures and applications for health information systems.
期刊最新文献
UMEval: a unified framework for explainable medical term semantic evaluation with large language models. Detecting stress from videos via intra-subject and inter-subject learning. ConsTCM: aligning fundus images with constitution differentiation in multimodal language model for Traditional Chinese Medicine. Enhancing LLM-based medical decision-making by test-time knowledge acquisition. DALI-Syn: integrating chemical LLMs with multi-level attention architectures for synergistic drug combination prediction.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1