Improved Breast Cancer Diagnosis through Transfer Learning on Hematoxylin and Eosin Stained Histology Images

Fahad Ahmed, Reem Abdel-Salam, Leon Hamnett, Mary Adewunmi, Temitope Ayano
{"title":"Improved Breast Cancer Diagnosis through Transfer Learning on Hematoxylin and Eosin Stained Histology Images","authors":"Fahad Ahmed, Reem Abdel-Salam, Leon Hamnett, Mary Adewunmi, Temitope Ayano","doi":"arxiv-2309.08745","DOIUrl":null,"url":null,"abstract":"Breast cancer is one of the leading causes of death for women worldwide.\nEarly screening is essential for early identification, but the chance of\nsurvival declines as the cancer progresses into advanced stages. For this\nstudy, the most recent BRACS dataset of histological (H\\&E) stained images was\nused to classify breast cancer tumours, which contains both the whole-slide\nimages (WSI) and region-of-interest (ROI) images, however, for our study we\nhave considered ROI images. We have experimented using different pre-trained\ndeep learning models, such as Xception, EfficientNet, ResNet50, and\nInceptionResNet, pre-trained on the ImageNet weights. We pre-processed the\nBRACS ROI along with image augmentation, upsampling, and dataset split\nstrategies. For the default dataset split, the best results were obtained by\nResNet50 achieving 66\\% f1-score. For the custom dataset split, the best\nresults were obtained by performing upsampling and image augmentation which\nresults in 96.2\\% f1-score. Our second approach also reduced the number of\nfalse positive and false negative classifications to less than 3\\% for each\nclass. We believe that our study significantly impacts the early diagnosis and\nidentification of breast cancer tumors and their subtypes, especially atypical\nand malignant tumors, thus improving patient outcomes and reducing patient\nmortality rates. Overall, this study has primarily focused on identifying seven\n(7) breast cancer tumor subtypes, and we believe that the experimental models\ncan be fine-tuned further to generalize over previous breast cancer histology\ndatasets as well.","PeriodicalId":501321,"journal":{"name":"arXiv - QuanBio - Cell Behavior","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Cell Behavior","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2309.08745","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Breast cancer is one of the leading causes of death for women worldwide. Early screening is essential for early identification, but the chance of survival declines as the cancer progresses into advanced stages. For this study, the most recent BRACS dataset of histological (H\&E) stained images was used to classify breast cancer tumours, which contains both the whole-slide images (WSI) and region-of-interest (ROI) images, however, for our study we have considered ROI images. We have experimented using different pre-trained deep learning models, such as Xception, EfficientNet, ResNet50, and InceptionResNet, pre-trained on the ImageNet weights. We pre-processed the BRACS ROI along with image augmentation, upsampling, and dataset split strategies. For the default dataset split, the best results were obtained by ResNet50 achieving 66\% f1-score. For the custom dataset split, the best results were obtained by performing upsampling and image augmentation which results in 96.2\% f1-score. Our second approach also reduced the number of false positive and false negative classifications to less than 3\% for each class. We believe that our study significantly impacts the early diagnosis and identification of breast cancer tumors and their subtypes, especially atypical and malignant tumors, thus improving patient outcomes and reducing patient mortality rates. Overall, this study has primarily focused on identifying seven (7) breast cancer tumor subtypes, and we believe that the experimental models can be fine-tuned further to generalize over previous breast cancer histology datasets as well.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
苏木精和伊红染色组织学图像迁移学习提高乳腺癌诊断
乳腺癌是全世界妇女死亡的主要原因之一。早期筛查对于早期识别至关重要,但随着癌症进展到晚期,存活的机会就会下降。在本研究中,最新的BRACS组织学(H\&E)染色图像数据集用于对乳腺癌肿瘤进行分类,其中包括全幻灯片图像(WSI)和感兴趣区域(ROI)图像,然而,在我们的研究中,我们考虑了ROI图像。我们尝试使用不同的预训练深度学习模型,如Xception、EfficientNet、ResNet50和inceptionresnet,在ImageNet权重上进行预训练。我们预处理了bracs ROI以及图像增强、上采样和数据集分割策略。对于默认的数据集分割,resnet50获得了最好的结果,达到66\% f1-score。对于自定义数据集分割,通过执行上采样和图像增强获得最佳结果,其结果为96.2% f1-score。我们的第二种方法还将每个类别的误阳性和误阴性分类数量减少到低于3%。我们认为,我们的研究对乳腺癌肿瘤及其亚型的早期诊断和鉴别,特别是非典型和恶性肿瘤的早期诊断和鉴别,从而改善患者的预后,降低患者的死亡率。总的来说,这项研究主要集中在确定7种乳腺癌肿瘤亚型,我们相信实验模型可以进一步微调,以推广以前的乳腺癌组织学数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Persistent pseudopod splitting is an effective chemotaxis strategy in shallow gradients Geometric Effects in Large Scale Intracellular Flows Motion Ordering in Cellular Polar-polar and Polar-nonpolar Interactions Modelling how lamellipodia-driven cells maintain persistent migration and interact with external barriers Synchronized Memory-Dependent Intracellular Oscillations for a Cell-Bulk ODE-PDE Model in $\mathbb{R}^2$
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1