Deep learning-based classification of breast cancer molecular subtypes from H&E whole-slide images

Q2 Medicine Journal of Pathology Informatics Pub Date : 2025-01-01 Epub Date: 2024-11-17 DOI:10.1016/j.jpi.2024.100410

Masoud Tafavvoghi , Anders Sildnes , Mehrdad Rakaee , Nikita Shvetsov , Lars Ailo Bongo , Lill-Tove Rasmussen Busund , Kajsa Møllersen

{"title":"Deep learning-based classification of breast cancer molecular subtypes from H&E whole-slide images","authors":"Masoud Tafavvoghi , Anders Sildnes , Mehrdad Rakaee , Nikita Shvetsov , Lars Ailo Bongo , Lill-Tove Rasmussen Busund , Kajsa Møllersen","doi":"10.1016/j.jpi.2024.100410","DOIUrl":null,"url":null,"abstract":"<div><div>Classifying breast cancer molecular subtypes is crucial for tailoring treatment strategies. While immunohistochemistry (IHC) and gene expression profiling are standard methods for molecular subtyping, IHC can be subjective, and gene profiling is costly and not widely accessible in many regions. Previous approaches have highlighted the potential application of deep learning models on hematoxylin and eosin (H&E)-stained whole-slide images (WSIs) for molecular subtyping, but these efforts vary in their methods, datasets, and reported performance. In this work, we investigated whether H&E-stained WSIs could be solely leveraged to predict breast cancer molecular subtypes (luminal A, B, HER2-enriched, and Basal). We used 1433 WSIs of breast cancer in a two-step pipeline: first, classifying tumor and non-tumor tiles to use only the tumor regions for molecular subtyping; and second, employing a One-vs-Rest (OvR) strategy to train four binary OvR classifiers and aggregating their results using an eXtreme Gradient Boosting model. The pipeline was tested on 221 hold-out WSIs, achieving an F1 score of 0.95 for tumor vs non-tumor classification and a macro F1 score of 0.73 for molecular subtyping. Our findings suggest that, with further validation, supervised deep learning models could serve as supportive tools for molecular subtyping in breast cancer. Our codes are made available to facilitate ongoing research and development.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"16 ","pages":"Article 100410"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11667687/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pathology Informatics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S215335392400049X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/17 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

Classifying breast cancer molecular subtypes is crucial for tailoring treatment strategies. While immunohistochemistry (IHC) and gene expression profiling are standard methods for molecular subtyping, IHC can be subjective, and gene profiling is costly and not widely accessible in many regions. Previous approaches have highlighted the potential application of deep learning models on hematoxylin and eosin (H&E)-stained whole-slide images (WSIs) for molecular subtyping, but these efforts vary in their methods, datasets, and reported performance. In this work, we investigated whether H&E-stained WSIs could be solely leveraged to predict breast cancer molecular subtypes (luminal A, B, HER2-enriched, and Basal). We used 1433 WSIs of breast cancer in a two-step pipeline: first, classifying tumor and non-tumor tiles to use only the tumor regions for molecular subtyping; and second, employing a One-vs-Rest (OvR) strategy to train four binary OvR classifiers and aggregating their results using an eXtreme Gradient Boosting model. The pipeline was tested on 221 hold-out WSIs, achieving an F1 score of 0.95 for tumor vs non-tumor classification and a macro F1 score of 0.73 for molecular subtyping. Our findings suggest that, with further validation, supervised deep learning models could serve as supportive tools for molecular subtyping in breast cancer. Our codes are made available to facilitate ongoing research and development.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度学习的H&E全片图像乳腺癌分子亚型分类。

对乳腺癌分子亚型进行分类对于制定治疗策略至关重要。虽然免疫组织化学（IHC）和基因表达谱是分子分型的标准方法，但IHC可能是主观的，而且基因谱昂贵，而且在许多地区无法广泛获得。先前的方法强调了深度学习模型在苏木精和伊红（H&E）染色的全片图像（wsi）上用于分子亚型的潜在应用，但这些努力在方法、数据集和报告的性能方面各不相同。在这项工作中，我们研究了h&e染色的wsi是否可以单独用于预测乳腺癌的分子亚型（管腔A型、B型、her2富集型和基底型）。我们使用了1433例乳腺癌WSIs，分为两步：首先，对肿瘤和非肿瘤瓦片进行分类，仅使用肿瘤区域进行分子分型；其次，采用One-vs-Rest （OvR）策略训练4个二元OvR分类器，并使用极端梯度增强模型对其结果进行聚合。该管道在221例hold-out wsi中进行了测试，肿瘤与非肿瘤分类的F1得分为0.95，分子分型的宏观F1得分为0.73。我们的研究结果表明，经过进一步验证，监督深度学习模型可以作为乳腺癌分子分型的辅助工具。提供我们的代码是为了促进正在进行的研究和开发。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Pathology Informatics Medicine-Pathology and Forensic Medicine

CiteScore

3.70

自引率

0.00%

发文量

审稿时长

18 weeks

期刊介绍： The Journal of Pathology Informatics (JPI) is an open access peer-reviewed journal dedicated to the advancement of pathology informatics. This is the official journal of the Association for Pathology Informatics (API). The journal aims to publish broadly about pathology informatics and freely disseminate all articles worldwide. This journal is of interest to pathologists, informaticians, academics, researchers, health IT specialists, information officers, IT staff, vendors, and anyone with an interest in informatics. We encourage submissions from anyone with an interest in the field of pathology informatics. We publish all types of papers related to pathology informatics including original research articles, technical notes, reviews, viewpoints, commentaries, editorials, symposia, meeting abstracts, book reviews, and correspondence to the editors. All submissions are subject to rigorous peer review by the well-regarded editorial board and by expert referees in appropriate specialties.