An interpretable deep learning model for detecting BRCA pathogenic variants of breast cancer from hematoxylin and eosin-stained pathological images.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS ACS Applied Bio Materials Pub Date : 2024-10-28 eCollection Date: 2024-01-01 DOI:10.7717/peerj.18098

Yi Li, Xiaomin Xiong, Xiaohua Liu, Yihan Wu, Xiaoju Li, Bo Liu, Bo Lin, Yu Li, Bo Xu

{"title":"An interpretable deep learning model for detecting BRCA pathogenic variants of breast cancer from hematoxylin and eosin-stained pathological images.","authors":"Yi Li, Xiaomin Xiong, Xiaohua Liu, Yihan Wu, Xiaoju Li, Bo Liu, Bo Lin, Yu Li, Bo Xu","doi":"10.7717/peerj.18098","DOIUrl":null,"url":null,"abstract":"Background: Determining the status of breast cancer susceptibility genes (BRCA) is crucial for guiding breast cancer treatment. Nevertheless, the need for BRCA genetic testing among breast cancer patients remains unmet due to high costs and limited resources. This study aimed to develop a Bi-directional Self-Attention Multiple Instance Learning (BiAMIL) algorithm to detect BRCA status from hematoxylin and eosin (H&E) pathological images.Methods: A total of 319 histopathological slides from 254 breast cancer patients were included, comprising two dependent cohorts. Following image pre-processing, 633,484 tumor tiles from the training dataset were employed to train the self-developed deep-learning model. The performance of the network was evaluated in the internal and external test sets.Results: BiAMIL achieved AUC values of 0.819 (95% CI [0.673-0.965]) in the internal test set, and 0.817 (95% CI [0.712-0.923]) in the external test set. To explore the relationship between BRCA status and interpretable morphological features in pathological images, we utilized Class Activation Mapping (CAM) technique and cluster analysis to investigate the connections between BRCA gene mutation status and tissue and cell features. Significantly, we observed that tumor-infiltrating lymphocytes and the morphological characteristics of tumor cells appeared to be potential features associated with BRCA status.Conclusions: An interpretable deep neural network model based on the attention mechanism was developed to predict the BRCA status in breast cancer. Keywords: Breast cancer, BRCA, deep learning, self-attention, interpretability.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11526788/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7717/peerj.18098","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Determining the status of breast cancer susceptibility genes (BRCA) is crucial for guiding breast cancer treatment. Nevertheless, the need for BRCA genetic testing among breast cancer patients remains unmet due to high costs and limited resources. This study aimed to develop a Bi-directional Self-Attention Multiple Instance Learning (BiAMIL) algorithm to detect BRCA status from hematoxylin and eosin (H&E) pathological images.

Methods: A total of 319 histopathological slides from 254 breast cancer patients were included, comprising two dependent cohorts. Following image pre-processing, 633,484 tumor tiles from the training dataset were employed to train the self-developed deep-learning model. The performance of the network was evaluated in the internal and external test sets.

Results: BiAMIL achieved AUC values of 0.819 (95% CI [0.673-0.965]) in the internal test set, and 0.817 (95% CI [0.712-0.923]) in the external test set. To explore the relationship between BRCA status and interpretable morphological features in pathological images, we utilized Class Activation Mapping (CAM) technique and cluster analysis to investigate the connections between BRCA gene mutation status and tissue and cell features. Significantly, we observed that tumor-infiltrating lymphocytes and the morphological characteristics of tumor cells appeared to be potential features associated with BRCA status.

Conclusions: An interpretable deep neural network model based on the attention mechanism was developed to predict the BRCA status in breast cancer. Keywords: Breast cancer, BRCA, deep learning, self-attention, interpretability.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

从苏木精和伊红染色病理图像中检测乳腺癌 BRCA 致病变体的可解释深度学习模型。

背景：确定乳腺癌易感基因（BRCA）的状态对于指导乳腺癌治疗至关重要。然而，由于成本高昂和资源有限，乳腺癌患者对 BRCA 基因检测的需求仍未得到满足。本研究旨在开发一种双向自注意多实例学习（BiAMIL）算法，从苏木精和伊红（H&E）病理图像中检测 BRCA 状态：共纳入了 254 名乳腺癌患者的 319 张组织病理切片，包括两个从属队列。经过图像预处理后，训练数据集中的 633,484 块肿瘤切片被用于训练自主开发的深度学习模型。在内部和外部测试集中对该网络的性能进行了评估：BiAMIL在内部测试集中的AUC值为0.819（95% CI [0.673-0.965]），在外部测试集中的AUC值为0.817（95% CI [0.712-0.923]）。为了探索 BRCA 状态与病理图像中可解释的形态特征之间的关系，我们利用类激活图谱（CAM）技术和聚类分析来研究 BRCA 基因突变状态与组织和细胞特征之间的联系。值得注意的是，我们观察到肿瘤浸润淋巴细胞和肿瘤细胞的形态特征似乎是与 BRCA 状态相关的潜在特征：结论：基于注意力机制，我们建立了一个可解释的深度神经网络模型来预测乳腺癌的 BRCA 状态。关键词乳腺癌 BRCA 深度学习自我注意可解释性

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊