An interpretable deep learning model for detecting BRCA pathogenic variants of breast cancer from hematoxylin and eosin-stained pathological images.

IF 2.4 3区 生物学 Q2 MULTIDISCIPLINARY SCIENCES PeerJ Pub Date : 2024-10-28 eCollection Date: 2024-01-01 DOI:10.7717/peerj.18098
Yi Li, Xiaomin Xiong, Xiaohua Liu, Yihan Wu, Xiaoju Li, Bo Liu, Bo Lin, Yu Li, Bo Xu
{"title":"An interpretable deep learning model for detecting <i>BRCA</i> pathogenic variants of breast cancer from hematoxylin and eosin-stained pathological images.","authors":"Yi Li, Xiaomin Xiong, Xiaohua Liu, Yihan Wu, Xiaoju Li, Bo Liu, Bo Lin, Yu Li, Bo Xu","doi":"10.7717/peerj.18098","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Determining the status of breast cancer susceptibility genes (<i>BRCA</i>) is crucial for guiding breast cancer treatment. Nevertheless, the need for <i>BRCA</i> genetic testing among breast cancer patients remains unmet due to high costs and limited resources. This study aimed to develop a Bi-directional Self-Attention Multiple Instance Learning (BiAMIL) algorithm to detect <i>BRCA</i> status from hematoxylin and eosin (H&E) pathological images.</p><p><strong>Methods: </strong>A total of 319 histopathological slides from 254 breast cancer patients were included, comprising two dependent cohorts. Following image pre-processing, 633,484 tumor tiles from the training dataset were employed to train the self-developed deep-learning model. The performance of the network was evaluated in the internal and external test sets.</p><p><strong>Results: </strong>BiAMIL achieved AUC values of 0.819 (95% CI [0.673-0.965]) in the internal test set, and 0.817 (95% CI [0.712-0.923]) in the external test set. To explore the relationship between <i>BRCA</i> status and interpretable morphological features in pathological images, we utilized Class Activation Mapping (CAM) technique and cluster analysis to investigate the connections between <i>BRCA</i> gene mutation status and tissue and cell features. Significantly, we observed that tumor-infiltrating lymphocytes and the morphological characteristics of tumor cells appeared to be potential features associated with <i>BRCA</i> status.</p><p><strong>Conclusions: </strong>An interpretable deep neural network model based on the attention mechanism was developed to predict the <i>BRCA</i> status in breast cancer. Keywords: Breast cancer, <i>BRCA</i>, deep learning, self-attention, interpretability.</p>","PeriodicalId":19799,"journal":{"name":"PeerJ","volume":"12 ","pages":"e18098"},"PeriodicalIF":2.4000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11526788/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.7717/peerj.18098","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Determining the status of breast cancer susceptibility genes (BRCA) is crucial for guiding breast cancer treatment. Nevertheless, the need for BRCA genetic testing among breast cancer patients remains unmet due to high costs and limited resources. This study aimed to develop a Bi-directional Self-Attention Multiple Instance Learning (BiAMIL) algorithm to detect BRCA status from hematoxylin and eosin (H&E) pathological images.

Methods: A total of 319 histopathological slides from 254 breast cancer patients were included, comprising two dependent cohorts. Following image pre-processing, 633,484 tumor tiles from the training dataset were employed to train the self-developed deep-learning model. The performance of the network was evaluated in the internal and external test sets.

Results: BiAMIL achieved AUC values of 0.819 (95% CI [0.673-0.965]) in the internal test set, and 0.817 (95% CI [0.712-0.923]) in the external test set. To explore the relationship between BRCA status and interpretable morphological features in pathological images, we utilized Class Activation Mapping (CAM) technique and cluster analysis to investigate the connections between BRCA gene mutation status and tissue and cell features. Significantly, we observed that tumor-infiltrating lymphocytes and the morphological characteristics of tumor cells appeared to be potential features associated with BRCA status.

Conclusions: An interpretable deep neural network model based on the attention mechanism was developed to predict the BRCA status in breast cancer. Keywords: Breast cancer, BRCA, deep learning, self-attention, interpretability.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从苏木精和伊红染色病理图像中检测乳腺癌 BRCA 致病变体的可解释深度学习模型。
背景:确定乳腺癌易感基因(BRCA)的状态对于指导乳腺癌治疗至关重要。然而,由于成本高昂和资源有限,乳腺癌患者对 BRCA 基因检测的需求仍未得到满足。本研究旨在开发一种双向自注意多实例学习(BiAMIL)算法,从苏木精和伊红(H&E)病理图像中检测 BRCA 状态:共纳入了 254 名乳腺癌患者的 319 张组织病理切片,包括两个从属队列。经过图像预处理后,训练数据集中的 633,484 块肿瘤切片被用于训练自主开发的深度学习模型。在内部和外部测试集中对该网络的性能进行了评估:BiAMIL在内部测试集中的AUC值为0.819(95% CI [0.673-0.965]),在外部测试集中的AUC值为0.817(95% CI [0.712-0.923])。为了探索 BRCA 状态与病理图像中可解释的形态特征之间的关系,我们利用类激活图谱(CAM)技术和聚类分析来研究 BRCA 基因突变状态与组织和细胞特征之间的联系。值得注意的是,我们观察到肿瘤浸润淋巴细胞和肿瘤细胞的形态特征似乎是与 BRCA 状态相关的潜在特征:结论:基于注意力机制,我们建立了一个可解释的深度神经网络模型来预测乳腺癌的 BRCA 状态。关键词乳腺癌 BRCA 深度学习 自我注意 可解释性
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
PeerJ
PeerJ MULTIDISCIPLINARY SCIENCES-
CiteScore
4.70
自引率
3.70%
发文量
1665
审稿时长
10 weeks
期刊介绍: PeerJ is an open access peer-reviewed scientific journal covering research in the biological and medical sciences. At PeerJ, authors take out a lifetime publication plan (for as little as $99) which allows them to publish articles in the journal for free, forever. PeerJ has 5 Nobel Prize Winners on the Board; they have won several industry and media awards; and they are widely recognized as being one of the most interesting recent developments in academic publishing.
期刊最新文献
Effective degradation of zearalenone by multiple microbial isolates. Kinesiophobia and alexithymia in knee osteoarthritis: association with radiological severity. Validity and reliability of Insomnia Severity Index among older adults in Indonesia. Predictive value of D4Z4 methylation levels for phenotypic heterogeneity and disease progression in Facioscapulohumeral Muscular Dystrophy with borderline D4Z4 repeat units: a retrospective cohort study. Comparison of primers for the pathogenicity factors vvhA and rpoS in Vibrio vulnificus environmental isolates from the Texas Coastal Bend region of the Gulf of Mexico.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1