Deep learning-based analysis of EGFR mutation prevalence in lung adenocarcinoma H&E whole slide images

IF 3.4 2区 医学 Q1 PATHOLOGY Journal of Pathology Clinical Research Pub Date : 2024-10-02 DOI:10.1002/2056-4538.70004
Jun Hyeong Park, June Hyuck Lim, Seonhwa Kim, Chul-Ho Kim, Jeong-Seok Choi, Jun Hyeok Lim, Lucia Kim, Jae Won Chang, Dongil Park, Myung-won Lee, Sup Kim, Il-Seok Park, Seung Hoon Han, Eun Shin, Jin Roh, Jaesung Heo
{"title":"Deep learning-based analysis of EGFR mutation prevalence in lung adenocarcinoma H&E whole slide images","authors":"Jun Hyeong Park,&nbsp;June Hyuck Lim,&nbsp;Seonhwa Kim,&nbsp;Chul-Ho Kim,&nbsp;Jeong-Seok Choi,&nbsp;Jun Hyeok Lim,&nbsp;Lucia Kim,&nbsp;Jae Won Chang,&nbsp;Dongil Park,&nbsp;Myung-won Lee,&nbsp;Sup Kim,&nbsp;Il-Seok Park,&nbsp;Seung Hoon Han,&nbsp;Eun Shin,&nbsp;Jin Roh,&nbsp;Jaesung Heo","doi":"10.1002/2056-4538.70004","DOIUrl":null,"url":null,"abstract":"<p><i>EGFR</i> mutations are a major prognostic factor in lung adenocarcinoma. However, current detection methods require sufficient samples and are costly. Deep learning is promising for mutation prediction in histopathological image analysis but has limitations in that it does not sufficiently reflect tumor heterogeneity and lacks interpretability. In this study, we developed a deep learning model to predict the presence of <i>EGFR</i> mutations by analyzing histopathological patterns in whole slide images (WSIs). We also introduced the <i>EGFR</i> mutation prevalence (EMP) score, which quantifies <i>EGFR</i> prevalence in WSIs based on patch-level predictions, and evaluated its interpretability and utility. Our model estimates the probability of EGFR prevalence in each patch by partitioning the WSI based on multiple-instance learning and predicts the presence of <i>EGFR</i> mutations at the slide level. We utilized a patch-masking scheduler training strategy to enable the model to learn various histopathological patterns of EGFR. This study included 868 WSI samples from lung adenocarcinoma patients collected from three medical institutions: Hallym University Medical Center, Inha University Hospital, and Chungnam National University Hospital. For the test dataset, 197 WSIs were collected from Ajou University Medical Center to evaluate the presence of <i>EGFR</i> mutations. Our model demonstrated prediction performance with an area under the receiver operating characteristic curve of 0.7680 (0.7607–0.7720) and an area under the precision-recall curve of 0.8391 (0.8326–0.8430). The EMP score showed Spearman correlation coefficients of 0.4705 (<i>p</i> = 0.0087) for p.L858R and 0.5918 (<i>p</i> = 0.0037) for exon 19 deletions in 64 samples subjected to next-generation sequencing analysis. Additionally, high EMP scores were associated with papillary and acinar patterns (<i>p</i> = 0.0038 and <i>p</i> = 0.0255, respectively), whereas low EMP scores were associated with solid patterns (<i>p</i> = 0.0001). These results validate the reliability of our model and suggest that it can provide crucial information for rapid screening and treatment plans.</p>","PeriodicalId":48612,"journal":{"name":"Journal of Pathology Clinical Research","volume":null,"pages":null},"PeriodicalIF":3.4000,"publicationDate":"2024-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11446692/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pathology Clinical Research","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/2056-4538.70004","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PATHOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

EGFR mutations are a major prognostic factor in lung adenocarcinoma. However, current detection methods require sufficient samples and are costly. Deep learning is promising for mutation prediction in histopathological image analysis but has limitations in that it does not sufficiently reflect tumor heterogeneity and lacks interpretability. In this study, we developed a deep learning model to predict the presence of EGFR mutations by analyzing histopathological patterns in whole slide images (WSIs). We also introduced the EGFR mutation prevalence (EMP) score, which quantifies EGFR prevalence in WSIs based on patch-level predictions, and evaluated its interpretability and utility. Our model estimates the probability of EGFR prevalence in each patch by partitioning the WSI based on multiple-instance learning and predicts the presence of EGFR mutations at the slide level. We utilized a patch-masking scheduler training strategy to enable the model to learn various histopathological patterns of EGFR. This study included 868 WSI samples from lung adenocarcinoma patients collected from three medical institutions: Hallym University Medical Center, Inha University Hospital, and Chungnam National University Hospital. For the test dataset, 197 WSIs were collected from Ajou University Medical Center to evaluate the presence of EGFR mutations. Our model demonstrated prediction performance with an area under the receiver operating characteristic curve of 0.7680 (0.7607–0.7720) and an area under the precision-recall curve of 0.8391 (0.8326–0.8430). The EMP score showed Spearman correlation coefficients of 0.4705 (p = 0.0087) for p.L858R and 0.5918 (p = 0.0037) for exon 19 deletions in 64 samples subjected to next-generation sequencing analysis. Additionally, high EMP scores were associated with papillary and acinar patterns (p = 0.0038 and p = 0.0255, respectively), whereas low EMP scores were associated with solid patterns (p = 0.0001). These results validate the reliability of our model and suggest that it can provide crucial information for rapid screening and treatment plans.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度学习的肺腺癌 H&E 全切片图像中表皮生长因子受体突变率分析。
表皮生长因子受体突变是肺腺癌的一个主要预后因素。然而,目前的检测方法需要足够的样本且成本高昂。在组织病理图像分析中,深度学习有望用于突变预测,但其局限性在于不能充分反映肿瘤的异质性,并且缺乏可解释性。在这项研究中,我们开发了一种深度学习模型,通过分析全切片图像(WSI)中的组织病理学模式来预测表皮生长因子受体突变的存在。我们还引入了表皮生长因子受体突变流行率(EGFR mutation prevalence,EMP)评分,该评分基于斑块级预测量化 WSI 中的表皮生长因子受体流行率,并评估了其可解释性和实用性。我们的模型通过基于多实例学习的 WSI 分区来估算每个斑块的表皮生长因子受体突变率概率,并在切片水平上预测表皮生长因子受体突变的存在。我们采用了斑块屏蔽调度器训练策略,使模型能够学习 EGFR 的各种组织病理学模式。这项研究包括从三家医疗机构收集的 868 份肺腺癌患者 WSI 样本:这些样本分别来自韩国韩林大学医学中心、仁荷大学医院和忠南大学医院。在测试数据集中,197 份 WSI 样本来自 Ajou 大学医学中心,用于评估表皮生长因子受体突变的存在。我们的模型具有良好的预测性能,接收者操作特征曲线下面积为 0.7680(0.7607-0.7720),精确度-召回曲线下面积为 0.8391(0.8326-0.8430)。在进行下一代测序分析的 64 个样本中,p.L858R 和 19 号外显子缺失的 EMP 得分的 Spearman 相关系数分别为 0.4705(p = 0.0087)和 0.5918(p = 0.0037)。此外,高 EMP 分数与乳头状和针状模式相关(分别为 p = 0.0038 和 p = 0.0255),而低 EMP 分数与实性模式相关(p = 0.0001)。这些结果验证了我们模型的可靠性,并表明它能为快速筛查和治疗计划提供重要信息。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Pathology Clinical Research
Journal of Pathology Clinical Research Medicine-Pathology and Forensic Medicine
CiteScore
7.40
自引率
2.40%
发文量
47
审稿时长
20 weeks
期刊介绍: The Journal of Pathology: Clinical Research and The Journal of Pathology serve as translational bridges between basic biomedical science and clinical medicine with particular emphasis on, but not restricted to, tissue based studies. The focus of The Journal of Pathology: Clinical Research is the publication of studies that illuminate the clinical relevance of research in the broad area of the study of disease. Appropriately powered and validated studies with novel diagnostic, prognostic and predictive significance, and biomarker discover and validation, will be welcomed. Studies with a predominantly mechanistic basis will be more appropriate for the companion Journal of Pathology.
期刊最新文献
Large language models as a diagnostic support tool in neuropathology. Homologous recombination deficiency score is an independent prognostic factor in esophageal squamous cell carcinoma Breast cancer survival prediction using an automated mitosis detection pipeline Assessing the impact of deep-learning assistance on the histopathological diagnosis of serous tubal intraepithelial carcinoma (STIC) in fallopian tubes A clinically feasible algorithm for the parallel detection of glioma-associated copy number variation markers based on shallow whole genome sequencing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1