Development and prognostic validation of a three-level NHG-like deep learning-based model for histological grading of breast cancer.

IF 5.6 1区医学 Q1 Medicine Breast Cancer Research Pub Date : 2024-01-29 DOI:10.1186/s13058-024-01770-4

Abhinav Sharma, Philippe Weitz, Yinxi Wang, Bojing Liu, Johan Vallon-Christersson, Johan Hartman, Mattias Rantalainen

{"title":"Development and prognostic validation of a three-level NHG-like deep learning-based model for histological grading of breast cancer.","authors":"Abhinav Sharma, Philippe Weitz, Yinxi Wang, Bojing Liu, Johan Vallon-Christersson, Johan Hartman, Mattias Rantalainen","doi":"10.1186/s13058-024-01770-4","DOIUrl":null,"url":null,"abstract":"Background: Histological grade is a well-known prognostic factor that is routinely assessed in breast tumours. However, manual assessment of Nottingham Histological Grade (NHG) has high inter-assessor and inter-laboratory variability, causing uncertainty in grade assignments. To address this challenge, we developed and validated a three-level NHG-like deep learning-based histological grade model (predGrade). The primary performance evaluation focuses on prognostic performance.Methods: This observational study is based on two patient cohorts (SöS-BC-4, N = 2421 (training and internal test); SCAN-B-Lund, N = 1262 (test)) that include routine histological whole-slide images (WSIs) together with patient outcomes. A deep convolutional neural network (CNN) model with an attention mechanism was optimised for the classification of the three-level histological grading (NHG) from haematoxylin and eosin-stained WSIs. The prognostic performance was evaluated by time-to-event analysis of recurrence-free survival and compared to clinical NHG grade assignments in the internal test set as well as in the fully independent external test cohort.Results: We observed effect sizes (hazard ratio) for grade 3 versus 1, for the conventional NHG method (HR = 2.60 (1.18-5.70 95%CI, p-value = 0.017)) and the deep learning model (HR = 2.27, 95%CI 1.07-4.82, p-value = 0.033) on the internal test set after adjusting for established clinicopathological risk factors. In the external test set, the unadjusted HR for clinical NHG 2 versus 1 was estimated to be 2.59 (p-value = 0.004) and clinical NHG 3 versus 1 was estimated to be 3.58 (p-value < 0.001). For predGrade, the unadjusted HR for predGrade 2 versus 1 HR = 2.52 (p-value = 0.030), and 4.07 (p-value = 0.001) for preGrade 3 versus 1 was observed in the independent external test set. In multivariable analysis, HR estimates for neither clinical NHG nor predGrade were found to be significant (p-value > 0.05). We tested for differences in HR estimates between NHG and predGrade in the independent test set and found no significant difference between the two classification models (p-value > 0.05), confirming similar prognostic performance between conventional NHG and predGrade.Conclusion: Routine histopathology assessment of NHG has a high degree of inter-assessor variability, motivating the development of model-based decision support to improve reproducibility in histological grading. We found that the proposed model (predGrade) provides a similar prognostic performance as clinical NHG. The results indicate that deep CNN-based models can be applied for breast cancer histological grading.","PeriodicalId":49227,"journal":{"name":"Breast Cancer Research","volume":"26 1","pages":"17"},"PeriodicalIF":5.6000,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10823657/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Breast Cancer Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13058-024-01770-4","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Histological grade is a well-known prognostic factor that is routinely assessed in breast tumours. However, manual assessment of Nottingham Histological Grade (NHG) has high inter-assessor and inter-laboratory variability, causing uncertainty in grade assignments. To address this challenge, we developed and validated a three-level NHG-like deep learning-based histological grade model (predGrade). The primary performance evaluation focuses on prognostic performance.

Methods: This observational study is based on two patient cohorts (SöS-BC-4, N = 2421 (training and internal test); SCAN-B-Lund, N = 1262 (test)) that include routine histological whole-slide images (WSIs) together with patient outcomes. A deep convolutional neural network (CNN) model with an attention mechanism was optimised for the classification of the three-level histological grading (NHG) from haematoxylin and eosin-stained WSIs. The prognostic performance was evaluated by time-to-event analysis of recurrence-free survival and compared to clinical NHG grade assignments in the internal test set as well as in the fully independent external test cohort.

Results: We observed effect sizes (hazard ratio) for grade 3 versus 1, for the conventional NHG method (HR = 2.60 (1.18-5.70 95%CI, p-value = 0.017)) and the deep learning model (HR = 2.27, 95%CI 1.07-4.82, p-value = 0.033) on the internal test set after adjusting for established clinicopathological risk factors. In the external test set, the unadjusted HR for clinical NHG 2 versus 1 was estimated to be 2.59 (p-value = 0.004) and clinical NHG 3 versus 1 was estimated to be 3.58 (p-value < 0.001). For predGrade, the unadjusted HR for predGrade 2 versus 1 HR = 2.52 (p-value = 0.030), and 4.07 (p-value = 0.001) for preGrade 3 versus 1 was observed in the independent external test set. In multivariable analysis, HR estimates for neither clinical NHG nor predGrade were found to be significant (p-value > 0.05). We tested for differences in HR estimates between NHG and predGrade in the independent test set and found no significant difference between the two classification models (p-value > 0.05), confirming similar prognostic performance between conventional NHG and predGrade.

Conclusion: Routine histopathology assessment of NHG has a high degree of inter-assessor variability, motivating the development of model-based decision support to improve reproducibility in histological grading. We found that the proposed model (predGrade) provides a similar prognostic performance as clinical NHG. The results indicate that deep CNN-based models can be applied for breast cancer histological grading.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于深度学习的乳腺癌组织学分级三级 NHG 类模型的开发与预后验证。

背景：组织学分级是众所周知的预后因素，是乳腺肿瘤的常规评估指标。然而，诺丁汉组织学分级（NHG）的人工评估在评估者之间和实验室之间存在很大差异，导致分级结果不确定。为了应对这一挑战，我们开发并验证了一种基于深度学习的三级 NHG 样组织学分级模型（predGrade）。主要性能评估侧重于预后性能：这项观察性研究基于两个患者队列（SöS-BC-4，N = 2421（训练和内部测试）；SCAN-B-Lund，N = 1262（测试）），其中包括常规组织学全切片图像（WSI）和患者预后。我们优化了具有注意力机制的深度卷积神经网络（CNN）模型，用于对血色素和伊红染色的 WSIs 进行三级组织学分级（NHG）。通过对无复发生存期的时间到事件分析评估了该模型的预后性能，并与内部测试集和完全独立的外部测试队列中的临床 NHG 分级进行了比较：在调整了既定的临床病理学风险因素后，我们在内部测试集上观察到了传统 NHG 方法（HR = 2.60 (1.18-5.70 95%CI, p-value = 0.017)）和深度学习模型（HR = 2.27, 95%CI 1.07-4.82, p-value = 0.033）对 3 级与 1 级的效应大小（危险比）。在外部测试集中，临床 NHG 2 与 1 的未调整 HR 估计为 2.59（p 值 = 0.004），临床 NHG 3 与 1 的未调整 HR 估计为 3.58（p 值 0.05）。我们检测了独立测试集中NHG和predGrade之间的HR估计值差异，发现两种分类模型之间没有显著差异（p值>0.05），证实了传统NHG和predGrade之间相似的预后性能：NHG的常规组织病理学评估在评估者之间存在很大的变异性，因此需要开发基于模型的决策支持，以提高组织学分级的可重复性。我们发现，所提出的模型（predGrade）具有与临床 NHG 相似的预后性能。结果表明，基于深度 CNN 的模型可用于乳腺癌组织学分级。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Breast Cancer Research ONCOLOGY-

CiteScore

12.00

自引率

0.00%

发文量

审稿时长

12 weeks

期刊介绍： Breast Cancer Research, an international, peer-reviewed online journal, publishes original research, reviews, editorials, and reports. It features open-access research articles of exceptional interest across all areas of biology and medicine relevant to breast cancer. This includes normal mammary gland biology, with a special emphasis on the genetic, biochemical, and cellular basis of breast cancer. In addition to basic research, the journal covers preclinical, translational, and clinical studies with a biological basis, including Phase I and Phase II trials.