Semi-supervised ViT knowledge distillation network with style transfer normalization for colorectal liver metastases survival prediction

IF 10.7 1区 医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Medical image analysis Pub Date : 2024-09-16 DOI:10.1016/j.media.2024.103346
Mohamed El Amine Elforaici , Emmanuel Montagnon , Francisco Perdigón Romero , William Trung Le , Feryel Azzi , Dominique Trudel , Bich Nguyen , Simon Turcotte , An Tang , Samuel Kadoury
{"title":"Semi-supervised ViT knowledge distillation network with style transfer normalization for colorectal liver metastases survival prediction","authors":"Mohamed El Amine Elforaici ,&nbsp;Emmanuel Montagnon ,&nbsp;Francisco Perdigón Romero ,&nbsp;William Trung Le ,&nbsp;Feryel Azzi ,&nbsp;Dominique Trudel ,&nbsp;Bich Nguyen ,&nbsp;Simon Turcotte ,&nbsp;An Tang ,&nbsp;Samuel Kadoury","doi":"10.1016/j.media.2024.103346","DOIUrl":null,"url":null,"abstract":"<div><div>Colorectal liver metastases (CLM) affect almost half of all colon cancer patients and the response to systemic chemotherapy plays a crucial role in patient survival. While oncologists typically use tumor grading scores, such as tumor regression grade (TRG), to establish an accurate prognosis on patient outcomes, including overall survival (OS) and time-to-recurrence (TTR), these traditional methods have several limitations. They are subjective, time-consuming, and require extensive expertise, which limits their scalability and reliability. Additionally, existing approaches for prognosis prediction using machine learning mostly rely on radiological imaging data, but recently histological images have been shown to be relevant for survival predictions by allowing to fully capture the complex microenvironmental and cellular characteristics of the tumor. To address these limitations, we propose an end-to-end approach for automated prognosis prediction using histology slides stained with Hematoxylin and Eosin (H&amp;E) and Hematoxylin Phloxine Saffron (HPS). We first employ a Generative Adversarial Network (GAN) for slide normalization to reduce staining variations and improve the overall quality of the images that are used as input to our prediction pipeline. We propose a semi-supervised model to perform tissue classification from sparse annotations, producing segmentation and feature maps. Specifically, we use an attention-based approach that weighs the importance of different slide regions in producing the final classification results. Finally, we exploit the extracted features for the metastatic nodules and surrounding tissue to train a prognosis model. In parallel, we train a vision Transformer model in a knowledge distillation framework to replicate and enhance the performance of the prognosis prediction. We evaluate our approach on an in-house clinical dataset of 258 CLM patients, achieving superior performance compared to other comparative models with a c-index of 0.804 (0.014) for OS and 0.735 (0.016) for TTR, as well as on two public datasets. The proposed approach achieves an accuracy of 86.9% to 90.3% in predicting TRG dichotomization. For the 3-class TRG classification task, the proposed approach yields an accuracy of 78.5% to 82.1%, outperforming the comparative methods. Our proposed pipeline can provide automated prognosis for pathologists and oncologists, and can greatly promote precision medicine progress in managing CLM patients.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":null,"pages":null},"PeriodicalIF":10.7000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841524002718","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Colorectal liver metastases (CLM) affect almost half of all colon cancer patients and the response to systemic chemotherapy plays a crucial role in patient survival. While oncologists typically use tumor grading scores, such as tumor regression grade (TRG), to establish an accurate prognosis on patient outcomes, including overall survival (OS) and time-to-recurrence (TTR), these traditional methods have several limitations. They are subjective, time-consuming, and require extensive expertise, which limits their scalability and reliability. Additionally, existing approaches for prognosis prediction using machine learning mostly rely on radiological imaging data, but recently histological images have been shown to be relevant for survival predictions by allowing to fully capture the complex microenvironmental and cellular characteristics of the tumor. To address these limitations, we propose an end-to-end approach for automated prognosis prediction using histology slides stained with Hematoxylin and Eosin (H&E) and Hematoxylin Phloxine Saffron (HPS). We first employ a Generative Adversarial Network (GAN) for slide normalization to reduce staining variations and improve the overall quality of the images that are used as input to our prediction pipeline. We propose a semi-supervised model to perform tissue classification from sparse annotations, producing segmentation and feature maps. Specifically, we use an attention-based approach that weighs the importance of different slide regions in producing the final classification results. Finally, we exploit the extracted features for the metastatic nodules and surrounding tissue to train a prognosis model. In parallel, we train a vision Transformer model in a knowledge distillation framework to replicate and enhance the performance of the prognosis prediction. We evaluate our approach on an in-house clinical dataset of 258 CLM patients, achieving superior performance compared to other comparative models with a c-index of 0.804 (0.014) for OS and 0.735 (0.016) for TTR, as well as on two public datasets. The proposed approach achieves an accuracy of 86.9% to 90.3% in predicting TRG dichotomization. For the 3-class TRG classification task, the proposed approach yields an accuracy of 78.5% to 82.1%, outperforming the comparative methods. Our proposed pipeline can provide automated prognosis for pathologists and oncologists, and can greatly promote precision medicine progress in managing CLM patients.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
半监督 ViT 知识蒸馏网络与风格转移归一化用于结直肠肝转移生存率预测
结直肠肝转移(CLM)影响着近一半的结肠癌患者,对全身化疗的反应对患者的生存起着至关重要的作用。虽然肿瘤学家通常使用肿瘤分级评分(如肿瘤回归分级(TRG))来准确预测患者的预后,包括总生存期(OS)和复发时间(TTR),但这些传统方法存在一些局限性。它们主观、耗时,需要大量的专业知识,这限制了它们的可扩展性和可靠性。此外,利用机器学习进行预后预测的现有方法大多依赖于放射成像数据,但最近的研究表明,组织学图像可以充分捕捉肿瘤复杂的微环境和细胞特征,因此与生存预测相关。为了解决这些局限性,我们提出了一种端到端方法,利用经苏木精和伊红(H&E)及红花苷(HPS)染色的组织学切片进行自动预后预测。我们首先采用生成对抗网络(GAN)对玻片进行归一化处理,以减少染色差异,提高作为预测管道输入的图像的整体质量。我们提出了一种半监督模型,从稀疏注释中进行组织分类,生成分割图和特征图。具体来说,我们使用一种基于注意力的方法,在生成最终分类结果时权衡不同切片区域的重要性。最后,我们利用提取的转移结节和周围组织特征来训练预后模型。与此同时,我们还在知识提炼框架中训练了一个视觉转换器模型,以复制和提高预后预测的性能。我们在一个包含 258 名 CLM 患者的内部临床数据集上对我们的方法进行了评估,与其他比较模型相比,我们的方法取得了优异的性能,OS 的 c 指数为 0.804 (0.014),TTR 为 0.735 (0.016),同时我们还在两个公共数据集上对我们的方法进行了评估。所提出的方法在预测 TRG 二分法方面达到了 86.9% 到 90.3% 的准确率。对于 TRG 的三类分类任务,所提出的方法获得了 78.5% 至 82.1% 的准确率,优于其他比较方法。我们提出的方法可以为病理学家和肿瘤学家提供自动预后分析,并能极大地促进精准医疗在管理CLM患者方面的进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Medical image analysis
Medical image analysis 工程技术-工程:生物医学
CiteScore
22.10
自引率
6.40%
发文量
309
审稿时长
6.6 months
期刊介绍: Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.
期刊最新文献
Beyond strong labels: Weakly-supervised learning based on Gaussian pseudo labels for the segmentation of ellipse-like vascular structures in non-contrast CTs A cross-attention-based deep learning approach for predicting functional stroke outcomes using 4D CTP imaging and clinical metadata DACG: Dual Attention and Context Guidance model for radiology report generation Simulation-free prediction of atrial fibrillation inducibility with the fibrotic kernel signature An objective comparison of methods for augmented reality in laparoscopic liver resection by preoperative-to-intraoperative image fusion from the MICCAI2022 challenge
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1