Hagnifinder: Recovering magnification information of digital histological images using deep learning

Q2 Medicine Journal of Pathology Informatics Pub Date : 2023-01-01 DOI:10.1016/j.jpi.2023.100302

Hongtai Zhang , Zaiyi Liu , Mingli Song , Cheng Lu

{"title":"Hagnifinder: Recovering magnification information of digital histological images using deep learning","authors":"Hongtai Zhang , Zaiyi Liu , Mingli Song , Cheng Lu","doi":"10.1016/j.jpi.2023.100302","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and objective</h3><p>Training a robust cancer diagnostic or prognostic artificial intelligent model using histology images requires a large number of representative cases with labels or annotations, which are difficult to obtain. The histology snapshots available in published papers or case reports can be used to enrich the training dataset. However, the magnifications of these invaluable snapshots are generally unknown, which limits their usage. Therefore, a robust magnification predictor is required for utilizing those diverse snapshot repositories consisting of different diseases. This paper presents a magnification prediction model named Hagnifinder for H&E-stained histological images.</p></div><div><h3>Methods</h3><p>Hagnifinder is a regression model based on a modified convolutional neural network (CNN) that contains 3 modules: Feature Extraction Module, Regression Module, and Adaptive Scaling Module (ASM). In the training phase, the Feature Extraction Module first extracts the image features. Secondly, the ASM is proposed to address the learned feature values uneven distribution problem. Finally, the Regression Module estimates the mapping between the regularized extracted features and the magnifications. We construct a new dataset for training a robust model, named Hagni40, consisting of 94 643 H&E-stained histology image patches at 40 different magnifications of 13 types of cancer based on The Cancer Genome Atlas. To verify the performance of the Hagnifinder, we measure the accuracy of the predictions by setting the maximum allowable difference values (0.5, 1, and 5) between the predicted magnification and the actual magnification. We compare Hagnifinder with state-of-the-art methods on a public dataset BreakHis and the Hagni40.</p></div><div><h3>Results</h3><p>The Hagnifinder provides consistent prediction accuracy, with a mean accuracy of 98.9%, across 40 different magnifications and 13 different cancer types when Resnet50 is used as the feature extractor. Compared with the state-of-the-art methods focusing on 4–5 levels of magnification classification, the Hagnifinder achieve<strong>s</strong> the best and most comparable performance in the BreakHis and Hagni40 datasets.</p></div><div><h3>Conclusions</h3><p>The experimental results suggest that Hagnifinder can be a valuable tool for predicting the associated magnification of any given histology image.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100302"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/79/f5/main.PMC10009300.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pathology Informatics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2153353923001165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 1

Abstract

Background and objective

Training a robust cancer diagnostic or prognostic artificial intelligent model using histology images requires a large number of representative cases with labels or annotations, which are difficult to obtain. The histology snapshots available in published papers or case reports can be used to enrich the training dataset. However, the magnifications of these invaluable snapshots are generally unknown, which limits their usage. Therefore, a robust magnification predictor is required for utilizing those diverse snapshot repositories consisting of different diseases. This paper presents a magnification prediction model named Hagnifinder for H&E-stained histological images.

Methods

Hagnifinder is a regression model based on a modified convolutional neural network (CNN) that contains 3 modules: Feature Extraction Module, Regression Module, and Adaptive Scaling Module (ASM). In the training phase, the Feature Extraction Module first extracts the image features. Secondly, the ASM is proposed to address the learned feature values uneven distribution problem. Finally, the Regression Module estimates the mapping between the regularized extracted features and the magnifications. We construct a new dataset for training a robust model, named Hagni40, consisting of 94 643 H&E-stained histology image patches at 40 different magnifications of 13 types of cancer based on The Cancer Genome Atlas. To verify the performance of the Hagnifinder, we measure the accuracy of the predictions by setting the maximum allowable difference values (0.5, 1, and 5) between the predicted magnification and the actual magnification. We compare Hagnifinder with state-of-the-art methods on a public dataset BreakHis and the Hagni40.

Results

The Hagnifinder provides consistent prediction accuracy, with a mean accuracy of 98.9%, across 40 different magnifications and 13 different cancer types when Resnet50 is used as the feature extractor. Compared with the state-of-the-art methods focusing on 4–5 levels of magnification classification, the Hagnifinder achieves the best and most comparable performance in the BreakHis and Hagni40 datasets.

Conclusions

The experimental results suggest that Hagnifinder can be a valuable tool for predicting the associated magnification of any given histology image.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Hagnifinder:利用深度学习恢复数字组织学图像的放大信息

背景与目的利用组织学图像训练一个鲁棒的癌症诊断或预后人工智能模型需要大量带有标签或注释的代表性病例，而这些病例很难获得。在已发表的论文或病例报告中可用的组织学快照可用于丰富训练数据集。然而，这些宝贵的快照的放大倍数通常是未知的，这限制了它们的使用。因此，需要一个健壮的放大预测器来利用由不同疾病组成的不同快照库。本文提出了一种用于H& e染色组织学图像的放大预测模型Hagnifinder。shagnifinder是一个基于改进卷积神经网络(CNN)的回归模型，该模型包含3个模块:特征提取模块、回归模块和自适应缩放模块(ASM)。在训练阶段，特征提取模块首先提取图像特征。其次，针对学习到的特征值分布不均匀的问题，提出了ASM算法。最后，回归模块估计正则化提取的特征与放大之间的映射关系。我们构建了一个新的数据集来训练一个名为Hagni40的鲁棒模型，该模型由94个 643个H& e染色的组织学图像斑块组成，基于癌症基因组图谱，在40种不同的放大倍率下，包含13种癌症。为了验证Hagnifinder的性能，我们通过设置预测放大倍率与实际放大倍率之间的最大允许差值(0.5、1和5)来测量预测的准确性。我们将Hagnifinder与公共数据集BreakHis和Hagni40上最先进的方法进行比较。结果当使用Resnet50作为特征提取器时，Hagnifinder在40种不同的放大倍数和13种不同的癌症类型上提供了一致的预测准确率，平均准确率为98.9%。与专注于4-5级放大分类的最先进方法相比，Hagnifinder在BreakHis和Hagni40数据集中实现了最佳和最具可比性的性能。结论Hagnifinder可作为预测任意组织学图像相关放大倍数的有效工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Pathology Informatics Medicine-Pathology and Forensic Medicine

CiteScore

3.70

自引率

0.00%

发文量

审稿时长

18 weeks

期刊介绍： The Journal of Pathology Informatics (JPI) is an open access peer-reviewed journal dedicated to the advancement of pathology informatics. This is the official journal of the Association for Pathology Informatics (API). The journal aims to publish broadly about pathology informatics and freely disseminate all articles worldwide. This journal is of interest to pathologists, informaticians, academics, researchers, health IT specialists, information officers, IT staff, vendors, and anyone with an interest in informatics. We encourage submissions from anyone with an interest in the field of pathology informatics. We publish all types of papers related to pathology informatics including original research articles, technical notes, reviews, viewpoints, commentaries, editorials, symposia, meeting abstracts, book reviews, and correspondence to the editors. All submissions are subject to rigorous peer review by the well-regarded editorial board and by expert referees in appropriate specialties.