首页 > 最新文献

Journal of Pathology Informatics最新文献

英文 中文
Pan-tumor T-lymphocyte detection using deep neural networks: Recommendations for transfer learning in immunohistochemistry 使用深度神经网络检测泛肿瘤t淋巴细胞:免疫组织化学迁移学习的建议
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100301
Frauke Wilm , Christian Ihling , Gábor Méhes , Luigi Terracciano , Chloé Puget , Robert Klopfleisch , Peter Schüffler , Marc Aubreville , Andreas Maier , Thomas Mrowiec , Katharina Breininger

The success of immuno-oncology treatments promises long-term cancer remission for an increasing number of patients. The response to checkpoint inhibitor drugs has shown a correlation with the presence of immune cells in the tumor and tumor microenvironment. An in-depth understanding of the spatial localization of immune cells is therefore critical for understanding the tumor’s immune landscape and predicting drug response. Computer-aided systems are well suited for efficiently quantifying immune cells in their spatial context. Conventional image analysis approaches are often based on color features and therefore require a high level of manual interaction. More robust image analysis methods based on deep learning are expected to decrease this reliance on human interaction and improve the reproducibility of immune cell scoring. However, these methods require sufficient training data and previous work has reported low robustness of these algorithms when they are tested on out-of-distribution data from different pathology labs or samples from different organs. In this work, we used a new image analysis pipeline to explicitly evaluate the robustness of marker-labeled lymphocyte quantification algorithms depending on the number of training samples before and after being transferred to a new tumor indication. For these experiments, we adapted the RetinaNet architecture for the task of T-lymphocyte detection and employed transfer learning to bridge the domain gap between tumor indications and reduce the annotation costs for unseen domains. On our test set, we achieved human-level performance for almost all tumor indications with an average precision of 0.74 in-domain and 0.72–0.74 cross-domain. From our results, we derive recommendations for model development regarding annotation extent, training sample selection, and label extraction for the development of robust algorithms for immune cell scoring. By extending the task of marker-labeled lymphocyte quantification to a multi-class detection task, the pre-requisite for subsequent analyses, e.g., distinguishing lymphocytes in the tumor stroma from tumor-infiltrating lymphocytes, is met.

免疫肿瘤学治疗的成功有望为越来越多的患者带来长期的癌症缓解。对检查点抑制剂药物的反应显示与肿瘤和肿瘤微环境中免疫细胞的存在相关。因此,深入了解免疫细胞的空间定位对于理解肿瘤的免疫景观和预测药物反应至关重要。计算机辅助系统非常适合于有效地定量免疫细胞的空间环境。传统的图像分析方法通常基于颜色特征,因此需要高水平的人工交互。基于深度学习的更稳健的图像分析方法有望减少对人类交互的依赖,并提高免疫细胞评分的可重复性。然而,这些方法需要足够的训练数据,并且先前的工作报告了当这些算法在来自不同病理实验室的分布外数据或来自不同器官的样本上进行测试时,这些算法的鲁棒性较低。在这项工作中,我们使用了一种新的图像分析管道来明确评估标记标记淋巴细胞量化算法的鲁棒性,这取决于转移到新的肿瘤适应症之前和之后的训练样本数量。在这些实验中,我们将retanet架构用于t淋巴细胞检测任务,并使用迁移学习来弥合肿瘤适应症之间的区域差距,降低未见区域的注释成本。在我们的测试集中,我们在几乎所有肿瘤适应症上都达到了人类水平的表现,平均精度为0.74域内和0.72-0.74跨域。从我们的结果中,我们得出了关于注释程度、训练样本选择和标记提取的模型开发建议,以开发用于免疫细胞评分的鲁棒算法。通过将标记淋巴细胞定量任务扩展到多类检测任务,满足了后续分析的先决条件,例如区分肿瘤基质中的淋巴细胞和肿瘤浸润淋巴细胞。
{"title":"Pan-tumor T-lymphocyte detection using deep neural networks: Recommendations for transfer learning in immunohistochemistry","authors":"Frauke Wilm ,&nbsp;Christian Ihling ,&nbsp;Gábor Méhes ,&nbsp;Luigi Terracciano ,&nbsp;Chloé Puget ,&nbsp;Robert Klopfleisch ,&nbsp;Peter Schüffler ,&nbsp;Marc Aubreville ,&nbsp;Andreas Maier ,&nbsp;Thomas Mrowiec ,&nbsp;Katharina Breininger","doi":"10.1016/j.jpi.2023.100301","DOIUrl":"10.1016/j.jpi.2023.100301","url":null,"abstract":"<div><p>The success of immuno-oncology treatments promises long-term cancer remission for an increasing number of patients. The response to checkpoint inhibitor drugs has shown a correlation with the presence of immune cells in the tumor and tumor microenvironment. An in-depth understanding of the spatial localization of immune cells is therefore critical for understanding the tumor’s immune landscape and predicting drug response. Computer-aided systems are well suited for efficiently quantifying immune cells in their spatial context. Conventional image analysis approaches are often based on color features and therefore require a high level of manual interaction. More robust image analysis methods based on deep learning are expected to decrease this reliance on human interaction and improve the reproducibility of immune cell scoring. However, these methods require sufficient training data and previous work has reported low robustness of these algorithms when they are tested on out-of-distribution data from different pathology labs or samples from different organs. In this work, we used a new image analysis pipeline to explicitly evaluate the robustness of marker-labeled lymphocyte quantification algorithms depending on the number of training samples before and after being transferred to a new tumor indication. For these experiments, we adapted the RetinaNet architecture for the task of T-lymphocyte detection and employed transfer learning to bridge the domain gap between tumor indications and reduce the annotation costs for unseen domains. On our test set, we achieved human-level performance for almost all tumor indications with an average precision of 0.74 in-domain and 0.72–0.74 cross-domain. From our results, we derive recommendations for model development regarding annotation extent, training sample selection, and label extraction for the development of robust algorithms for immune cell scoring. By extending the task of marker-labeled lymphocyte quantification to a multi-class detection task, the pre-requisite for subsequent analyses, e.g., distinguishing lymphocytes in the tumor stroma from tumor-infiltrating lymphocytes, is met.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100301"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040882/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9219568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of digital pathology and machine learning in the liver, kidney and lung diseases 数字病理学和机器学习在肝脏、肾脏和肺部疾病中的应用
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2022.100184
Benjamin Wu , Gilbert Moeckel

The development of rapid and accurate Whole Slide Imaging (WSI) has paved the way for the application of Artificial Intelligence (AI) to digital pathology. The availability of WSI in the recent years allowed the rapid development of various AI technologies to blossom. WSI-based digital pathology combined with neural networks can automate arduous and time-consuming tasks of slide evaluation. Machine Learning (ML)-based AI has been demonstrated to outperform pathologists by eliminating inter- and intra-observer subjectivity, obtaining quantitative data from slide images, and extracting hidden image patterns that are relevant to disease subtype and progression. In this review, we outline the functionality of different AI technologies such as neural networks and deep learning and discover how aspects of different diseases make them benefit from the implementation of AI. AI has proven to be valuable in many different organs, with this review focusing on the liver, kidney, and lungs. We also discuss how AI and image analysis not only can grade diseases objectively but also discover aspects of diseases that have prognostic value. In the end, we review the current status of the integration of AI in pathology and share our vision on the future of digital pathology.

快速准确的全玻片成像(WSI)的发展为人工智能(AI)在数字病理学中的应用铺平了道路。近年来,WSI的出现使各种人工智能技术的快速发展蓬勃发展。基于WSI的数字病理学与神经网络相结合,可以自动化艰巨而耗时的幻灯片评估任务。基于机器学习(ML)的人工智能已被证明优于病理学家,因为它消除了观察者之间和观察者内部的主观性,从幻灯片图像中获得定量数据,并提取了与疾病亚型和进展相关的隐藏图像模式。在这篇综述中,我们概述了神经网络和深度学习等不同人工智能技术的功能,并发现不同疾病的各个方面如何使它们从人工智能的实施中受益。人工智能已被证明在许多不同的器官中有价值,这篇综述的重点是肝、肾和肺。我们还讨论了人工智能和图像分析如何不仅可以客观地对疾病进行分级,还可以发现具有预后价值的疾病方面。最后,我们回顾了人工智能在病理学中的整合现状,并分享了我们对数字病理学未来的愿景。
{"title":"Application of digital pathology and machine learning in the liver, kidney and lung diseases","authors":"Benjamin Wu ,&nbsp;Gilbert Moeckel","doi":"10.1016/j.jpi.2022.100184","DOIUrl":"10.1016/j.jpi.2022.100184","url":null,"abstract":"<div><p>The development of rapid and accurate Whole Slide Imaging (WSI) has paved the way for the application of Artificial Intelligence (AI) to digital pathology. The availability of WSI in the recent years allowed the rapid development of various AI technologies to blossom. WSI-based digital pathology combined with neural networks can automate arduous and time-consuming tasks of slide evaluation. Machine Learning (ML)-based AI has been demonstrated to outperform pathologists by eliminating inter- and intra-observer subjectivity, obtaining quantitative data from slide images, and extracting hidden image patterns that are relevant to disease subtype and progression. In this review, we outline the functionality of different AI technologies such as neural networks and deep learning and discover how aspects of different diseases make them benefit from the implementation of AI. AI has proven to be valuable in many different organs, with this review focusing on the liver, kidney, and lungs. We also discuss how AI and image analysis not only can grade diseases objectively but also discover aspects of diseases that have prognostic value. In the end, we review the current status of the integration of AI in pathology and share our vision on the future of digital pathology.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100184"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/d4/30/main.PMC9874068.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10584262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Artificial intelligence-based triage of large bowel biopsies can improve workflow 基于人工智能的大肠活检分诊可以改善工作流程
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2022.100181
Frederick George Mayall , Mark David Goodhead , Louis de Mendonça , Sarah Eleanor Brownlie , Azka Anees , Stephen Perring

Background

Large bowel biopsies are one of the commonest types of biopsy specimen. We describe a service evaluation study to test the feasibility of using artificial intelligence (AI) to triage large bowel biopsies from a reporting backlog and prioritize those that require more urgent reporting.

Methods

The pathway was developed in the UK by National Health Service (NHS) laboratory staff working in a medium-sized general hospital.   The AI platform was interfaced with the slide scanner software and the reporting platform’s software, so that pathologists could correct the AI label and reinforce the training set as they reported the cases.

Results

The AI classifier achieved a sensitivity of 97.56% and specificity of 93.02% for the case-level-diagnosis of neoplasia (adenoma and adenocarcinoma) and for an AI diagnosis of any significant pathology (i.e., adenomas, adenocarcinomas, inflammation, hyperplastic polyps, and sessile serrated lesions) sensitivity was 95.65% and specificity 92.96%. The automated AI diagnostic classification pathway took approximately 175 s per slide to download and process the scanned whole slide image (WSI) and return an AI diagnostic classification. Biopsies with an AI diagnosis of neoplasia or inflammation were prioritized for reporting while the remainder followed the routine reporting pathway. The AI triaged pathway resulted in a significantly shorter reporting turnaround time for pathologist verified neoplastic cases (P < 0.001) and inflammation (P < 0.05). The project’s costs amounted to  £14800, excluding laboratory staff salaries. More time and resources were spent on developing the interface between the AI platform and laboratory IT systems than on the development of the AI platform itself.

Conclusions

NHS laboratory staff were able to implement an AI solution to accurately triage large bowel biopsies into several diagnostic classes and this improved reporting turnaround times for cases with neoplasia or with inflammation.

背景:大肠活检是最常见的活检标本之一。我们描述了一项服务评估研究,以测试使用人工智能(AI)从报告积压中分类大肠活检并优先考虑那些需要更紧急报告的可行性。方法该途径由英国一家中型综合医院的国家卫生服务(NHS)实验室工作人员开发。 人工智能平台与载玻片扫描软件和报告平台软件进行接口,病理学家在报告病例时可以纠正人工智能标签并强化训练集。结果人工智能分类器对肿瘤(腺瘤和腺癌)的病例级诊断敏感性为97.56%,特异性为93.02%,对任何重大病理(腺瘤、腺癌、炎症、增殖性息肉、无根锯齿状病变)的人工智能诊断敏感性为95.65%,特异性为92.96%。自动AI诊断分类路径每张幻灯片大约需要175秒的时间来下载和处理扫描的整张幻灯片图像(WSI)并返回AI诊断分类。人工智能诊断为肿瘤或炎症的活检优先报告,其余的则遵循常规报告途径。人工智能分类途径显著缩短了病理学家证实的肿瘤病例的报告周转时间(P <0.001)和炎症(P <0.05)。该项目的成本为 £14800,不包括实验室工作人员的工资。人工智能平台与实验室IT系统之间的接口开发比人工智能平台本身的开发花费了更多的时间和资源。结论snhs实验室工作人员能够实施人工智能解决方案,准确地将大肠活检分类为几个诊断类别,这缩短了肿瘤或炎症病例的报告时间。
{"title":"Artificial intelligence-based triage of large bowel biopsies can improve workflow","authors":"Frederick George Mayall ,&nbsp;Mark David Goodhead ,&nbsp;Louis de Mendonça ,&nbsp;Sarah Eleanor Brownlie ,&nbsp;Azka Anees ,&nbsp;Stephen Perring","doi":"10.1016/j.jpi.2022.100181","DOIUrl":"10.1016/j.jpi.2022.100181","url":null,"abstract":"<div><h3>Background</h3><p>Large bowel biopsies are one of the commonest types of biopsy specimen. We describe a service evaluation study to test the feasibility of using artificial intelligence (AI) to triage large bowel biopsies from a reporting backlog and prioritize those that require more urgent reporting.</p></div><div><h3>Methods</h3><p>The pathway was developed in the UK by National Health Service (NHS) laboratory staff working in a medium-sized general hospital.   The AI platform was interfaced with the slide scanner software and the reporting platform’s software, so that pathologists could correct the AI label and reinforce the training set as they reported the cases.</p></div><div><h3>Results</h3><p>The AI classifier achieved a sensitivity of 97.56% and specificity of 93.02% for the case-level-diagnosis of neoplasia (adenoma and adenocarcinoma) and for an AI diagnosis of any significant pathology (i.e., adenomas, adenocarcinomas, inflammation, hyperplastic polyps, and sessile serrated lesions) sensitivity was 95.65% and specificity 92.96%. The automated AI diagnostic classification pathway took approximately 175 s per slide to download and process the scanned whole slide image (WSI) and return an AI diagnostic classification. Biopsies with an AI diagnosis of neoplasia or inflammation were prioritized for reporting while the remainder followed the routine reporting pathway. The AI triaged pathway resulted in a significantly shorter reporting turnaround time for pathologist verified neoplastic cases (P &lt; 0.001) and inflammation (P &lt; 0.05). The project’s costs amounted to  £14800, excluding laboratory staff salaries. More time and resources were spent on developing the interface between the AI platform and laboratory IT systems than on the development of the AI platform itself.</p></div><div><h3>Conclusions</h3><p>NHS laboratory staff were able to implement an AI solution to accurately triage large bowel biopsies into several diagnostic classes and this improved reporting turnaround times for cases with neoplasia or with inflammation.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100181"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9852686/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10584108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Validation of automated positive cell and region detection of immunohistochemically stained laryngeal tumor tissue using digital image analysis 使用数字图像分析验证免疫组织化学染色喉部肿瘤组织自动阳性细胞和区域检测
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100198
Hilde J.G. Smits , Justin E. Swartz , Marielle E.P. Philippens , Remco de Bree , Johannes H.A.M. Kaanders , Sjors A. Koppes , Gerben E. Breimer , Stefan M. Willems

Objectives

This study aimed to validate a digital image analysis (DIA) workflow for automatic positive cell detection and positive region delineation for immunohistochemical hypoxia markers with a nuclear (hypoxia-inducible factor 1α [HIF-1α]) and a cytoplasmic (pimonidazole [PIMO]) staining pattern.

Materials and methods

101 tissue fragments from 44 laryngeal tumor biopsies were immunohistochemically stained for HIF-1α and PIMO. QuPath was used to determine the percentage of positive cells and to delineate positive regions automatically. For HIF-1α, only cells with strong staining were considered positive. Three dedicated head and neck pathologists scored the percentage of positive cells using three categories (0: <1%; 1: 1%–33%; 2: >33%;). The pathologists also delineated the positive regions on 14 corresponding PIMO and HIF-1α-stained fragments. The consensus between observers was used as the reference standard and was compared to the automatic delineation.

Results

Agreement between categorical positivity scores was 76.2% and 65.4% for PIMO and HIF-1α, respectively. In all cases of disagreement in HIF-1α fragments, the DIA underestimated the percentage of positive cells. As for the region detection, the DIA correctly detected most positive regions on PIMO fragments (false positive area=3.1%, false negative area=0.7%). In HIF-1α, the DIA missed some positive regions (false positive area=1.3%, false negative area=9.7%).

Conclusions

Positive cell and region detection on biopsy material is feasible, but further optimization is needed before unsupervised use. Validation at varying DAB staining intensities is hampered by lack of reliability of the gold standard (i.e., visual human interpretation). Nevertheless, the DIA method has the potential to be used as a tool to assist pathologists in the analysis of IHC staining.

目的:通过核(缺氧诱导因子1α [HIF-1α])和细胞质(吡咪唑[PIMO])染色模式,验证数字图像分析(DIA)工作流程对免疫组织化学缺氧标志物的自动阳性细胞检测和阳性区域描绘。材料与方法对44例喉部肿瘤活检101个组织片段进行HIF-1α和PIMO免疫组化染色。使用QuPath来确定阳性细胞的百分比,并自动划定阳性区域。对于HIF-1α,只有染色强烈的细胞被认为是阳性的。三位专门的头颈部病理学家使用三个类别(0:<1%;1: 1% - -33%;2:在33%;)。病理学家还在14个相应的PIMO和hif -1α染色片段上划定了阳性区域。观测者之间的一致意见作为参考标准,并与自动划定进行比较。结果PIMO和HIF-1α分类阳性评分的符合率分别为76.2%和65.4%。在所有HIF-1α片段不一致的情况下,DIA低估了阳性细胞的百分比。在区域检测方面,DIA正确检测出PIMO片段上大部分阳性区域(假阳性面积3.1%,假阴性面积0.7%)。在HIF-1α中,DIA遗漏了部分阳性区域(假阳性面积=1.3%,假阴性面积=9.7%)。结论活检材料的阳性细胞和区域检测是可行的,但在无监督使用前需要进一步优化。在不同DAB染色强度下的验证由于缺乏金标准的可靠性而受到阻碍(即视觉人类解释)。尽管如此,DIA方法有可能被用作辅助病理学家分析免疫组化染色的工具。
{"title":"Validation of automated positive cell and region detection of immunohistochemically stained laryngeal tumor tissue using digital image analysis","authors":"Hilde J.G. Smits ,&nbsp;Justin E. Swartz ,&nbsp;Marielle E.P. Philippens ,&nbsp;Remco de Bree ,&nbsp;Johannes H.A.M. Kaanders ,&nbsp;Sjors A. Koppes ,&nbsp;Gerben E. Breimer ,&nbsp;Stefan M. Willems","doi":"10.1016/j.jpi.2023.100198","DOIUrl":"10.1016/j.jpi.2023.100198","url":null,"abstract":"<div><h3>Objectives</h3><p>This study aimed to validate a digital image analysis (DIA) workflow for automatic positive cell detection and positive region delineation for immunohistochemical hypoxia markers with a nuclear (hypoxia-inducible factor 1α [HIF-1α]) and a cytoplasmic (pimonidazole [PIMO]) staining pattern.</p></div><div><h3>Materials and methods</h3><p>101 tissue fragments from 44 laryngeal tumor biopsies were immunohistochemically stained for HIF-1α and PIMO. QuPath was used to determine the percentage of positive cells and to delineate positive regions automatically. For HIF-1α, only cells with strong staining were considered positive. Three dedicated head and neck pathologists scored the percentage of positive cells using three categories (0: &lt;1%; 1: 1%–33%; 2: &gt;33%;). The pathologists also delineated the positive regions on 14 corresponding PIMO and HIF-1α-stained fragments. The consensus between observers was used as the reference standard and was compared to the automatic delineation.</p></div><div><h3>Results</h3><p>Agreement between categorical positivity scores was 76.2% and 65.4% for PIMO and HIF-1α, respectively. In all cases of disagreement in HIF-1α fragments, the DIA underestimated the percentage of positive cells. As for the region detection, the DIA correctly detected most positive regions on PIMO fragments (false positive area=3.1%, false negative area=0.7%). In HIF-1α, the DIA missed some positive regions (false positive area=1.3%, false negative area=9.7%).</p></div><div><h3>Conclusions</h3><p>Positive cell and region detection on biopsy material is feasible, but further optimization is needed before unsupervised use. Validation at varying DAB staining intensities is hampered by lack of reliability of the gold standard (i.e., visual human interpretation). Nevertheless, the DIA method has the potential to be used as a tool to assist pathologists in the analysis of IHC staining.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100198"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9930147/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10772692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applications of discriminative and deep learning feature extraction methods for whole slide image analysis: A survey 判别和深度学习特征提取方法在全幻灯片图像分析中的应用综述
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100335
Khaled Al-Thelaya, Nauman Ullah Gilal, Mahmood Alzubaidi, Fahad Majeed, Marco Agus, Jens Schneider, Mowafa Househ

Digital pathology technologies, including whole slide imaging (WSI), have significantly improved modern clinical practices by facilitating storing, viewing, processing, and sharing digital scans of tissue glass slides. Researchers have proposed various artificial intelligence (AI) solutions for digital pathology applications, such as automated image analysis, to extract diagnostic information from WSI for improving pathology productivity, accuracy, and reproducibility. Feature extraction methods play a crucial role in transforming raw image data into meaningful representations for analysis, facilitating the characterization of tissue structures, cellular properties, and pathological patterns. These features have diverse applications in several digital pathology applications, such as cancer prognosis and diagnosis. Deep learning-based feature extraction methods have emerged as a promising approach to accurately represent WSI contents and have demonstrated superior performance in histology-related tasks. In this survey, we provide a comprehensive overview of feature extraction methods, including both manual and deep learning-based techniques, for the analysis of WSIs. We review relevant literature, analyze the discriminative and geometric features of WSIs (i.e., features suited to support the diagnostic process and extracted by “engineered” methods as opposed to AI), and explore predictive modeling techniques using AI and deep learning. This survey examines the advances, challenges, and opportunities in this rapidly evolving field, emphasizing the potential for accurate diagnosis, prognosis, and decision-making in digital pathology.

数字病理学技术,包括全玻片成像(WSI),通过促进组织玻片的存储、查看、处理和共享数字扫描,显著改善了现代临床实践。研究人员为数字病理学应用提出了各种人工智能(AI)解决方案,例如自动图像分析,以从WSI中提取诊断信息,以提高病理生产力、准确性和可重复性。特征提取方法在将原始图像数据转换为有意义的表征以供分析,促进组织结构、细胞特性和病理模式的表征方面发挥着至关重要的作用。这些特征在一些数字病理学应用中有不同的应用,如癌症预后和诊断。基于深度学习的特征提取方法已经成为准确表示WSI内容的一种有前途的方法,并且在组织学相关任务中表现出优越的性能。在本调查中,我们提供了特征提取方法的全面概述,包括手动和基于深度学习的技术,用于分析wsi。我们回顾了相关文献,分析了wsi的判别和几何特征(即适合支持诊断过程的特征,并通过与人工智能相反的“工程”方法提取),并探索了使用人工智能和深度学习的预测建模技术。本调查探讨了这一快速发展领域的进展、挑战和机遇,强调了数字病理学准确诊断、预后和决策的潜力。
{"title":"Applications of discriminative and deep learning feature extraction methods for whole slide image analysis: A survey","authors":"Khaled Al-Thelaya,&nbsp;Nauman Ullah Gilal,&nbsp;Mahmood Alzubaidi,&nbsp;Fahad Majeed,&nbsp;Marco Agus,&nbsp;Jens Schneider,&nbsp;Mowafa Househ","doi":"10.1016/j.jpi.2023.100335","DOIUrl":"https://doi.org/10.1016/j.jpi.2023.100335","url":null,"abstract":"<div><p>Digital pathology technologies, including whole slide imaging (WSI), have significantly improved modern clinical practices by facilitating storing, viewing, processing, and sharing digital scans of tissue glass slides. Researchers have proposed various artificial intelligence (AI) solutions for digital pathology applications, such as automated image analysis, to extract diagnostic information from WSI for improving pathology productivity, accuracy, and reproducibility. Feature extraction methods play a crucial role in transforming raw image data into meaningful representations for analysis, facilitating the characterization of tissue structures, cellular properties, and pathological patterns. These features have diverse applications in several digital pathology applications, such as cancer prognosis and diagnosis. Deep learning-based feature extraction methods have emerged as a promising approach to accurately represent WSI contents and have demonstrated superior performance in histology-related tasks. In this survey, we provide a comprehensive overview of feature extraction methods, including both manual and deep learning-based techniques, for the analysis of WSIs. We review relevant literature, analyze the discriminative and geometric features of WSIs (i.e., features suited to support the diagnostic process and extracted by “engineered” methods as opposed to AI), and explore predictive modeling techniques using AI and deep learning. This survey examines the advances, challenges, and opportunities in this rapidly evolving field, emphasizing the potential for accurate diagnosis, prognosis, and decision-making in digital pathology.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100335"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49858503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bioinformatics evaluation of anticancer properties of GP63 protein-derived peptides on MMP2 protein of melanoma cancer GP63蛋白衍生肽对黑色素瘤MMP2蛋白抗癌特性的生物信息学评价
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100190
Fatemeh Sharifi , Iraj Sharifi , Zahra Babaei , Sodabeh Alahdin , Ali Afgar

Background

GP63, also known as Leishmanolysin, is a multifunctional virulence factor abundant on the surface of Leishmania spp. small peptides with anticancer capabilities that are selective and toxic to cancer cells are known as anticancer peptides. We aimed to demonstrate the activity of GP63 and its anticancer properties on melanoma using a range of in silico tools and screening methods to identify predicted and designed anticancer peptides.

Methods

Various in silico modeling methodologies are used to establish the three-dimensional (3D) structure of GP63. Refinement and re-evaluation of the modeled structures and the built models' quality evaluated using the different docking used to find the interacting amino acids between MMP2 and GP63 and its anticancer peptides. AntiCP2.0 is used for screening anticancer peptides. 2D interaction plots of protein–ligand complexes evaluated by Protein–Ligand Interaction Profiler server. It is for the first time that used anticancer peptides of GP63 and the predicted and designed peptides.

Results

We used 3 peptides of GP63 based on the AntiCP 2.0 server with scores of 0.63, 0.53, and 0.49, and common peptides of GP63/MMP2 (continues peptide: mean the completely selected peptide after docking with non-anticancer effect, predicted with 0.58 score and designed peptides with 0.47 and 0.45 scores by AntiCP 2.0 server).

Conclusions

The antileishmanial and anticancer peptide research topics exemplify the multidisciplinary nature of peptide research. The advancement of therapeutics targeting cancer and/or Leishmania requires an interconnected research strategy shown in this work.

dgp63,又称利什曼溶素,是利什曼原虫表面大量存在的一种多功能毒力因子,对癌细胞具有选择性和毒性的小肽被称为抗癌肽。我们的目的是利用一系列的计算机工具和筛选方法来鉴定预测和设计的抗癌肽,证明GP63的活性及其对黑色素瘤的抗癌特性。方法采用多种计算机建模方法建立GP63的三维结构。通过对MMP2与GP63及其抗癌肽之间的相互作用氨基酸的不同对接,对模型结构进行改进和重新评估,并对构建的模型质量进行评估。antip2.0用于筛选抗癌肽。利用protein-ligand interaction Profiler server评估蛋白质-配体复合物的二维相互作用图。这是首次使用GP63的抗癌肽以及预测和设计的肽。结果我们选取了基于antip 2.0服务器的GP63的3个多肽,评分分别为0.63、0.53和0.49,以及GP63/MMP2的常用多肽(连续多肽:指对接后完全选择的无抗癌作用的多肽,预测评分为0.58分,设计肽评分为0.47和0.45分)。结论抗利什曼原虫和抗癌肽的研究主题体现了肽研究的多学科性质。针对癌症和/或利什曼原虫的治疗方法的进步需要一种相互关联的研究策略。
{"title":"Bioinformatics evaluation of anticancer properties of GP63 protein-derived peptides on MMP2 protein of melanoma cancer","authors":"Fatemeh Sharifi ,&nbsp;Iraj Sharifi ,&nbsp;Zahra Babaei ,&nbsp;Sodabeh Alahdin ,&nbsp;Ali Afgar","doi":"10.1016/j.jpi.2023.100190","DOIUrl":"10.1016/j.jpi.2023.100190","url":null,"abstract":"<div><h3>Background</h3><p>GP63, also known as Leishmanolysin, is a multifunctional virulence factor abundant on the surface of <em>Leishmania</em> spp. small peptides with anticancer capabilities that are selective and toxic to cancer cells are known as anticancer peptides. We aimed to demonstrate the activity of GP63 and its anticancer properties on melanoma using a range of <em>in silico</em> tools and screening methods to identify predicted and designed anticancer peptides.</p></div><div><h3>Methods</h3><p>Various <em>in silico</em> modeling methodologies are used to establish the three-dimensional (3D) structure of GP63. Refinement and re-evaluation of the modeled structures and the built models' quality evaluated using the different docking used to find the interacting amino acids between MMP2 and GP63 and its anticancer peptides. AntiCP2.0 is used for screening anticancer peptides. 2D interaction plots of protein–ligand complexes evaluated by Protein–Ligand Interaction Profiler server. It is for the first time that used anticancer peptides of GP63 and the predicted and designed peptides.</p></div><div><h3>Results</h3><p>We used 3 peptides of GP63 based on the AntiCP 2.0 server with scores of 0.63, 0.53, and 0.49, and common peptides of GP63/MMP2 (continues peptide: mean the completely selected peptide after docking with non-anticancer effect, predicted with 0.58 score and designed peptides with 0.47 and 0.45 scores by AntiCP 2.0 server).</p></div><div><h3>Conclusions</h3><p>The antileishmanial and anticancer peptide research topics exemplify the multidisciplinary nature of peptide research. The advancement of therapeutics targeting cancer and/or <em>Leishmania</em> requires an interconnected research strategy shown in this work.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100190"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9867975/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10621742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning based tumor–stroma ratio scoring in colon cancer correlates with microscopic assessment 基于深度学习的结肠癌肿瘤-基质比率评分与显微评估相关
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100191
Marloes A. Smit , Francesco Ciompi , John-Melle Bokhorst , Gabi W. van Pelt , Oscar G.F. Geessink , Hein Putter , Rob A.E.M. Tollenaar , J. Han J.M. van Krieken , Wilma E. Mesker , Jeroen A.W.M. van der Laak

Background

The amount of stroma within the primary tumor is a prognostic parameter for colon cancer patients. This phenomenon can be assessed using the tumor–stroma ratio (TSR), which classifies tumors in stroma-low (≤50% stroma) and stroma-high (>50% stroma). Although the reproducibility for TSR determination is good, improvement might be expected from automation. The aim of this study was to investigate whether the scoring of the TSR in a semi- and fully automated method using deep learning algorithms is feasible.

Methods

A series of 75 colon cancer slides were selected from a trial series of the UNITED study. For the standard determination of the TSR, 3 observers scored the histological slides. Next, the slides were digitized, color normalized, and the stroma percentages were scored using semi- and fully automated deep learning algorithms. Correlations were determined using intraclass correlation coefficients (ICCs) and Spearman rank correlations.

Results

37 (49%) cases were classified as stroma-low and 38 (51%) as stroma-high by visual estimation. A high level of concordance between the 3 observers was reached, with ICCs of 0.91, 0.89, and 0.94 (all P < .001). Between visual and semi-automated assessment the ICC was 0.78 (95% CI 0.23–0.91, P-value 0.005), with a Spearman correlation of 0.88 (P < .001). Spearman correlation coefficients above 0.70 (N=3) were observed for visual estimation versus the fully automated scoring procedures.

Conclusion

Good correlations were observed between standard visual TSR determination and semi- and fully automated TSR scores. At this point, visual examination has the highest observer agreement, but semi-automated scoring could be helpful to support pathologists.

原发肿瘤内基质的数量是结肠癌患者预后的一个参数。这种现象可以通过肿瘤-基质比率(TSR)来评估,TSR将肿瘤分为基质低(≤50%基质)和基质高(>50%基质)。虽然TSR测定的重现性很好,但自动化可能会有所改善。本研究的目的是研究使用深度学习算法在半自动和全自动方法中对TSR评分是否可行。方法从UNITED研究的一系列试验中选择了一系列75张结肠癌载玻片。为标准测定TSR, 3名观察员对组织学切片评分。接下来,对幻灯片进行数字化,颜色归一化,并使用半自动和全自动深度学习算法对基质百分比进行评分。使用类内相关系数(ICCs)和Spearman秩相关来确定相关性。结果经目测,低基质37例(49%),高基质38例(51%)。3位观察者之间达到了高度的一致性,ICCs分别为0.91、0.89和0.94(均P <措施)。在目视评估和半自动化评估之间,ICC为0.78 (95% CI 0.23-0.91, P值0.005),Spearman相关性为0.88 (P <措施)。与全自动评分程序相比,视觉估计的Spearman相关系数大于0.70 (N=3)。结论标准目测TSR与半自动和全自动TSR评分有良好的相关性。在这一点上,视觉检查具有最高的观察者协议,但半自动评分可能有助于支持病理学家。
{"title":"Deep learning based tumor–stroma ratio scoring in colon cancer correlates with microscopic assessment","authors":"Marloes A. Smit ,&nbsp;Francesco Ciompi ,&nbsp;John-Melle Bokhorst ,&nbsp;Gabi W. van Pelt ,&nbsp;Oscar G.F. Geessink ,&nbsp;Hein Putter ,&nbsp;Rob A.E.M. Tollenaar ,&nbsp;J. Han J.M. van Krieken ,&nbsp;Wilma E. Mesker ,&nbsp;Jeroen A.W.M. van der Laak","doi":"10.1016/j.jpi.2023.100191","DOIUrl":"10.1016/j.jpi.2023.100191","url":null,"abstract":"<div><h3>Background</h3><p>The amount of stroma within the primary tumor is a prognostic parameter for colon cancer patients. This phenomenon can be assessed using the tumor–stroma ratio (TSR), which classifies tumors in stroma-low (≤50% stroma) and stroma-high (&gt;50% stroma). Although the reproducibility for TSR determination is good, improvement might be expected from automation. The aim of this study was to investigate whether the scoring of the TSR in a semi- and fully automated method using deep learning algorithms is feasible.</p></div><div><h3>Methods</h3><p>A series of 75 colon cancer slides were selected from a trial series of the UNITED study. For the standard determination of the TSR, 3 observers scored the histological slides. Next, the slides were digitized, color normalized, and the stroma percentages were scored using semi- and fully automated deep learning algorithms. Correlations were determined using intraclass correlation coefficients (ICCs) and Spearman rank correlations.</p></div><div><h3>Results</h3><p>37 (49%) cases were classified as stroma-low and 38 (51%) as stroma-high by visual estimation. A high level of concordance between the 3 observers was reached, with ICCs of 0.91, 0.89, and 0.94 (all P &lt; .001). Between visual and semi-automated assessment the ICC was 0.78 (95% CI 0.23–0.91, P-value 0.005), with a Spearman correlation of 0.88 (P &lt; .001). Spearman correlation coefficients above 0.70 (N=3) were observed for visual estimation versus the fully automated scoring procedures.</p></div><div><h3>Conclusion</h3><p>Good correlations were observed between standard visual TSR determination and semi- and fully automated TSR scores. At this point, visual examination has the highest observer agreement, but semi-automated scoring could be helpful to support pathologists.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100191"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9922811/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10745510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Improving Lyme disease testing with data driven test design in pediatrics 用数据驱动测试设计改进儿科莱姆病测试
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100300
Mahmoud Elkhadrawi , Oscar Lopez-Nunez , Murat Akcakaya , Sarah E. Wheeler

Diagnostic advances have not kept pace with the expansion of Lyme disease caused by Borrelia burgdorferi and transmitted by ticks. Lyme disease clinical manifestations can overlap with many other diagnoses making Lyme disease a critical part of many differential diagnoses in endemic areas. Current diagnostic blood tests rely on a 2-tiered algorithm for which the second step is either a time-consuming western blot or a whole cell lysate immunoassay. Neither of these second step tests allow for rapid results of this critical rule out test. We hypothesized that using western blot confirmation information, we could create computational models to propose recombinant second-tier tests that would allow for more rapid, automated, and specific testing algorithms. We propose here a framework for assessing retrospective data to determine putative recombinant assay components. A retrospective pediatric cohort of 2755 samples submitted for Lyme disease screening was assessed using support vector machine learning algorithms to optimize tier 1 diagnostic thresholds for the Vidas IgG II assay and determine optimal tier 2 components for both a positive and negative confirmation test. In cases where the tier 1 screen was negative, but clinical suspicion was high, we found that 1 protein (L58) could be used to reduce false-negative results. For second-tier testing of screen positive cases, we found that 6 proteins could be used to reduce false-positive results (L18, L39M, L39, L41, L45, and L58) with a final machine learning classifier or 2 proteins using a final rules-based approach (L41, L18). This led to an overall accuracy of 92.36% for the proposed algorithm without a final machine learning classifier and 92.12% with integration of the machine learning classifier in the final algorithm when compared to the IgG western blot as the gold-standard. Use of this framework across multiple assays and institutions will allow for a data-driven approach to assay development to provide laboratories and patients with the improvements in turnaround time needed for this testing.

诊断的进步没有跟上由伯氏疏螺旋体引起并由蜱虫传播的莱姆病的扩大。莱姆病的临床表现可与许多其他诊断重叠,使莱姆病成为流行地区许多鉴别诊断的重要组成部分。目前的诊断性血液检测依赖于两层算法,第二步要么是耗时的western blot,要么是全细胞裂解免疫测定。这两种第二步测试都不能快速得出这个关键的排除测试的结果。我们假设使用western blot确认信息,我们可以创建计算模型来提出重组二级测试,这将允许更快速、自动化和特定的测试算法。我们在这里提出了一个评估回顾性数据的框架,以确定推定的重组分析成分。使用支持向量机器学习算法对2755份提交用于莱姆病筛查的样本进行回顾性儿科队列评估,以优化Vidas IgG II检测的一级诊断阈值,并确定阳性和阴性确认试验的最佳二级成分。在一级筛查为阴性,但临床怀疑度高的情况下,我们发现1蛋白(L58)可用于减少假阴性结果。对于筛选阳性病例的第二级测试,我们发现6种蛋白质可以使用最终机器学习分类器来减少假阳性结果(L18, L39M, L39, L41, L45和L58)或2种蛋白质使用最终基于规则的方法(L41, L18)。与IgG western blot作为金标准相比,在没有最终机器学习分类器的情况下,所提出的算法的总体准确率为92.36%,在最终算法中集成机器学习分类器的情况下,总体准确率为92.12%。在多个检测方法和机构中使用该框架将允许采用数据驱动的方法进行检测开发,从而为实验室和患者提供该检测所需的周转时间的改进。
{"title":"Improving Lyme disease testing with data driven test design in pediatrics","authors":"Mahmoud Elkhadrawi ,&nbsp;Oscar Lopez-Nunez ,&nbsp;Murat Akcakaya ,&nbsp;Sarah E. Wheeler","doi":"10.1016/j.jpi.2023.100300","DOIUrl":"10.1016/j.jpi.2023.100300","url":null,"abstract":"<div><p>Diagnostic advances have not kept pace with the expansion of Lyme disease caused by <em>Borrelia burgdorferi</em> and transmitted by ticks. Lyme disease clinical manifestations can overlap with many other diagnoses making Lyme disease a critical part of many differential diagnoses in endemic areas. Current diagnostic blood tests rely on a 2-tiered algorithm for which the second step is either a time-consuming western blot or a whole cell lysate immunoassay. Neither of these second step tests allow for rapid results of this critical rule out test. We hypothesized that using western blot confirmation information, we could create computational models to propose recombinant second-tier tests that would allow for more rapid, automated, and specific testing algorithms. We propose here a framework for assessing retrospective data to determine putative recombinant assay components. A retrospective pediatric cohort of 2755 samples submitted for Lyme disease screening was assessed using support vector machine learning algorithms to optimize tier 1 diagnostic thresholds for the Vidas IgG II assay and determine optimal tier 2 components for both a positive and negative confirmation test. In cases where the tier 1 screen was negative, but clinical suspicion was high, we found that 1 protein (L58) could be used to reduce false-negative results. For second-tier testing of screen positive cases, we found that 6 proteins could be used to reduce false-positive results (L18, L39M, L39, L41, L45, and L58) with a final machine learning classifier or 2 proteins using a final rules-based approach (L41, L18). This led to an overall accuracy of 92.36% for the proposed algorithm without a final machine learning classifier and 92.12% with integration of the machine learning classifier in the final algorithm when compared to the IgG western blot as the gold-standard. Use of this framework across multiple assays and institutions will allow for a data-driven approach to assay development to provide laboratories and patients with the improvements in turnaround time needed for this testing.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100300"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/f9/b6/main.PMC9985057.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9424496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hagnifinder: Recovering magnification information of digital histological images using deep learning Hagnifinder:利用深度学习恢复数字组织学图像的放大信息
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100302
Hongtai Zhang , Zaiyi Liu , Mingli Song , Cheng Lu

Background and objective

Training a robust cancer diagnostic or prognostic artificial intelligent model using histology images requires a large number of representative cases with labels or annotations, which are difficult to obtain. The histology snapshots available in published papers or case reports can be used to enrich the training dataset. However, the magnifications of these invaluable snapshots are generally unknown, which limits their usage. Therefore, a robust magnification predictor is required for utilizing those diverse snapshot repositories consisting of different diseases. This paper presents a magnification prediction model named Hagnifinder for H&E-stained histological images.

Methods

Hagnifinder is a regression model based on a modified convolutional neural network (CNN) that contains 3 modules: Feature Extraction Module, Regression Module, and Adaptive Scaling Module (ASM). In the training phase, the Feature Extraction Module first extracts the image features. Secondly, the ASM is proposed to address the learned feature values uneven distribution problem. Finally, the Regression Module estimates the mapping between the regularized extracted features and the magnifications. We construct a new dataset for training a robust model, named Hagni40, consisting of 94 643 H&E-stained histology image patches at 40 different magnifications of 13 types of cancer based on The Cancer Genome Atlas. To verify the performance of the Hagnifinder, we measure the accuracy of the predictions by setting the maximum allowable difference values (0.5, 1, and 5) between the predicted magnification and the actual magnification. We compare Hagnifinder with state-of-the-art methods on a public dataset BreakHis and the Hagni40.

Results

The Hagnifinder provides consistent prediction accuracy, with a mean accuracy of 98.9%, across 40 different magnifications and 13 different cancer types when Resnet50 is used as the feature extractor. Compared with the state-of-the-art methods focusing on 4–5 levels of magnification classification, the Hagnifinder achieves the best and most comparable performance in the BreakHis and Hagni40 datasets.

Conclusions

The experimental results suggest that Hagnifinder can be a valuable tool for predicting the associated magnification of any given histology image.

背景与目的利用组织学图像训练一个鲁棒的癌症诊断或预后人工智能模型需要大量带有标签或注释的代表性病例,而这些病例很难获得。在已发表的论文或病例报告中可用的组织学快照可用于丰富训练数据集。然而,这些宝贵的快照的放大倍数通常是未知的,这限制了它们的使用。因此,需要一个健壮的放大预测器来利用由不同疾病组成的不同快照库。本文提出了一种用于H& e染色组织学图像的放大预测模型Hagnifinder。shagnifinder是一个基于改进卷积神经网络(CNN)的回归模型,该模型包含3个模块:特征提取模块、回归模块和自适应缩放模块(ASM)。在训练阶段,特征提取模块首先提取图像特征。其次,针对学习到的特征值分布不均匀的问题,提出了ASM算法。最后,回归模块估计正则化提取的特征与放大之间的映射关系。我们构建了一个新的数据集来训练一个名为Hagni40的鲁棒模型,该模型由94个 643个H& e染色的组织学图像斑块组成,基于癌症基因组图谱,在40种不同的放大倍率下,包含13种癌症。为了验证Hagnifinder的性能,我们通过设置预测放大倍率与实际放大倍率之间的最大允许差值(0.5、1和5)来测量预测的准确性。我们将Hagnifinder与公共数据集BreakHis和Hagni40上最先进的方法进行比较。结果当使用Resnet50作为特征提取器时,Hagnifinder在40种不同的放大倍数和13种不同的癌症类型上提供了一致的预测准确率,平均准确率为98.9%。与专注于4-5级放大分类的最先进方法相比,Hagnifinder在BreakHis和Hagni40数据集中实现了最佳和最具可比性的性能。结论Hagnifinder可作为预测任意组织学图像相关放大倍数的有效工具。
{"title":"Hagnifinder: Recovering magnification information of digital histological images using deep learning","authors":"Hongtai Zhang ,&nbsp;Zaiyi Liu ,&nbsp;Mingli Song ,&nbsp;Cheng Lu","doi":"10.1016/j.jpi.2023.100302","DOIUrl":"10.1016/j.jpi.2023.100302","url":null,"abstract":"<div><h3>Background and objective</h3><p>Training a robust cancer diagnostic or prognostic artificial intelligent model using histology images requires a large number of representative cases with labels or annotations, which are difficult to obtain. The histology snapshots available in published papers or case reports can be used to enrich the training dataset. However, the magnifications of these invaluable snapshots are generally unknown, which limits their usage. Therefore, a robust magnification predictor is required for utilizing those diverse snapshot repositories consisting of different diseases. This paper presents a magnification prediction model named Hagnifinder for H&amp;E-stained histological images.</p></div><div><h3>Methods</h3><p>Hagnifinder is a regression model based on a modified convolutional neural network (CNN) that contains 3 modules: Feature Extraction Module, Regression Module, and Adaptive Scaling Module (ASM). In the training phase, the Feature Extraction Module first extracts the image features. Secondly, the ASM is proposed to address the learned feature values uneven distribution problem. Finally, the Regression Module estimates the mapping between the regularized extracted features and the magnifications. We construct a new dataset for training a robust model, named Hagni40, consisting of 94 643 H&amp;E-stained histology image patches at 40 different magnifications of 13 types of cancer based on The Cancer Genome Atlas. To verify the performance of the Hagnifinder, we measure the accuracy of the predictions by setting the maximum allowable difference values (0.5, 1, and 5) between the predicted magnification and the actual magnification. We compare Hagnifinder with state-of-the-art methods on a public dataset BreakHis and the Hagni40.</p></div><div><h3>Results</h3><p>The Hagnifinder provides consistent prediction accuracy, with a mean accuracy of 98.9%, across 40 different magnifications and 13 different cancer types when Resnet50 is used as the feature extractor. Compared with the state-of-the-art methods focusing on 4–5 levels of magnification classification, the Hagnifinder achieve<strong>s</strong> the best and most comparable performance in the BreakHis and Hagni40 datasets.</p></div><div><h3>Conclusions</h3><p>The experimental results suggest that Hagnifinder can be a valuable tool for predicting the associated magnification of any given histology image.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100302"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/79/f5/main.PMC10009300.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9475747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Automated detection and delineation of lymph nodes in haematoxylin & eosin stained digitised slides 在苏木精和伊红染色的数字化玻片上自动检测和描绘淋巴结
Q2 Medicine Pub Date : 2023-01-01 DOI: 10.1016/j.jpi.2023.100192
Manon Beuque , Derek R. Magee , Avishek Chatterjee , Henry C. Woodruff , Ruth E. Langley , William Allum , Matthew G. Nankivell , David Cunningham , Philippe Lambin , Heike I. Grabsch

Treatment of patients with oesophageal and gastric cancer (OeGC) is guided by disease stage, patient performance status and preferences. Lymph node (LN) status is one of the strongest prognostic factors for OeGC patients. However, survival varies between patients with the same disease stage and LN status. We recently showed that LN size from patients with OeGC might also have prognostic value, thus making delineations of LNs essential for size estimation and the extraction of other imaging biomarkers.

We hypothesized that a machine learning workflow is able to: (1) find digital H&E stained slides containing LNs, (2) create a scoring system providing degrees of certainty for the results, and (3) delineate LNs in those images.

To train and validate the pipeline, we used 1695 H&E slides from the OE02 trial. The dataset was divided into training (80%) and validation (20%). The model was tested on an external dataset of 826 H&E slides from the OE05 trial. U-Net architecture was used to generate prediction maps from which predefined features were extracted. These features were subsequently used to train an XGBoost model to determine if a region truly contained a LN. With our innovative method, the balanced accuracies of the LN detection were 0.93 on the validation dataset (0.83 on the test dataset) compared to 0.81 (0.81) on the validation (test) datasets when using the standard method of thresholding U-Net predictions to arrive at a binary mask. Our method allowed for the creation of an “uncertain” category, and partly limited false-positive predictions on the external dataset. The mean Dice score was 0.73 (0.60) per-image and 0.66 (0.48) per-LN for the validation (test) datasets.

Our pipeline detects images with LNs more accurately than conventional methods, and high-throughput delineation of LNs can facilitate future LN content analyses of large datasets.

食管癌和胃癌(OeGC)患者的治疗是根据疾病分期、患者的身体状况和偏好来指导的。淋巴结(LN)状态是OeGC患者最重要的预后因素之一。然而,同一疾病分期和LN状态的患者的生存率不同。我们最近发现,OeGC患者的淋巴结大小也可能具有预后价值,因此对淋巴结的描绘对于大小估计和其他成像生物标志物的提取至关重要。我们假设机器学习工作流程能够:(1)找到包含ln的数字H&E染色玻片,(2)创建一个评分系统,为结果提供确定性程度,(3)在这些图像中描绘ln。为了训练和验证管道,我们使用了来自OE02试验的1695张H&E幻灯片。数据集分为训练(80%)和验证(20%)。该模型在OE05试验的826张H&E幻灯片的外部数据集上进行了测试。采用U-Net体系结构生成预测图,从中提取预定义特征。这些特征随后被用于训练XGBoost模型,以确定一个区域是否真正包含LN。使用我们的创新方法,当使用阈值U-Net预测的标准方法来获得二进制掩码时,LN检测的平衡精度在验证数据集上为0.93(在测试数据集上为0.83),而在验证(测试)数据集中为0.81(0.81)。我们的方法允许创建“不确定”类别,并在一定程度上限制了外部数据集上的假阳性预测。对于验证(测试)数据集,平均Dice评分为每张图像0.73(0.60)和每张ln 0.66(0.48)。我们的管道比传统方法更准确地检测具有LN的图像,并且对LN的高通量描绘可以促进未来大型数据集的LN内容分析。
{"title":"Automated detection and delineation of lymph nodes in haematoxylin & eosin stained digitised slides","authors":"Manon Beuque ,&nbsp;Derek R. Magee ,&nbsp;Avishek Chatterjee ,&nbsp;Henry C. Woodruff ,&nbsp;Ruth E. Langley ,&nbsp;William Allum ,&nbsp;Matthew G. Nankivell ,&nbsp;David Cunningham ,&nbsp;Philippe Lambin ,&nbsp;Heike I. Grabsch","doi":"10.1016/j.jpi.2023.100192","DOIUrl":"10.1016/j.jpi.2023.100192","url":null,"abstract":"<div><p>Treatment of patients with oesophageal and gastric cancer (OeGC) is guided by disease stage, patient performance status and preferences. Lymph node (LN) status is one of the strongest prognostic factors for OeGC patients. However, survival varies between patients with the same disease stage and LN status. We recently showed that LN size from patients with OeGC might also have prognostic value, thus making delineations of LNs essential for size estimation and the extraction of other imaging biomarkers.</p><p>We hypothesized that a machine learning workflow is able to: (1) find digital H&amp;E stained slides containing LNs, (2) create a scoring system providing degrees of certainty for the results, and (3) delineate LNs in those images.</p><p>To train and validate the pipeline, we used 1695 H&amp;E slides from the OE02 trial. The dataset was divided into training (80%) and validation (20%). The model was tested on an external dataset of 826 H&amp;E slides from the OE05 trial. U-Net architecture was used to generate prediction maps from which predefined features were extracted. These features were subsequently used to train an XGBoost model to determine if a region truly contained a LN. With our innovative method, the balanced accuracies of the LN detection were 0.93 on the validation dataset (0.83 on the test dataset) compared to 0.81 (0.81) on the validation (test) datasets when using the standard method of thresholding U-Net predictions to arrive at a binary mask. Our method allowed for the creation of an “uncertain” category, and partly limited false-positive predictions on the external dataset. The mean Dice score was 0.73 (0.60) per-image and 0.66 (0.48) per-LN for the validation (test) datasets.</p><p>Our pipeline detects images with LNs more accurately than conventional methods, and high-throughput delineation of LNs can facilitate future LN content analyses of large datasets.</p></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"14 ","pages":"Article 100192"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/87/43/main.PMC9932489.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9162066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Pathology Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1