首页 > 最新文献

Journal of Pathology Informatics最新文献

英文 中文
Technical considerations during validation of the Genius® Digital Diagnostic System Genius®数字诊断系统验证期间的技术考虑
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2025-11-19 DOI: 10.1016/j.jpi.2025.100532
Lakshmi Harinath , Sarah Harrington , Jonee Matsko , Amy Colaizzi , Esther Elishaev , Samer Khader , Rohit Bhargava , Chengquan Zhao , Liron Pantanowitz

Background

The aim of this study was to document technical errors encountered during validation of the Genius Digital Diagnostics System (GDDS).

Materials and methods

A total of 909 cases of archived ThinPrep Pap slides with follow-up biopsies were retrieved. Slides were cleaned, relabeled, and scanned with GDDS. Digital imager errors, including slide events and imager errors, were documented and evaluated.

Results

Of the 909 slides scanned, 21 (2.3 %) demonstrated slide events. For 5 cases, the slides had cell focus errors, 12 failed due to quality control (QC) errors, 2 had barcode issues, 1 showed an oversaturated frame, and 1 presented a problem because it was a duplicate. Some errors could be corrected, of which 8 cases with various diagnostic cytology interpretations were successfully rescanned. There were 13 (1.4%) cases that could not be scanned and thus were excluded from the study, predominantly because of focus QC errors due to scratched coverslips from long-term storage. There were 43 imager errors including failure of motor movement, cancellation of slide handling action, and failure to pick slides from the carrier station for which the scanning process had to be paused. Imager errors were solved by rebooting the system, correcting the positioning of the slide on the system, and technical help provided by the vendor.

Conclusion

Minor errors are to be expected when digitizing large volume of Pap slides. Total number of rescanned cases to address such technical problems were low in number and did not compromise the interpretation of Pap test slides using GDDS.
本研究的目的是记录天才数字诊断系统(GDDS)验证过程中遇到的技术错误。材料与方法共检索909例已存档的薄prep Pap切片并随访活检。切片清洗,重新贴上标签,并用GDDS扫描。数字成像仪错误,包括滑动事件和成像仪错误,被记录和评估。结果在扫描的909张幻灯片中,21张(2.3%)出现滑动事件。在5例中,载玻片有细胞聚焦错误,12例由于质量控制(QC)错误而失败,2例有条形码问题,1例显示过饱和帧,1例因为重复而出现问题。有一些错误是可以纠正的,其中有8例诊断细胞学解释不同的病例成功重新扫描。有13例(1.4%)病例无法扫描,因此被排除在研究之外,主要是因为长期储存造成的盖子划伤导致焦点QC错误。有43个成像仪错误,包括电机运动失败,取消载玻片处理动作,未能从载体站取玻片,扫描过程必须暂停。成像仪错误通过重新启动系统、纠正系统上载玻片的定位以及供应商提供的技术帮助来解决。结论对大量巴氏涂片进行数字化处理,误差较小。为解决此类技术问题而重新扫描病例的总数较少,并且不影响使用GDDS对巴氏涂片的解释。
{"title":"Technical considerations during validation of the Genius® Digital Diagnostic System","authors":"Lakshmi Harinath ,&nbsp;Sarah Harrington ,&nbsp;Jonee Matsko ,&nbsp;Amy Colaizzi ,&nbsp;Esther Elishaev ,&nbsp;Samer Khader ,&nbsp;Rohit Bhargava ,&nbsp;Chengquan Zhao ,&nbsp;Liron Pantanowitz","doi":"10.1016/j.jpi.2025.100532","DOIUrl":"10.1016/j.jpi.2025.100532","url":null,"abstract":"<div><h3>Background</h3><div>The aim of this study was to document technical errors encountered during validation of the Genius Digital Diagnostics System (GDDS).</div></div><div><h3>Materials and methods</h3><div>A total of 909 cases of archived ThinPrep Pap slides with follow-up biopsies were retrieved. Slides were cleaned, relabeled, and scanned with GDDS. Digital imager errors, including slide events and imager errors, were documented and evaluated.</div></div><div><h3>Results</h3><div>Of the 909 slides scanned, 21 (2.3<!--> <!-->%) demonstrated slide events. For 5 cases, the slides had cell focus errors, 12 failed due to quality control (QC) errors, 2 had barcode issues, 1 showed an oversaturated frame, and 1 presented a problem because it was a duplicate. Some errors could be corrected, of which 8 cases with various diagnostic cytology interpretations were successfully rescanned. There were 13 (1.4%) cases that could not be scanned and thus were excluded from the study, predominantly because of focus QC errors due to scratched coverslips from long-term storage. There were 43 imager errors including failure of motor movement, cancellation of slide handling action, and failure to pick slides from the carrier station for which the scanning process had to be paused. Imager errors were solved by rebooting the system, correcting the positioning of the slide on the system, and technical help provided by the vendor.</div></div><div><h3>Conclusion</h3><div>Minor errors are to be expected when digitizing large volume of Pap slides. Total number of rescanned cases to address such technical problems were low in number and did not compromise the interpretation of Pap test slides using GDDS.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100532"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145705589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Digital pathology imaging artificial intelligence in cancer research and clinical trials: An NCI workshop report 数字病理成像人工智能在癌症研究和临床试验:NCI研讨会报告
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2025-11-14 DOI: 10.1016/j.jpi.2025.100531
Hala R. Makhlouf , Miguel R. Ossandon , Keyvan Farahani , Irina Lubensky , Lyndsay N. Harris
Digital pathology imaging (DPI) is a rapidly advancing field with increasing relevance to cancer diagnosis, research, and clinical trials through large-scale image analysis and artificial intelligence (AI) integration. Despite these advances, regulatory adoption in digital pathology (DP) has lagged; to date, only three AI/ML Software as a Medical Device tool have received FDA clearance, highlighting a validation dataset gap rather than an absence of regulatory pathways. On March 6–7, 2024, the National Cancer Institute held a virtual workshop titled “Digital Pathology Imaging-Artificial Intelligence in Cancer Research and Clinical Trials,” bringing together experts in pathology, radiology, oncology, data science, and regulatory fields to assess current challenges, practical solutions, and future directions. This report summarizes expert opinions on key issues related to the use of DPI in cancer research and clinical trials, including data standardization, de-identification, and the application of Digital Imaging and Communication in Medicine (DICOM) standards. Key topics included data standardization, image quality assurance, validation strategies, AI applications, integration in clinical trials, biobanking, intellectual property, investigators' needs, and lessons from digital cytology and radiology domains. Solutions discussed included adoption of open standards such as DICOM, centralized imaging portals, and scalable cloud-based platforms. The expert consensus outlined in this report is intended to guide the development of DPI infrastructure, standardization, support AI validation, and align regulatory and data-sharing practices to advance precision oncology.
数字病理成像(DPI)是一个快速发展的领域,通过大规模图像分析和人工智能(AI)集成,与癌症诊断、研究和临床试验的相关性越来越大。尽管取得了这些进步,但数字病理学(DP)的监管采用却滞后;到目前为止,只有三个AI/ML软件作为医疗器械工具获得了FDA的许可,这突出了验证数据集的差距,而不是缺乏监管途径。2024年3月6日至7日,美国国家癌症研究所举办了一场名为“数字病理成像——癌症研究和临床试验中的人工智能”的虚拟研讨会,汇集了病理学、放射学、肿瘤学、数据科学和监管领域的专家,以评估当前的挑战、实用的解决方案和未来的方向。本报告总结了与DPI在癌症研究和临床试验中使用相关的关键问题的专家意见,包括数据标准化、去识别和医学数字成像和通信(DICOM)标准的应用。主要议题包括数据标准化、图像质量保证、验证策略、人工智能应用、临床试验集成、生物银行、知识产权、研究者需求以及数字细胞学和放射学领域的经验教训。讨论的解决方案包括采用开放标准,如DICOM、集中式成像门户和可扩展的基于云的平台。本报告中概述的专家共识旨在指导DPI基础设施的发展,标准化,支持人工智能验证,并协调监管和数据共享实践,以推进精准肿瘤学。
{"title":"Digital pathology imaging artificial intelligence in cancer research and clinical trials: An NCI workshop report","authors":"Hala R. Makhlouf ,&nbsp;Miguel R. Ossandon ,&nbsp;Keyvan Farahani ,&nbsp;Irina Lubensky ,&nbsp;Lyndsay N. Harris","doi":"10.1016/j.jpi.2025.100531","DOIUrl":"10.1016/j.jpi.2025.100531","url":null,"abstract":"<div><div>Digital pathology imaging (DPI) is a rapidly advancing field with increasing relevance to cancer diagnosis, research, and clinical trials through large-scale image analysis and artificial intelligence (AI) integration. Despite these advances, regulatory adoption in digital pathology (DP) has lagged; to date, only three AI/ML Software as a Medical Device tool have received FDA clearance, highlighting a validation dataset gap rather than an absence of regulatory pathways. On March 6–7, 2024, the National Cancer Institute held a virtual workshop titled “Digital Pathology Imaging-Artificial Intelligence in Cancer Research and Clinical Trials,” bringing together experts in pathology, radiology, oncology, data science, and regulatory fields to assess current challenges, practical solutions, and future directions. This report summarizes expert opinions on key issues related to the use of DPI in cancer research and clinical trials, including data standardization, de-identification, and the application of Digital Imaging and Communication in Medicine (DICOM) standards. Key topics included data standardization, image quality assurance, validation strategies, AI applications, integration in clinical trials, biobanking, intellectual property, investigators' needs, and lessons from digital cytology and radiology domains. Solutions discussed included adoption of open standards such as DICOM, centralized imaging portals, and scalable cloud-based platforms. The expert consensus outlined in this report is intended to guide the development of DPI infrastructure, standardization, support AI validation, and align regulatory and data-sharing practices to advance precision oncology.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100531"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145705637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of digital displays for efficient diagnosis and accuracy 比较数字显示的诊断效率和准确性
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2026-01-14 DOI: 10.1016/j.jpi.2026.100544
Monika Lamba Saini , Amanda Hemmerich , Anna Banach-Ovens , Hannah Manssens , Robbie Dougan , Tom Kimpe , John D. Cochran
Digital pathology is transforming diagnostic workflows by enabling the interpretation of whole-slide images on digital displays, offering advantages in remote diagnostics, efficiency, and integration with computational tools. However, diagnostic reliability depends not only on scanner and software quality but also on the performance of the display systems used for image review, as well as on environmental and ergonomic factors. This study evaluated the impact of display quality on diagnostic accuracy by comparing a medical-grade, professional-grade, and consumer-grade display against conventional light microscopy. Nineteen hematoxylin and eosin (H&E) and 19 immunohistochemistry (IHC) slides were assessed by two board-certified pathologists across all modalities. The medical-grade display achieved perfect concordance with microscopy for H&E slides and was rated highest for image clarity, color fidelity, and luminance stability. In contrast, the professional- and consumer-grade displays showed lower concordance (85–95%) and received inferior subjective evaluation. For IHC slides, both observers achieved 100% concordance for HER2 and AMACR markers across all displays. One discordant MLH1 case was noted on the medical-grade display for one observer out of five cases. Although MLH1 is consistently analyzed alongside other MSI markers, this isolated discrepancy precludes firm conclusions about display performance for this marker. These findings highlight the importance of clinically validated displays with automated calibration and quality assurance mechanisms to ensure consistent diagnostic performance. As digital pathology becomes increasingly integrated with remote workflows, establishing minimum performance standards for display systems will be essential to safeguard diagnostic accuracy, reproducibility, and patient safety in clinical practice.
数字病理学正在改变诊断工作流程,通过在数字显示器上解释整个幻灯片图像,提供远程诊断、效率和与计算工具集成的优势。然而,诊断的可靠性不仅取决于扫描仪和软件的质量,还取决于用于图像审查的显示系统的性能,以及环境和人体工程学因素。本研究通过比较医疗级、专业级和消费级显示器与传统光学显微镜,评估了显示质量对诊断准确性的影响。19张苏木精和伊红(H&;E)和19张免疫组织化学(IHC)载玻片由两名委员会认证的病理学家对所有模式进行评估。该医疗级显示器与H&;E载玻片的显微镜实现了完美的一致性,并在图像清晰度、色彩保真度和亮度稳定性方面被评为最高。相比之下,专业级和消费级显示器的一致性较低(85-95%),主观评价较差。对于IHC玻片,两名观察者在所有显示器上的HER2和AMACR标记均达到100%的一致性。在医学级显示中,每5例中有1例观察到MLH1病例不一致。虽然MLH1一直与其他MSI标记一起分析,但这种孤立的差异排除了关于该标记显示性能的确切结论。这些发现强调了临床验证显示器的重要性,该显示器具有自动校准和质量保证机制,以确保一致的诊断性能。随着数字病理学越来越多地与远程工作流程集成,建立显示系统的最低性能标准对于保障临床实践中的诊断准确性、可重复性和患者安全至关重要。
{"title":"Comparison of digital displays for efficient diagnosis and accuracy","authors":"Monika Lamba Saini ,&nbsp;Amanda Hemmerich ,&nbsp;Anna Banach-Ovens ,&nbsp;Hannah Manssens ,&nbsp;Robbie Dougan ,&nbsp;Tom Kimpe ,&nbsp;John D. Cochran","doi":"10.1016/j.jpi.2026.100544","DOIUrl":"10.1016/j.jpi.2026.100544","url":null,"abstract":"<div><div>Digital pathology is transforming diagnostic workflows by enabling the interpretation of whole-slide images on digital displays, offering advantages in remote diagnostics, efficiency, and integration with computational tools. However, diagnostic reliability depends not only on scanner and software quality but also on the performance of the display systems used for image review, as well as on environmental and ergonomic factors. This study evaluated the impact of display quality on diagnostic accuracy by comparing a medical-grade, professional-grade, and consumer-grade display against conventional light microscopy. Nineteen hematoxylin and eosin (H&amp;E) and 19 immunohistochemistry (IHC) slides were assessed by two board-certified pathologists across all modalities. The medical-grade display achieved perfect concordance with microscopy for H&amp;E slides and was rated highest for image clarity, color fidelity, and luminance stability. In contrast, the professional- and consumer-grade displays showed lower concordance (85–95%) and received inferior subjective evaluation. For IHC slides, both observers achieved 100% concordance for HER2 and AMACR markers across all displays. One discordant MLH1 case was noted on the medical-grade display for one observer out of five cases. Although MLH1 is consistently analyzed alongside other MSI markers, this isolated discrepancy precludes firm conclusions about display performance for this marker. These findings highlight the importance of clinically validated displays with automated calibration and quality assurance mechanisms to ensure consistent diagnostic performance. As digital pathology becomes increasingly integrated with remote workflows, establishing minimum performance standards for display systems will be essential to safeguard diagnostic accuracy, reproducibility, and patient safety in clinical practice.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100544"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MuCoSA: Multi-contextual similarity assessment for histopathology image search 粘膜:组织病理学图像搜索的多上下文相似性评估
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2025-12-04 DOI: 10.1016/j.jpi.2025.100533
Gyu Yeong Kim , Yongjun Jeon , Hoyeon Jeong , Seungkyun Lee , Hyungbin Kim , Boram Lee , Kyu-Hwan Jung , David Joon Ho , Yoon-La Choi
Histological diagnosis in pathology often begins with visual identification of morphological patterns that resemble known disease entities. Whereas digital pathology and augmented intelligence have enabled reverse image search in histopathology, most existing frameworks rely on single-magnification image patches and slide-level labels, limiting their ability to capture tissue morphologies with at various levels of granularity and scale. To address this, we propose Progressive Regional Image Sequence by Magnification (PRISM), a sequence of images that captures contextual information across multiple magnifications. Building on this, we introduce Multi-Contextual Similarity Assessment (MuCoSA), a PRISM-based image retrieval framework that leverages pre-trained feature encoders without additional fine-tuning. To evaluate MuCoSA's performance, we constructed reference and query PRISM datasets utilizing histological patterns of lung adenocarcinoma. Whole-slide image (WSIs) from Samsung Medical Center were used as reference and internal query, whereas those from The Cancer Genome Atlas served as external query. For each WSI, representative regions of interest were selected and converted into PRISM structure. Similarity between two PRISMs was calculated by averaging the cosine similarities of corresponding patches at the same magnification level across all magnifications. We conducted a comprehensive evaluation across different feature encoders, magnification levels, receptive field sizes, and both internal/external query conditions. As a result, MuCoSA with a multi-magnification outperformed single-magnification methods. For instance, MuCoSA using UNI2-h encoder with eight magnifications achieved an F1-score of 0.8051 (95% CI: 0.7843–0.8267), mAP@5 of 0.7065 (95% CI: 0.6893–0.7250), and mMV@5 of 0.8232 (95% CI: 0.8038–0.8434), all significantly higher than the single-magnification baseline (p < 0.0001). Furthermore, confusion matrices and visual inspection confirmed that search results were closely aligned with the morphological perceptions of pathologists. In conclusion, our study demonstrates that using a simple and efficient framework like MuCoSA with multi-magnification PRISM without fine-tuning can significantly improve image search in histopathology. We anticipate that MuCoSA can assist pathologists in making more accurate and consistent diagnoses, thereby reducing observer variability in histopathological interpretation.
病理学中的组织学诊断通常从与已知疾病实体相似的形态学模式的视觉识别开始。尽管数字病理学和增强智能已经在组织病理学中实现了反向图像搜索,但大多数现有框架依赖于单倍放大图像补丁和幻灯片级标签,限制了它们在不同粒度和尺度上捕获组织形态的能力。为了解决这个问题,我们提出了通过放大的渐进区域图像序列(PRISM),这是一个图像序列,可以在多个放大倍数中捕获上下文信息。在此基础上,我们引入了多上下文相似性评估(黏膜),这是一种基于prism的图像检索框架,它利用预训练的特征编码器而无需额外的微调。为了评估粘膜的性能,我们利用肺腺癌的组织学模式构建了参考和查询PRISM数据集。以三星医疗中心的全幻灯片图像(WSIs)作为参考和内部查询,以癌症基因组图谱(The Cancer Genome Atlas)作为外部查询。对于每个WSI,选择有代表性的感兴趣区域并转换为PRISM结构。两个棱镜之间的相似性是通过平均余弦相似度对应的补丁在相同的放大水平在所有放大计算。我们在不同的特征编码器、放大水平、接受域大小和内部/外部查询条件下进行了全面的评估。结果,粘膜与多倍放大优于单倍放大方法。例如,使用8倍放大UNI2-h编码器的粘膜的f1评分为0.8051 (95% CI: 0.7843-0.8267), mAP@5为0.7065 (95% CI: 0.6893-0.7250), mMV@5为0.8232 (95% CI: 0.8038-0.8434),均显著高于单倍放大基线(p <; 0.0001)。此外,混淆矩阵和目视检查证实,搜索结果与病理学家的形态学感知密切相关。综上所述,我们的研究表明,使用简单高效的黏膜多倍棱镜框架,无需微调,可以显著提高组织病理学中的图像搜索。我们预计粘膜可以帮助病理学家做出更准确和一致的诊断,从而减少组织病理学解释的观察者差异。
{"title":"MuCoSA: Multi-contextual similarity assessment for histopathology image search","authors":"Gyu Yeong Kim ,&nbsp;Yongjun Jeon ,&nbsp;Hoyeon Jeong ,&nbsp;Seungkyun Lee ,&nbsp;Hyungbin Kim ,&nbsp;Boram Lee ,&nbsp;Kyu-Hwan Jung ,&nbsp;David Joon Ho ,&nbsp;Yoon-La Choi","doi":"10.1016/j.jpi.2025.100533","DOIUrl":"10.1016/j.jpi.2025.100533","url":null,"abstract":"<div><div>Histological diagnosis in pathology often begins with visual identification of morphological patterns that resemble known disease entities. Whereas digital pathology and augmented intelligence have enabled reverse image search in histopathology, most existing frameworks rely on single-magnification image patches and slide-level labels, limiting their ability to capture tissue morphologies with at various levels of granularity and scale. To address this, we propose Progressive Regional Image Sequence by Magnification (PRISM), a sequence of images that captures contextual information across multiple magnifications. Building on this, we introduce Multi-Contextual Similarity Assessment (MuCoSA), a PRISM-based image retrieval framework that leverages pre-trained feature encoders without additional fine-tuning. To evaluate MuCoSA's performance, we constructed reference and query PRISM datasets utilizing histological patterns of lung adenocarcinoma. Whole-slide image (WSIs) from Samsung Medical Center were used as reference and internal query, whereas those from The Cancer Genome Atlas served as external query. For each WSI, representative regions of interest were selected and converted into PRISM structure. Similarity between two PRISMs was calculated by averaging the cosine similarities of corresponding patches at the same magnification level across all magnifications. We conducted a comprehensive evaluation across different feature encoders, magnification levels, receptive field sizes, and both internal/external query conditions. As a result, MuCoSA with a multi-magnification outperformed single-magnification methods. For instance, MuCoSA using UNI2-h encoder with eight magnifications achieved an F1-score of 0.8051 (95% CI: 0.7843–0.8267), mAP@5 of 0.7065 (95% CI: 0.6893–0.7250), and mMV@5 of 0.8232 (95% CI: 0.8038–0.8434), all significantly higher than the single-magnification baseline (<em>p</em> &lt; 0.0001). Furthermore, confusion matrices and visual inspection confirmed that search results were closely aligned with the morphological perceptions of pathologists. In conclusion, our study demonstrates that using a simple and efficient framework like MuCoSA with multi-magnification PRISM without fine-tuning can significantly improve image search in histopathology. We anticipate that MuCoSA can assist pathologists in making more accurate and consistent diagnoses, thereby reducing observer variability in histopathological interpretation.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100533"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145926432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applications and challenges of utilizing digital pathology and AI-enabled workflows in clinical trials 在临床试验中利用数字病理学和人工智能工作流程的应用和挑战
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2026-01-02 DOI: 10.1016/j.jpi.2025.100542
Manu Sebastian , Harsh Batra , Monika Lamba Saini , Staci Kearney , Lorcan Sherry , Serge Alexanian , Michael Cohen , William Weber , Joe Lennerz , Anil V. Parwani
This is a comprehensive review on current utilization and challenges of digital pathology adoption in clinical trials and aims to provide a broad view on its impact on pathology review processes in clinical trials. It provides an overview of current pathology review practices in clinical trials and unique advantages digital pathology adoption can offer. The key areas including existing workflows, use case scenarios in different disease areas in clinical trials, including but not limited to patient identification and pre-screening, and regulatory aspects have been described with relevance. In addition, the review delves into the integration of genomics, AI, image analysis, radiology, and advanced computational pathology, to propose measures to enhance clinical trial outcomes. The current regulatory landscape around digital pathology adoption and potential future advancements in this field are also discussed as appropriate.
本文对数字病理学在临床试验中的应用现状和面临的挑战进行了全面的综述,旨在就其对临床试验病理审查过程的影响提供一个广泛的观点。它提供了当前病理审查实践的概述,在临床试验和独特的优势,数字化病理采用可以提供。关键领域包括现有工作流程、临床试验中不同疾病领域的用例情景,包括但不限于患者识别和预筛选,以及相关的监管方面。此外,本文还深入探讨了基因组学、人工智能、图像分析、放射学和先进计算病理学的整合,提出了提高临床试验结果的措施。此外,本文还讨论了当前围绕数字病理学采用的监管格局以及该领域未来的潜在进展。
{"title":"Applications and challenges of utilizing digital pathology and AI-enabled workflows in clinical trials","authors":"Manu Sebastian ,&nbsp;Harsh Batra ,&nbsp;Monika Lamba Saini ,&nbsp;Staci Kearney ,&nbsp;Lorcan Sherry ,&nbsp;Serge Alexanian ,&nbsp;Michael Cohen ,&nbsp;William Weber ,&nbsp;Joe Lennerz ,&nbsp;Anil V. Parwani","doi":"10.1016/j.jpi.2025.100542","DOIUrl":"10.1016/j.jpi.2025.100542","url":null,"abstract":"<div><div>This is a comprehensive review on current utilization and challenges of digital pathology adoption in clinical trials and aims to provide a broad view on its impact on pathology review processes in clinical trials. It provides an overview of current pathology review practices in clinical trials and unique advantages digital pathology adoption can offer. The key areas including existing workflows, use case scenarios in different disease areas in clinical trials, including but not limited to patient identification and pre-screening, and regulatory aspects have been described with relevance. In addition, the review delves into the integration of genomics, AI, image analysis, radiology, and advanced computational pathology, to propose measures to enhance clinical trial outcomes. The current regulatory landscape around digital pathology adoption and potential future advancements in this field are also discussed as appropriate.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100542"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146037903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging sequence-to-sequence models for semantic annotation of Dutch pathology reports 利用序列到序列模型对荷兰病理学报告进行语义注释
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2025-12-05 DOI: 10.1016/j.jpi.2025.100534
M. Siepel , G.T.N. Burger , Q.J.M. Voorham , R. Cornet , I. Calixto , I. Vagliano
Palga Foundation is responsible for indexing Dutch pathology data across the Netherlands, which relies on annotations of pathology reports. These annotations, derived from the conclusion text, consist of codes from the Palga thesaurus, serving patient care and scientific research. However, manual annotation by pathologists is both labor-intensive and prone to errors. Therefore, in this study, we seek to leverage sequence-to-sequence transformer models, particularly Text-To-Text Transfer Transformer (T5)-based models, to generate these annotations. Additionally, we investigate a constrained decoding (CD) approach that encodes domain knowledge. We compare a standard multilingual T5 model (mT5) with our own T5 model (PaTh5.NL) pre-trained using Palga data with the goal of better aligning the model's learned representations with the specific structure, terminology, and annotation conventions used in Dutch pathology reports. We fine-tune both pre-trained models using default (DD) and CD and compare both decoding strategies. Performance is assessed using Bilingual Evaluation Understudy (BLEU) scores for quantitative evaluation and case-based evaluations for qualitative assessment, where we use the generated codes to retrieve patients from the Palga database. Quantitative evaluations indicated that our two fine-tuned PaTh5.NL models significantly outperformed the fine-tuned mT5 model, particularly for shorter histology and cytology reports, but performance of all models declined on longer or complex reports. The case-based evaluation revealed that, despite higher BLEU scores, the PaTh5.NL models did not consistently outperform the mT5 model in retrieving relevant patients. This study demonstrates that fine-tuned T5-based models can enhance the annotation process for Dutch pathology reports, though challenges remain regarding complex conclusion texts, especially in histology and autopsy reports. Future research should focus on expanding gold-standard datasets and developing post-processing algorithms to improve annotations' generalization.
帕尔加基金会负责索引荷兰病理数据在整个荷兰,这依赖于病理报告的注释。这些注解,源自结论文本,由来自帕尔加同义词典的代码组成,服务于病人护理和科学研究。然而,病理学家的手工注释既费力又容易出错。因此,在本研究中,我们寻求利用序列到序列转换器模型,特别是基于文本到文本传输转换器(T5)的模型,来生成这些注释。此外,我们研究了一种对领域知识进行编码的约束解码(CD)方法。我们将标准的多语种T5模型(mT5)与我们自己的T5模型(PaTh5.NL)进行比较,该模型使用Palga数据进行预训练,目的是更好地将模型的学习表征与荷兰语病理报告中使用的特定结构、术语和注释惯例相一致。我们使用默认(DD)和CD对预训练模型进行微调,并比较两种解码策略。使用双语评估替代研究(BLEU)分数进行定量评估,使用基于病例的评估进行定性评估,其中我们使用生成的代码从Palga数据库检索患者。定量评估表明,我们的两个微调PaTh5。NL模型的表现明显优于经过微调的mT5模型,特别是对于较短的组织学和细胞学报告,但所有模型的表现在较长或复杂的报告中都有所下降。基于病例的评估显示,尽管BLEU得分较高,但PaTh5。在检索相关患者时,NL模型的表现并不总是优于mT5模型。本研究表明,微调的基于t5的模型可以增强荷兰病理学报告的注释过程,尽管在复杂的结论文本方面仍然存在挑战,特别是在组织学和尸检报告中。未来的研究应该集中在扩展金标准数据集和开发后处理算法以提高注释的泛化。
{"title":"Leveraging sequence-to-sequence models for semantic annotation of Dutch pathology reports","authors":"M. Siepel ,&nbsp;G.T.N. Burger ,&nbsp;Q.J.M. Voorham ,&nbsp;R. Cornet ,&nbsp;I. Calixto ,&nbsp;I. Vagliano","doi":"10.1016/j.jpi.2025.100534","DOIUrl":"10.1016/j.jpi.2025.100534","url":null,"abstract":"<div><div>Palga Foundation is responsible for indexing Dutch pathology data across the Netherlands, which relies on annotations of pathology reports. These annotations, derived from the conclusion text, consist of codes from the Palga thesaurus, serving patient care and scientific research. However, manual annotation by pathologists is both labor-intensive and prone to errors. Therefore, in this study, we seek to leverage sequence-to-sequence transformer models, particularly Text-To-Text Transfer Transformer (T5)-based models, to generate these annotations. Additionally, we investigate a constrained decoding (CD) approach that encodes domain knowledge. We compare a standard multilingual T5 model (mT5) with our own T5 model (PaTh5.NL) pre-trained using Palga data with the goal of better aligning the model's learned representations with the specific structure, terminology, and annotation conventions used in Dutch pathology reports. We fine-tune both pre-trained models using default (DD) and CD and compare both decoding strategies. Performance is assessed using Bilingual Evaluation Understudy (BLEU) scores for quantitative evaluation and case-based evaluations for qualitative assessment, where we use the generated codes to retrieve patients from the Palga database. Quantitative evaluations indicated that our two fine-tuned PaTh5.NL models significantly outperformed the fine-tuned mT5 model, particularly for shorter histology and cytology reports, but performance of all models declined on longer or complex reports. The case-based evaluation revealed that, despite higher BLEU scores, the PaTh5.NL models did not consistently outperform the mT5 model in retrieving relevant patients. This study demonstrates that fine-tuned T5-based models can enhance the annotation process for Dutch pathology reports, though challenges remain regarding complex conclusion texts, especially in histology and autopsy reports. Future research should focus on expanding gold-standard datasets and developing post-processing algorithms to improve annotations' generalization.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100534"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Iris RESTful Server and IrisTileSource: An Iris implementation for existing OpenSeaDragon viewers Iris RESTful Server和IrisTileSource:现有opensedragon查看器的Iris实现
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2025-11-17 DOI: 10.1016/j.jpi.2025.100530
Ryan Erik Landvater, Navin Kathawa, Mustafa Yousif, Ulysses Balis
The Iris File Extension (IFE) is a low overhead performance-oriented whole-slide image (WSI) file format designed to improve the image rendering experience for pathologists and simplify image management for system administrators. However, static hypertext transfer protocol (HTTP) file servers cannot natively stream subregions of high-resolution image files, such as the IFE. The majority of contemporary WSI viewer systems are designed as browser-based web applications and leverage OpenSeaDragon as the tile-based rendering framework. These systems convert WSI files to Deep Zoom Images (DZI) for compatibility with simple static HTTP file servers. To address this limitation, we have developed the Iris RESTful Server, a low-overhead HTTP server with a RESTful application programming interface (API) that is natively compatible with the DICOMweb WADO-RS API. Written in C++ with Boost Beast HTTP and Asio networking libraries atop the public IFE libraries, the server offers both security and high performance. Testing shows that a single Raspberry Pi equivalent system can handle a peak of 5061 req./s (average 3883 req./s) with a median latency of 21 ms on a private (i.e., hospital) network. We also developed and merged a new OpenSeaDragon TileSource, compatible with the Iris RESTful API, into the next OpenSeaDragon release, enabling simple and immediate drop-in replacement of DZI images within WSI viewer stacks. Designed as a secure cross-origin resource sharing microservice, this architecture includes detailed deployment instructions for new or existing WSI workflows, and the public examples.restful.irisdigitalpathology.org subdomain is provided as a development tool to accelerate WSI web viewer development. All relevant Iris software is available under the open-source MIT software license.
虹膜文件扩展(IFE)是一种低开销、以性能为导向的全幻灯片图像(WSI)文件格式,旨在改善病理学家的图像渲染体验,简化系统管理员的图像管理。但是,静态超文本传输协议(HTTP)文件服务器不能本地流式传输高分辨率图像文件的子区域,例如IFE。大多数当代WSI查看器系统被设计为基于浏览器的web应用程序,并利用opensedragon作为基于磁贴的渲染框架。这些系统将WSI文件转换为深度缩放图像(DZI),以便与简单的静态HTTP文件服务器兼容。为了解决这个限制,我们开发了Iris RESTful Server,这是一个低开销的HTTP服务器,具有RESTful应用程序编程接口(API),与DICOMweb WADO-RS API本地兼容。该服务器使用c++编写,在公共IFE库之上使用Boost Beast HTTP和Asio网络库,提供了安全性和高性能。测试表明,一个等效的树莓派系统可以处理5061 req的峰值。/s(平均3883 要求。/s),在私有(即医院)网络上的中位延迟为21 ms。我们还开发并合并了一个新的openseaddragon tile资源,与Iris RESTful API兼容,到下一个openseaddragon版本中,可以在WSI查看器堆栈中简单而直接地替换DZI图像。该架构被设计为安全的跨域资源共享微服务,包括新的或现有的WSI工作流的详细部署说明,并提供公共example.restful.irisdigitalpathology.org子域作为加速WSI web查看器开发的开发工具。所有相关的虹膜软件都是在开源的MIT软件许可下提供的。
{"title":"Iris RESTful Server and IrisTileSource: An Iris implementation for existing OpenSeaDragon viewers","authors":"Ryan Erik Landvater,&nbsp;Navin Kathawa,&nbsp;Mustafa Yousif,&nbsp;Ulysses Balis","doi":"10.1016/j.jpi.2025.100530","DOIUrl":"10.1016/j.jpi.2025.100530","url":null,"abstract":"<div><div>The Iris File Extension (IFE) is a low overhead performance-oriented whole-slide image (WSI) file format designed to improve the image rendering experience for pathologists and simplify image management for system administrators. However, static hypertext transfer protocol (HTTP) file servers cannot natively stream subregions of high-resolution image files, such as the IFE. The majority of contemporary WSI viewer systems are designed as browser-based web applications and leverage OpenSeaDragon as the tile-based rendering framework. These systems convert WSI files to Deep Zoom Images (DZI) for compatibility with simple static HTTP file servers. To address this limitation, we have developed the Iris RESTful Server, a low-overhead HTTP server with a RESTful application programming interface (API) that is natively compatible with the DICOMweb WADO-RS API. Written in C++ with Boost Beast HTTP and Asio networking libraries atop the public IFE libraries, the server offers both security and high performance. Testing shows that a single Raspberry Pi equivalent system can handle a peak of 5061 req./s (average 3883 req./s) with a median latency of 21 ms on a private (i.e., hospital) network. We also developed and merged a new OpenSeaDragon TileSource, compatible with the Iris RESTful API, into the next OpenSeaDragon release, enabling simple and immediate drop-in replacement of DZI images within WSI viewer stacks. Designed as a secure cross-origin resource sharing microservice, this architecture includes detailed deployment instructions for new or existing WSI workflows, and the public <span><span>examples.restful.irisdigitalpathology.org</span><svg><path></path></svg></span> subdomain is provided as a development tool to accelerate WSI web viewer development. All relevant Iris software is available under the open-source MIT software license.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100530"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145977811","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An integrated system from microscopy to AI for real-time object detection in endometrial cytology 从显微镜到人工智能的集成系统,用于子宫内膜细胞学中的实时目标检测
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2025-12-31 DOI: 10.1016/j.jpi.2025.100541
Mika Terasaki , Shun Tanaka , Ichito Shimokawa , Etsuko Toda , Shoichiro Takakuma , Yusuke Kajimoto , Shinobu Kunugi , Akira Shimizu , Yasuhiro Terasaki
Endometrial cytology, which is minimally invasive and available as an outpatient procedure, is widely used in Japan for early detection of endometrial cancer, but its diagnostic process is time-consuming and requires expert diagnosticians. We developed a real-time artificial intelligence (AI)-assisted system using a standard microscope, a charge-coupled device (CCD) camera, and the You-Only-Look-Once version 5x (YOLOv5x) (a well-established object detection model) to support endometrial cytology screening in resource-limited settings. A total of 146 pre-operative cytology cases were collected, and the model was trained to detect abnormal cell clusters. The system was evaluated in real-time using a CCD camera, and its diagnostic performance was compared with that of three pathologists and four medical students. In an independent test of 20 cases, the AI model achieved an accuracy of 85%, showing promising performance comparable to the average accuracy of 75% among human evaluators. Furthermore, the median diagnostic time was reduced by approximately 45% with AI assistance. The impact of AI support varied by user expertise, with notable improvements among non-specialists. This proof-of-concept study demonstrates the feasibility and potential of affordable, real-time AI support for endometrial cytology using widely available equipment. Further validation with larger, multicenter datasets is warranted to confirm the generalizability and clinical utility of this approach.
子宫内膜细胞学检查是一种微创的门诊手术,在日本广泛用于子宫内膜癌的早期检测,但其诊断过程耗时且需要专家诊断。我们开发了一种实时人工智能(AI)辅助系统,使用标准显微镜,电荷耦耦器(CCD)相机和You-Only-Look-Once version 5x (YOLOv5x)(一种完善的对象检测模型)来支持资源有限环境下的子宫内膜细胞学筛查。收集术前细胞学检查病例146例,训练模型检测异常细胞簇。利用CCD摄像机对该系统进行实时评价,并与3名病理学家和4名医学生的诊断效果进行比较。在20个案例的独立测试中,人工智能模型达到了85%的准确率,与人类评估者75%的平均准确率相当。此外,在人工智能的帮助下,中位诊断时间减少了约45%。人工智能支持的影响因用户的专业知识而异,在非专业人士中有显著改善。这项概念验证研究表明,使用广泛可用的设备,可负担得起的实时人工智能支持子宫内膜细胞学的可行性和潜力。在更大的、多中心的数据集上进一步验证是有必要的,以确认该方法的普遍性和临床实用性。
{"title":"An integrated system from microscopy to AI for real-time object detection in endometrial cytology","authors":"Mika Terasaki ,&nbsp;Shun Tanaka ,&nbsp;Ichito Shimokawa ,&nbsp;Etsuko Toda ,&nbsp;Shoichiro Takakuma ,&nbsp;Yusuke Kajimoto ,&nbsp;Shinobu Kunugi ,&nbsp;Akira Shimizu ,&nbsp;Yasuhiro Terasaki","doi":"10.1016/j.jpi.2025.100541","DOIUrl":"10.1016/j.jpi.2025.100541","url":null,"abstract":"<div><div>Endometrial cytology, which is minimally invasive and available as an outpatient procedure, is widely used in Japan for early detection of endometrial cancer, but its diagnostic process is time-consuming and requires expert diagnosticians. We developed a real-time artificial intelligence (AI)-assisted system using a standard microscope, a charge-coupled device (CCD) camera, and the You-Only-Look-Once version 5x (YOLOv5x) (a well-established object detection model) to support endometrial cytology screening in resource-limited settings. A total of 146 pre-operative cytology cases were collected, and the model was trained to detect abnormal cell clusters. The system was evaluated in real-time using a CCD camera, and its diagnostic performance was compared with that of three pathologists and four medical students. In an independent test of 20 cases, the AI model achieved an accuracy of 85%, showing promising performance comparable to the average accuracy of 75% among human evaluators. Furthermore, the median diagnostic time was reduced by approximately 45% with AI assistance. The impact of AI support varied by user expertise, with notable improvements among non-specialists. This proof-of-concept study demonstrates the feasibility and potential of affordable, real-time AI support for endometrial cytology using widely available equipment. Further validation with larger, multicenter datasets is warranted to confirm the generalizability and clinical utility of this approach.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100541"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146037908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ADPv2: A hierarchical histological tissue type-annotated dataset for potential biomarker discovery of colorectal disease ADPv2:一个分层组织类型注释数据集,用于发现结直肠疾病的潜在生物标志物
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2025-12-23 DOI: 10.1016/j.jpi.2025.100537
Zhiyuan Yang , Kai Li , Sophia Ghamoshi Ramandi , Patricia Brassard , Abdelhakim Khellaf , Vincent Quoc-Huy Trinh , Jennifer Zhang , Lina Chen , Corwyn Rowsell , Sonal Varma , Kostas Plataniotis , Mahdi S. Hosseini
Computational pathology (CPath) leverages histopathology images to enhance diagnostic precision and reproducibility in clinical pathology. However, publicly available datasets for CPath that are annotated with extensive histological tissue type (HTT) taxonomies at a granular level remain scarce due to the significant expertise and high annotation costs required. Existing datasets, such as the Atlas of Digital Pathology (ADP), address this by offering diverse HTT annotations generalized to multiple organs, but limit the capability for in-depth studies on specific organ diseases. Building upon this foundation, we introduce ADPv2, a novel dataset focused on gastrointestinal histopathology. Our dataset comprises 20,004 image patches derived from healthy colon biopsy slides, annotated according to a hierarchical taxonomy of 32 distinct HTTs of 3 levels. Furthermore, we train a multilabel representation learning model following a two-stage training procedure on our ADPv2 dataset. By leveraging the VMamba model architecture, we achieve a mean average precision of 0.88 in multilabel colon HTT classification.. Finally, we show that our dataset is capable of an organ-specific in-depth study for potential biomarker discovery by analyzing the model's prediction behavior on tissues affected by different colon diseases, which reveals statistical patterns that confirm the two pathological pathways of colon cancer development. Our dataset is publicly available here: Part 1, Part 2, and Part 3.
计算病理学(CPath)利用组织病理学图像来提高临床病理学诊断的准确性和可重复性。然而,由于需要大量的专业知识和高昂的注释成本,在粒度级别上用广泛的组织学组织类型(HTT)分类法注释的CPath公开可用的数据集仍然很少。现有的数据集,如数字病理图谱(ADP),通过提供不同的HTT注释来解决这个问题,但限制了对特定器官疾病进行深入研究的能力。在此基础上,我们介绍了ADPv2,一个专注于胃肠道组织病理学的新数据集。我们的数据集包括来自健康结肠活检切片的200004个图像补丁,根据3个级别的32个不同的HTTs的分层分类法进行注释。此外,我们根据ADPv2数据集的两阶段训练过程训练了一个多标签表示学习模型。通过利用vamba模型架构,我们在多标签冒号HTT分类中实现了0.88的平均精度。最后,我们通过分析模型对受不同结肠疾病影响的组织的预测行为,表明我们的数据集能够对潜在的生物标志物发现进行器官特异性的深入研究,这揭示了确认结肠癌发展的两种病理途径的统计模式。我们的数据集可以在这里公开获取:第1部分、第2部分和第3部分。
{"title":"ADPv2: A hierarchical histological tissue type-annotated dataset for potential biomarker discovery of colorectal disease","authors":"Zhiyuan Yang ,&nbsp;Kai Li ,&nbsp;Sophia Ghamoshi Ramandi ,&nbsp;Patricia Brassard ,&nbsp;Abdelhakim Khellaf ,&nbsp;Vincent Quoc-Huy Trinh ,&nbsp;Jennifer Zhang ,&nbsp;Lina Chen ,&nbsp;Corwyn Rowsell ,&nbsp;Sonal Varma ,&nbsp;Kostas Plataniotis ,&nbsp;Mahdi S. Hosseini","doi":"10.1016/j.jpi.2025.100537","DOIUrl":"10.1016/j.jpi.2025.100537","url":null,"abstract":"<div><div>Computational pathology (CPath) leverages histopathology images to enhance diagnostic precision and reproducibility in clinical pathology. However, publicly available datasets for CPath that are annotated with extensive histological tissue type (HTT) taxonomies at a granular level remain scarce due to the significant expertise and high annotation costs required. Existing datasets, such as the Atlas of Digital Pathology (ADP), address this by offering diverse HTT annotations generalized to multiple organs, but limit the capability for in-depth studies on specific organ diseases. Building upon this foundation, we introduce ADPv2, a novel dataset focused on gastrointestinal histopathology. Our dataset comprises 20,004 image patches derived from healthy colon biopsy slides, annotated according to a hierarchical taxonomy of 32 distinct HTTs of 3 levels. Furthermore, we train a multilabel representation learning model following a two-stage training procedure on our ADPv2 dataset. By leveraging the VMamba model architecture, we achieve a mean average precision of 0.88 in multilabel colon HTT classification.. Finally, we show that our dataset is capable of an organ-specific in-depth study for potential biomarker discovery by analyzing the model's prediction behavior on tissues affected by different colon diseases, which reveals statistical patterns that confirm the two pathological pathways of colon cancer development. Our dataset is publicly available here: Part 1, Part 2, and Part 3.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100537"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146077926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PANDA-PLUS: Improved dataset of prostate whole slide images from PANDA Challenge with pixel-level expert annotations PANDA- plus:改进的来自PANDA Challenge的前列腺整张幻灯片图像数据集,具有像素级专家注释
Q2 Medicine Pub Date : 2026-01-01 Epub Date: 2025-12-30 DOI: 10.1016/j.jpi.2025.100540
Spencer Hopson, Carson Mildon, Corbyn Kubalek, Joshua Ebbert, Ryan Vance, Lauren Laverty, Paul Urie, Dennis Della Corte
Artificial intelligence (AI)-based prostate cancer detection through whole slide images (WSIs) offers promising potential to address the global pathologist shortage while improving clinical consistency. Digital slides and improving image analysis methods encourage the creation of tools to aid in WSI classification. Despite promising advances, these tools are still limited by available training data. Current publicly available datasets, such as Kaggle's PANDA Challenge, while large in scale, rely on slide-level labels that may introduce noise and limit model reliability. Others contain detailed annotations, but are smaller in size due to manual processing efforts. In this work, we introduce PANDA-PLUS, a 546-image dataset derived from PANDA images with improved pixel-level annotations, as well as an accompanying annotation pipeline that reduces pathologists' time commitment. We present a detailed comparative analysis between PANDA-PLUS and PANDA using Gleason score and ISUP grade, supported by agreement values, κ, and PABAK under multiple weighting schemes. The results demonstrate consistently lower grading in PANDA-PLUS, with disagreement patterns especially pronounced at higher grades. We also demonstrate through single rater grading of various annotation granularities how slide- and patch-level labels may distort grading proportions and alter image scores. PANDA-PLUS not only improves annotation granularity and reduces label noise but also exposes potential grading errors in the original PANDA dataset. We present PANDA-PLUS's annotations as an improved alternative to the PANDA labels and conclude that it represents a step forward in the development of higher-quality public datasets for clinical AI applications in prostate cancer pathology.
基于人工智能(AI)的全幻灯片图像前列腺癌检测在解决全球病理学家短缺问题的同时提高了临床一致性。数字幻灯片和改进的图像分析方法鼓励创建工具来帮助WSI分类。尽管有了很大的进步,但这些工具仍然受到现有训练数据的限制。目前可公开获得的数据集,如Kaggle的PANDA挑战,虽然规模很大,但依赖于可能引入噪声和限制模型可靠性的滑动级标签。其他一些包含详细的注释,但由于手工处理工作,其大小较小。在这项工作中,我们介绍了PANDA- plus,这是一个来自PANDA图像的546张图像数据集,具有改进的像素级注释,以及附带的注释管道,可以减少病理学家的时间投入。我们使用Gleason评分和ISUP评分,并在多重加权方案下使用协议值、κ和PABAK支持,对PANDA- plus和PANDA进行了详细的比较分析。结果显示,PANDA-PLUS的评分始终较低,在较高的评分中,差异模式尤为明显。我们还通过单个分级器对各种标注粒度进行分级,演示了幻灯片级和补丁级标签如何扭曲分级比例并改变图像分数。PANDA- plus不仅提高了标注粒度,减少了标注噪声,而且暴露了原始PANDA数据集中潜在的分级错误。我们提出PANDA- plus的注释作为PANDA标签的改进替代品,并得出结论,它代表了在开发更高质量的公共数据集方面向前迈进了一步,用于临床人工智能在前列腺癌病理学中的应用。
{"title":"PANDA-PLUS: Improved dataset of prostate whole slide images from PANDA Challenge with pixel-level expert annotations","authors":"Spencer Hopson,&nbsp;Carson Mildon,&nbsp;Corbyn Kubalek,&nbsp;Joshua Ebbert,&nbsp;Ryan Vance,&nbsp;Lauren Laverty,&nbsp;Paul Urie,&nbsp;Dennis Della Corte","doi":"10.1016/j.jpi.2025.100540","DOIUrl":"10.1016/j.jpi.2025.100540","url":null,"abstract":"<div><div>Artificial intelligence (AI)-based prostate cancer detection through whole slide images (WSIs) offers promising potential to address the global pathologist shortage while improving clinical consistency. Digital slides and improving image analysis methods encourage the creation of tools to aid in WSI classification. Despite promising advances, these tools are still limited by available training data. Current publicly available datasets, such as Kaggle's PANDA Challenge, while large in scale, rely on slide-level labels that may introduce noise and limit model reliability. Others contain detailed annotations, but are smaller in size due to manual processing efforts. In this work, we introduce PANDA-PLUS, a 546-image dataset derived from PANDA images with improved pixel-level annotations, as well as an accompanying annotation pipeline that reduces pathologists' time commitment. We present a detailed comparative analysis between PANDA-PLUS and PANDA using Gleason score and ISUP grade, supported by agreement values, κ, and PABAK under multiple weighting schemes. The results demonstrate consistently lower grading in PANDA-PLUS, with disagreement patterns especially pronounced at higher grades. We also demonstrate through single rater grading of various annotation granularities how slide- and patch-level labels may distort grading proportions and alter image scores. PANDA-PLUS not only improves annotation granularity and reduces label noise but also exposes potential grading errors in the original PANDA dataset. We present PANDA-PLUS's annotations as an improved alternative to the PANDA labels and conclude that it represents a step forward in the development of higher-quality public datasets for clinical AI applications in prostate cancer pathology.</div></div>","PeriodicalId":37769,"journal":{"name":"Journal of Pathology Informatics","volume":"20 ","pages":"Article 100540"},"PeriodicalIF":0.0,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146037910","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Pathology Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1