Abnormality-aware multimodal learning for WSI classification.

IF 3.1 3区 医学 Q1 MEDICINE, GENERAL & INTERNAL Frontiers in Medicine Pub Date : 2025-02-25 eCollection Date: 2025-01-01 DOI:10.3389/fmed.2025.1546452
Thao M Dang, Qifeng Zhou, Yuzhi Guo, Hehuan Ma, Saiyang Na, Thao Bich Dang, Jean Gao, Junzhou Huang
{"title":"Abnormality-aware multimodal learning for WSI classification.","authors":"Thao M Dang, Qifeng Zhou, Yuzhi Guo, Hehuan Ma, Saiyang Na, Thao Bich Dang, Jean Gao, Junzhou Huang","doi":"10.3389/fmed.2025.1546452","DOIUrl":null,"url":null,"abstract":"<p><p>Whole slide images (WSIs) play a vital role in cancer diagnosis and prognosis. However, their gigapixel resolution, lack of pixel-level annotations, and reliance on unimodal visual data present challenges for accurate and efficient computational analysis. Existing methods typically divide WSIs into thousands of patches, which increases computational demands and makes it challenging to effectively focus on diagnostically relevant regions. Furthermore, these methods frequently rely on feature extractors pretrained on natural images, which are not optimized for pathology tasks, and overlook multimodal data sources such as cellular and textual information that can provide critical insights. To address these limitations, we propose the <b>A</b>bnormality-<b>A</b>ware <b>M</b>ulti<b>M</b>odal (AAMM) learning framework, which integrates abnormality detection and multimodal feature learning for WSI classification. AAMM incorporates a Gaussian Mixture Variational Autoencoder (GMVAE) to identify and select the most informative patches, reducing computational complexity while retaining critical diagnostic information. It further integrates multimodal features from pathology-specific foundation models, combining patch-level, cell-level, and text-level representations through cross-attention mechanisms. This approach enhances the ability to comprehensively analyze WSIs for cancer diagnosis and subtyping. Extensive experiments on normal-tumor classification and cancer subtyping demonstrate that AAMM achieves superior performance compared to state-of-the-art methods. By combining abnormal detection with multimodal feature integration, our framework offers an efficient and scalable solution for advancing computational pathology.</p>","PeriodicalId":12488,"journal":{"name":"Frontiers in Medicine","volume":"12 ","pages":"1546452"},"PeriodicalIF":3.1000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11893561/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fmed.2025.1546452","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Whole slide images (WSIs) play a vital role in cancer diagnosis and prognosis. However, their gigapixel resolution, lack of pixel-level annotations, and reliance on unimodal visual data present challenges for accurate and efficient computational analysis. Existing methods typically divide WSIs into thousands of patches, which increases computational demands and makes it challenging to effectively focus on diagnostically relevant regions. Furthermore, these methods frequently rely on feature extractors pretrained on natural images, which are not optimized for pathology tasks, and overlook multimodal data sources such as cellular and textual information that can provide critical insights. To address these limitations, we propose the Abnormality-Aware MultiModal (AAMM) learning framework, which integrates abnormality detection and multimodal feature learning for WSI classification. AAMM incorporates a Gaussian Mixture Variational Autoencoder (GMVAE) to identify and select the most informative patches, reducing computational complexity while retaining critical diagnostic information. It further integrates multimodal features from pathology-specific foundation models, combining patch-level, cell-level, and text-level representations through cross-attention mechanisms. This approach enhances the ability to comprehensively analyze WSIs for cancer diagnosis and subtyping. Extensive experiments on normal-tumor classification and cancer subtyping demonstrate that AAMM achieves superior performance compared to state-of-the-art methods. By combining abnormal detection with multimodal feature integration, our framework offers an efficient and scalable solution for advancing computational pathology.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
异常感知多模态学习的WSI分类。
全幻灯片图像在肿瘤诊断和预后中起着至关重要的作用。然而,它们的十亿像素分辨率,缺乏像素级注释,以及对单模态视觉数据的依赖,为准确和高效的计算分析带来了挑战。现有方法通常将wsi划分为数千个补丁,这增加了计算量,并且难以有效地关注诊断相关区域。此外,这些方法经常依赖于对自然图像进行预训练的特征提取器,而这些特征提取器并没有针对病理任务进行优化,并且忽略了可以提供关键见解的多模态数据源,如细胞和文本信息。为了解决这些限制,我们提出了异常感知多模态(AAMM)学习框架,该框架将异常检测和多模态特征学习集成到WSI分类中。AAMM结合了一个高斯混合变分自编码器(GMVAE)来识别和选择信息量最大的补丁,在保留关键诊断信息的同时降低了计算复杂性。它进一步集成了来自病理特异性基础模型的多模态特征,通过交叉注意机制结合了补丁级、细胞级和文本级的表示。该方法提高了综合分析wsi用于癌症诊断和分型的能力。在正常肿瘤分类和癌症亚型分型方面的大量实验表明,与最先进的方法相比,AAMM具有优越的性能。通过将异常检测与多模态特征集成相结合,我们的框架为推进计算病理学提供了高效且可扩展的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Frontiers in Medicine
Frontiers in Medicine Medicine-General Medicine
CiteScore
5.10
自引率
5.10%
发文量
3710
审稿时长
12 weeks
期刊介绍: Frontiers in Medicine publishes rigorously peer-reviewed research linking basic research to clinical practice and patient care, as well as translating scientific advances into new therapies and diagnostic tools. Led by an outstanding Editorial Board of international experts, this multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide. In addition to papers that provide a link between basic research and clinical practice, a particular emphasis is given to studies that are directly relevant to patient care. In this spirit, the journal publishes the latest research results and medical knowledge that facilitate the translation of scientific advances into new therapies or diagnostic tools. The full listing of the Specialty Sections represented by Frontiers in Medicine is as listed below. As well as the established medical disciplines, Frontiers in Medicine is launching new sections that together will facilitate - the use of patient-reported outcomes under real world conditions - the exploitation of big data and the use of novel information and communication tools in the assessment of new medicines - the scientific bases for guidelines and decisions from regulatory authorities - access to medicinal products and medical devices worldwide - addressing the grand health challenges around the world
期刊最新文献
Aspirin vs. enoxaparin for thromboprophylaxis after total hip arthroplasty, total knee arthroplasty, or hip fracture surgery-a systematic review and meta-analysis. Diabetes mellitus is associated with an increased risk of postoperative neurocognitive disorders: a systematic review. Vimentin expression as a prognostic marker in pancreatic cancer: a systematic review and meta-analysis. Eye-tracking biomarkers of clinical expertise in ECG interpretation: statistical and machine learning evidence. A case report of intramural pregnancy with uterine rupture: a warning signal from ineffective evacuation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1