NaMA-Mamba: Foundation model for generalizable nasal disease detection using masked autoencoder with Mamba on endoscopic images

IF 4.9 2区医学 Q1 ENGINEERING, BIOMEDICAL Computerized Medical Imaging and Graphics Pub Date : 2025-03-12 DOI:10.1016/j.compmedimag.2025.102524

Wensheng Wang , Zewen Jin , Xueli Liu , Xinrong Chen

{"title":"NaMA-Mamba: Foundation model for generalizable nasal disease detection using masked autoencoder with Mamba on endoscopic images","authors":"Wensheng Wang , Zewen Jin , Xueli Liu , Xinrong Chen","doi":"10.1016/j.compmedimag.2025.102524","DOIUrl":null,"url":null,"abstract":"<div><div>Artificial intelligence (AI) has shown great promise in analyzing nasal endoscopic images for disease detection. However, current AI systems require extensive expert-labeled data for each specific medical condition, limiting their applications. In this work, the challenge is addressed through two key innovations, the creation of the first large-scale pre-training dataset of nasal endoscopic images, and the development of a novel self-learning AI system specifically designed for nasal endoscopy, named NaMA-Mamba. In the proposed NaMA-Mamba model, two key technologies are utilized, which are the nasal endoscopic state space model (NE-SSM) for analyzing sequences of images and an enhanced learning mechanism (CoMAE) for capturing fine details in nasal tissues. These innovations enable the system to learn effectively from unlabeled images while maintaining high accuracy across different diagnostic tasks. In extensive testing, NaMA-Mamba achieved remarkable results using minimal labeled data, matching the performance of traditional systems that require full expert labeling while needing only 1% of the labeled data for tasks such as detecting nasal polyps and identifying nasopharyngeal cancer. These results demonstrate the potential of NaMA-Mamba to significantly improve the efficiency and accessibility of AI-assisted nasal disease diagnosis in clinical practice.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"122 ","pages":"Article 102524"},"PeriodicalIF":4.9000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895611125000333","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Artificial intelligence (AI) has shown great promise in analyzing nasal endoscopic images for disease detection. However, current AI systems require extensive expert-labeled data for each specific medical condition, limiting their applications. In this work, the challenge is addressed through two key innovations, the creation of the first large-scale pre-training dataset of nasal endoscopic images, and the development of a novel self-learning AI system specifically designed for nasal endoscopy, named NaMA-Mamba. In the proposed NaMA-Mamba model, two key technologies are utilized, which are the nasal endoscopic state space model (NE-SSM) for analyzing sequences of images and an enhanced learning mechanism (CoMAE) for capturing fine details in nasal tissues. These innovations enable the system to learn effectively from unlabeled images while maintaining high accuracy across different diagnostic tasks. In extensive testing, NaMA-Mamba achieved remarkable results using minimal labeled data, matching the performance of traditional systems that require full expert labeling while needing only 1% of the labeled data for tasks such as detecting nasal polyps and identifying nasopharyngeal cancer. These results demonstrate the potential of NaMA-Mamba to significantly improve the efficiency and accessibility of AI-assisted nasal disease diagnosis in clinical practice.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

NaMA-Mamba：在内窥镜图像上使用蒙面自动编码器与Mamba进行通用鼻疾病检测的基础模型

人工智能（AI）在分析鼻内窥镜图像以检测疾病方面显示出巨大的前景。然而，目前的人工智能系统需要针对每种特定医疗状况的大量专家标记数据，这限制了它们的应用。在这项工作中，通过两项关键创新来解决这一挑战，即创建第一个大规模鼻内窥镜图像预训练数据集，以及开发一种专门为鼻内窥镜设计的新型自学习人工智能系统，名为NaMA-Mamba。在NaMA-Mamba模型中，使用了两项关键技术，即用于分析图像序列的鼻内镜状态空间模型（NE-SSM）和用于捕获鼻组织精细细节的增强学习机制（CoMAE）。这些创新使系统能够有效地从未标记的图像中学习，同时在不同的诊断任务中保持高精度。在广泛的测试中，NaMA-Mamba使用最少的标记数据取得了显著的结果，与传统系统的性能相当，传统系统需要完整的专家标记，而检测鼻息肉和识别鼻咽癌等任务只需要1%的标记数据。这些结果证明了NaMA-Mamba在临床实践中显著提高人工智能辅助鼻部疾病诊断的效率和可及性的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computerized Medical Imaging and Graphics 医学-核医学

CiteScore

10.70

自引率

3.50%

发文量

审稿时长

26 days

期刊介绍： The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.