Mamba-DDPM-BSA: Diffusion model based boundary sampling algorithm for imbalanced classification

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Expert Systems with Applications Pub Date : 2025-05-15 Epub Date: 2025-02-23 DOI:10.1016/j.eswa.2025.126926
Fan Zhang , Quan Yuan , Xinhong Zhang
{"title":"Mamba-DDPM-BSA: Diffusion model based boundary sampling algorithm for imbalanced classification","authors":"Fan Zhang ,&nbsp;Quan Yuan ,&nbsp;Xinhong Zhang","doi":"10.1016/j.eswa.2025.126926","DOIUrl":null,"url":null,"abstract":"<div><div>Data category imbalance is one of the major challenges in the field of medical image classification. This imbalance seriously affects the accuracy and reliability of the classification model, posing potential risks to doctors’ diagnosis and treatment. This paper proposes a Mamba-DDPM-BSA method to address the imbalanced classification issue of medical image. Firstly, the generative model Mamba-DDPM is designed for the synthesis of medical image samples. It utilizes Mamba’s global modeling capability and linear computational efficiency to improve the quality of generated samples by improving DDPM (Denoising Diffusion Probabilistic Model). Secondly, by oversampling training samples in boundary regions, the proposed Boundary Sampling Algorithm (BSA) enables synthesizer focuses more on decision boundary areas when fitting sample distributions. This approach generates more samples near the decision boundary, pushing the decision boundary that originally intrudes into the minority class distribution towards the true distribution. Finally, a Mamba-DDPM-BSA method is proposed, which adopts an interactive synthesis method and makes full use of diffusion generation model and Boundary Sampling Algorithm to interact with the classification model, aiming to synthesize images that target the defects of the classification model to improve the discriminative ability and robustness of the classifier. Experiments based on HAM10000 data set show that Mamba-DDPM-BSA reaches 81.03%, 82.14%, and 82.71% on Matthew’s correlation coefficient, Balanced Accuracy, and Macro F1, respectively. The proposed method is superior to the traditional imbalanced classification method.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"274 ","pages":"Article 126926"},"PeriodicalIF":7.5000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425005482","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Data category imbalance is one of the major challenges in the field of medical image classification. This imbalance seriously affects the accuracy and reliability of the classification model, posing potential risks to doctors’ diagnosis and treatment. This paper proposes a Mamba-DDPM-BSA method to address the imbalanced classification issue of medical image. Firstly, the generative model Mamba-DDPM is designed for the synthesis of medical image samples. It utilizes Mamba’s global modeling capability and linear computational efficiency to improve the quality of generated samples by improving DDPM (Denoising Diffusion Probabilistic Model). Secondly, by oversampling training samples in boundary regions, the proposed Boundary Sampling Algorithm (BSA) enables synthesizer focuses more on decision boundary areas when fitting sample distributions. This approach generates more samples near the decision boundary, pushing the decision boundary that originally intrudes into the minority class distribution towards the true distribution. Finally, a Mamba-DDPM-BSA method is proposed, which adopts an interactive synthesis method and makes full use of diffusion generation model and Boundary Sampling Algorithm to interact with the classification model, aiming to synthesize images that target the defects of the classification model to improve the discriminative ability and robustness of the classifier. Experiments based on HAM10000 data set show that Mamba-DDPM-BSA reaches 81.03%, 82.14%, and 82.71% on Matthew’s correlation coefficient, Balanced Accuracy, and Macro F1, respectively. The proposed method is superior to the traditional imbalanced classification method.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Mamba-DDPM-BSA:基于扩散模型的不平衡分类边界采样算法
数据分类不平衡是医学图像分类领域面临的主要挑战之一。这种不平衡严重影响了分类模型的准确性和可靠性,给医生的诊断和治疗带来了潜在风险。针对医学图像分类不平衡问题,提出了一种Mamba-DDPM-BSA方法。首先,设计了用于医学图像样本合成的生成模型Mamba-DDPM。它利用Mamba的全局建模能力和线性计算效率,通过改进DDPM(去噪扩散概率模型)来提高生成样本的质量。其次,本文提出的边界采样算法(boundary Sampling Algorithm, BSA)通过在边界区域对训练样本进行过采样,使合成器在拟合样本分布时更加关注决策边界区域。这种方法在决策边界附近产生更多的样本,将原本侵入少数类分布的决策边界推向真实分布。最后,提出了Mamba-DDPM-BSA方法,该方法采用交互式合成方法,充分利用扩散生成模型和边界采样算法与分类模型进行交互,旨在合成针对分类模型缺陷的图像,提高分类器的判别能力和鲁棒性。基于HAM10000数据集的实验表明,mamba - ddppm - bsa在Matthew’s correlation coefficient、Balanced Accuracy和Macro F1上分别达到81.03%、82.14%和82.71%。该方法优于传统的不平衡分类方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
期刊最新文献
Topology-inspired metric for detecting potential defects in lithography An interpretable intrusion detection framework based on ensemble neural networks for dynamic network environments Multi-sequence parotid gland lesion segmentation via expert text-guided segment anything model Federated learning of diffusion networks A collaborative optimization framework for efficient long-sequence Audio-Visual understanding
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1