A TransUNet model with an adaptive fuzzy focal loss for medical image segmentation

IF 2.5 3区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Soft Computing Pub Date : 2024-07-24 DOI:10.1007/s00500-024-09953-z
Adrian Talamantes-Roman, Graciela Ramirez-Alonso, Fernando Gaxiola, Olanda Prieto-Ordaz, David R. Lopez-Flores
{"title":"A TransUNet model with an adaptive fuzzy focal loss for medical image segmentation","authors":"Adrian Talamantes-Roman, Graciela Ramirez-Alonso, Fernando Gaxiola, Olanda Prieto-Ordaz, David R. Lopez-Flores","doi":"10.1007/s00500-024-09953-z","DOIUrl":null,"url":null,"abstract":"<p>Segmentation of medical images is a critical step in assisting doctors in making accurate diagnoses and planning appropriate treatments. Deep learning architectures often serve as the basis for computer models used for this task. However, a common challenge faced by segmentation models is class imbalance, which leads to a bias towards classes with a larger number of pixels, resulting in reduced accuracy for the minority-class regions. To address this problem, the <span>\\(\\alpha \\)</span>-balanced variant of the focal loss function introduces a <span>\\(\\alpha \\)</span> modulation factor that reduces the weight assigned to majority classes and gives greater weight to minority classes. This study proposes the use of a fuzzy inference system to automatically adjust the <span>\\(\\alpha \\)</span> factor, rather than maintaining a fixed value as commonly implemented. The adaptive fuzzy focal loss (AFFL) achieves an appropriate adjustment in <span>\\(\\alpha \\)</span> by employing fifteen fuzzy rules. To evaluate the effectiveness of AFFL, we implement an encoder-decoder segmentation model based on the UNet and Transformer architectures (AFFL-TransUNet) using the CHAOS dataset. We compare the performance of seven segmentation models implemented using the same data partition and hardware equipment. A statistical analysis, considering the DICE coefficient metric, demonstrates that AFFL-TransUNet outperforms four baseline models and performs comparably to the remaining models. Remarkably, AFFL-TransUNet achieves this high performance while significantly reducing training processing time by 66.31–72.39%. This reduction is attributed to the fuzzy system that effectively adapts the <span>\\(\\alpha \\)</span> value of the loss function, stabilizing the model within just a few epochs.</p>","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"62 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00500-024-09953-z","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Segmentation of medical images is a critical step in assisting doctors in making accurate diagnoses and planning appropriate treatments. Deep learning architectures often serve as the basis for computer models used for this task. However, a common challenge faced by segmentation models is class imbalance, which leads to a bias towards classes with a larger number of pixels, resulting in reduced accuracy for the minority-class regions. To address this problem, the \(\alpha \)-balanced variant of the focal loss function introduces a \(\alpha \) modulation factor that reduces the weight assigned to majority classes and gives greater weight to minority classes. This study proposes the use of a fuzzy inference system to automatically adjust the \(\alpha \) factor, rather than maintaining a fixed value as commonly implemented. The adaptive fuzzy focal loss (AFFL) achieves an appropriate adjustment in \(\alpha \) by employing fifteen fuzzy rules. To evaluate the effectiveness of AFFL, we implement an encoder-decoder segmentation model based on the UNet and Transformer architectures (AFFL-TransUNet) using the CHAOS dataset. We compare the performance of seven segmentation models implemented using the same data partition and hardware equipment. A statistical analysis, considering the DICE coefficient metric, demonstrates that AFFL-TransUNet outperforms four baseline models and performs comparably to the remaining models. Remarkably, AFFL-TransUNet achieves this high performance while significantly reducing training processing time by 66.31–72.39%. This reduction is attributed to the fuzzy system that effectively adapts the \(\alpha \) value of the loss function, stabilizing the model within just a few epochs.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
带有自适应模糊焦点损失的 TransUNet 模型用于医学图像分割
医学图像分割是协助医生做出准确诊断和规划适当治疗的关键步骤。深度学习架构通常是用于这一任务的计算机模型的基础。然而,分割模型面临的一个常见挑战是类不平衡,这会导致偏向于像素数量较多的类,从而降低少数类区域的准确性。为了解决这个问题,焦点损失函数的(α)平衡变体引入了一个(α)调制因子,降低了分配给多数类的权重,给少数类更大的权重。本研究建议使用模糊推理系统来自动调整系数,而不是像通常的方法那样保持一个固定值。自适应模糊焦点损失(AFFL)通过采用 15 条模糊规则实现了对\(\alpha \)的适当调整。为了评估 AFFL 的有效性,我们使用 CHAOS 数据集实现了基于 UNet 和 Transformer 架构的编码器-解码器分割模型(AFFL-TransUNet)。我们比较了使用相同数据分区和硬件设备实施的七个分割模型的性能。根据 DICE 系数指标进行的统计分析表明,AFFL-TransUNet 的性能优于四个基准模型,与其余模型的性能相当。值得注意的是,AFFL-TransUNet 在实现这一高性能的同时,还将训练处理时间大幅缩短了 66.31%-72.39%。这种缩短归功于模糊系统,它能有效地调整损失函数的 \(α \) 值,从而在短短几个历时内稳定模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Soft Computing
Soft Computing 工程技术-计算机:跨学科应用
CiteScore
8.10
自引率
9.80%
发文量
927
审稿时长
7.3 months
期刊介绍: Soft Computing is dedicated to system solutions based on soft computing techniques. It provides rapid dissemination of important results in soft computing technologies, a fusion of research in evolutionary algorithms and genetic programming, neural science and neural net systems, fuzzy set theory and fuzzy systems, and chaos theory and chaotic systems. Soft Computing encourages the integration of soft computing techniques and tools into both everyday and advanced applications. By linking the ideas and techniques of soft computing with other disciplines, the journal serves as a unifying platform that fosters comparisons, extensions, and new applications. As a result, the journal is an international forum for all scientists and engineers engaged in research and development in this fast growing field.
期刊最新文献
Retraction Note: A probabilistic approach toward evaluation of Internet rumor on COVID. RETRACTED ARTICLE: A cognitive approach for blockchain-based cryptographic curve hash signature (BC-CCHS) technique to secure healthcare data in Data Lake. Handwritten text recognition and information extraction from ancient manuscripts using deep convolutional and recurrent neural network Optimizing green solid transportation with carbon cap and trade: a multi-objective two-stage approach in a type-2 Pythagorean fuzzy context Production chain modeling based on learning flow stochastic petri nets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1