A TransUNet model with an adaptive fuzzy focal loss for medical image segmentation

IF 2.5 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Soft Computing Pub Date : 2024-07-24 DOI:10.1007/s00500-024-09953-z

Adrian Talamantes-Roman, Graciela Ramirez-Alonso, Fernando Gaxiola, Olanda Prieto-Ordaz, David R. Lopez-Flores

{"title":"A TransUNet model with an adaptive fuzzy focal loss for medical image segmentation","authors":"Adrian Talamantes-Roman, Graciela Ramirez-Alonso, Fernando Gaxiola, Olanda Prieto-Ordaz, David R. Lopez-Flores","doi":"10.1007/s00500-024-09953-z","DOIUrl":null,"url":null,"abstract":"Segmentation of medical images is a critical step in assisting doctors in making accurate diagnoses and planning appropriate treatments. Deep learning architectures often serve as the basis for computer models used for this task. However, a common challenge faced by segmentation models is class imbalance, which leads to a bias towards classes with a larger number of pixels, resulting in reduced accuracy for the minority-class regions. To address this problem, the \\(\\alpha \\)-balanced variant of the focal loss function introduces a \\(\\alpha \\) modulation factor that reduces the weight assigned to majority classes and gives greater weight to minority classes. This study proposes the use of a fuzzy inference system to automatically adjust the \\(\\alpha \\) factor, rather than maintaining a fixed value as commonly implemented. The adaptive fuzzy focal loss (AFFL) achieves an appropriate adjustment in \\(\\alpha \\) by employing fifteen fuzzy rules. To evaluate the effectiveness of AFFL, we implement an encoder-decoder segmentation model based on the UNet and Transformer architectures (AFFL-TransUNet) using the CHAOS dataset. We compare the performance of seven segmentation models implemented using the same data partition and hardware equipment. A statistical analysis, considering the DICE coefficient metric, demonstrates that AFFL-TransUNet outperforms four baseline models and performs comparably to the remaining models. Remarkably, AFFL-TransUNet achieves this high performance while significantly reducing training processing time by 66.31–72.39%. This reduction is attributed to the fuzzy system that effectively adapts the \\(\\alpha \\) value of the loss function, stabilizing the model within just a few epochs.","PeriodicalId":22039,"journal":{"name":"Soft Computing","volume":"62 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00500-024-09953-z","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Segmentation of medical images is a critical step in assisting doctors in making accurate diagnoses and planning appropriate treatments. Deep learning architectures often serve as the basis for computer models used for this task. However, a common challenge faced by segmentation models is class imbalance, which leads to a bias towards classes with a larger number of pixels, resulting in reduced accuracy for the minority-class regions. To address this problem, the \(\alpha \)-balanced variant of the focal loss function introduces a \(\alpha \) modulation factor that reduces the weight assigned to majority classes and gives greater weight to minority classes. This study proposes the use of a fuzzy inference system to automatically adjust the \(\alpha \) factor, rather than maintaining a fixed value as commonly implemented. The adaptive fuzzy focal loss (AFFL) achieves an appropriate adjustment in \(\alpha \) by employing fifteen fuzzy rules. To evaluate the effectiveness of AFFL, we implement an encoder-decoder segmentation model based on the UNet and Transformer architectures (AFFL-TransUNet) using the CHAOS dataset. We compare the performance of seven segmentation models implemented using the same data partition and hardware equipment. A statistical analysis, considering the DICE coefficient metric, demonstrates that AFFL-TransUNet outperforms four baseline models and performs comparably to the remaining models. Remarkably, AFFL-TransUNet achieves this high performance while significantly reducing training processing time by 66.31–72.39%. This reduction is attributed to the fuzzy system that effectively adapts the \(\alpha \) value of the loss function, stabilizing the model within just a few epochs.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

带有自适应模糊焦点损失的 TransUNet 模型用于医学图像分割

医学图像分割是协助医生做出准确诊断和规划适当治疗的关键步骤。深度学习架构通常是用于这一任务的计算机模型的基础。然而，分割模型面临的一个常见挑战是类不平衡，这会导致偏向于像素数量较多的类，从而降低少数类区域的准确性。为了解决这个问题，焦点损失函数的（α）平衡变体引入了一个（α）调制因子，降低了分配给多数类的权重，给少数类更大的权重。本研究建议使用模糊推理系统来自动调整系数，而不是像通常的方法那样保持一个固定值。自适应模糊焦点损失（AFFL）通过采用 15 条模糊规则实现了对\(\alpha \)的适当调整。为了评估 AFFL 的有效性，我们使用 CHAOS 数据集实现了基于 UNet 和 Transformer 架构的编码器-解码器分割模型（AFFL-TransUNet）。我们比较了使用相同数据分区和硬件设备实施的七个分割模型的性能。根据 DICE 系数指标进行的统计分析表明，AFFL-TransUNet 的性能优于四个基准模型，与其余模型的性能相当。值得注意的是，AFFL-TransUNet 在实现这一高性能的同时，还将训练处理时间大幅缩短了 66.31%-72.39%。这种缩短归功于模糊系统，它能有效地调整损失函数的 \(α \) 值，从而在短短几个历时内稳定模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Soft Computing 工程技术-计算机：跨学科应用

CiteScore

8.10

自引率

9.80%

发文量

927

审稿时长

7.3 months

期刊介绍： Soft Computing is dedicated to system solutions based on soft computing techniques. It provides rapid dissemination of important results in soft computing technologies, a fusion of research in evolutionary algorithms and genetic programming, neural science and neural net systems, fuzzy set theory and fuzzy systems, and chaos theory and chaotic systems. Soft Computing encourages the integration of soft computing techniques and tools into both everyday and advanced applications. By linking the ideas and techniques of soft computing with other disciplines, the journal serves as a unifying platform that fosters comparisons, extensions, and new applications. As a result, the journal is an international forum for all scientists and engineers engaged in research and development in this fast growing field.