HMedCaps: a new hybrid capsule network architecture for complex medical images

Neural Computing and Applications Pub Date : 2024-08-14 DOI:10.1007/s00521-024-10147-9

Sumeyra Busra Sengul, Ilker Ali Ozkan

{"title":"HMedCaps: a new hybrid capsule network architecture for complex medical images","authors":"Sumeyra Busra Sengul, Ilker Ali Ozkan","doi":"10.1007/s00521-024-10147-9","DOIUrl":null,"url":null,"abstract":"<p>Recognizing and analyzing medical images is crucial for disease early detection and treatment planning with appropriate treatment options based on the patient's individual needs and disease history. Deep learning technologies are widely used in the field of healthcare because they can analyze images rapidly and precisely. However, because each object on the image has the potential to hold illness information in medical images, it is critical to analyze the images with minimal information loss. In this context, Capsule Network (CapsNet) architecture is an important approach that aims to reduce information loss by storing the location and properties of objects in images as capsules. However, because CapsNet maintains information on each object in the image, the existence of several objects in complicated images can impair CapsNet's performance. This work proposes a new model called HMedCaps to improve the performance of CapsNet. In the proposed model, it is aimed to develop a deeper and hybrid structure by using Residual Block and FractalNet module together in the feature extraction layer. While it is aimed to obtain rich feature maps by increasing the number of features extracted by deepening the network, it is aimed to prevent the vanishing gradient problem that may occur in the network with increasing depth with these modules with skip connections. Furthermore, a new squash function is proposed to make distinctive capsules more prominent by customizing capsule activation. The CIFAR10 dataset of complex images, RFMiD dataset of retinal images, and Blood Cell Count Dataset dataset of blood cell images were used to evaluate the study. When the proposed model was compared with the basic CapsNet and studies in the literature, it was observed that the performance in complex images was improved and more accurate classification results were obtained in the field of medical image analysis. The proposed hybrid HMedCaps architecture has the potential to make more accurate diagnoses in the field of medical image analysis.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":"37 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Computing and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00521-024-10147-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recognizing and analyzing medical images is crucial for disease early detection and treatment planning with appropriate treatment options based on the patient's individual needs and disease history. Deep learning technologies are widely used in the field of healthcare because they can analyze images rapidly and precisely. However, because each object on the image has the potential to hold illness information in medical images, it is critical to analyze the images with minimal information loss. In this context, Capsule Network (CapsNet) architecture is an important approach that aims to reduce information loss by storing the location and properties of objects in images as capsules. However, because CapsNet maintains information on each object in the image, the existence of several objects in complicated images can impair CapsNet's performance. This work proposes a new model called HMedCaps to improve the performance of CapsNet. In the proposed model, it is aimed to develop a deeper and hybrid structure by using Residual Block and FractalNet module together in the feature extraction layer. While it is aimed to obtain rich feature maps by increasing the number of features extracted by deepening the network, it is aimed to prevent the vanishing gradient problem that may occur in the network with increasing depth with these modules with skip connections. Furthermore, a new squash function is proposed to make distinctive capsules more prominent by customizing capsule activation. The CIFAR10 dataset of complex images, RFMiD dataset of retinal images, and Blood Cell Count Dataset dataset of blood cell images were used to evaluate the study. When the proposed model was compared with the basic CapsNet and studies in the literature, it was observed that the performance in complex images was improved and more accurate classification results were obtained in the field of medical image analysis. The proposed hybrid HMedCaps architecture has the potential to make more accurate diagnoses in the field of medical image analysis.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

HMedCaps：用于复杂医学图像的新型混合胶囊网络架构

识别和分析医学影像对于疾病的早期检测以及根据患者的个人需求和病史制定适当的治疗方案至关重要。深度学习技术能够快速、精确地分析图像，因此在医疗保健领域得到了广泛应用。然而，由于图像上的每个物体都有可能包含医疗图像中的疾病信息，因此在分析图像时尽量减少信息丢失至关重要。在这种情况下，胶囊网络（CapsNet）架构是一种重要的方法，旨在通过将图像中物体的位置和属性存储为胶囊来减少信息丢失。然而，由于 CapsNet 维护图像中每个物体的信息，复杂图像中存在多个物体会影响 CapsNet 的性能。本研究提出了一种名为 HMedCaps 的新模型来提高 CapsNet 的性能。在提出的模型中，旨在通过在特征提取层中同时使用残差块和分形网模块来开发一种更深层次的混合结构。其目的是通过加深网络来增加特征提取的数量，从而获得丰富的特征图，同时也是为了防止随着网络深度的增加，这些具有跳转连接的模块可能会出现梯度消失的问题。此外，还提出了一种新的压扁函数，通过定制胶囊激活，使独特的胶囊更加突出。研究使用了复杂图像的 CIFAR10 数据集、视网膜图像的 RFMiD 数据集和血细胞计数数据集来进行评估。将所提出的模型与基本的 CapsNet 和文献中的研究进行比较后发现，在医学图像分析领域，所提出的模型在复杂图像中的性能得到了提高，并获得了更准确的分类结果。所提出的混合 HMedCaps 架构有望在医学图像分析领域做出更准确的诊断。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Neural Computing and Applications

自引率

0.00%

发文量