[Skin lesion classification with multi-level fusion of Swin-T and ConvNeXt].

Zetong Wang, Junhua Zhang, Xiao Wang
{"title":"[Skin lesion classification with multi-level fusion of Swin-T and ConvNeXt].","authors":"Zetong Wang, Junhua Zhang, Xiao Wang","doi":"10.7507/1001-5515.202305025","DOIUrl":null,"url":null,"abstract":"<p><p>Skin cancer is a significant public health issue, and computer-aided diagnosis technology can effectively alleviate this burden. Accurate identification of skin lesion types is crucial when employing computer-aided diagnosis. This study proposes a multi-level attention cascaded fusion model based on Swin-T and ConvNeXt. It employed hierarchical Swin-T and ConvNeXt to extract global and local features, respectively, and introduced residual channel attention and spatial attention modules for further feature extraction. Multi-level attention mechanisms were utilized to process multi-scale global and local features. To address the problem of shallow features being lost due to their distance from the classifier, a hierarchical inverted residual fusion module was proposed to dynamically adjust the extracted feature information. Balanced sampling strategies and focal loss were employed to tackle the issue of imbalanced categories of skin lesions. Experimental testing on the ISIC2018 and ISIC2019 datasets yielded accuracy, precision, recall, and F1-Score of 96.01%, 93.67%, 92.65%, and 93.11%, respectively, and 92.79%, 91.52%, 88.90%, and 90.15%, respectively. Compared to Swin-T, the proposed method achieved an accuracy improvement of 3.60% and 1.66%, and compared to ConvNeXt, it achieved an accuracy improvement of 2.87% and 3.45%. The experiments demonstrate that the proposed method accurately classifies skin lesion images, providing a new solution for skin cancer diagnosis.</p>","PeriodicalId":39324,"journal":{"name":"生物医学工程学杂志","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11208655/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"生物医学工程学杂志","FirstCategoryId":"1087","ListUrlMain":"https://doi.org/10.7507/1001-5515.202305025","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Skin cancer is a significant public health issue, and computer-aided diagnosis technology can effectively alleviate this burden. Accurate identification of skin lesion types is crucial when employing computer-aided diagnosis. This study proposes a multi-level attention cascaded fusion model based on Swin-T and ConvNeXt. It employed hierarchical Swin-T and ConvNeXt to extract global and local features, respectively, and introduced residual channel attention and spatial attention modules for further feature extraction. Multi-level attention mechanisms were utilized to process multi-scale global and local features. To address the problem of shallow features being lost due to their distance from the classifier, a hierarchical inverted residual fusion module was proposed to dynamically adjust the extracted feature information. Balanced sampling strategies and focal loss were employed to tackle the issue of imbalanced categories of skin lesions. Experimental testing on the ISIC2018 and ISIC2019 datasets yielded accuracy, precision, recall, and F1-Score of 96.01%, 93.67%, 92.65%, and 93.11%, respectively, and 92.79%, 91.52%, 88.90%, and 90.15%, respectively. Compared to Swin-T, the proposed method achieved an accuracy improvement of 3.60% and 1.66%, and compared to ConvNeXt, it achieved an accuracy improvement of 2.87% and 3.45%. The experiments demonstrate that the proposed method accurately classifies skin lesion images, providing a new solution for skin cancer diagnosis.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
[利用 Swin-T 和 ConvNeXt 的多级融合进行皮肤病变分类]。
皮肤癌是一个重大的公共卫生问题,而计算机辅助诊断技术可以有效减轻这一负担。在使用计算机辅助诊断时,准确识别皮肤病变类型至关重要。本研究提出了一种基于 Swin-T 和 ConvNeXt 的多级注意级联融合模型。它采用分层 Swin-T 和 ConvNeXt 分别提取全局和局部特征,并引入剩余通道注意和空间注意模块以进一步提取特征。多级注意机制用于处理多尺度的全局和局部特征。为解决浅层特征因距离分类器较远而丢失的问题,提出了分层倒残差融合模块,以动态调整提取的特征信息。针对皮损类别不平衡的问题,采用了平衡采样策略和病灶损失。在 ISIC2018 和 ISIC2019 数据集上进行的实验测试得出的准确率、精确度、召回率和 F1-Score 分别为 96.01%、93.67%、92.65% 和 93.11%,以及 92.79%、91.52%、88.90% 和 90.15%。与 Swin-T 相比,拟议方法的准确率分别提高了 3.60% 和 1.66%;与 ConvNeXt 相比,拟议方法的准确率分别提高了 2.87% 和 3.45%。实验证明,所提出的方法能准确地对皮肤病变图像进行分类,为皮肤癌诊断提供了一种新的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
生物医学工程学杂志
生物医学工程学杂志 Medicine-Medicine (all)
CiteScore
0.80
自引率
0.00%
发文量
4868
期刊介绍:
期刊最新文献
[An ensemble model for assisting early Alzheimer's disease diagnosis based on structural magnetic resonance imaging with dual-time-point fusion]. [An identification method of chromatin topological associated domains based on spatial density clustering]. [Design and performance study of bone trabecular scaffolds based on triply periodic minimal surface method]. [Development of flexible multi-phase barium titanate piezoelectric sensor for physiological health and action behavior monitoring]. [Dielectric properties of tidal volume changes in rabbit lung tissue in the 100 MHz~1 GHz band].
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1