STC-UNet: renal tumor segmentation based on enhanced feature extraction at different network levels.

IF 2.9 3区 医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING BMC Medical Imaging Pub Date : 2024-07-19 DOI:10.1186/s12880-024-01359-5
Wei Hu, Shouyi Yang, Weifeng Guo, Na Xiao, Xiaopeng Yang, Xiangyang Ren
{"title":"STC-UNet: renal tumor segmentation based on enhanced feature extraction at different network levels.","authors":"Wei Hu, Shouyi Yang, Weifeng Guo, Na Xiao, Xiaopeng Yang, Xiangyang Ren","doi":"10.1186/s12880-024-01359-5","DOIUrl":null,"url":null,"abstract":"<p><p>Renal tumors are one of the common diseases of urology, and precise segmentation of these tumors plays a crucial role in aiding physicians to improve diagnostic accuracy and treatment effectiveness. Nevertheless, inherent challenges associated with renal tumors, such as indistinct boundaries, morphological variations, and uncertainties in size and location, segmenting renal tumors accurately remains a significant challenge in the field of medical image segmentation. With the development of deep learning, substantial achievements have been made in the domain of medical image segmentation. However, existing models lack specificity in extracting features of renal tumors across different network hierarchies, which results in insufficient extraction of renal tumor features and subsequently affects the accuracy of renal tumor segmentation. To address this issue, we propose the Selective Kernel, Vision Transformer, and Coordinate Attention Enhanced U-Net (STC-UNet). This model aims to enhance feature extraction, adapting to the distinctive characteristics of renal tumors across various network levels. Specifically, the Selective Kernel modules are introduced in the shallow layers of the U-Net, where detailed features are more abundant. By selectively employing convolutional kernels of different scales, the model enhances its capability to extract detailed features of renal tumors across multiple scales. Subsequently, in the deeper layers of the network, where feature maps are smaller yet contain rich semantic information, the Vision Transformer modules are integrated in a non-patch manner. These assist the model in capturing long-range contextual information globally. Their non-patch implementation facilitates the capture of fine-grained features, thereby achieving collaborative enhancement of global-local information and ultimately strengthening the model's extraction of semantic features of renal tumors. Finally, in the decoder segment, the Coordinate Attention modules embedding positional information are proposed aiming to enhance the model's feature recovery and tumor region localization capabilities. Our model is validated on the KiTS19 dataset, and experimental results indicate that compared to the baseline model, STC-UNet shows improvements of 1.60%, 2.02%, 2.27%, 1.18%, 1.52%, and 1.35% in IoU, Dice, Accuracy, Precision, Recall, and F1-score, respectively. Furthermore, the experimental results demonstrate that the proposed STC-UNet method surpasses other advanced algorithms in both visual effectiveness and objective evaluation metrics.</p>","PeriodicalId":9020,"journal":{"name":"BMC Medical Imaging","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11264758/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12880-024-01359-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Renal tumors are one of the common diseases of urology, and precise segmentation of these tumors plays a crucial role in aiding physicians to improve diagnostic accuracy and treatment effectiveness. Nevertheless, inherent challenges associated with renal tumors, such as indistinct boundaries, morphological variations, and uncertainties in size and location, segmenting renal tumors accurately remains a significant challenge in the field of medical image segmentation. With the development of deep learning, substantial achievements have been made in the domain of medical image segmentation. However, existing models lack specificity in extracting features of renal tumors across different network hierarchies, which results in insufficient extraction of renal tumor features and subsequently affects the accuracy of renal tumor segmentation. To address this issue, we propose the Selective Kernel, Vision Transformer, and Coordinate Attention Enhanced U-Net (STC-UNet). This model aims to enhance feature extraction, adapting to the distinctive characteristics of renal tumors across various network levels. Specifically, the Selective Kernel modules are introduced in the shallow layers of the U-Net, where detailed features are more abundant. By selectively employing convolutional kernels of different scales, the model enhances its capability to extract detailed features of renal tumors across multiple scales. Subsequently, in the deeper layers of the network, where feature maps are smaller yet contain rich semantic information, the Vision Transformer modules are integrated in a non-patch manner. These assist the model in capturing long-range contextual information globally. Their non-patch implementation facilitates the capture of fine-grained features, thereby achieving collaborative enhancement of global-local information and ultimately strengthening the model's extraction of semantic features of renal tumors. Finally, in the decoder segment, the Coordinate Attention modules embedding positional information are proposed aiming to enhance the model's feature recovery and tumor region localization capabilities. Our model is validated on the KiTS19 dataset, and experimental results indicate that compared to the baseline model, STC-UNet shows improvements of 1.60%, 2.02%, 2.27%, 1.18%, 1.52%, and 1.35% in IoU, Dice, Accuracy, Precision, Recall, and F1-score, respectively. Furthermore, the experimental results demonstrate that the proposed STC-UNet method surpasses other advanced algorithms in both visual effectiveness and objective evaluation metrics.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
STC-UNet:基于不同网络级别的增强特征提取的肾肿瘤分割。
肾脏肿瘤是泌尿外科常见疾病之一,对这些肿瘤进行精确分割对帮助医生提高诊断准确性和治疗效果起着至关重要的作用。然而,由于肾脏肿瘤存在边界不清、形态变化、大小和位置不确定等固有挑战,因此准确分割肾脏肿瘤仍是医学图像分割领域的一项重大挑战。随着深度学习的发展,医学图像分割领域取得了巨大成就。然而,现有模型在提取不同网络层次的肾脏肿瘤特征时缺乏特异性,导致肾脏肿瘤特征提取不足,进而影响肾脏肿瘤分割的准确性。针对这一问题,我们提出了选择性内核、视觉变换器和坐标注意增强型 U-Net(STC-UNet)。该模型旨在增强特征提取,以适应不同网络层次上肾肿瘤的显著特征。具体来说,选择性内核模块被引入到 U-Net 的浅层,这里的细节特征更为丰富。通过选择性地使用不同尺度的卷积核,该模型增强了在多个尺度上提取肾肿瘤细节特征的能力。随后,在特征图较小但包含丰富语义信息的网络深层,以非补丁方式集成了视觉转换器模块。这些模块有助于模型在全局范围内捕捉远距离上下文信息。它们的非补丁实施有利于捕捉细粒度特征,从而实现全局-本地信息的协同增强,最终加强模型对肾脏肿瘤语义特征的提取。最后,在解码器部分,提出了嵌入位置信息的坐标注意模块,旨在增强模型的特征恢复和肿瘤区域定位能力。我们的模型在 KiTS19 数据集上进行了验证,实验结果表明,与基线模型相比,STC-UNet 在 IoU、Dice、Accuracy、Precision、Recall 和 F1-score 方面分别提高了 1.60%、2.02%、2.27%、1.18%、1.52% 和 1.35%。此外,实验结果表明,所提出的 STC-UNet 方法在视觉效果和客观评价指标上都超越了其他先进算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
BMC Medical Imaging
BMC Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-
CiteScore
4.60
自引率
3.70%
发文量
198
审稿时长
27 weeks
期刊介绍: BMC Medical Imaging is an open access journal publishing original peer-reviewed research articles in the development, evaluation, and use of imaging techniques and image processing tools to diagnose and manage disease.
期刊最新文献
Computed tomography enterography radiomics and machine learning for identification of Crohn's disease. Computer vision applications for the detection or analysis of tuberculosis using digitised human lung tissue images - a systematic review. Computerized tomography features acting as predictors for invasive therapy in the management of Crohn's disease-related spontaneous intra-abdominal abscess: experience from long-term follow-up. Correction: CT coronary fractional flow reserve based on artificial intelligence using different software: a repeatability study. Diagnostic significance of ultrasound characteristics in discriminating follicular thyroid carcinoma from adenoma.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1