Self-supervised Semantic Segmentation: Consistency over Transformation.

Sanaz Karimijafarbigloo, Reza Azad, Amirhossein Kazerouni, Yury Velichko, Ulas Bagci, Dorit Merhof
{"title":"Self-supervised Semantic Segmentation: Consistency over Transformation.","authors":"Sanaz Karimijafarbigloo, Reza Azad, Amirhossein Kazerouni, Yury Velichko, Ulas Bagci, Dorit Merhof","doi":"10.1109/ICCVW60793.2023.00280","DOIUrl":null,"url":null,"abstract":"<p><p>Accurate medical image segmentation is of utmost importance for enabling automated clinical decision procedures. However, prevailing supervised deep learning approaches for medical image segmentation encounter significant challenges due to their heavy dependence on extensive labeled training data. To tackle this issue, we propose a novel self-supervised algorithm, <math><mrow><mrow><msup><mi>S</mi><mn>3</mn></msup></mrow><mo>-</mo><mi>Net</mi></mrow></math>, which integrates a robust framework based on the proposed Inception Large Kernel Attention (I-LKA) modules. This architectural enhancement makes it possible to comprehensively capture contextual information while preserving local intricacies, thereby enabling precise semantic segmentation. Furthermore, considering that lesions in medical images often exhibit deformations, we leverage deformable convolution as an integral component to effectively capture and delineate lesion deformations for superior object boundary definition. Additionally, our self-supervised strategy emphasizes the acquisition of invariance to affine transformations, which is commonly encountered in medical scenarios. This emphasis on robustness with respect to geometric distortions significantly enhances the model's ability to accurately model and handle such distortions. To enforce spatial consistency and promote the grouping of spatially connected image pixels with similar feature representations, we introduce a spatial consistency loss term. This aids the network in effectively capturing the relationships among neighboring pixels and enhancing the overall segmentation quality. The <math><mrow><mrow><msup><mi>S</mi><mn>3</mn></msup></mrow><mo>-</mo><mi>N</mi><mi>e</mi><mi>t</mi></mrow></math> approach iteratively learns pixel-level feature representations for image content clustering in an end-to-end manner. Our experimental results on skin lesion and lung organ segmentation tasks show the superior performance of our method compared to the SOTA approaches.</p>","PeriodicalId":72022,"journal":{"name":"... IEEE International Conference on Computer Vision workshops. IEEE International Conference on Computer Vision","volume":"2023 ","pages":"2646-2655"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10829429/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"... IEEE International Conference on Computer Vision workshops. IEEE International Conference on Computer Vision","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCVW60793.2023.00280","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/12/25 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate medical image segmentation is of utmost importance for enabling automated clinical decision procedures. However, prevailing supervised deep learning approaches for medical image segmentation encounter significant challenges due to their heavy dependence on extensive labeled training data. To tackle this issue, we propose a novel self-supervised algorithm, S3-Net, which integrates a robust framework based on the proposed Inception Large Kernel Attention (I-LKA) modules. This architectural enhancement makes it possible to comprehensively capture contextual information while preserving local intricacies, thereby enabling precise semantic segmentation. Furthermore, considering that lesions in medical images often exhibit deformations, we leverage deformable convolution as an integral component to effectively capture and delineate lesion deformations for superior object boundary definition. Additionally, our self-supervised strategy emphasizes the acquisition of invariance to affine transformations, which is commonly encountered in medical scenarios. This emphasis on robustness with respect to geometric distortions significantly enhances the model's ability to accurately model and handle such distortions. To enforce spatial consistency and promote the grouping of spatially connected image pixels with similar feature representations, we introduce a spatial consistency loss term. This aids the network in effectively capturing the relationships among neighboring pixels and enhancing the overall segmentation quality. The S3-Net approach iteratively learns pixel-level feature representations for image content clustering in an end-to-end manner. Our experimental results on skin lesion and lung organ segmentation tasks show the superior performance of our method compared to the SOTA approaches.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
自监督语义分割:一致性超越转换。
准确的医学图像分割对于实现自动化临床决策程序至关重要。然而,目前用于医学图像分割的有监督深度学习方法由于严重依赖大量标注的训练数据而面临巨大挑战。为了解决这个问题,我们提出了一种新颖的自监督算法 S3-Net,它集成了一个基于所提出的 Inception Large Kernel Attention(I-LKA)模块的稳健框架。这种架构上的改进使得在保留局部复杂性的同时全面捕捉上下文信息成为可能,从而实现精确的语义分割。此外,考虑到医学图像中的病变通常会发生形变,我们将可变形卷积作为一个不可或缺的组成部分,有效捕捉和划分病变形变,从而实现出色的对象边界定义。此外,我们的自监督策略强调获得仿射变换的不变性,这在医学场景中很常见。这种对几何失真的鲁棒性强调大大增强了模型准确建模和处理此类失真的能力。为了加强空间一致性,促进具有相似特征表示的空间连接图像像素的分组,我们引入了空间一致性损失项。这有助于网络有效捕捉相邻像素之间的关系,提高整体分割质量。S3-Net 方法以端到端的方式迭代学习像素级特征表示,用于图像内容聚类。我们在皮肤病变和肺部器官分割任务上的实验结果表明,与 SOTA 方法相比,我们的方法性能更优。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Self-Supervised Anomaly Detection from Anomalous Training Data via Iterative Latent Token Masking. Self-supervised Semantic Segmentation: Consistency over Transformation. Learning to Learn: How to Continuously Teach Humans and Machines. STRIDE: Street View-based Environmental Feature Detection and Pedestrian Collision Prediction. Robust AMD Stage Grading with Exclusively OCTA Modality Leveraging 3D Volume.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1